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r | YHIS volume is the sixth in a series of 
publications prepared to assist test users 
in education, industry, psychiatry, and 

psychology to locate and evaluate tests and books 

on testing. A detailed account of the initiation 
and growth of this service to test users has been 
presented in the Introduction of The Nineteen 

Forty Mental Measurements Yearbook.* Con- 

sequently, the earlier publications in the series 

will be described only briefly here. 


EARLIER PUBLICATIONS 


The first two publications ¢ in this series, 
published in 1935 and 1936, respectively, were 
noncritical bibliographies of tests. 

The third publication $ in 1937 was the first 
issue to include a bibliography of books on test- 
ing and closely related fields, and to reprint 
critical excerpts from all published reviews 
known to the Editor. 

* Buros, y Ae Krisen, editor. The Nineteen Forty Mental 
Measmenienis book, Highland Park, N. J.: Ages prit; 
"t. Ara brat "e 
in xn Toi 49. 
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Measurements Yearbook and B8s7 in The Nineteen 
tal Measurements Yearbook. 


By 1938, the publication had developed into _ 
a 430-page clothbound volume * which 
duced, in addition to the bibliographical infor- 
mation and excerpts of book reviews of the 
earlier publications, 331 original reviews of 
tests by 133 educationists, psychologists, subject 
matter specialists, teachers, and test techni- 
cians.f Since one of the two book-review- 
excerpt sections, "Research and Statistical Meth- 
odology Books," was also of value to many 
research workers riot interested in the testing 
section, it was also published as a separate mon- 
ograph.] This section reprint was favorably re- 
viewed in the journals of such varied fields as 
accounting, agriculture, biology, business, chem- 
istry, economics, eugenics, forestry, mathe- 
matics, medicine, philosophy, physics, science, 
sociology, and statistics.§ 

The fifth publication in the series, The Nine- 
teen Forty Mental Measurements Y earbook, was 
published in 1941. The section on research and 
statistical methodology books was published as 
à separate volume ** and not included in The 
1940 Yearbook. 'The number of test reviewers, 
however, was nearly doubled, and the number 
of original test reviews and excerpts from re- 
views of measurement books greatly increased. tt 
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THE THIRD YEARBOOK 


Plans for the continued publication of a new 
volume every two years had to be abandoned for 
the duration of World War II. Consequently, 
The Third Mental Measurements Yearbook 
covers the period 1940 through 1947, a period 
which normally would have been covered by 
four yearbooks. It is obvious, therefore, that 
the review coverage in the yearbooks to date is 
not as comprehensive as would have been the 
case had the publication schedulé not been in- 
terrupted by the war. Hereafter, the Editor 
hopes that this project will receive sufficient 
support to make it possible to publish a Mental 
Measurements Yearbook every two or three 
years. As in the past, each yearbook will con- 
sist entirely of new material and will, therefore, 
supplement rather than supplant earlier year- 
books in the series. 

The Third Mental Measurements Yearbook 
consists of two major sections; “Tests and Re- 
views" (pp. 1-750) and “Books and Reviews" 
(pp. 751-978). In addition there are five in- 
dexes and directories: Periodical Directory and 
Index, Publishers Directory and Index, Index 
of Titles, Index of Names, and Classified Index 
of Tests. 


TESTS AND REVIEWS 


The section “Tests and Reviews" lists 663 
tests, 713 original reviews by 320 reviewers, 66 
review excerpts, and 3,368 references on the 
construction, validity, use, and limitations of 
specific tests. Seventy per cent of the tests have 
been reviewed by one or more reviewers; 30 
per cent by two or more reviewers; 6.5 per cent 
by three or more reviewers; and 1.5 per cent 
by four or more reviewers. Some of these tests 
have been reviewed in earlier yearbooks; many 
will be again reviewed in future yearbooks. It 
is planned that most tests will eventually re- 
ceive at least three reviews each. 

à The yearbook attempts to list all commer- 
cially available tests—educational, psychological, 
and vocational—published as separates in Eng- 
lish-speaking countries between October 1940 
and December 1947. The following two, classes 
of tests are also included: (a) older tests se- 
lected for review; (b) tests published during 
the 15-year period (1933-1948) covered by 
this series of yearbooks and bibliographies but 
not previously listed. 


The objectives of the section “Tests and Re- 
views" are the same as those reported for the- 
corresponding section in The 1940 Yearbook: 


1) To make readily available comprehensive and 
up-to-date biblographies of recent tests published in all 
English-speaking countries. A 

2) To make readily available hundreds of frankly 
critical test reviews, written by persons of outst anding 
ability representing various viewpoints, which will as- 
sist test users to make more discriminating selections 
of the standard tests which will best meet th 

) To make readily available comprehensive 
curate bibliographies of references on the const 
validation, use, and limitations of specific te 

4) To impel authors and publishers to place 
but better tests on the market and to provide test users 
with detailed and accurate information on the con- 
struction, validation, uses, and limitations of their tests 
at the time that they are first placed on the market. | 

5) To suggest to test users better methods of arriv- = 
ing at their own appraisals of both standard and non- 
standard tests in light of their particular values and 
needs. 

6) To stimuluate cooperating reviewers—and oth- 
ers to a less extent—to reconsider and think through 
more carefully their beliefs and values relevant to 
testing. 

7) Fo inculcate upon test users a keener awareness 
of both the values and dangers which may accompany 
the use of standard tests. ER 

8) To impress test users with the desirability of 
suspecting all standard tests—even though prepared by 
well-known authorities—unaccompanied by detaile 
data on their construction, validation, use, and limita- 
tions. 


BOOKS AND REVIEWS 

The section “Books and Reviews" lists 549 
books on measurements and closely related 
fields and 785 excerpts from reviews of these 
books in r35 journals. An attempt has been 
made to list all measurement books published 
in English-speaking countries between October 
1940 and December 1947. In addition, older 
books are listed whenever accompanied by re- 
view excerpts not previously published in this 
series. The range of books is the same as that 
in The 1940 Yearbook with one exception. 
Books on the methodology of factor analysis 
are not included; such books will be included 
in the next Yearbook of Research and Statisti- 
cal Methodology. 

The objectives of the section “Books and 
Reviews" are the same as those reported for 
the corresponding section in The 1940 Year- 
book: 


1) To make readily available comprehensive and 
up-to-date bibliographies of recent boo published in 
all English-s ing countries in measurements ani 
closely associated fields. 

2) To make readily available evaluative excerpts 
from hundreds of book reviews appearing in a great 
variety of journals in this country and abroad in order 
to assist test users to make more discriminating sclec- 
tions of books for study and purchase. 
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3) To stimulate readers to develop more critical 
attitudes toward what they read. 

4) To make readily available important and pro- 
vocative statements which, though appearing in book 
reviews, have considerable value entirely apart from 
a consideration of the book under review. 

5) To point out books which are not being reviewed 
but which probably merit review. 

6) To improve the quality of book reviews by stimu- 
lating review editors to make greater efforts to choose 
competent reviewers who will contribute frankly criti- 
cal reviews. 

7) To improve the quality of book reviews by stimu- 
lating reviewers “to take their responsibilities more 
seriously” by refusing to review books which they can- 
not appraise competently and honestly. 


SUGGESTIONS TO TEST REVIEWERS 


The following instructions—presented on a 
mimeographed sheet entitled “Suggestions for 
Reviewing Tests for The Third Mental Meas- 
urements Yearbook”—were sent to each test 
reviewer. 


_ Y) Test reviews should be written with the follow- 
ing major objectives in mind: 

a) To provide test users with carefully prepared 
appraisals of tests for their guidance in selecting 
and using tests. 

b) To stimulate progress toward higher profes- 
sional standards in the construction and use of tests 
by commending good work, by censuring poor work, 
and by suggesting improvements. 

c) To impel test authors and publishers to present 
more detailed information on the construction, vali- 
dation, uses, and misuses of their tests. 

2) Reviews should be frankly critical with both 
strengths and weaknesses pointed out in a judicious 
manner, Descriptive comments should be the minimum 
necessary to support the critical portions of the re- 
view. Criticism should be as specific as possible; im- 
plied criticisms meaningful only to students of testing 
should be avoided. Reviews should be written pri- 
marily for the rank and file of test users. An indication 
of the importance and value of a test relative to com- 
peting tests should be presented whenever possible. If 
à reviewer considers a competing test better than one 
under review, the competing test should be specifically 
named. 

3) Concise reviews of 300 to 500 words are wanted. 
The average length of the reviews written by any one 
person should not exceed 500 words, Unless arrange- 
ments are made otherwise, no review may exceed 700 
words. The older tests—say tests published prior to 
1940—should generally receive shorter reviews than 
the more recent tests. 

4) Each test will be reviewed by two or more per- 
sons in order to secure reviews representing various 
viewpoints among test users. The editor reserves the 
right to delete noncritical, descriptive content which 
overlaps. similar material presented another re- 
viewer or which appears unessential. Reviewers will 
have an opportunity to see galley proofs of their re- 
views. 

) Reviewers may write to test authors and pub- 
lishers for additional information about the construc- 
tion, validation, and use of tests being reviewed either 
directly or through the editor. 

6) If, for any reason, a reviewer concludes that he 
is not in a position to write a frankly critical review 
in a scholarly and unbiased manner, it is suggested 


that he request the assignment of another test for 
review. 

7) The reviewer's name (as he wishes it written), 
title, and position should precede each review. Reviews - 
should be typed with double spacing and submitted in 
duplicate. test review should start on a new page. 

8) Only the title and forms of the test reviewed 
need to be indicated. The editor will prepare the full 
bibliographic entry which will include: title, date, in- 
dividual or group test, number of forms, cost, time 
to administer, author, publisher, references on the con- 
struction, validation, and use of the test, and the titles 
of parts or subtests printed as separate booklets. 

9) Reviewers may find it of interest and value to 
skim over the Preface and Introduction of The 1940 
Mental Measurements Yearbook. n 

10) The due date for your review is Ina 
few cases arrangements may be made to submit re- 
views later; especially where additional time is needed 
for trying out tests. 


HOW TO USE THIS YEARBOOK 


The following suggestions should be care- 
fully read by all persons wishing to get the 
maximum value out of The Third Mental 
Measurements Yearbook. 

1) Contents. The Contents (p. vii) should 
be consulted to get an over-all picture of the- 

-volume's contents and the classification plan 
used. The Contents lists all the headings and 
subheadings under which tests are classified in 
the section "Tests and Reviews.” 

2) Classified Index of Tests (pp. 1035-47). 
This index, an expanded table of contents, pre- 
sents a complete list of tests, books devoted ex- 
clusively to specific tests, and reviewers of these 
tests and books. The Classified Index should 
prove very useful in helping readers to locate 
material in the section "Tests and Reviews." 
Stars indicate new tests not previously listed 
in the series ; asterisks indicate tests which have 
been revised or have been issued in new form 
since last listed in this series. 

3) Test Entries. For each test, an attempt 
has been made to present in the bibliographic 
entry the following information in the order 
given: 

a) tithe. The test titles are printed in boldface 

» Secondary and series titles are set off from main titles 
by a colon. Subtest or booklet titles are printed in 


small capital letters. When the titles on the test book- 
let io the test manual differ, the better known title 
1S u: |a Pi 

b) DESCRIPTION OF THE GROUPS FOR WHICH THE TEST 
18 INTENDED. The grade, chronological age, or semes- 
ter range is usually given. "Grades 1B, 1A, 2-3, 4-12, 
1327 - indicates that there are five test booklets: a 
booklet for the first half of grade 1, a booklet for 
$e second half of grade 1, a booklet for grades 2 and 

a booklet for grades 4 to 12 inclusive, and a booklet 
for undergraduate and graduate students in colleges 
and universities. "First, second semesters” indicates 
that there are two test booklets : one covering the work 
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of the first semester, the other covering the work of 
the second semester. 

C) DATE OF COPYRIGHT OR PUBLICATION. Whenever 
known, copyright dates are reported; otherwise, pub- 
lication dates are given. Generally, for tests revised or 
published over a period of years the range of copyright 
dates is given; however, there are many exceptions to 
this rule in the case of tests which appear in a new 
form annually. 

d) MACHINE-SCORABLE TESTS. Tests which may be 
scored by the International Test Scoring Machine (see 
p. 464) are marked “IBM,” the initial letters of the 
first three words of the manufacturer, International 
Business Machines Corporation. 

€) INDIVIDUAL OR GROUP TEST. All tests are group 
tests unless otherwise indicated. 

f), FORMS, PARTS, AND LEVELS. Instead of merely re- 
porting the number of forms as in previous yearbooks, 
all available forms are identified; e.g., Forms A, B, C. 

g) cost or Tests. The yearbook usually reports the 
price of a single copy or, when the average cost per 
copy is less, the price per package of about twenty-five 
tests. Prices of specimen sets and accessory materials 
are also included. Although every precaution has been 
taken to ensure accuracy, some prices may be in error 
and other prices are likely to increase if the present 
inflationary tendency continues. For full and up-to- 
date information on test prices, especially for quantity 
orders, the latest catalog of the publisher should be 
consulted, 

h) time. The number of minutes of actual working 
time allowed examinees and the approximate length of 
time needed for administering a test are reported» 
whenever obtainable. The latter figure is always en- 
closed in parentheses. Thus, “s0(60) minutes" indi- 
cates that the examinees are allowed fifty minutes of 
working time and that a total of sixty minutes is needed 
to administer the test. ! 

i) AutHor, For most tests, all authors are reported. 
Tn the case of tests which appear in a new form each 
year, only authors of the most recent forms are listed. 
Names are reported exactly as printed on test book- 
lets. Names of editors are not reported for tests edited 
by two or more persons. 

j) PUBLISHER. The full name of the publisher or 
distributor is reported for each test. Whenever a test 
i$ available from a distributor rather than from the 
publisher, only the name of the distributor is given. 
For addresses and branch offices of the publishers, sce 
the Publishers Directory and Index (pp. 986-91). 


4) Test References. All known references, 
published and unpublished, on the construction, 
validity, use, and limitations of each test are re- 
ported immediately after the test entry. These 
references are arranged in chronological order 
by year of publication and alphabetically by 
authors within years. The chronological ar- 
rangement permits the reader to locate quickly 
the most recent references on each test. The 
asterisk following a reference indicates that the 
Editor has examined the reference to determine 
relevancy and to ensure an accurate biblio- 
graphic listing. Cross references are given to 
references previously listed in The Nineteen 
Forty Mental Measurements Yearbook; for ex- 
ample, “1-6. See 40:1211” means that six ref- 
erences are listed in The Nineteen Forty Men- 


tal Measurements Yearbook under entry 1211. 
Cross references are given also to abstracts and 
listings. in Psychological Abstracts; for exam- 
ple, “(PA 22:1643)” should be interpreted 
thus: see Psychological Abstracts, Volume 22, 
entry 1643 for an abstract of the reference in 
question ; “(PA 22:1644, title only)” indicates 
that the reference was listed but not abstracted 
in Psychological Abstracts. It is hoped that 
these cross references to Psychological Ab- 
stracts will be of considerable value to the seri- 
ous student who wishes to do selected reading 
on a particular test. 

5) Original Test Reviews. Original test re- 
views (i.e., reviews written specifically for this 
yearbook at the invitation of the Editor) of 
particular test are arranged in alphabetical 
order by reviewers. The name, position, and ad- 
dress of the cooperating reviewer are listed in 
italic type at the beginning of each review. 

6) Excerpted Test Reviews. Excerpts from 
test reviews published elsewhere appear imme- 
diately after the original test reviews in alpha 
betical order by journals. Reviews which in- 
clude no critical comment are listed but not 
excerpted. Readers should note that the critical 
portions of all test reviews, regardless of merit, 
found in professional and scholarly journals 
are included in this yearbook. Asterisks and 
ellipses within excerpts indicate the omission of 
nonevaluative material which appeared in the 
original review; asterisks indicate a break in 
continuity of reading, and ellipses indicate con- 
tinuity of reading. 

7) Cross References to Test Reviews. Cross 
references to reviews of the same test in earlier 
yearbooks and to related reviews in this volume 
follow the original and excerpted reviews. Cross 
references are to be interpreted thus: “see 497” 
means see entry 497 in this volume; "see 
38:934" means see entry 934 in The Nineteen 
Thirty Eight Mental Measurements Yearbook ; 
and “see 40:1243" means see entry 1243 in The 
Nineteen Forty Mental Measurements Year- 
book. 

8) Book Entries. The books listed in the 
section “Books and Reviews” are arranged in 
alphabetical order by authors. Rather com- 
plete bibliographic information—including both 
American and British publishers and prices— 
has been presented for each book. Stars indi- 
cate new books not previously listed in this 
series. The asterisk following a book entry in- 
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dicates that the Editor has examined the book 
to determine relevancy and to ensure an accu- 
rate bibliographic listing. 

9) Classified Index of Books. A roughly 
classified subject index (pages 1035-47) will 
assist the reader to locate books on a particular 
subject. In addition to using this index, readers 
are urged to skim over titles and excerpts in 
search of works which otherwise may be over- 
looked. 

IO) Book Reviews. Excerpts from book re- 
views originally published elsewhere are ar- 
ranged under each book in alphabetical order 
by journals. Reviews which included no critical 
comment are listed but not excerpted. Readers 
should note that the critical portions of all book 
reviews, regardless of merit, found in profes- 
sional and scholarly journals are included in 
this yearbook. Asterisks and ellipses within ex- 
cerpts indicate the omission of nonevaluative 
material which appeared in the original review ; 
asterisks indicate a break in continuity of read- 
ing, and ellipses indicate continuity of reading. 
Books dealing with specific tests are also listed 
in the section “Tests and Reviews” immediately 
following the relevant test ; review excerpts for 
these books appear only in the section “Tests 
and Reviews.” 

II) Cross References to Book Reviews. 
Cross references to reviews of the same book in 
earlier yearbooks follow the excerpted reviews. 
Cross references are to be interpreted thus: 
“see 987” means see entry 987 in this volume; 
“see 38:B326” means see entry B326 in The 
Nineteen Thirty Eight Mental Mcasurements 
Yearbook; and “see 40:B858” means see entry 
B858 in The Nineteen Forty Mental Measure- 
ments Yearbook. 

12) Catchwords. The running heads present 
catchwords to assist readers in the location of 
particular material. In the section “Tests and 
Reviews” and in the Classified Index of Tests, 
the headings and subheadings under which the 
tests are classified are used as catchwords. In 
the section “Books and Reviews,” the last 
names of book authors are used as catchwords. 
In the Periodical Directory and Index, Pub- 
lishers Directory and Index, Index of Titles, 
and Index of Names, one or more words show- 
ing the alphabetical sequence of the index are 
used as catchwords. 

13) Periodical Directory and Index (pp. 
979-85). The major purpose of this directory 
and index is to serve as a key to the abbrevia- 


tions used for excerpted journals. The names 
and addresses of the editor and review editor 
are given for each journal. Numbers preceded 
by “test” refer to test review excerpts; num- 
bers preceded by “bk” refer to book review 
excerpts. Test references are not indexed. 

14) Publishers Directory and Index (pp. 
986-91). The major purpose of the directory 
and index is to provide addresses of test and 
book publishers; branch offices are given for 
selected test publishers. Stars indicate test pub- 
lishers which issue catalogs devoted entirely or 
in large part to tests; asterisks indicate other 
publishers of one or more tests listed in this 
volume. Numbers preceded by “test” refer to 
tests ; numbers preceded by “bk” refer to books. 

15) Index of Titles (pp. 992-1006). If the 
exact title of a test or book is known, the quick- 
est way of locating it is to use the Index of 
Titles, an alphabetical listing of all books and 
tests in this volume. Book titles are set in italic 
type and test titles in roman type. 

16) Index of Names (pp. 1007-34). This in- 

*dex lists the names of all authors and reviewers 
and also serves as a directory of cooperating 
test reviewers. The name of each cooperating 
test reviewer appears in large and small capitals 
and is followed by his professional position and 
address in italic type. The abbreviations and 
numbers following the names may be interpreted 
thus: "rev, 68" indicates authorship of an 
original review of test 68 ; “test, 73” indicates au- 
thorship of test 73; “bk, 896" indicates author- 
ship of book 896; “exc, 112," indicates author- 
ship of an excerpted review of test (or book) 
112; "ref, 126(34)" indicates authorship of 
reference 34 for test 126; "cross ref, 347" in- 
dicates that a cross reference to the name is 
mentioned under test (or book) 347; "other, 
466" indicates that the name is mentioned in 
the entry for test (or book) 466 in a manner 
not classifiable under any of the above cate- 
gories, 


SPONSORSHIP 


In March 1946 the trustees of Rutgers Uni- 
versity established within. the School of Educa- 
tion an Institute of Mental Measurements to 
serve as a clearing house of information about 
educational, psychological, and vocational tests. 
The establishment of this Institute made it pos- 
sible for the Editor, as Director of the Insti- 
tute, to be relieved of teaching duties in order 
that he might devote full time to the prepara- 


»-— 
xiv] PREFACE 


tion of the first postwar Mental Measurements 
Yearbook. The Institute of Mental Measure- 
ments received additional support in the form 
of grants for the years 1946-47 and 1948-49 
from the Rutgers University Research Council. 
The sponsorship and support of the School of 
Education of Rutgers University made the prep- 
aration and publication of this volume feasible. 
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I have been most fortunate in having had the 
assistance of an exceptionally able office staff 
in the preparation of this volume. My secretary, 


Mrs. Ruth Gruner Thurlow, did outstanding 
work at every stage of progress; her efficient 
management of the editorial office and her com 
petence as a copyholder, proofreader, and edi- 
torial factotum cannot be praised too high! 

My part-time editorial staff consisted of three 
psychologists: Mrs. Jean Abel Adams, Mrs. 
Ruth Deitz Churchill, and Mrs. Olive Caldwci 
Hotaling. Mrs. Adams competently assisted in 
a variety of editorial activities—checking ac- 
curacy of data, copyholding, copyreading, and 
proofreading; her painstaking insistence upon 
accuracy and consistency has been a great com 

fort to me. Mrs. Churchill assisted in the editing 
of the major part of the manuscript and read the 
galley proofs, without copy, twice; she made 
many valuable suggestions for improving the 
manuscript and did a superb job of editing cops 
and reading proofs. Mrs. Hotaling assisted i: 
the editing of part of the manuscript and served 
as chief proofreader for the major part of the 
volume; I am deeply grateful and appreciative 
of her invaluable help. Thanks are due also to 
Mr. Warner H. Thurlow who prepared and 
checked the five indexes and to Miss Marion 
Byrne and Mrs. Doris McCann who typed a 
part of the manuscript. I wish to make a special 
acknowledgment to my wife, Luella Buros, the 
only unpaid member of my office staff, who con- 
tributed considerable secretarial and editorial 
assistance. Finally, I wish to express my appreci 
ation for the encouragement and support which ! 
have received from Dean Albert E. Meder, Jr. 
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[1] 
*American School Achievement Tests. Grades 1, 


2-3, 4-6; 1941-43 ; 3 levels; Robert V. Young and Willis 
E. Pratt; Public School Publishing Co. 

a) PRIMARY BATTERY I. Grade 1; 1941; Forms A, B; 
$3 per 100; 30¢ per specimen set, postpaid; 35(50) 
minutes; coauthor, Frank Gatto. 

b) PRIMARY BATTERY II. Grades 2-3; 1941; Forms A, 
B, C; $4.80 per 100; 36€ per specimen set, postpaid ; 
85(105) minutes. 

C) INTERMEDIATE BATTERY. Grades 4-6; 1942-43; 
Forms A, B, C; $7.20 per 100; 36¢ per specimen set, 
postpaid; 127(147) minutes. 


REFERENCES 


1, SrowE, Crarence R. "Validity of Tests in Beginning 
Reading.” El Sch J 43:361-5 F "43. * (PA 18:2605) 


Walter W. Cook, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota. These tests are designed to measure a few 
of the traditionally emphasized reading, arith- 
metic, and language skills with short, sharply 
discriminating scales. To make this possible, 
three batteries are provided: one for the first 
grade, one for grades two and three, and one 
for grades four through six. The first grade 
tests measure word recognition, word meaning, 
and number concepts. The other two batteries 
measure sentence and word meaning, paragraph 
meaning, arithmetic computation, arithmetic 


problems, language (English mechanics), and 
spelling. 

The three manuals are unique in that they 
give in considerable detail the sources of the 
items, i.e, word lists, textbooks, courses of 
study, and curriculum studies. For some of the 
tests the difficulty and discriminating power of 
the items are reported. However, it is regret- 
table that more information is not given re- 
garding the proper interpretation and use of the 
scores, For example, the first grade manual 
reports the frequency levels in the Wheeler- 
Howell and Gates word lists of the vocabulary. 
in the scales. Approximately 25 per cent are 
beyond the first 500 words, The 24 words of 
each scale were selected because of the sharp 
increase in accuracy from grade 1A through 
2B and 2A and also to give a mean accuracy 
of approximately 50 per cent at the 1A level. 
All this is sound test-building procedure, but it 
should have been explained that it does not 
insure that the words in the scale have been in 
the reading curriculum of a given school. To 
the extent that the words have been taught the 
test may properly be considered an achievement 
test. To the extent that they have not been 
taught it should be considered a reading readi- 
ness test. Basal readers used in the first grade 
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vary sufficiently in vocabulary to warrant spe- 
cial tests for each set, if achievement is to be 
measured. 

The manuals place emphasis on the diagnos- 
tic value of the tests. They have this value only 
to the extent that the tests may be taught. The 
survey function is more appropriate with such 
short undifferentiated scales. The test is not 
designed to influence greatly what is taught or 
how it is taught. The basic objectives of the 
traditional elementary school are measured; 
those of the modern school are not. 

Only age and grade norms are reported with 
no attempt to prepare teachers for the fact that 
a range of from four to eight years should be 
expected on any test at a given grade level. 
Percentile norms at each grade level would tend 
to avoid faulty conceptions regarding the at- 
tainable homogeneity of grade groups and the 
tendency to use norms as standards. The chief 
value of age and grade norms is that progress 
may be measured in each skill over a period of 
years. No provision is made for either profile 
or progress charts. Hence, the values of such 
norms are lost and all their limitations are 
emphasized. 

The tests are completely objective and de- 
signed for the use of an answer sheet.and mask 
scoring. However, neither answer sheets nor 
mask scoring materials are provided. The tests 
are made consumable and panel scoring keys 
provided. Certainly there is no excuse for this 
above the fourth grade level except profit. 

The reliabilities of the tests are high consid- 
ering their length. The range is from .71 to .96 
with a median of .86. The variability of the 
groups upon which reliability coefficients are 
based is not given, nor is the standard error of 
measurement. 


Gordon N. Mackenzie, Executive Officer of 
the Horace Mann-Lincoln Institute of School 
Experimentation, Head of the Department of 
Curriculum and Teaching, and Professor of 
Education; and Glen Hass, Assistant in the 
Horace Mann-Lincoln Institute of School Ex- 
perimentation; Teachers College, Columbia 
University, New York, New York. According 
to the manual accompanying each battery, the 
purposes of the American School Achieve- 
ment Tests are as follows: to measure pupil 
progress; to assist in the classification of pupils ; 
to furnish data for remedial programs in read- 


ing and the language arts; and to diagnose 
pupils" knowledge of specific computations and 
problem-solving. The first grade battery in- 
cludes tests in word recognition, word meaning, 
and numbers. The second and third grade bat- 
tery and the intermediate battery include read- 
ing, arithmetic, language, and spelling tests. 
The manual states that the tests are confined 
to these fields because in the "content field," 
important outcomes such as habits, attitudes, 
and ideals *do not lend themselves to objective 
testing." The tests are constructed on the basis 
of word lists, courses of study from various 
communities and states, and textbooks used in 
these grades. Reliability of the tests has been 
established experimentally, and grade and age 
norms are furnished on the basis of median 
scores received by pupils in schools ranging 
from small rural schools to urban school dis- 
tricts. 

In general, it might be said that these tests 
would do an acceptable job of measuring pupil 
progress in traditional curricula which empha- 
size isolated skills and knowledge. It must al- 
ways be questioned, however, whether valid in- 
dications of achievement can be secured unless 
one measures in terms of what has been offered 
in a particular classroom. 

For those interested in achieving the much- 
needed curriculum reconstruction in American 
schools, such tests are likely to have an undesir 
able influence. If curriculum improvement is 
continuous, as it should be, such tests are likely 
to constitute a positive hindrance because they 
tend to determine content and to check the de- 
velopment of curricula based on pupil needs 
and life situations. Teachers who find it appro- 
priate to provide reading readiness rather than 
formal reading programs for many of their 
children in the first grade should not be con- 
fronted with a standard every-pupil test of 
reading to be administered during or at the end 
of this year. Teachers who postpone formal 
arithmetic until grade two or three should not 
be confronted with a first grade test contain- 
ing such material. Teachers who are attempting 
to study the maturity levels of the individual 
children and to make reading, spelling, and 
arithmetic functional to life experiences should 
not be hindered by tests based on a philosophy 
requiring that certain skills be mastered each 
year. 

Certainly the use of such tests should be op- 
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tional on the part of teachers, and they should 
be employed primarily for individual pupils 
rather than total class groups. 


[2] 

Cooperative Contempo! Affairs Test for Col- 
lege Students. Wien I 47; new form available 
annually in April; IB! orm 1947; separate answer 
sheets need not be used; $2 per 25; 25¢ per specimen 
set, postpaid; 4o¢ per 25 machine-scorable answer 
sheets; 15¢ per stencil for scoring answer sheets; 
80(90) minutes; Form 1946: E. F. Lindquist and 
Robert L. Ebel; Form 1947: E. F. Lind n Rob- 
ert L, Ebel, Goldwin Smith, and Wendell Smit ; Co- 
operative Test Service. 


REFERENCES 
1-8. See 40:1182, 
6. FLANAGAN, Jonn C. “Measuring Interests,” Abstract. 
Psychol B 36:5a9r30 JI 2% * (PA 13:6514, title only) 
7. Cureton, Epnwaxp E. “Evaluation or Guidance? The Re 
ort of the Eighth Annual National Gollege yop mper eng 
r 


‘rogram A; 17 to Mi VEJ d 8:308 
"40 Rr (PA 5 S ay S, 1939. 308-40 
8, RYANS, b» 


of the 1040 pA A d pA Ae eremo x oat 

re Tes ive 
Sone Publicnt ions in Measurement and Guidance, Series 3, 
Vol, r, No, 1. New York: Cooperative Test Service, January 
1941. Pp. 35. Paper. $0.10, * 

9. Crissy, WrLLIAM J. E, anp Ryans, Davip G, The Col. 
lege Sophomore Appraises His Curriculum: A Report of the 
1941 Sophomore Testing Program, Cooperative Test Service 
Publications in Measurement and Guidance, Series 3, Vol. a, 
No, 2, New York: Cooperative Test Service, January 1942. Pp. 


39. Paper. $0.10, * 
10, Crissy, Mura J. E., ano PxoxeseN, RuTM A. The 
War-Time Role of the College Sophomore: A Ri on the 
1942 Sophomore Testing Program. Cooperative Test Service 
Publications in Measurement and Guidance, Series 3, Vol. 2, 
No. 3. New York: Cooperative Test Service, July 1942. Pp. 40. 
Paper. $0.10. * (PA 16:4005) 

T1, Final Sophomore Percentile Tables: The 1943 College 
Sophomore Testing Program, New York: Cooperative Test 
Service, July 1943. Pp. 6. Paper. Gratis, * 

12, CARROLL, Moun B., “The Factorial Representation of Men: 

d & Psychol Meas 


Li 
Sop! € Testing Program, New York: Cooperative Test 
Service, July 1944. 6. Paper. Gratis. * 
Wy LP PD CA Percentile Tables: The 1945 College 
$e omone Testing Prem New, Soret, Cooperative Test 
Service, st 1945. Pp. 6. Paper. Gra! 
‘Final "Sophomore Percentile Tables: The 1946 Colones 


Sophomore Te: ir cen New York: Cooperative Test 
Service, July 1946, Pp. 6. Paper. Gratis. * 

16. Final Sopi re Percentile Tables: The 1947 College 
Sophomore Testing Program. New York: Cooperative Test 
Service, May 1047. Pp. 4; Paper. Gratis. * 

17. Gourp, nor. "The Predictive Value of Certain 
Selective Measures." Ed Adm & Sup 33:20812 Ap ‘a7 * 


H. T. Morse, Dean, The General College, 
University of Minnesota, Minneapolis, Minne- 
sota. [Review of Form 1946.] Previous issues 
of the Cooperative Contemporary Affairs Tests 
have been criticized on certain specific points. 
One of the most frequent criticisms has been 
that many items were apparently selected on 
the basis of frequency of mention in the news 
or popular interest rather than on real signifi- 
cance. It is always easier to criticize such a 
shortcoming in any test of contemporary affairs 
than it is to attempt to define or identify what 
news is really significant at the time it is hap- 
pening. Some topics, such as our international 
relations, are inherently of importance to the 


American public. Those events also which in- 
dicate the dynamic nature of society are by and 
large of greater significance than those refer- 
ring to its static aspects, although the latter 
are more specific and therefore are easier grist 
for the test maker's mill. It is also important 
to stimulate the student through tests, as well 
as in instruction, to attempt to get a broad grasp 
of and insight into the directions in which so- 
ciety is moving. One item which is a good ex- 
ample is the following: “What general trend 
is probably exerting the most adverse effect on 
the freedom of the press, the radio, and motion 
pictures in the United States? (1) Ownership 
is being concentrated in a progressively smaller 
group of individuals, (2) More governmental 
regulations are being imposed, (3) Irresponsible 
persons are using these agencies to spread un- 
American propaganda, (4) Wartime conditions 
have fostered intolerance for all minority opin- 
ion.” (1 is the correct response.) 

Items in the 1946 form of the test are for 
the most part carefully screened to weed out 
the ephemeral and inconsequential aspects of 
the news and to require a discriminating knowl- 
edge and understanding of the contemporary 
scene. Part III, Literature and Fine Arts, how- 
ever, is weaker in this respect, in the opinion of 
the reviewer, since it places too much emphasis 
on less important details, such as the plot of a 
specific play or the name of a certain com- 
poser. It would also seem that-6 out of 30 items 
is too large a quota to devote to the latest do- 
ings of Hollywood. 

The fundamental question in passing judg- 
ment on any test of contemporary affairs is, of 
course, what are the criteria for the selection 
of topics upon which items are to be based? A 
rough classification of the 75 items of Part T, 
Public Affairs, yields the following distribu- 
tion: International relations and politics, 36; 
U.S. economic factors, 25, of which 7 relate 
specifically to labor ; U.S. social trends, 9; U.S. 
domestic politics, 5. It would seem that there 
might be a more equitable balance among do- 
mestic economic, social, and political affairs. 

The division of the test into 75 items on 
public affairs, 30 on science and medicine (an 
excellent section), and 30 on fine arts and 
literature appears to be a good distribution. 
The time distribution, 45 minutes for Part I 
and 15 minutes each for Parts II and III, how- 
ever, is not practicable for administration, since 
it would require a double period for completion 


3 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[4 


of the whole test and as such the test is not 
suitable for most collége classes in the social 
studies. 

"There is a high degree of technical excellence 
in the make-up of the items in the 1946 form 
of the Cooperative Contemporary Affairs Test. 
Items for the most part are clearly and con- 
cisely stated and avoid ambiguities. Incorrect 
options are plausible. By and large, this edition 
measures up to the high standard maintained 
by most Cooperative tests. Its judicious use in 
social studies classes should stimulate the de- 
sirable instructional practice of illuminating the 
study of the past with illustrations of social 
processes currently in action. 


For reviews by Paul M. Limbert and Ralph 
W. Tyler of earlier: forms, see 38:048 and 
40: 1182. 


[3] 
Cooperative General Achievement Tests, Revised 
Series. Grades 10-12 and college entrants; 1942-47 ; 
TBM ; Forms S, T, X ; 3 tests; separate answer sheets 
need not be used; $2 per 25 of any one test; 50¢ per 
specimen set, postpaid; 25¢ per specimen set of any one 
test, postpaid; 40¢ per 25 machine-scorable answer 
sheets for any one test; 15¢ per stencil for scoring an- 
swer sheets for any one test; 40(45) minutes per test; 
Cooperative Test Service. 
G) TEST I, A TEST OF GENERAL PROFICIENCY IN THE FIELD 
OF THE SOCIAL STUDIES. Forms S and T: Mary Willis, 
Charlotte W. Croon (Form S), and the Committee on 
Social Studies Tests of the Educational Records Bu- 
reau: Ronald Beasley, Charles K. Cummings, Jr., 
W. Ainsworth Greene, Margaret Hastings, and Elsie 
Resor; Form X: Jeanne M. Bradford. 
b) TEST 2, A TEST OF GENERAL PROFICIENCY IN THE FIELD 
OF NATURAL SCIENCES. Form S: Charlotte W. Croon, 
Mary Willis, Alexander Calandra, Richard E. Watson, 
John G. Zimmerman; Form T: Carl A. Pearson; 
Form X: Paul J. Burke. 
€) TEST 3, A TEST OF GENERAL PROFICIENCY IN THE FIELD 
OF MATHEMATICS. Form S: Emma Spaney and Char- 
lotte W. Croon; Form T : Margaret Martin; Form X: 
Paul J. Burke. 


John V. McQuitty, University Examiner, 
University of Florida, Gainesville, Florida. 
[Review of Forms S and T.] These tests were 
developed to measure general proficiency in the 
social studies, natural sciences, and mathematics 
rather than to measure mastery of specific 
course content. The tests are intended for use 
in all classes in the senior high school and for 
college placement. There is a separate test for 
each of the three fields. Each test has the same 
general format, beginning with a 15-minute 
part covering the examinee's knowledge of 
terms and concepts followed by a 25-minute 
part covering comprehension and interpretation. 
All of the items are of the multiple-choice 


type and may be answered either in the book- 
let or on a separate answer sheet. 

The tests are mostly nonfactual—especially 
the comprehension and interpretation portions. 
In these portions the examinee is given a brief 
graphical, pictorial, or verbal selection from the 
field involved and is required to answer items 
involving his understanding and evaluation of 
the selection, 

These tests are recommended. where an esti- 
mate of the students’ general proficiency is de- 
sired. They should prove particularly hel 
where the academic background of the exami 
nees is widely divergent or where a long period 
of time has elapsed since the examinee studied 
the field involved. In comparable areas these 
tests probably serve a function similar to that 
served by the Tests of General Educational De- 
velopment of the United States Armed Forc 
Institute. When interpreting the results of tests 
of this nature, considerable caution is neces: 
because a high degree of proficiency may 
mistaken for specific course mastery. A further 
difficulty, encountered in all short tests which 
cover a broad field or area, is one of item sam- 
pling. For example, it is impossible to cover 
the entire field of the social studies in a 45 
minute test. 

The intertest correlations, as reported by the 
Cooperative Test Service, range between about 
.60 and .70. The intercorrelations between the 
part scores on each test range between .41 and 
.71, with a median of about .54. 

Total scores for each test are expressed in 
terms of scaled scores, and percentile norms are 
provided for end-of-year high school students 
at each grade level and for freshmen entering 
liberal arts colleges. 

The publishers claim that the tests are “espe- 
cially useful in placement of entering college 
freshmen.” Although the statement seems plau- 
sible, no substantiating factual information is 
given. 

The reliability of the scaled scores is re- 
corded in terms of the standard error of meas- 
urement. This is a very meaningful measure. 
However, the magnitude of the error is indi- 
cated only by a vertical line drawn along a 
column of scaled scores. This makes it difficult 
to determine precisely the magnitude of this 
variable. 

The make-up of the test is pleasing and is 
very easy for the examinee to follow. In most 
of the samples examined, the paper seemed to 
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be a sulfite stock whose lasting qualities are 
rather inferior, and the pages tend to wear and 
tear along the fold. This limits the extent to 
which the booklets can be re-used with answer 
sheets. 


For a review by Harry D. Berg of Test 1, 
see 596. For a review by Palmer O. Johnson 
of Test 2, see 548. For a review by John F. 
Randolph of Test 3, see 316. 


à [4] 
Cooperative General Culture Test, Revi 

^ » sed Se- 
nen College ; 1946-47 ; new form available annually in 

pril; IBM; Forms W, X; $3.75 per 25; 30¢ per speci- 
men set, postpaid; $1 per 25 sets of machine-scorable 
answer sheets; 45¢ per set,of stencils for scoring an- 
swer Sheets; 180(190) minutes; Form W: Jeanne M. 
Bradford, Miriam M. Bryan, Paul J. Burke, and Ger- 
aldine Spaulding; Form X: Norman J. Blair, Jeanne 
M. Bradíord, Miriam M. Bryan, Paul J. Burke, and 
Herbert Danzer ; Cooperative Test Service. 
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Benjamin S. Bloom, College Examiner, 
Board of Examinations, The University of 
Chicago, Chicago, Illinois. [Review of Form 


W.] The Cooperative General Culture Test 
has already acquired a tradition and a history 
shared by few tests. Not many achievement 
tests can boast of such care and attention as 
has fallen to the lot of this one. Started in 
1934, it has been revised periodically and, with 
Form W appearing in 1946, is in its eighth 
form. Its publishers claim that its content has 
been determined by the consensus of a number 
of: scholars in various fields. They also state 
that course boundaries are completely disre- 
garded in this examination, the purpose of 
which is to provide an indication of the stu- 
dent’s general cultural background. 

This test has been reviewed several times in 
the various Mental Measurements Yearbooks. 
Each reviewer has complained that the test is 
little more than a miscellaneous collection of 
items of information and that course boundaries 
have not really been completely disregarded. It 
is evident that these reviews have had little 
or no effect since parallel forms are ground out 
with monotonous regularity, and each new 
form is as much like the previous forms as test 
writers can devise, Like the Sorcerer’s Ap- 
prentice, someone has started a process which 
no ordinary words can stop. 

Evidently the test has been carefully made. 
Norms are reported on sizeable groups of stu- 
dents, and reliability coefficients are quite satis- 
factory for purposes of individual measure- 
ment and interpretation. Isn’t that enough? 
Could one ask for anything more than a test 
with eight carefully prepared parallel forms? 

It is not the test, in itsélf, but the ways 
in which the test is used and the possible 
effect of its use on educational institutions, 
instructors, and students that cause one to be- 
come alarmed. A test used as widely as this has 
a number of effects. First, it serves to de- 
fine to college instructors what a “general edu- 
cation” can mean. The specific test items repre- 
sent to the test user operational definitions of 
what the test purports to be. The fact that 
Form W of this test was used by more than 
130 colleges would seem to indicate that a rela- 
tively large number of college administrators 
or college instructors may be influenced to %c- 
cept this definition of general culture and may 
even, consciously or not, organize some of their 
instruction in such a way that their students 
will do reasonably well on the test. The test 
also serves to inform students of the way in 
which an authoritative test-making organiza- 
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tion defines general culture. Since their college 
faculties use the test and therefore evidently 
approve of it, students are likely to accept the 
definition of general culture provided by the 
test and to guide their study accordingly. A 
number of studies have rather clearly demon- 
strated the powerful effect of a test in influ- 
encing and motivating students. 

Since the sample of test questions included 
in the Cooperative General Culture Test repre- 
sents one definition of general culture, the ade- 
quacy of the test and of this definition can be 
best appraised by contrasting it with other 
possible definitions of general culture. If one 
can assume that general culture is synonymous 
with general education, a number of excellent 
statements on the subject are available. General 
Education in a Free Society (Harvard Univer- 
sity Press, 1945), Building a Curriculum for 
General Education (University of Minnesota 
Press, 1943), and the 1946-1947 announcement 
of the College and the Divisions of the Uni- 
versity of Chicago represent outstanding state- 
ments on the content and purposes of general 
education as conceived by particular institu- 
tions. A Design for General Education (Ameri- 
can Council on Education, 1944), prepared 
under the auspices of the Council by a rather 
large committee selected from a number of 
academic associations and institutions of higher 
education, is a statement on general education 
which more nearly represents a consensus of 
the views accepted by a great variety of edu- 
cational institutions, The design is a remark- 
ably complete and detailed statement of what 
constitutes a general education and appeared 
to the reviewer to be perhaps the most useful 
definition for contrast with the Cooperative 
General Culture Test. The design includes an 
outline of the subject matter regarded as ap- 
propriate for a general education as well as 
the objectives or outcomes to be derived from 
a study of this subject material. This enables 
one to contrast both the subject matter included 
in the Cooperative General Culture Test and 
the types of behavior required by the test items 
with relevant portions of the design. 

A contrast of the courses or subject matter 
included in the design for general education 
with the Cooperative General Culture Test re- 
veals a number of gaps. This may be most 
readily seen in the following comparison: 


'URSES Posen IN A DzsicN Pants or THE COOPERATIVE 
E eras EDUCATION GeneraL CuLTURE TEST 
Problems of American Life...Part I, Current Social Problems 
Development of American 
Thought and Institutions... 


America in International Ai 
fairs ... 


Part II, History and Social 
Studies 


..Parts I and II 
«Part III, Literature 


Part IV, Science and Part VI, 
Mathematics 

Society Part V, Fine Arts 
Music in 


«Part V, Fine Arts 


ment » 
Marriage and Family Adjust- 
ment 
Vocational Orientation 
Philosophy and Religion 
Oral and Written 
cation $ 
Personal and Community 
Health 


ommuni- 


The three areas on adjustment and orienta- 
tion—social, family, and vocational—are not 
represented in the test, although occasional 
items touch on these subjects. Philosophy and 
religion are represented by a few questions in 
literature and current social problems, although 
not very systematically. Although oral and 
written communication is not directly dealt with 
in the test, this area is rather adequately repre- 
sented in the Cooperative English Test which 
is usually given at the same time as the Cooper- 
ative General Culture Test. The personal and 
community health area is represented by 2 few 
items in the current social problems and sci- 
ence parts of the test, but only in the most 
cursory manner. By itself, then, the Coopera-* 
tive General Culture Test does not adequately 
measure achievement in all the subject areas 
deemed important by the committee which 
prepared A Design for General Education. 

Two competent individuals attempting to 
classify a given set of test items with regard to 
the types of student behavior involved would 
frequently disagree, since the mental operations 
necessary to solve various test problems have 
not been adequately studied or defined. How- 
ever, it is possible to classify objectively test 
questions as representing certain general types 
of knowledge, abilities, or skills, especially when 
the latter are defined as completely as the ob- 
jectives listed in the design. In the following 
discussion, each of the parts of the test is ana- 
lyzed by determining the extent to which the 
objectives in the appropriate subject area of 
the design for general education are repre- 
sented by test questions. No attempt has been 
appreciation objectives, since such objectives 
made to relate test items to the attitude and 
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are usually not readily appraised by achieve- 
ment test questions. 

PART I, CURRENT SOCIAL PROBLEMS. A major 
purpose in A Design for General Education is 
that the individual should “do his part as an 
active and intelligent citizen in dealing with 
the interrelated social, economic, and political 
problems of American life and in solving the 
problems of postwar international reconstruc- 
tion." Under this major purpose are listed a 
number of specific objectives. One objective is 
that the student should have a knowledge of 
the nature, history, and alternative solutions 
of the interrelated social, economic, and politi- 
cal problems of his community, state, and na- 
tion. This objective.is represented by about 64 
per cent of the test items. A second objective, 
knowledge of the geographical, cultural, eco- 
nomic, and political backgrounds of the present 
war and the factors involved in attaining a 
lasting peace, is represented by about 32 per 
cent of the items. The ability to read maps, 
charts, and statistics and to make warranted 
deductions from them accounts for 4 per cent 
of the items. i 

The following rather important objectives 
are not sampled by the test items in this part 
of the test: the habit of thinking critically, with 
the ability to define problems, collect data, es- 
tablish facts, and make discriminating use of 
materials; the ability to identify the concrete 
manifestations of basic national and interna- 
tional issues and conflicts and to recognize his 
(the student's) personal relationship to them; 
the ability to identify and appraise judgments 
and values that are involved in the choice of 
a course of action; and the ability to formulate 
explicitly and systematically a pattern of values 
as a basis of individual and social action. 

The questions in this part require little more 
than a superficial acquaintance with current 
social phenomena. The student is not required 
to demonstrate a grasp of the meaning, signifi- 
cance, causes, or consequences of these social 
phenomena. The emphasis is on description 
rather than analysis. 

PART II, HISTORY AND SOCIAL STUDIES, Little 
mention is made of history or social studies in 
A Design for General Education except as it 
enters into such objectives as knowledge of 
American thought, institutions, traditions and 
ideals, their character, backgrounds, and con- 
temporary tendencies; and knowledge of the 
nature, history, and alternative solutions of the 
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interrelated social, economic, and political prob- 
lems of community, state, and nation, All items 
in this part of the test can be related to these 
broad objectives. 

However, the subject matter of the test at- 
tempts to cover such a broad area as to permit 
only rather limited sampling of certain areas 
in history and social studies. Thus, there are 
only five questions on ancient history, one on 
the Industrial Revolution, one on geography, 
etc. An attempt to sample sich a broad range 
of subject matter with only 75 questions makes 
it difficult to interpret a score on this test. A 
slightly different selection of items might make 
for a quite different performance by individual 
students. It is a questionable measuring instru- 
ment when the average score for college sopho- 
mores is only 20 per cent of the possible score. 

PART III, LITERATURE. A major purpose in 
the design is for the individual “to find self- 
expression in literature and to share through 
literature man’s experience and his motivating 
ideas.” A specific objective under this purpose, 
knowledge of the content of a reasonable num- 
ber of outstanding works of the past and pres- 
ent through reading, accounts for practically 
all of the test items in this part of the test. 

None of the test items can be related to such 
objectives as: knowledge of the continuity and 
change in man's experience, character, moti- 
vating ideas, and ideals as reflected in litera- 
ture; knowledge of the human qualities which 
are central to the democratic tradition and ideal, 
and basic to cultural unity; knowledge of 
sources of information and guidance in choice 
of reading ; knowledge, on an elementary level, 
of the manner in which the techniques employed 
by writers aid them in accomplishing their 
particular purposes; the ability to read signifi- 
cant literature with comprehension (except in 
so far as this is indirectly measured by the 
items dealing with acquaintance with litera- 
ture) ; the ability to exercise taste in the choice 
of reading material; and the ability to recog- 
nize form and pattern in literary works as a 
means of understanding their meaning. 

This test might appropriately be called a test 
of acquaintance with literature, since many of 
the questions can be answered correctly even 
though the examinee has not read the specific 
book or work involved. An individual who had 
read a good deal about literature should score 
as high or even higher than an examinee who 
has read many specific books. This is in large 
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measure the result of using distractors which, 
in many cases, can be ruled out on the basis of 
the most superficial acquaintance with the book. 

PART IV, SCIENCE. The primary purpose of 
science in 4 Design for General Education is 
for the individual "to act in the light of an 
understanding of the natural phenomena in his 
environment in its implication for human so- 
ciety and human welfare, to use scientific 
methods in the solution of his problems, and to 
employ useful nonverbal methods of thought 
and communication." Under this major pur- 
pose, the specific objective, knowledge of the 
facts, principles, and theories that explain nat- 
ural phenomena, is represented by about 55 per 
cent of the test items; knowledge of the basic 
Scientific vocabulary necessary for the attain- 
ment of essential objectives accounts for at 
least 15 per cent of the questions ; and the abil- 
ity to apply principles to new situations is 
sampled by about 28 per cent of the items. 

Some of the objectives of science instruction 
not represented in this test are: knowledge of 
the role and function of science in our society ; 
knowledge of the questions that can be an- 
swered by science and the kinds of answers 
that science can provide and, conversely, of the 
limit to the information about the world attain- 
able through science; knowledge of the tech- 
niques and methods used by scientists in seek- 
ing to answer questions about the world and 
of the proper functions of scientific theory and 
experiment. Some additional objectives of sci- 
ence instruction not represented in this science 
test are: the ability to observe phenomena ac- 
curately; the ability to draw valid inferences 
from data; the ability to test hypotheses; the 
ability to recognize and formulate problems; 
and the ability to read graphs, diagrams, and 
blueprints. 

Although this test does not sample a number 
of rather important objectives of science in- 
struction, it is even less adequate when the sub- 
ject matter sampled is considered. It would seem 
an almost impossible task to measure achieve- 
ment in the natural sciences with a sample of 
only 60 questions. There are 20 questions cov- 
ering the biological sciences. Of the remaining 
40 questions on the physical sciences, there are 
21 on physics, 9 on chemistry, 6 on astronomy, 
and 4 on geology. 

It is difficult to see how this test could pos- 
sibly represent a valid sample of attainment in 
science for any college in the country. The 


attempt to cover such a wide variety of sub ject 
matter with such a limited sample of test items 
makes an almost impossible situation. This is 
in part demonstrated by the fact that the aver- 
age score of sophomores on this test is only 18 
per cent of the possible score. ; 

PART V, FINE ARTS. The primary purpose oí 
the fine arts in A Design for General Educa- 
tion is “to find a means of self-expression in 
music and in the various visual arts and craits, 
and to understand and appreciate art and music 
as reflections both of individual experience and 
of social patterns and movements.” Under this 
purpose, the objective, knowledge of outstand- 
ing examples of contemporary and historical 
art and music, is accounted for by 72 per cent 
of the items. Knowledge of the problems faced 
in creative work in the arts, and of the tech- 
niques and media used in solving these prob- 
lems, is sampled by 20 per cent of the items. 
Knowledge of standards of good taste as de- 
veloped by individual musicians and artists and 
reflected in the various regional and historical 
styles is, to some extent, represented by about 
8 per cent of the items. 

The objectives of instruction in the fine arts 
not represented in this test are: knowledge of 
change and continuity in man’s experience, 
character, and motivating ideas and ideals, as 
reflected in music and in works of art; knowl- 
edge of the relationship of music and art to 
everyday life and to the social and cultural char- 
acteristics of the time in which they were pro- 
duced; the ability to recognize artistic quality 
in contemporary works of music and art; and 
the ability to recognize aesthetic values and re- 
lationships in significant works of the historical 
past. 

As a test of acquaintance with artistic works, 
workers, and, to some extent, artistic tech- 
niques, this would appear to be a satisfactory 
test. However, without a sample of competence 
on some of the other objectives listed here, one 
would hesitate to say that a student with a high 
score is a cultured individual in so far as the 
fine arts are concerned. 

PART VI, MATHEMATICS. No specific objec- 
tives for mathematics in general education are 
listed in A Design for General Education, al- 
though this statement does appear, “In the solu- 
tion of his problems, the layman may frequently 
need to use mathematical operations. The gen- 
erally educated person is expected to have cer- 
tain arithmetical and algebraic knowledge and 
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skills, but these are usually to be attained before 
the student reaches the level of general educa- 
tion considered in this outline." A breakdown 
of the test reveals that about 50 per cent of the 
items involve simple mathematical operations, 
about 30 per cent deal with definitions in mathe- 
matics, 13 per cent deal with the basis for mathe- 
matics and its underlying structure, while an- 
other 7 per cent deal with specific and isolated 
facts about mathematics and its history. Thirty- 
eight per cent of the items involve algebra, 37 
per cent geometry and trigonometry, 13 per 
cent arithmetic, and the remaining 12 per cent 
deal with miscellaneous aspects of mathematics. 

Although no specific objectives are available 
for comparison, it is quite likely that this test 
would satisfy the needs of a generally edu- 
cated person in so far as the field of mathe- 
matics is concerned. 

SUMMARY. In general, the test has many 
shortcomings. As a measure of general educa- 
tion, it is decidedly inadequate in both the sub- 
ject matter sampled and the types of objectives 
listed. For the most part, the emphasis is on 
knowledge rather than on the ability and skill 
objectives which are emphasized in A Design 
for General Education, as well as in the other 
statements on general education. The knowl- 
edge is sampled by items which measure rather 
superficial acquaintance with the subject-matter 
fields listed. Although such knowledge may be 
a prerequisite to the more complex abilities and 
skills, only the most reckless test interpreter 
would claim that measurement of one provides 
a good index of the other. 

As a test of acquaintance with a great variety 
of subject matter, it is a useful test. As an index 
of the individual's general culture, general 
education, or liberal education, it is quite inade- 
quate. Perhaps the major criticism of the test 
is that it attempts to do an almost impossible 
task—the measurement of the product of two 
or more years of education in 180 minutes. 

After completing this review, the writer re- 
ceived the ninth product of this apparently 
interminable process—Form X of the Cooper- 
ative General Culture Test. Except for a few 
slight changes in percentage estimates, this re- 
view fits Form X as well as it does Forms O, 
PROS RASS andes We 


H. T. Morse, Dean, The General College, 
University of Minnesota, Minneapolis, Minne- 
sota. [Review of Form W.] A number of 


criticisms made in The Nineteen Forty Mental 
Measurements Yearbook of the 1937 form of 
the Cooperative General Culture Test would 
still be valid for the 1946 form. 

There is still a heavy emphasis on specific 
factual information, some of it certainly of 
questionable significance, although the six sec- 
tions are uneven in this respect. Part I, Current 
Social Problems, has a gratifyingly large por- 
tion of its 75 items based on generalizations 
or fundamental definitions, which highlight 
the significance of social change. Part II, His- 
tory and Social Studies, has a smaller percentage 
of such items and divides its 75 items about 
equally between more remote events and con- 
ditions and those which are related to the mod- 
ern. scene. In Part I the use of graphs is to be 
commended, although the related items would 
be improved if they weresto call for an inter- 
pretation of the graphs rather than a mere 
reading of them for facts. Identification of 
the function of reference materials (called 
"sources") is included in one item only. More 
items of this type in place of some of the more 
factual ones would have strengthened this por- 
tion of the test. 

The section on Literature, Part III, is nar- 
row in scope in that its 90 items are almost 
exclusively concerned with identification of 
works and association of titles, authors, and 
characters. Consider, for example, the follow- 
ing: “Which of the following figures of speech 
was used most effectively by Southey in The 
Cataract of Lodore? (1) Hyperbole, (2) Ono- 
matopoeia, (3) Simile, (4) Allegory, (5) An- 
tithesis.” And why is it significant to know 
which of a list of Roman emperors wrote in 
Greek? 

Part IV, Science, contains some ambiguities 
and seems to reflect little more than a textbook 
knowledge of the physical and biological sci- 
ences, with greater emphasis on the former. 
The section could be improved by the inclusion 
of items which would require some recognition 
of the significance of technological advances 
and an understanding of the scientific method 
of analyzing problems, which is generally 
stated as a basic objective in the varions scien- 
tific fields. 

Part V, Fine Arts, go items, is to be com- 
mended more on its scope than on its effort to 
test any appreciation of the arts. Most of the 
items deal with simple association of the artist 
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and his specific creation or his field of promi- 
nence, or with technical terminology. The items 
on music (21 in all) are most limited in this re- 
spect. The use of pictures, six in all, is com- 
mendable in so far as items based on them 
require identification of a period or school of 
art, rather than identification of the name of 
the painter. 

Part VI, Mathematics, 60 items, consists for 
the most part of problems in higher mathe- 
matics which require a working knowledge of 
mathematical symbols more than an understand- 
ing of quantitative thinking and of the func- 
tion of mathematics in daily living. Some of 
the items seem to force the student to look at 
a fundamentally simple problem in an unneces- 
sarily complicated manner. 

In spite of some of its limitations, the test 
as a whole possesses good technical qualities 
and is probably the best over-all measuring 
instrument of its type. While there is undoubt- 
edly a premium on factual associations, it can- 
not be denied that such materials are the warp 
and woof of “culture” as traditionally con- 
ceived, It is to be hoped that in subsequent 
editions more items may be included which re- 
quire not only factual information, but in addi- 
tion a grasp of relationships, implications, 
trends, and a more critical discrimination. 

No test of this type can be considered apart 
from its relationship to classroom teaching and 
curricular development, Too narrow an empha- 
sis upon traditional content from separate sub- 
ject-matter areas makes such a measuring in- 
strument much less suitable to the broad scope 

- courses which are developing in the trend to- 
wards general education, and little more than 
a composite achievement test. It would also be 
of value to know the uses which colleges might 
make of test results, so that by coordinated 
study there might be a mutually beneficial influ- 
ence upon both curricula and tests. 


For reviews by F. S. Beers, Lavone A. 
Hanna, Edward S. Jones, and Hilda Taba of 
earlier forms, see 38:871 and 40:1184. 


[5] 
kC rative Test on Recent Social and Scien- 
tific Developments. Grades 9-12; 1946-47; new form 
available ly in Hen for use in the Nationwide 
High School Testing Program; parti ing schools 
RM mere to Nini the tett. (0 Every pupil en- 
rolled in grades 9-12; charges include the Scor- 
ing, and reporting of results; 10f per test plus $2 reg- 


istration fee and shipping costs ; 80(90) minutes ; Form 
ms z E F Lindquist. E Robert L. Ebel; Form 1947 : 
E. F. Lindquist, Robert L. Ebel, Goldwin Smith, and 
Wendell Smith; Cooperative Test Service. 


Ernest W. Tiegs, Editor-in-Chief of the 
California Test Bureau, Los Angeles, Califor- 
nia. [Review of Form 1946.] The major pur- 
poses of this test are: (a) to make possible a 
more adequate evaluation of one important out- 
come of high school education that does not 
have a specific place in the curriculum; (5) to 
obtain data in guiding the development of indi- 
vidual pupils; (c) to motivate pupils to become 
more critical students of recent social and sci- 
entific developments; (d) to encourage teach 
ers to take advantage of current social and 
scientific developments and trends to bring the 
curriculum to life and thus better prepare 
students for more intelligent participation in 
the activities of the age in which they live. 

This test consists of a total of 116 multiple- 
choice items, the first 76 of which are devoted 
to social, economic, political, and military de 
velopments requiring 55 minutes for adminis- 
tration, and 40 items on scientific, technical, 
and medical development, requiring 25 minutes 
for administration. 

The items themselves seem to be discrimi- 
natingly selected and carefully worded ; there 
are few, if any, that can be answered by scan- 
ning the headlines of the newspapers. This is 
because it is not a test of current events, but 
rather of more fundamental developments and 
trends. Some of the items appear rather diffi- 
cult for high school students. 

The publishers recommend that the test be 
given “with little or no previous notice” in order 
to avoid any special preparation. Because of 
the nature of the test, no norms are available 
for any given form until after the tests have 
been taken and scored. There were no experi- 
mental editions with the tryouts, item analysis, 
etc., which characterize the validation of most 
tests now distributed commercially. For the 
same reason, there are no data on reliability 
and validity. However, the prospective user 
need not take this fact too seriously because 
a glance at the authorship, together with as- 
sistants and reviewers, should assure anyone 
that as far as the judgments of competent test 
makers can make it, this is a good test. 

One of the excellent features of this instru- 
ment is that of differentiated norms which are 
made available after the first administration of 
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the test. These consist of national norms, norms 
for homogeneous geographical regions, and 
norms for individual states, Percentile norms 
are also derived for both individual pupil scores 
ind for school averages by separate grades. 

The administration of this test should have 
a very salutary effect in relating the curriculum 
to the world and its activities. Evidence from 
the 1946 tests shows the need for this trend. 
While, in general, students do better as they 
get farther along in high school, a number of 
items were answered better by freshmen than 
by seniors, and in a number of other instances, 
there were apparently no significant differences 
between high school classes. 


Ralph W. Tyler, Chairman of the Depart- 
ment of Education, Professor of Education, 
and Examiner, The University of Chicago, 
Chicago, Illinois. One of the commonest criti- 
cisms of the school is its isolation from contem- 
porary life. An important task of education is 
to develop students who apply their school 
learning to the affairs of their daily lives, Rec- 
ognizing the need to measure this aspect of 
learning, many tests have included certain items 
requiring some transfer of school learning to 
situations outside the school. In recent years, 
tests of current events have appeared aimed 
primarily at appraising the student's familiarity 
with matters widely broadcast by rress and 
radio. These tests have too frequently served 
only to measure a relatively superficial recall 
of events rather than the kind of intelligent 
interpretation which is the goal of social stud- 
ies teaching. 

The authors of the Cooperative Test on Re- 
cent Social and Scientific Developments recog- 
nize the importance of a test that requires the 
student to use basic concepts, principles, and 
facts in interpreting contemporary events. 
They state, “The test questions should be con- 
cerned primarily with broad movements and 


‘trends, with basic issues and principles, with 


broad generalizations and inferences. The tak- 
ing of the test should encourage the student to 
think about contemporary problems, to make 
sound observations, to draw conclusions, and 
to exercise good judgment.” A valid test of 
this sort has a significant place in evaluating a 
secondary school program. For most schools 
there are three major objectives for which such 
a test should provide evidence of achievement. 
These are: (a) To develop the ability to apply 


basic social concepts, facts, and principles in 
the analysis of contemporary social problems; 
(b) To develop the ability to apply important 
facts and principles of science in explaining 
the scientific phenomena of daily life; (c) To 
develop broad and mature interests in social 
and scientific affairs. 

The Cooperative test contains items that can 
give some evidence on the attainment of each 
of these three objectives. A high total score 
indicates relatively broad and mature interests 
in social and scientific affairs. High scores on 
the two parts are indices of the ability to apply 
social and scientific knowledge to contemporary 
developments. In these respects the test is to 
be commended. Such weaknesses as are found 
in the test are primarily the inadequacies of 
many items in meeting the criteria set up by 
the authors. In Part I, Social, Economic, Politi- 
cal and Military Developments, about one-third 
of the items, 24 of the 76, clearly require an 
interpretation of events in terms of basic con- 
cepts or principles of the social studies, while 
32 of the 76 items can be answered merely by 
remembering the news treatment of the events, 
without the use of social studies learning. The 
remaining 20 items are borderline, involving 
some judgments or inferences but not requiring 
the use of social concepts or principles. In 
Part II, Scientific, Technical and Medical De- 
velopments, 40 per cent of the items, 16 out 
of 40, clearly require the application of scien- 
tific facts and principles to answer them, while 
45 per cent of the items, 18 of them, can be 
answered merely by recalling the news treat- 
ment. The remaining 6 items are borderline, 
involving some judgments or inferences but 
not requiring the use of scientific facts or prin- 
ciples. 

Another way of reviewing the test critically 
is to examine the fields of science and social 
studies involved in interpreting the events men- 
tioned in the items. Economic concepts, par- 
ticularly those relating to inflation, play a major 
role; concepts of sociology, anthropology, psy- 
chology, geography, and history are rarely in- 
volved. Political concepts are included to some 
extent. Similarly, the science facts and princi- 
ples relating to nucleonics, fluid mechanics, and 
thermodynamics receive major application 
while those relating to other aspects of biologi- 
cal and physical science are rarely involved. 

Obviously, it is difficult to prepare a test 
which provides a balance among the significant 
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events and among the basic fields of knowledge 
that can be applied to the interpretation of these 
events. The authors have made a good start in 
developing a test for use in modern secondary 
schools. It is hoped that the next form will 
include a larger proportion of items that meet 
their own criteria and that are also a better 
balanced opportunity for students to- apply 
school learning to significant contemporary de- 
velopments. Y 


Edgar B. Wesley, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota. [Review of Form 1946.] This test, con- 
sisting of 116 items of the four-option, mul- 
tiple-choice type, was prepared for use in a 
nationwide high school testing program. Part 
I, consisting of 76 items, is devoted to social 
developments, and Part II, consisting of 40 
items, is devoted to scientific developments. 
This review is concerned primarily with Part I. 

The test is designed to avoid the obviously 
factual and to measure more significant entities 
‘than the ability to recall the name of the capital 
of Arkansas. It stresses movements, trends, 
issues, generalizations, interrelations, conclu- 
sions, and inferences. For the most part, the 
items do deal with significant ideas and call for 
discriminative insight. The authors avoid nearly 
all the errors made by novices and reveal a 
sophisticated insight and a highly developed 
aptitude. The reviewer knows no test which 
equals this one in the quality and range of its 
contents. 

The publishers of the test provide varied and 
interesting information about its performance. 
Reports are given of pupil performance in six 
geographical regions of the United States. The 
tables show, as one would expect, rather con- 
sistently lower scores for the Southern states 
and, rather surprisingly, they show the North 
Central region barely ahead of New England 
in grades ro and 11 and significantly below it 
in grade r2. One wonders why the North Cen- 
tral is not consistently ahead, as one would ex- 
pect, of both New England and the Middle 
Atlantic. Perhaps the explanation may be found 
in the rather curious inclusion of West Vir- 
ginia, Arkansas, and Oklahoma in the North 
Central, for these states undoubtedly tended to 
lower the average of the region. 

The test service also reports the per cents 
of correct responses for each item for grades 
9, 10, 11, and 12. Since no correction for guess- 


ing was made, the performance looks better 
than it probably was. One can only speculate 
as to why Items 25, 29, 41, 59, and 70 were so 
difficult. 

This test is excellent and the service in con- 
nection with it is even better. The testing service 
at Iowa City gave its reports to all schools 
within five days after receiving the tests—a 
truly remarkable performance. The school 
principal or teacher who fails to use the test 
is denying himself the use of an expert na- 
tional yardstick, one which can be bought at a 
ridiculously low price. A current national index 
of performance serves as a useful challenge to 
poorer schools and a constant stimulus to better 
schools. It is to be hoped that so many schools 
will use this service that the facilities at Iowa 
City will have to be greatly enlarged. 

The merits of this test are so outstanding 
that criticism sounds almost ungracious. Yet 
the fundamental purpose of these reviews is to 
promote good tests and to encourage the trans- 
formation of good tests into still better ones. 
The test under review is susceptible to im- 
provement. In fact, it deserves two major criti- 
cisms. 

The first major fault is wordiness. A criti- 
cally disposed editor could probably reduce the 
wordage by 50 per cent, and it should be re- 
duced by 66 per cent. The verbosity of the test 
is conspicuous in the examiner's manual as 
well as in the items themselves. The examiner 
is asked to read a wandering essay on the values 
of democracy, discrimination, and interest in 
contemporary affairs. Why such cruelty to a 
group already nervous with apprehension? 

Wordiness is evidenced in almost every item. 
Consider Item 68, which contains 53 words. 
Would not the following r4-word version be 
just as good? The most aggressive minority in 
Canada consists of (1) prairie farmers, (2) 
fishermen, (3) French-Canadians, (4) im- 
perialists. 

Or consider Item 5 which contains 91 words. 
Is anything lost in the following 21-word ver- 
sion of the item? According to postwar opin- 
ion, conscientious objectors were (1) treated 
too leniently, (2) handled rather wisely, (3) 
treated too harshly, (4) needlessly separated 
from their associates. 

Wordiness inevitably introduces a problem 
of reading, and so the authors cannot know 
what they are measuring. Wordiness consumes 
paper, occupies space, absorbs time, deflects 
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attention, lessens the sampling, wearies the stu- 
dents, and defeats the purpose of the test. 

The second major criticism concerns the offi- 
cial announcement and description rather than 
the test itself. The 1946 announcement de- 
nounces facts and says they do not contribute 
(italics theirs) to understanding. This prepos- 
terous doctrine is modified slightly in the 1947 
announcement where facts "are not essential to 
understanding." The slight retreat reminds one 
of a puppy which has released his hold on a rag 
but continues to growl. 

The absurdity of this verbalistic line of rea- 
z will appear by examining any one of 50 
or 60 of the 116 items. Let us choose Item 27 
because it is one of the shortest in the test. It 
asks where Britain, Russia, and the United 

z are most likely to clash and suggests 
e, Italy, Spain, and Iran. How can one 
v the answer without knowing some simple 
facts about oil, transportation routes, and past 
events? Curiously enough the authors recog- 
nized this requirement, for Item 69 calls for the 
simple fact that Iran is important because of 
its oil. All the generalizations, interpretations, 
and inferences required by this test rest upon 
facts—simple, complex, or compound, but none- 
theless facts. Why scorn the steps of a stairway 
because they are not on the level of the second 
floor? 

“The test contains not a single question of 
the type. . . What important meeting was held 
in San Francisco in the spring of 1945?" states 
the announcement. Item 6, however, asks stu- 
dents to select the theme of Henry Wallace’s 
book of 1945. Which is more important as 
basic material in understanding contemporary 
affairs—knowing what the San Francisco con- 
ference was or the theme of a book by Wallace? 

These confusions in the announcement are 
significant because they have caused the authors 
to think they are dealing only with the results 
of using facts rather than with the facts them- 
selves. Let them henceforth have less scorn for 
facts and still continue their emphasis upon the 
utilization of them. 

In spite of these two major faults, this is the 
best general social studies test dealing with in- 
formation and the use thereof which has so 
far been made. With some additional work it 
could be made into an ideal test. The reviewer 
hopes that the authors will make the 1948 edi- 
tion a well-nigh perfect test. It can be done. 


[6] 

XCoordinated Scales of Attainment. Grades 1, 2, 3, 
4, 5, 6, 7, 8; 1947; IBM for grades 4 and over ; 8 levels; 
Form A; separate answer sheets must be used in 
grades 4-8; $2.90 per 25 of any one level, grades 4-8; 
$1.50 per 25 machine-scorable answer booklets for any 
one level; $1.25 per 25 answer booklets for hand scor- 
Ing; 25é per set of scoring keys; $1 per specimen 
Set, postpaid; spelling: James A. Fitzgerald; English: 
Dora V. Smith; reading: M. J. Van Wagenen; pri- 
mary reading: U. W. Leavell ; history: Edgar B. Wes- 
ley; geography: M. E. Branom ; arithmetic: L, J. 

rueckner; literature: Ellen Frogner; science: Victor 
C. Smith and August Dvorak; Educational Test Bu- 
reau. 
4) BATTERY 1, Grade 1; 1947; $1.75 per 25; nontimed 
(90) minutes. 

) BATTERY 2, Grade 2; 1947; $1.75 per 25; nontimed 
(110) minutes. 
C) BATTERY 3. Grade 3; 1947 ; $1.75 per 25; nontimed 
(100) minutes. 
d) BATTERY 4. Grade 4; 1946; nontimed (256) minutes. 
€) BATTERY 5. Grade 5; 1946; nontimed (256) minutes. 
f) BATTERY 6, Grade 6; 1946; nontimed (256) minutes. 
g) BATTERY € Grade 7 ; 1947; nontimed (256) minutes. 
h) nATTERY 8. Grade 8; 1047; nontimed (256) minutes. 


Roland L. Beck, Professor of Education and 
Director of the Demonstration. School, Central 
State College, Edmond, Oklahoma. [Review of s 
Batteries 4-8.] These tests were constructed by 
subject-matter specialists and educators-from 
six universities and one public school system. 
The scales represent extensive cooperation, and 
thus a more valid content in the various subjects 
has resulted. The number of pupils at each 
grade level whose scores were included in the 
standardization of the batteries of Form A 
ranges from approximately five to ten thou- 
sand. 

The tests may be scored by a machine or by 
hand with the cutout type of key. Separate 
answer sheets and re-usable test booklets de- 
crease the cost 50 per cent after the first - 
twenty-five tests in a grade. Percentile rank 
indicators are provided for each grade. Four- 
or five-choice recognition test items are used 
throughout the scales, except for usage and 
punctuation in which alternative response is 
used. Progress norms will be possible as soon 
as Form B has been standardized. 

Charts and maps are used in geography; 
proper method of solution is used in arithmetic 
reasoning; and paragraph comprehension, in 
reading. At least 90 per cent of the pupils are 
allowed to complete the tests. This feature 
makes the answering of the tests largely a mat- 
ter of ability or power rather than speed. The 
derived scores or C-scores make possible com- 
parisons of gains made within a test or from 
one test to another. 
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The analytical outline of the tests gives test- 
item numbers and relative difficulty of the sub- 
ject-matter content used as a basis for each of 
the questions. This information is highly de- 
sirable. . 

The validity of the tests is based on expert 
judgment and research, but some form of statis- 
tical expression in addition would also be valu- 
able. Although coefficients of reliability have 
their limitations, one usually prefers to know 
what coefficients were found for a control 
group. The research back of the grade levels 
of attainment makes for far better educational 
diagnosis. However, grade attainment thus 
evaluated, if used to justify a method of teach- 
ing, might cause one to question the extent of 
the use of averages, medians, measures of vari- 
ability, and other statistical measures. 

Alternative response in usage, punctuation, 
and capitalization and multiple choice in arith- 
metic are not the most valid forms of test 
items; at least, this is the opinion of the writer. 
The test forms which are more valid cannot be 
scored by a machine, and for this reason there 
is a disadvantage in using them. 


Lavone A. Hanna, Associate Professor of 
Education, San Francisco State College, San 
Francisco, California. [Review of Batteries 4-8.] 
These tests present some new and interesting 
features. Few tests for the elementary school 
are set up with separate answer sheets so that 
machine scoring may be used. Yet, in spite of 
the authors’ claim that fourth grade children 
had little difficulty understanding the procedures 


_to be followed, the directions seem complicated 


and the answer sheet formidable. Each battery 
of subject tests is designed specifically for use 
in'a single grade, and the battery of nine tests 
covering the basic subjects of the elementary 
grades would keep attention centered on the 
total program. As a restilt of an analysis of 


_ thirty state courses of study and fifteen city 


curricula, the manual states that “at least 70% 
of the tasks attempted by a child in each sub- 
ject are concerned with material taught in his 
grade.” ; " 

* The words selected for the spelling test were 


those found by the author to be used by chil- ` 


dren outside of school and included in most 
spelling books. A unique feature of this test 


.is the two-phase technique of measurement 


whereby the pupil first writes the dictated 
words on a special answer sheet and then 


checks his own spelling of each word by com- 
paring it with a set of four variant spellings-of 
the same word. The marks thus made are all 
that count in the scoring. On the experimental 
tests, it was found that the pupils made fewer 
errors in marking their papers in this manner 
than did the teachers who scored all the papers 
by checking them with a key. A similar tech- 
nique is used in the arithmetic computation 
tests. The language arts test contains three sub- 
tests: punctuation, usage, and capitalization. 
These are of the right-wrong variety. The read- 
ing and literature tests are among the best in 
the battery. The reading selections are interest- 
ing and the abilities tested well chosen, The 
test, however, looks confusing, and the crowded, 
small, black-faced type used with the responses 
would tend to overpower a child taking the test. 
The literature test should provide data helpful 
in stimulating and directing the selection of 
books to be read. 

It is regrettable that the same functional ap- 
proach was not used with the social studies and 
science sections as was used with the spelling 
and language arts. The manual states that this 
was not done because the experimental forms 
showed that the learnings in history and geog- 
raphy are at present largely informational and 
that the functional items proved too difficul 
for even the higher grades. But if it is true 
teachers stress that which is tested, then these 
tests would tend to perpetuate informational 
rather than functional learnings. One is amazed 
to learn that social studies courses in the mid- 
dle grades of so many schools emphasize ancient 
and medieval history to the extent indicated by 
these tests. 

The authors recognize that there is no agree- 
ment on the grade placement of arithmetic 
processes and recommend adjustment in the ad- 
ministration of the tests to make them fit the 
local curriculum. The unique feature of the 
tests in arithmetic problem reasoning is that 
pupils are asked to indicate the method they 
would use in solving the problem rather than 
to work out the answer. This technique, the 
experimental evidence indicates, distinguishes 
more distinctly the measurement of problem 
reasoning abilities from the measurement of 
computational abilities. 

The tests seem to have been carefully .con- 
structed. The validity of the battery is based 
upon the expert judgment of subject-matter 
specialists, analysis of many state and city 
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a 
courses of study, and an item evaluation of the 
éxperimental form. The probable error of 
measurement of each of the attained scores is 
less than two score units. The norms were 
established on a population of 45,299 from 
schools in forty states, Raw scores are trans- 
lated into scaled scores so that scores from the 
different tests are comparable and a profile can 
be drawn. Tables are provided so that scaled 
scores may be interpreted as either grade equiv- 
alents or percentiles. The tests may be adjusted 
to the curriculum. The manual states that any 
combination of grade-subject tests may be used, 
provided that the shift is not more than one 
grade away from the grade for which the tests 
were designed. 


Gordon N. Mackenzie, Executive Officer of 
the Horace Mann-Lincoln Institute of School 
Experimentation, Head of the Department of 
Curriculum and Teaching, and Professor of 
Education; and Glen Hass, Assistant in the 
Horace Mann-Lincoln Institute of School Ex- 
perimentation ; Teachers College, Columbia Uni- 
versity, New York, New York. [Review of 
Batteries 4-8.] The manual states that these 
tests were developed for the use of teachers 
and administrators in measuring the attainment 
and progress of pupils “in the basic subjects of 
the elementary school.” Each battery of subject 
tests is “designed specifically for use in a sin- 
gle grade” and consists of items whose diffi- 
culty, it-is stated, has been determined by ex- 
perimental research. The battery for each grade 
includes tests in punctuation, usage, capitaliza- 
tion, reading, history, geography, elementary 
science, reading experience—literature, arith- 
metic computation, and arithmetic problem 
solving. Since the tests are not timed, the pupil 
is permitted to demonstrate his level of per- 
formance rather than his speed of response. 
The subject content of each test is based on an 
analysis of thirty state courses of study. and 
fifteen city curricula which, it is asserted, re- 
veal the subject content common to each grade 
in schools in all parts of the country, The 
courses of study used and their dates of publi- 
cation are not specifically identified. The re- 
liability of the tests has been established ex- 
perimentally, and ʻa careful normative study 
was made including pupils in grades four to 
eight in forty states. 

Effort was made to include words in the 
spelling tests which are used by children as 


well as taught in the grade. In the reading and 
science tests, everyday life situations assumed 
to be of interest to children are emphasized. 
The tests in history and geography emphasize 
informational rather than functional outcomes 
because “the functional outcomes are yet to be 
achieved to any measurable extent in all but a | 
very few schools." ~ 

If one is to take for granted that the area of 
agreement that may now exist in the printed 
courses of study of thirty states and fifteen 
cities is the desirable curriculum for elementary 
school children, that this curriculum should be 
crystallized, and that the objectives of the ele- 
mentary school are largely the teaching of a 
group of "basic subjects," then the general use 
of tests of this kind should be encouraged. If, 
on the other hand, one believes that each cur- 
riculum developed by a particular group of 
pupils and a teacher is in many respects unique 
and should be a result of the potentialities of 
that particular situation, he will be disposed in 
most instances to use means of testing and 
evaluating devised in terms of that particular 
situation. Standardized tests tend to set objec- 
tives and to limit teaching to the areas which 
they measure. Admitting that the functional 
outcomes of social studies, for instance, are often 
not achieved at present should not result in a 
testing program which emphasizes informa- 
tional outcomes and places an even greater 
handicap on teachers who seek to base class- 
room work on the life of the community, pupil 
needs, and life situations. It should result, in- 
stead, in a renewed effort to assist such teach- 
ers and to encourage their efforts to achieve a 
more realistic and functional curriculum. Tests 
based on the idea that certain skills must be 
mastered each year, and norms that tend to be- 
come standards, hinder such efforts. 

Curriculum change should be continuous, and 
it is so urgently needed in American schools 
that every possible hindrance to such change 
should be removed. The use of standardized 
tests should be limited to individual pupils 
rather than total class groups, and their use 
should be optional on the part of teachers. 


C. C. Ross, Late Professor of Educational 
Psychology, University of Kentucky, Lexing- 
ton, Kentucky. [Review of Batteries 4-8.] The 
manual points out that since at least 70 per cent 
of the items in each subject test have to do 
with material commonly taught in that grade, 
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they attempt to measure the pupil's attainment 
in the things he is actually being taught. This 
is a worthy ambition, but it is more easily at- 
tainable for pupils in the typical school than 
for pupils in the exceptional school. 

The manual also offers the assurance that 

the tests "were developed in accordance with 
all of the commonly accepted procedures for 
validation of attainment survey tests." The 
procedures included analyses of thirty state 
courses of study and fifteen city curricula to 
discover the subject content in each grade that 
was common among the schools in all parts of 
the country. With these analyses as guides, ex- 
perimental tests for each grade were prepared 
by recognized leaders in their respective teach- 
ing fields. 
* These experimental tests were then adminis- 
tered to pupils in the grade for which each test 
was intended and to the pupils in the grade 
above and in the grade below in a large number 
of schools “carefully selected asa representa- 
tive cross section of the whole school popula- 
tion." The item evaluation of the experimental 
tests was then based upon papers with scores in 
the middle 20 per cent of each distribution, All 
items were eliminated which failed to show a 
significantly higher proportion of correct an- 
swers in the grade for which they were de- 
signed than in the grade below, or which 
showed irregular patterns of grade-to-grade 
difficulty. Items included in the final forms of 
the tests were limited to those which fell within 
a difficulty range of two quartile deviations, 
one above and one below the median. These 
discriminating items were then arranged in or- 
der of difficulty in such a manner as to provide 
a sampling of areas within each subject propor- 
tionate to the emphasis given them in the cur- 
riculum of the typical school. a 

The manual concludes the discussion of va- 
lidity with this sentence: “The validity of the 
tests in the battery series is based, therefore, 
upon expert judgments and sound research and 
is better described by the foregoing paragraphs 
[here somewhat abridged] than by any statis- 
tical expression." This is tantamount to saying 
that curricular validity is more important than 
statistical. Possibly so. But many prospective 
users would probably like to know how the 
scores on the Coordinated Scales of Attain- 
ment correlate with acceptable criteria, such 
as scores on other well-known achievement test 
batteries. The reviewer, for example, has found 
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that the spelling scores on the Coordinated 
Scales correlate with spelling scores on various 
forms of the Stanford Achievement Tests 
from .92 to .96. 

The authors are to be commended for at- 
tempting to make some of their tests more spe- 
cific than usual. The problem-reasoning test in 
arithmetic is a good example. The ordinary 
reasoning test measures in unknown propor- 
tions reasoning analysis and computational skill. 
In the Coordinated Scales the two are kept dis- 
tinct. The problem-reasoning test requires the 
pupil to indicate which of four given methods 
will produce the correct solution to each prob- 
lem. This procedure affords a straightforward 
measure of reasoning ability as distinguished 
from computational skill, which has already been 
measured in the preceding test. 

Some of the subject tests are perhaps too 
specific or restricted in scope. History and geog- 
raphy, for example, are measured as separate 
subjects rather than as one integrated field of 
social studies. Furthermore, tests in these sub- 
jects are largely informational in content rather 
than designed to measure the functional out- 
comes of instruction. This is not in accordance 
with the wishes of the authors but is a logical 
result of the philosophy of all general achieve- 
ment tests based upon practice in the typical 
school, regardless of how expertly the job is 
done. Some of the purposes of measurement 
would be better served with tests designed to 
measure what is being taught in pioneer schools 
rather than in conventional schools. There is a 
need for tests which attempt to measure what 
ought to be and not merely what is. Of course, 
there is a legitimate place for both types of 
tests in the armory of the well-equipped school. 
The Coordinated Scales of Attainment do a 
fine job of measuring what is actually being 
taught today. 

The authors of the Coordinated Scales of 
Attainment have leaned over backward in their 
effort to avoid overstressing reliability, a com- 
mon error of test makers in the past. Such evi- 
dence as is presented is mainly indirect. A 
statement by the publishers appears to assume 
that “sufficiently long tests, composed of scaled 
items of known difficulty and administered un- 
der controlled conditions” will guarantee a high 
level of reliability. The only objective state- 
ment is as follows: “The. probable error of 
measurement of each of the subject test scores 
-in Batteries 4-8, and of each of the subject 
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averages in Batteries 1—3, is less than two Score 
units." But there are no doubt considerable 
differences among the reliabilities of the sep- 
arate tests. And a probable error of any magni- 
tude is far more serious in some tests and 
grades than in others. In terms of grade equiv- 
alents, a probable error of 2 points may vary 
from .2 to .5, depending on the test and grade. 
More detailed information would be desirable. 

On most counts these tests rank high in 
usability. Among the desirable features, the 
following may be mentioned: (a) The direc- 
tions both to the examiner and to the pupil are 
clear, concise, and complete enough to cover all 
essential points. (b) One set of answer keys is 
sufficient for all forms of the test, For hand- 
scoring the keys are of the convenient window 
type printed on heavy stock. The test is also 
adapted for machine scoring. (c) Multiple 
forms of the test will facilitate the use of the 
test for measuring school progress. (d) Norms 
based on 45,000 cases may be considered rea- 
sonably satisfactory. They show the age and 
grade equivalent through a three-year range and 
C-scores and percentile ranks throughout the 
entire distribution of scores. (e) The manual 
contains valuable information regarding the 
derivation of the tests, as well as on the 
administration and interpretation of the scores. 

Itis to be hoped that when the manual is 
revised it may include norms for various types 
of schools, for various geographical regions, 
and perhaps for pupils of varying levels of 
ability. More emphasis would be in order on 
how to use the tests for diagnosis and as a 
basis for remedial instruction. Selected refer- 
ences for further professional reading on these 
points would be helpful. 

The over-all evaluation of these tests places 
them high in the list of available measures of 
general achievement. They deserve a wide use. 


[7] 
Every Pupil Primary Achievement Test. Grades 
1-3; 1932-33; Forms A, B; 75¢ per 25, postpaid; 15 
er specimen set, postpaid; (25) minutes; 
yser and H. E. Schrammel ; Bureau of Educational 
Measurements, Kansas State Teachers College of Em- 
poria. 


C. C. Ross, Late Professor of Educational 
Psychology, University of Kentucky, Lexing- 
ton, Kentucky. The catalog describes the test 
as follows: “This test may be used for survey 
and diagnostic purposes. It is an achievement 
test consisting of six parts.....Each part is easy 
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to give and to score.” The manual is a pamphlet 
of seven pages (approximately 4 by 8 inches). 
The test itself consists of four pages (874 by 
11 inches). The arithmetic computation test 
and the paragraph comprehension test consist 
of one page each, and the other tests are half a 
page in length. The working time on the six 
parts varies from two to five minutes. 

It is too much to expect valid diagnosis from 
a “reading arithmetic" test of only 10 items or 
from an arithmetic computation test with only 
6 to 8 items in each of the four fundamental 
processes. Moreover, some of the test forms 
used are poorly adapted to diagnosis since the - 
possibility of guessing the correct response is 
very high. For example, on the sentence-read- 
ing test pupils would tend to get 10 of the 20 
true-false items right by sheer chance, and yet- 
the norm for the second grade is only 11.6, or 
slightly more than a chance score. In like man- 
ner, pupils would tend to get about 5-of the 
14 three-response items right byschance, and 
the norm for the second grade is 7.7. Under 
such conditions a valid diagnosis is highly im- 
probable. 

No validity coefficients are given. The au- 
thors rest their case on the statement that “each 
item of the test was carefully selected and 
checked against reputable criteria, such as 
courses of study, textbooks,” etc., and that “by 
this means a high degree of validity has been 
assured.” It would be more correct to say that 
because of the procedure used the validity of 
the test has been “assumed.” The validity of 
any test should be experimentally established 
and not merely assumed. - 

The reliability coefficients for first, second, 
and third grades are reported as .97, 90, and 
.94, respectively. This is surprisingly high, con- 
sidering the length of the test. 

The test lives up fairly well to the authors’ 
claims that "each part is easy to give and to 
score.” The percentile norms, “which are based 
on the scores of 6,709 pupils in 143 schools of 
17 different states,” are undoubtedly sufficient 
for this type of test. 

The test ranks among the least expensive 
general achievement tests available today. In 
fact, its principal function would seem to be to 
provide an inexpensive survey of achievement 
in the first three grades where time and cost 
are primary considerations. Most teachers will 
probably find that longer tests fully warrant 
the additional cost, however. 
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*Fife Tests of Ability. Entrants to secondary 
schools (England and Scotland); 1947; 4 parts; 1 
form; 55. 6d. (tax, 1s. 234d.) per 25 of any one part; 
4d. per manual for any one part; 165(185) minutes; 
Frank M. Earle; University of London Press Ltd. 

0) TEST I, ABILITY FOR ENGLISH (LANGUAGE). 30(35) 
minutes, 

b) TEST 2, ABILITY FOR SCIENCE. 30(35) minutes. 

C) TEST 3, ABILITY FOR ALGEBRA. 60(65) minutes. 

d) TEST 4, ABILITY FOR GEOMETRY. 45(50) minutes 


REFERENCES 
1. EARLE, FRANK M. Tests of A] for Secondary School 


Courses. Publications of the Scottish Council for Research in 
Éducation, No. 10. London: University of London Press, 1936. 
Pp. xiii, 138. ss. * 

2. EanLE, FRANK M. Reconstruction in the Secondary School. 
London: University of London Press, 1944. Pp. 188. 85. 6d. " 

James Maxwell, Principal Lecturer in Psy- 
chology, Moray House, Edinburgh, Scotland. 
The aim of this set of four tests is to predict 
a pupil’s fitness for secondary school in- 
struction in algebra, geometry, English, and 
science. The genesis of the tests is described in 
an earlier publication by the author (1). The 
material from which the tests are constructed 
was drawn from those items of intelligence 
tests and of tests of more specific abilities 
which showed highest correlation with each of 
the school subjects mentioned above. The tests 
were validated on about 1,500 pupils in Scottish 
secondary schools and a smaller number in 
English secondary schools (2), The main cri- 
terion was success or failure in certain public 
examinations. A table attached to each test 
gives the probability that pupils at different 
score levels will successfully complete their 
courses. It should be remembered in interpret- 
ing such tables that (a) they apply to certain 
specific courses in English and Scottish schools 
only and (b) in many such examinations the 
odds are on the pupil passing in any case, e.g., 
in Scottish Junior Leaving Certificate examina- 
tions the odds are 3:1 on a pass for any ran- 
domly chosen candidate, The applicability of 
the standards for the test is, therefore, some- 
what limited. 

TEST NO. I, ABILITY FOR ENGLISH (LAN- 
GUAGE). This test contains five sections, Vo- 
cabulary, Word Pairing, Sentence Completion, 
Paragraph Completion, and Sentence Con- 
struction. The total possible score is 225, and 
the time limit is 30 minutes. 

The main impression given by this test is the 
sloppiness of its construction. The five sections 
are not timed separately, with the result that 
many children are unlikely to reach the later 
sections. The median score of 93, for instance, 


can be surpassed in the first two sections alone. 
Secondly, consider an item like this [directions | 
paraphrased] : Underline the word in brackets 
which makes the best pair with “Unhealthy” 
(country, climate, weather, person). The an- 
swer given is "climate." But is there anything. 
wrong with the others? Nor is this an isolated” 
instance; the test is far too full of such am- 
biguous items and arbitrary scoring. Thirdly, - 
one must also complain about the marking key. 
Paper shortage may excuse the fact that items ^" 
arranged in columns in the test are arranged 
in rows in the key but cannot excuse such in- 
adequate marking instructions as are given in 
"Sentence Construction," which opens with. 
“The scoring is left to the examiner.” Not even 
one sample sentence is given as a standard— 
only a short note on omitted words and inade-- 
quate sentences. . 
One has no option but to pass an unfavor-- 
able judgment on this as an inefficient piece of 
test construction. More care and foresight 
could have obviated most of the more glaring 
defects. i 
One must comment on the fluctuations of the 
median scores for children attending secondary 
schools in Scotland. The median for age 11:9 
is ror, and that for 12:9 is only 88, while the 
upper quartile score for 12:9 is 100. In other 
words, the interpretation is that the average 
child of 11:9 is better than 75 per cent of the 
12:9-year-olds, Consistent interpretation of the - 
standard expected of any child is impossible. 
One would like to know the reliability of the 
test, It may also be noted that 75 per cent of 
the children fail to score half the possible ^ 
marks in the test. Is this due to excessive diffi- 
culty or to arbitrary marking? A table of pre- 
dictive probabilities is also given, but in view 
of the inconsistency of standards it is impossi- - 
ble to estimate what reliance can be placed 
upon it. 
In short, the test is faulty in construction” 
and impossible to interpret. 3 
TEST 2, ABILITY FOR SCIENCE. This test is | 
divided into six sections, which contain the 
following main types of question: arrangement - 
of series, classification, analogies, and problems, 
involving elementary logic and scientific infor- | 
mation. The possible total score is 180, and the - 
time limit, 30 minutes. E. 
This is a somewhat more neatly constructed 
test than some of the others in the series, and 
the sections are of reasonable length and diffi- 
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culty, sothat the pupil's ability is more"com- 
prehensively tested. The upper quartile scores 
approach 130 points, indicating that the test as 
a whole is within the limits of the ordinary 
child's ability. Ambiguous items are not so fre- 
quent as, for instance, in the English test. The 
marking key suffers from the same disadvan- 
tage in arrangement as the other tests, but 
specific answers are given for each item, and 
the marking is therefore considerably more ob- 
jective. 

'The tables of median and quartile scores, 
also, show more consistency than usual. A dis- 
tinct sex difference appears. The validation of 
the test has evidently presented the author with 
some difficulty, because of differences in ex- 
amination requirements in different secondary 
school courses, High scores on the test appear 
to be considerably more predictive than low 
scores, 

Of the four tests in the series this impresses 
one as the most workmanlike and reliable. 
Though the predictive value appears from the 
author’s data to be lower, yet the consistency 
of. the scores given enables one to place greater 
confidence in it. 

TEST 3, ABILITY FOR ALGEBRA. This test 
contains two sections, Number Series Comple- 
tion and Miscellaneous Problems: 55 items in 
all, with a time limit of one hour. Though in- 
tended to measure only aptitude for algebra, 
some of the later items require more than a 
modicum of knowledge, e.g., Item 54 requires 
the formulation and reduction of the expression 


a+b 1 

(a + b)( a — b) xt x CES 
This is second-year algebra.The test also con- 
tains a small number of ambiguities which a 
little care could have avoided. Besides the table 
of the odds on pupils of different scores com- 
pleting their courses successfully, there are also 
tables giving the median and quartile scores for 
English and Scottish boys and girls separately. 
These tables show certain peculiar features : 

4) They are limited to pupils in the first 
three months of a secondary course and appear 
to include young and not fully developed pupils 
at one end of the age range and older and more 
retarded pupils at the other. The age groups, 
therefore, are not by any means of universal 
application but only apply within a specifically 
selected group. 

b) The marks fluctuate rather violently 


within the age limits, The “ 
do best, the younger and 
the difference may be as m 
the median and the quartile scores; 1 
inspire confidence in the reliability of the test. 

c) The upper quartile score is of the order 
of 54. The possible total for the test is 116. 
This again is confirmation that the latter part 
of the test is too difficult. 

The test does have some predictive value and 
constitutes an interesting approach to a very 
real problem; but as it stands, the reliability is 
strongly suspect, and it suffers from the limited 
field in which its validity was established. 

TEST 4, ABILITY FOR GEOMETRY. This test 
contains 84 items of six different types. All 
involve operations with lengths, areas, and 
angles and measure to a considerable extent 
what one can probably best describe as a two- 
dimensional space factor. As in the other tests 
of this series, the sections are not timed sepa- 
rately, with the same possibility that the pupil 
may not attempt the later sections. The aver- 
age upper quartile score, in fact, is less than 
half the possible total. No formal knowledge 
of geometry is required for the test, and it is 
to that extent superior to other tests of this 
series. 

The author, however, has little mercy for the 
marker. In the last three sections, the whole 
burden of working out the answers (and many 
items have more than one correct answer) and 
of constructing a marking key is placed upon 
the marker's shoulders fairly and squarely. 
“Any solution which adequately satisfies the 
condition is accepted.” In one set of 20 geomet- 
rical figures, the reviewer calculated that there 
were at least 44 correct diagrams, not to men- 
tion the innumerable incorrect ones. The mark- 
ing takes a disproportionate amount of time, 
as each item has to be scrutinised individually. 
One must list this as a major defect of the test. 

Medians and quartile scores show the same 
tendency to considerable fluctuation as in the 
other tests. Two features, however, are peculiar 
to the present test. The standard of perform- 
ance of English children, especially girls, is 
considerably above that of their Scottish 
counterparts. National prejudice apart, it is 
difficult to see why English girls should make 
so much better geometers than their Scottish 
cousins, Or is it the unreliability of the test? 
The predictive value of this test, also, is com- 
paratively low. Recent investigation has raised 
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doubt as to the existence of a space factor in 
children under 13 years old. Perhaps this test 
is attempting to measure an ability that has not 
yet fully developed. It would be of interest to 
see the results obtained from older children. 

The test is confined to the measurement of 
abilities in a fairly compact field and is both 
consistent and comprehensive in its construc- 
tion. It is, however, at the same time one of the 
least predictive of the battery. 


[9] 
Gray-Votaw General Achievement Tests. Grades 
I-3, 4-6 or 4-7, 7-9; 1934-45; revision of the New 
South Achievement Tests; 3 editions; Hob Gray and 
David F. Votaw, Sr.; Steck Co. 
G) STANDARD EDITION. Grades 1-3, 4-7; 1939-40; 2 
levels; Forms A, B, C, D; $2.50 per 30; 25¢ per speci- 
men set of any one level; grades 1-3, 52(60) minutes; 
[gm 4-7, 149(170) minutes. 

) REVISED EDITION. Grades 1-3, 4-6, 7-9; 1942-45; 3 
levels; Forms E, F, G, H ; grades 1-3; $2.50 per 30 tests 
of any one level; 25 per specimen set of any one level; 
grades 1-2, 62(75) minutes; grade 3, 55(65) minutes ; 
grades 4-6, 147(170) minutes; grades 7-9, 150(170) 
minutes. _ 

C) ABBREVIATED EDITION. Grades 4-8; 1939-40; abbrevi- 
ated machine-scorable edition of the Standard Edition ; 
IBM; Form N; $2 per 30; 3¢ per machine-scorable 
answer sheet; 25¢ per specimen set; 73(80) minutes. 


REFERENCES 

I. See 40:1187. $ 

2. Barner, Dixie. Comparisons of the New-South Achieve- 
ment Test Scores and the New Stanford Achievement Test 
Scores. Unpublished master's thesis, Southwest Texas State 
"Teachers College, 1938. p. y 

3. Drake, Truman, The Prediction of High School Success 

Means of the Seventh Grade Scores om the New South 
chievement Test, Unpublished master's thesis, East Texas 
State Teachers College, 1940. Pp. 49. 

4. OEHLER, CHESTER A, At-Age-Grade Norms for the Gray- 
Votaw General Achievement Tests, Standard Edition, Unpub- 
lished EUMD thesis, Southwest Texas State Teachers College, 
1940. Pp. 40. 


Roland L. Beck, Professor of Education and 
Director of the Demonstration Si chool, Central 
State College, Edmond, Oklahoma. The test 
for grades 1 to 3 has large size type and covers 
reading comprehension, reading vocabulary, 
dictation (spelling), arithmetic reasoning, and 
arithmetic computation. The intermediate and 
advanced tests include the following subject 
matter areas: elementary science, choice of 

-. words, knowledge of literature, dictation (spell- 
ing), reading vocabulary and comprehension, 
social studies, physical education, and arithme- 
tic reasoning and comprehension. The individ- 
wal educational chart (primary, intermediate, 
and advanced) provides for a comparison of 
chronological age, educational age, educational 
grade, and school grade. 

Grade norms for high, low, and normal abil- 
ity groups are given. Sufficient statistical data 
are given to evaluate the reliability of the tests 


and the variability of the scores. Percentile 
scores equivalent to total average scores for 
grades r to 9 are also given. The statistical 
treatment is thorough enough to satisfy any 
reasonable critic. A sufficient number of stu- 
dents was used to make the norms meaningful 
and applicable. 

The items of the tests were supplied by ex- 
perienced teachers. The curricular validity of 
the tests plus the statistical procedure used in 
constructing the tests merits the recommenda- 
tion of their use. Although one might question 
the appropriate grade level of the vocabulary 
used in parts of the tests, the tests if properly 
used should eliminate much of the guesswork 
in our school systems. 

More use of tests which have been as care- 
fully standardized as these tests should be en- 
couraged. The writer believes the tests should 
be recommended for use in grades 1 to 9, in- 
clusive. 


For reviews by Joseph E. Moore and C. C. 
Ross, see 40:1187. 


[10] 

Iowa Every-Pupil Tests of Basic Skills, New Edi- 
tion. Grades 3-5, 5-9; 1940-45; IBM, Advanced Bat- 
tery; Forms L, M, N, O; 4 tests; 2 levels; separate 
answer sheets may be used with the Advanced Battery ; 
$5.70 per 25 sets of the Elementary Battery; $6 per 25 
sets of the Advanced Battery; $1.50 per 25 of any one 
test in the Elementary Battery ; $1.60 per 25 of any one 
test in the Advanced Battery; 35¢ per 25 cumulative 
record forms; 44¢ per specimen set of either level; 50¢ 
per 25 machine-scorable answer sheets for Tests À, B, 
or D (Form O) ; 63¢ per 25 machine-scorable answer 
sheets for Test C; 15¢ per machine-scoring stencil for 
Test A; 30¢ per set of machine-scoring stencils for 
Tests B, C, and D ; Elementary Battery : 196(230) min- 
utes; Advanced Battery : 263 or 271(325) minutes; 
F. Spitzer in collaboration with Ernest Horn, Maude 
McBroom, H. A. Greene, and E. F. Lindquist (General 
Editor); Houghton Mifflin Co. 
G) TEST A, SILENT READING COMPREHENSION. Elemen- 
tary Battery: grades 3-5; 46(50) minutes; Advanced 
Battery: grades 5-9; 68(85) minutes. 
b) TEST B, WORK-STUDY SKILLS. Elementary Battery: 
grades 3-5; 47(55) minutes; Advanced Battery: 
grades 5-9; 77(90) minutes. 
C) TEST C, BASIC LANGUAGE SKILLS. Elementary Battery: 
grades 3-5; 46(60) minutes; Advanced Battery: 
grades 5-9; 55 or 58(70) minutes. 
d) TEST D, BASIC ARITHMETIC SKILLS. Elementary Bat- 
lery: grades 3-5; 57(65) minutes; Advanced Battery: 
grades 5-9; 63 or 68(80) minutes. 

REFERENCES 


1. Rocers, J. Luoyp. A Survey of the Relative Effectiveness 
With Which Iowa Elementary Schools Are Developing Cer- 
tain Basic Study Skills. Unpublished doctor's thesis, Univer- 
sity of Iowa, 1935. 

2. HUNTER, Pas W. An Analysis of the 1936 Iowa Every- 
Pupil Test in Silent Reading Comprehension jor Grades Six, 
Seven ond Fight. Unpublished master’s thesis, University of 

owa, 1936. 

3. Victor, Grorcta C. An Analysis of Pupil Responses to 

Items in Iowa Every-Pupil Test in Basic Language Skills for 
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1935. Unpublished master's thesis, University of Iowa, 1936. 

4. Moore, Crarence Lez. A Study of the Norms om the 
New Stanford Achievement Tests and the Iowa Every-Pupil 
Tests of Basic Skills, Unpublished master’s thesis, University 
of Towa, 1938. Pp. 41. 

s. TOWNSEND, Bast, Leroy. A Comparative Study of the 
Norms of the Metropolitan Achievement Tests and the Iowa 
Every-Pupil Tests of Basic Skills. Unpublished master's the- 
sis, University of Iowa, 1938. Pp. 34. 

k Bone, PauL Lee, The lowa Every-Pupil Advanced Read- 
ing Test: Selection of Items and Validation of the Test. Un- 
published master’s thesis, University of Iowa, 1941. 

„7. Firzstmmons, THroponE D. The Selection and Valida- 
tion of Iowa Every.Pupil Reading Test Items for Grades 3, 4, 
and §. Unpublished master's thesis, University of Towa, 941. 

8. Hazard, Lucite, Comparisons of the New Stanford 
Achievement Tests and the Iowa Every-Pupil Test of Basic 
Skills. Unpublished master’s thesis, University of Iowa, 1941. 

Frederic L. Ayer, Graduate Student, Teach- 
ers College, Columbia University, New York, 
New York. This battery is intended to measure 
“certain of the skills involved in reading, work- 
study, language, and arithmetic. Skills....crucial 
in the whole educational development of the 
pupil....and which largely determine the extent 
to which he can profit from later instruction.” 
Both the Manual of General Information and 
the Examiner’s Manual state that the tests do 
not provide “standards” for individual pupils, 
schools, or classes, but give only a comparison 
with the standardization population. The au- 
thors recognize that courses of study vary in 
different schools; consequently, they urge that 
scores be considered relative to test content as 
it applies to the local program. 

An excellent discussion of validity, reliability, 
and standardization data is given in the Manual 
of General Information, including a detailed de- 
scription of how to use the four sets of norms 
which are available: grade, percentile, age-at- 
grade, and chronological age. These norms are 
based on four samples of approxiniately 50,000 
pupils each in more than 350 school systems, 
all but about 60,000 of whom were from Iowa 
graded public schools. 

With the exception of some of the items 
listed below, teachers in the elementary school 
(in Elizabeth, New Jersey) where the elemen- 
tary battery was used by this reviewer, consid- 
ered these tests to be valid in light of their 
educational objectives. 

In scoring, no correction is made for guess- 
ing; however, the directions to the pupils do 
imply a penalty for wrong answers. The tests 
are very easy to administer and score. This 
reviewer’s scoring time averaged less than one 
minute per booklet for the elementary battery. 
A tally of items missed is easily recorded di- 
rectly on the scoring stencil to determine spe- 
cific areas of class weaknesses; however, indi- 
vidual weaknesses cannot be easily determined 


since no marks are placed on the pupil’s paper, 
except in the arithmetic and language skill tests. 
The Examiner’s Manuals contain concise, help- 
ful suggestions for remedial teaching. 

The format of the elementary battery is dis- 
tinctly superior to that of the advanced. The 
type of the elementary battery is clear, easy to 
read, and well spaced on a good grade paper. 
The advanced battery uses a much smaller type 
size; the silent reading test is printed on news- 
print; too much print is crowded on a page; 
and the directions to the pupil on the front 
cover are so similar in type size and style to 
the title and copyright data that some confusion 
resulted in getting the pupils to consider only 
the directions. 

Both batteries use mainly 4-response multiple- 
choice items. Again the elementary battery 
seems superior in calling for the marking of 
answers directly on the booklet, while the ad- 
vanced battery uses the front page as an answer 
sheet. The increased tendency toward mechani- 
cal errors in recording answers and the clutter- 
ing of the page with identifying numerals is not 
compensated for by the slight gain in the ease 
of scoring. 

The elementary test on map reading uses a 
reasonably realistic map of a mythical country 
and covers general map direction, river flow 
and land slope recognition, interpretation of 
symbols, and use of a key. Five of the 18 items 
call for some type of deductive reasoning from 
the information supplied. h 

The advanced test on map reading bases 11 
of its 37 items on an imaginary plane map of 
a major portion of a spherical surface. The 
region extends from latitude 30° N. to 70° S. 
and from longitude 80° W. to 30° E. The tre- 
mendous area represented in a space 3 by 474 
inches leads to confusion in distinguishing lines 
of longitude from those of latitude, and the desig- 
nation of all points on the map by letters does not 
simplify the mechanics of answering. For a test . 
in map reading as a basic skill, this reviewer 
questions the validity of items presupposing a 
general knowledge of geography; for example, 
“Which of these cities would probably lead in 
the exporting of corn?" Despite the stress in 
the elementary manual that “A large number 
of children and many adults consider that a 
large river cannot run toward the north” 
(italics mine), the authors have not seen fit to 
include items on the advanced test to cover this 
phase. Seventeen items concern 8 different small 
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maps shaded and spotted to show rainfall, ele- 
vation, population distribution, crop areas, etc. 
Some of them use as many as six different 
types of black and white shadings on a map 134 
by 3% inches to make a jumble that would give 
trouble to a professional cartographer, 

The sections on use of references and dic- 
tionaries are simple, realistic, and effective. 
Part 3 of the use of the index section provides 
a sample index and requires the pupil to give 
page numbers as to where the answers to the 
questions might be found. The directions to the 
pupils neglect to emphasize that only one an- 
swer is expected unless otherwise indicated, and 
the pupils tended to list all page numbers which 
might apply. The manual gives no instructions 
for scoring such papers, and the key often 
covers up answers which might be counted 
correct. 

The tests of basic language skills are excel- 
lent in content. The elementary test is, how- 
ever, difficult to score accurately, and the ad- 
vanced test is cluttered with numbers and 
answer boxes interspersed with the text, The 
punctuation and capitalization sections of the 
elementary test call for simple proofreading 
directly on the test booklet. In scoring, how- 
ever, the key does not expose many of the 
points of overpunctuation and often hides cor- 
rect punctuation which is placed a little out of 
line with the text, Rescoring by hand fre- 
quently revealed as many as I5 errors uncounted 
in the total keyed score. As such overpunctua- 
tion was apparently not considered in the tabu- 
lation of norms, it is impossible to relate such 
Scores to the standardized data. In practice, 
hand scoring with a correctly marked paper 
was nearly as fast as key scoring and much more 
uccurate, An interlinear cut-out type of key 
would seem to be more efficient. 

The arithmetic test is almost evenly divided 


among vocabulary and fundamental knowledge. 


and includes distance judgment, time telling, 
and the understanding of the reasons behind 
basic computations, fundamental operations, and 
story problems. In general, the content is well 
presented and appears appropriate to grade 
levels, 

These batteries should prove excellent instru- 
ments for schools where the curriculum is not 
geared to intensive subject matter teaching, and 
xS the pd Skills are taught as tools to bet- 
er earning. They will probably not prove r- 
ticularly beneficial to the "tradition" cdd 


because they measure skills not knowledge. To 
those administrators who are interested in de- 
termining the comparative proficiency of their 
schools in the basic skills, these tests may be 
highly recommended. 


Gustav J. Froehlich, Assistant Director and 
Assistant Professor, Bureau of Institutional 
Research, University of Illinois, Urbana, Illi- 
nois. This new edition is not materially different 
from earlier editions as to content; but. its 
physical aspects are improved. This test battery 
—consisting of separate tests in (a) reading, 
(b) work-study, (c) language, and (d) arith- 
metic skills—actually samples a large number 
of specific skills in these four areas. 

Elaborate percentile, grade- and age-equiva- 
lent norms are available so that achievement 
status and growth are readily determined ; but 
in the experience of this reviewer, the Jowa 
Every-Pupil Tests of Basic Shills are most 
valuable as'diagnostic instruments. The tables 
of classification of test items according to spe- 
cific skills are an invaluable aid to the diag- 
nosis of difficulties, It is especially easy to note 
the degree of repeated failure relative to any 
one given skill sampled. 

Other valuable features of this test battery 
are: (a) inclusion of a work-study skills sec- 
tion which samples skills more complex than 
the recall of facts—the reading of maps, graphs, 
and charts and the use of reference materials, 
the index, and the dictionary; (b) ingenious 
and convenient scoring stencils; (c) adequate 
and unambiguous instructions to both the ex- 
aminer and the pupil. 

A less commendable feature of the battery 
is inadequate subtest length and too narrow a 
range of test-item difficulty (for the advertised 
range of the battery). In a number of instances 
fewer than a dozen items represent a differ- 
ence in achievement level of one whole grade. 

As of this date, the accompanying profes- 
sional manual is still in press. It is hoped that 
the new manual will continue to include the 
wise precautions for interpreting test scores 
and the valuable suggestions for remedial in- 
struction found in the older manuals. It is also 
hoped that the critical test user will be able to 
find adequate validity and reliability references 
in the forthcoming revised manual. 


Ralph C. Preston, Associate Professor of 
Education, University of Pennsylvania, Phila- 
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delphia, Pennsylvania. These tests cover with 
marked intensity the various basic school skills, 
analyzing and sampling each area with com- 
mendable thoroughness. In the silent reading 
comprehension tests, for example, short para- 
graphs are followed by from five to ten ques- 
tions ; and basic language skills are checked not 
only with the customary language usage items, 
but with tests of punctuation, capitalization, 
spelling, and sentence sense. All tests in these 
batteries lend themselves to diagnostic purposes 
more successfully than tests in most achieve- 
ment batteries, Each Examiner's Manual con- 
tains a chart from which a teacher may analyze 
readily the errors of any pupil in terms of spe- 
cific weaknesses. Reading comprehension errors 
may thus be broken down to three aspects of 
paragraph comprehension, two aspects of loca- 
tion and understanding of detail, three aspects 
of organization of ideas, and two aspects of 
grasp of total meaning. The manual contains 
sound and helpful teaching suggestions for 
developing each of these component skills. 

The tests are not without their perplexing 
features. Both the Elementary and Advanced 
Batteries include grade 5 in their range, yet a 
substantial difference in psychological effect 
upon typical fifth-grade pupils between the two 
tests was apparent in clinical use of the tests 
by the reviewer, and no suggestion is contained 
in the manuals to guide a teacher in the selec- 
tion of batteries for grade 5. The Elementary 
Battery provided several psychological hazards 
to third-grade pupils who were confused by 
certain terms in the instructions, such as title 
page, italic, and the interchangeable use of 
selections and. articles and of items and ques- 
lions, and who were quite frustrated by most 
of the tasks of Test B, Work-Study Skills. 

In general, the instructions to pupils are too 
long and involved. In Advanced Test B, Work- 
Study Skills, the pupil begins on the last page 
of the booklet before attacking the first page, 
and he is instructed to tear off the cover of 
each of the advanced tests and record all his 
responses in boxes provided for the purpose on 
the inside of the cover. There is a disconcerting 
lack of uniformity in the arrangement of re- 
sponses in the reading tests: they appear in 
single columns, double columns, and rows. 
These awkward mechanical points are scarcely 
offset by some clever mechanical improvements 
in scoring. 

Most of the items, their scaling, and their 


combinations are good, but there are excep- 
tions. For example, of seven paragraphs in one 
reading test, five deal with the subject matter 
of biology. Final judgment of the tests awaits 
a description of construction, establishment of 
norms, and establishment of equivalence of 
forms. Until these data are available and ap- 
praised, specific recommendations concerning 
the possible uses of these tests must be with- 
held. 


For reviews by Harriet M. Barthelmess, 
William A. Brownell, J. Murray Lee, and 
Charles W. Odell of an earlier edition, see 
38:872. For reviews by James R. Hobson and 
Constance M. McCullough of Test A, see 501. 
For reviews by William A. Brownell and Leroy 
H. Schnell of Test D, see 334. 


[n] 
Iowa High School Content Examination, 1943 
Edition. Grades 12-13; 1925-43; IBM; Forms L, M; 
separate answer sheets must be used; $2.25 per 25; 
$1.50 per 100 hand-scored answer sheets ; 15 per man- 
ual; 1o¢ per hand-scoring key; 3€ per machine- 
scorable answer sheets; 25¢ per set of machine-scoring 
stencils; 25¢ per specimen set; 75(85) minutes; D. B. 
Stuit, H. A. Greene, and G. M. Ruch; Bureau of Edu- 
cational Research and Service, State University of 
Iowa. 
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Dams, WinLIAM Micnart, “Prediction of Scholastic 
Success in Colleges of Law: II, An Investigation of Pre-Law 
Grades and Other Indices of Law School Aptitude," Ed & 
Psychol Meas 4:13:9 sp '44. * (PA 18:3271) 

Maurice E. Troyer, Director, Evaluation 
"Service Center, Syracuse University, Syracuse, 
New York. Forms L and M are revisions of 
the 1925 edition and include four subtests: 
English and literature, history and social studies, 
mathematics, and science. The entire test in- 
cludes 335 multiple-choice items, requires 75 
minutes, and can be divided into two sections 
for use in single periods of 40 to 45 minutes. 
An attempt has been made in the revision to 
improve and extend the sampling of subject 
matter measured. The emphasis remains on 
content, as implied by the title, rather than on 
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applications of knowledge. Since the major 
Purpose of the test is to predict college success, 
it would seem that the test would be of limited 
value in institutions that base their gtades on 
achievement that goes beyond mere knowledge 
of content. The total test and subtest scores 
correlate from .50 to .60 with academic grades 
in high school and in the freshman year of 
college. The diagnostic value of the test is lim- 
ited to interpretation from subscores in the 
subject-matter areas, 

The score on the test is number of items cor- 
rectly answered. Norms in the manual are con- 
veniently arranged, but there is no provision 
for an individual test profile as is true in a 
number of modern achievement tests, Raw 
Scores may be readily translated into standard 
Scorés or their percentile equivalent. Eleventh- 
and twelfth-grade norms are based on a sam- 
‘pling of 5,500 students from 35 schools scat- 
tered throughout the nation. Further evidence 
of the appropriateness of the sample is lacking. 
Highly tentative college norms are available 
and more substantial norms are being accumu- 
lated. Validity of the test rests on careful selec- 
tion of items by “competent analyses of the 
content of academic high school curricula” and 
the ability of each item to discriminate between 
“criterion groups selected on the basis of their 
response to the test.” Coefficients of reliability 
determined by the Kuder-Richardson footrule 
formula are above 91 for the total test and 
range from .87 to .92 for the four parts of the 
test. 

The format of the test could be markedly 

- improved, especially for use with hand-scored 
or IBM answer sheets. The items are close to- 
gether, and alternative responses are difficult to 
find on the eye excursion from the answer 
Sheet to the booklet. A number of commonly 
accepted rules of item construction are violated. 
For example, the alternative responses in multi- 
ple-choice items may appear at the end, begin- 
ning, or in the middle of the stem. 

The strength of this test seems to be in 
choice of items in relation to content. Its weak- 
nesses are in format, limited diagnostic yalue 
as Compared with some other achievement tests, 
and limited use of applicational type of items. 

[12] 
XIowa Tests of Educational Development. Grades 
9-13; 1942; Forms X-1, Y-1; tests are rented rather 


than sol ; rental service includes scoring, tabulating, 
and reporting of resulte; 75¢ per test; 440(480) min- 
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; Tests 2. 5, 6, 8, and 9), 
Julia Pd Gene d» 7), J. William Maule 
(Test 1), and Paul Blommers (Test 4) under the di- 
rection of E. F. Lindquist; Science Research Asso- 
ciates. 

Henry Chauncey, President, Educational 
Testing Service, Princeton, New Jersey. The 
Towa Tests of Educational Development are 
designed to measure the important and last- 
ing outcomes of secondary education. The 
phrase "educational development" is attractive 
but indefinite. It is an ambitious title and a. 
vague one. The tests reflect this vagueness in 
being a compromise between aptitude and 
achievement tests, Although no test is ever 
completely one thing or the other, in a battery. 
of nine tests one might have attempted to put 
in separate tests materials that largely reflect 
the abilities of the individual and materials that 
measure his knowledge. The attempt was not 
made. Tn all tests the aptitude and achievement 
elements are mixed. While the first four tests 
involve more achievement than aptitude and 
the last five emphasize aptitude, there is no 
sharp division. 

Despite this shortcoming, which is perhaps 
debatable and depends on one's philosophy of 
testing, this program of tests is probably the 
best conceived and executed battery that is 
available for use in the senior high school. Any 
subsequent criticisms should be interpreted in 
the light of this over-all estimate. 

The tests are normally administered in the 
fall so that the results may be used by teachers 
in sectioning their classes, counseling their stu- 
dents, and providing individualized instruction, 

The materials, while “standard” for the most 
part, are quite fresh and not textbookish. The 
items in general seem well chosen and carefully 
edited. The difficulty level seems a little high 
for grade 9, but appropriate for grades 10, 11, 
12, Because of the lack of homogeneity of in- 
dividual tests and the probable high intercorre- 
lation among them (no intercorrelations of the 
tests have been reported), the battery is not 
ideal for profile interpretation. 

Test 1, Understanding of Basic Social Con- 
cepts, is based on civics, political organization, 
economics, and general information. Textbook 
phraseology and approach have been avoided. 
Questions cover a broad area, stress important 
topics. Options, however, tend to be wordy, 
making questions rather awkward and decreas- 
ing efficiency. 
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Test 2, Background in the Natural Sciences, 
covers physics and biology better than chem- 
istry (probably to increase likelihood of correct 
answers from common observation). The test 
contains a smattering of geology and astronomy. 
The individual items are fair, though at times 
they tend to be somewhat specific and a little 
trivial. They call for thinking rather than 
memory, however, and avoid the classroom 
slant, 

Test 3, Correctness in Writing, measures 
punctuation; usage, capitalization, and spelling. 
It is thus very restricted. There is nothing, for 
example, on sentence structure. Capitalization 
is very heavily weighted. The questions are 
straightforward and traditional. 

Test 4, Ability to Do Quantitative Thinking, 
measures arithmetical reasoning plus a con- 
glomeration of other factors: chart interpreta- 
tion, formulation, knowledge of exponents, 
number series, knowledge of basic formulas. 
The problem settings are realistic, and the 
quality of items, in general, good. 

Test 5, Ability to Interpret Reading Mate- 
rials in the Social Studies, contains varied and 
fairly interesting passages. The questions, how- 
ever, are not particularly ingenious and run to 
type. There are a good many "spot" questions, 
which can be answered by direct reference to 
the passage. 

Test 6, Ability to Interpret Reading Mate- 
rials in the Natural Sciences, contains passages 
which, though individually good, are too largely 
drawn from the biological as opposed to the 
physical sciences. In one form there is one 
passage relating to chemistry, none to physics 
and at least six relating to biology. Four of 
these would be covered in a class in botany. 
The questions are subject to the same objec- 
tions as those in Test 5. The correlation be- 
tween Tests 5 and 6 is probably high. 

Test 7, Ability to Interpret Literary Mate- 
rials, contains an excellent variety of passages. 
The questions are good, calling for real literary 
interpretation in certain instances. The corre- 
lation with Tests 5 and 6 is probably about as 
low as is possible for a test of this type. 

Test 8, General Vocabulary, contains words 
which seem appropriate, being drawn from 
those in common usage. The type of item in 
which fragmentary context is provided is an 
improvement over the usual synonym form. 
The distractors, however, do not seem to be 
particularly ingenious. 


TESTS & REVIEWS: ACHIEVEMENT BATTERIES 12 


Test 9, Uses of Sources of Information, is a 
rather interesting test which measures: (a) 
familiarity with contents of standard reference 
sources in general (eg. dictionary, encyclo- 
pedia) and of some sources in particular (e.g. 
Popular Mechanics, Hygeia, National Geo- 
graphic), and (b) practical judgment—making 
a sensible decision as to which reference to use 
when, as in many cases, some must be elimi- 
nated on the basis of their titles. The problems 
and the items are good. The test probably could 
be coached for very successfully in a short 
time. 

The reliability of each of the tests is reported 
as being approximately .91 for a representative 
sample of pupils in a single grade. Each test 
was made of sufficient length to yield just this 
reliability. The reviewer was unable to obtain 
from either the author or the publisher of the 
battery any study of the intercorrelation among 
the nine tests, There is only one study of the 
relationship of test scores to school grades. It 
is reported in an unpublished thesis by Allen 
M. Busse. For six small schools the average 
within-school correlation between test score 
and grade-point average in the previous two or 
three years, for individual tests, ranged from 
.44 to .94. The correlation of the average of 
the first four tests with grade-point average in 
the preceding two or three years was for fresh- 
men .57, for sophomores .70, for juniors .67, 
for seniors .83. The correlation of the average 
of the three reading tests with grade-point 
average in the two or three preceding years 
was for freshmen .58, for sophomores .66, for 
juniors .71, for seniors .79. (The number of 
cases for the four classes were 120, 110, 86, 
and 97, respectively.) The very substantial re- 
lation indicated by these correlations between 
the test scores and achievement in previous 
high school years is a strong indication that the 
tests do measure the outcomes of secondary 
education, at least those that are considered 
important in the Iowa schools. 

The quality of the tests themselves is even 
more than matched by the quality of the man- 
uals that have been prepared for the school 
administrator, the examiner, the guidance offi- 
cer, and the teacher. These materials insure, if 
they are carefully studied, that the test will be 
weil administered and the results appropriately 
used, The manual, How to Use the Test Re- 
sults, contains, in addition to norms, sound and 
extensive advice on the interpretation of test 
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Scores. The principal criticism of the supple- 
mentary materials is that there are too many of 
them, a fact which might lead to confusion and 
superficial study of them. 

The tests are rented rather than sold, and a 
scoring service is provided. Four sets of profile 
cards reporting the scores are returned to the 
School, normally in about two weeks, About 
ten days later a confidential summary report is 
sent, giving the average standard score on each 
test for each grade and showing how those av- 
erages compare with averages of other schools. 

For those interested in using the tests, a 
complete specimen set of materials in a con- 
venient binder may be obtained. In general the 
claims made for the tests are not exaggerated 
except in the descriptive leaflet which is the 
first item in the specimen set, Salesmanship has 
displaced cautious and balanced presentation in 
this one instance. In contrast, a film strip that 
has been prepared describing the tests and their 
uses is straightforward and in good taste. It 
should be particularly useful in explaining to a 
School board that these tests serve a useful 
function and are worth the moderate cost. 


Gustav J. Froehlich, Assistant Director and 
Assistant Professor, Bureau of Institutional 
Research, University of Illinois, Urbana, Illi- 
nois. This test battery is designed to yield ob- 
jective evidence of complex achievement ; that 
is, to measure the degree to which concepts are 
understood rather than the degree to which iso- 
lated facts and operations are recalled. It is 
similar to the United States Armed Forces In- 
stitute's Tests of General Educational Devel- 
opment (High School Level); but the Iowa 
battery has a wider range of item difficulty. 
Thus the Iowa battery is to be preferred over 
the USAFI battery when cumulative growth 
data are wanted from repeated testings of the 
same individual. The results of the USAFI 
battery, with its regional norms, are more read- 
ily interpreted to show attainment, or lack of 
attainment, of a Specified achievement level re- 
sulting from out-of-school experiences, 

The Iowa Tests of Educational Develo pment 
consist of 784 multiple-choice items grouped 
into nine different subtests. The first four sub- 
tests deal with principles and relationships in 
the social studies, the natural sciences, the me- 
chanics of writing, and problem solving, The 
next three subtests are reading tests, designed 
to measure interpretative ability for social 
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studies, scientific, and literary content. The 
eighth subtest is a general vocabulary test. It 
is followed by a test designed to measure func- 
tional familiarity with a number of common 
reference and source materials. 

From the standpoint of the more detailed 
aspects of test construction, the Jowa Tests of 
Educational Development constitute an excel- 
lent battery. The physical make-up and arrange- 
ment of both the test booklets and the separate. 
answer sheets are good. Directions to the ex- 
aminer are specific, unambiguous, and adequate. 
Each different section has a sufficient number 
of sample items. All responses are completely 
objective. The introduction of a "none wrong" 
response in the spelling section and a "mot 
given” response in the problem-solving subtest 
is desirable. Raw scores for all subtests individ- 
ually, as well as the composite score for the 
first eight subtests, are readily translated into 
comparable scores on a 30-point standard scale, 

Beginning- and middle-of-the-year percentile 
norms for grades 9 through 12 are available. 
These norms are based on a sample of “ap- 
proximately 50,000 pupils in 290 high schools 
..located in Iowa and neighboring states. Pro- 
vision has been made for an annual revision of 
these norms.” If, in addition, regional norms 
were to be made available, the chances for na- 
tional acceptance of this test battery would be 
greatly increased. In view of the complete ab- 
sence of norms for grade 13, and by noting 
that the tests are relatively easy for middle-of- 
the-year twelfth graders, this reviewer would 
say that the useful range for this battery is 
grades 9 to 12 inclusive rather than 9 to 13 as 
printed on the test booklet, 

In spite of the elaborate brochure and speci- 
men packet prepared by the publisher, there are 
a number of basic questions which remain un- 
answered to this reviewer. What about validity ? 
In the absence of direct statements and with no 
presentation of experimental evidence to the 
contrary, the implication for this reviewer is 
that what the test measures should be obvious 
from the test itself. The obvious conclusion, 
however, may not always be the correct one. 
For example, the first subtest is supposed to 
measure understanding of social concepts, yet 
it contains a number of items like: 

Where are the largest cities in the United 
States located? 


1) On the Gulf of Mexico and the southern 
part of the Atlantic coast. 
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2) On the Pacific coast. 

3) On the northern part of the Atlantic coast 
and the Great Lakes. 

4) On the Mississippi and Ohio Rivers. 


There certainly are a number of social, soci- 
ological, economic, historical, and political fac- 
tors which determine the sites and sizes of our 
cities; nevertheless, these need not even be 
known, let alone be understood, to yield a cor- 
rect response to the above test item. 

In so far as validity is concerned, this re- 
viewer would like to have both factorial and 
practical evidence. What are the factor load- 
ings in common and meaningful reference fac- 
tors? What is the correlation with practical 
criteria of achievement? To what degree do the 
subtests intercorrelate? Data of this kind are 
conspicuous by their absence. 

What about reliability? The length of the 
subtests individually, and the many evidences 
of meticulous attention to the details of test 
construction, would seem to insure high accu- 
racy of measurement ; yet the reviewer is puz- 
zled by the careless manner in which reliability 
coefficients are mentioned. In the manual it is 
stated: "The reliability of each of the Iowa 
Tests of Educational Development is approx- 
imately .91 for a representative sample of 
pupils in a single grade." The publisher's ad- 
vertising states “....all tests have the same re- 
liability (.91 to .95 for the sample drawn from 
grades 9-12).” A reliability coefficient of .9t 
for a sample drawn from a single grade is quite 
different from one of .91 (or even .95) for a 
sample drawn from a range of talent covering 
four grades. This reviewer would also like to 
know the correlation between Forms X—1 and 
Y-1. 


Lavone A. Hanna, Associate Professor of 
Education, San Francisco State College, San 
Francisco, California. These tests differ from 
the usual comprehensive achievement tests in 
that they are designed to measure the growth of 
the pupil’s general educational development as 
well as to acquaint the teacher with the general 
background which the pupil brings to each of 
the major fields of instruction, The authors 
state that the purposes of the tests are twofold : 
to provide teachers, administrators, and coun- 
selors with data on each pupil so that instruc- 
tion and guidance may be better adapted to his 
peculiar and changing interests, needs, and 
abilities and to give administrators a more de- 


pendable and objective basis for the over-all 
evaluation of the total educational offering of 
the school in order to determine curriculum and 
supervisory emphasis. The nine areas measured 
are: understanding of basic social concepts, 
background in the natural sciences, correctness 
in writing, quantitative thinking, interpretation 
of reading material in the social studies, inter- 
pretation of reading material in the natural 
sciences, interpretation of literary materials, 
general vocabulary, and use of sources of in- 
formation. Each test contains from 50 to 170 
items of the multiple-choice type. The coverage 
of these tests and the fact that they are in- 
tended to measure the relatively permanent 
changes effected in certain broad aspects of the 
pupil’s educational development rather than 
the specifics of any particular course recom- 
mend them over most achievement tests now 
available. Two equivalent forms make it pos- 
sible to alternate the tests so that no pupil is 
measured on the same problems two years in 
succession. The battery requires a total of eight 
hours for administration and should be given 
in three or four half-day periods. 

Schools with limited research facilities will 
also welcome the services given by the Science 
Research Associates. Schools giving the tests 
are required to register with the Science Re- 
search Associates and to agree to give the entire 
battery to every pupil in the high school or, 
during the first year, to every pupil in one full 
grade and to return all test materials not later 
than the day following the administration of 
the tests. The SRA Testing Department scores, 
tabulates, and reports the results. This usually 
requires two weeks. The schools are then pro- 
vided with four profile cards for each pupil 
and with two copies of a confidential summary 
report on which the school scores are compared 
with the national norms. Extra copies of the 
profiles and report may be obtained for an ad- 
ditional sum. However, the cost of the tests and 
service is not only prohibitive for many schools 
but an unnecessary expense for schools which 
maintain their own research staff and test- 
scoring machines. 

The reliability coefficient of .91 for each of the 
tests was obtained for a representative sample 
of pupils in a single grade. The authors state 
that the tests were designed to be of a length 
which would yield this reliability. Raw scores 
for each test are translated into standard 
scores. These have the same meaning for all 
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tests and make it possible to draw a meaningful 
profile of the test performance of each pupil. 
On the 30-unit scale used, the probable error 
of measurement is +1. In addition to standard 
Scores, the percentile rank of the standard 
score is given for each grade. These were es- 
tablished on a population of more than 50,000 
pupils in 290 Midwestern schools. Norms are 
reported separately for the beginning and mid- 
dle of the school year, and the beginning-of- 
the-year norms may also be used as the end- 
of-the-year norms for the preceding grade. 
The Science Research Associates publish an 
elaborate prospectus explaining the test pro- 
gram. It contains three interpretive booklets, 
copies of the tests and answer sheets, and sam- 
ple profiles with suggestions for their inter- 
pretation. 
[13] 
Metropolitan Achievement Tests [1947 Edi- 
tion]. Grades 1, 2, 3-4, 5-6, 7-9.5; 1931-48; 5 levels; 
Forms R, S; complete and partial batteries available 
for grades 5-6 and 7-9.5; the tests in arithmetic, 
reading, and handwriting are available as separates ; 
per copy of Manual for Interpreting; 20¢ per 
copy of Primary Handwriting Scale; 35¢ per specimen 
set, postpaid, of any one level; Gertrude H. Hildreth 
with the collaboration of Frederick B, Graham, Rich- 
ard D. Allen, Harold H. Bixler, and William L. 
Connor; World Book Co. 
4) PRIMARY I BATTERY. Grade 1; 1931-47; $1.60 per 
ru 45(60) minutes, 
PRIMARY II BATTERY. Grade 2; 1932-47; $1.75 per 
25; 85(100) minutes. 
C) ELEMENTARY BATTERY. Grades 3-4; 1932-47 ; $2.30 
per 25; 135(150) minutes. 
d) INTERMEDIATE BATTERY: COMPLETE, Grades 5-6; 
1932-47; $2.70 per 25; 200(240) minutes, 
e) INTERMEDIATE BATTERY: PARTIAL, Grades 5-6; 
1932-47; $2.25 per 25; 155(180) minutes. 
) ADVANCED BATTERY : COMPLETE. Grades 7-9.5; 1932- 
47; $2.70 per 25; 220(240) minutes, 
9) ADVANCED BATTERY: PARTIAL. Grades 7-9.5; 1932- 
47; $2.20 per 25; 165(180) minutes. 
REFERENCES 
CRGA MN Leroy. A Comparative Study of th 
Ne I Ld ly of the 
Beer Fui of Base Bille epulis ath qun 
sis, University of Iowa, 1938. Pp. 34. 
$, Paso, Anian A. The Predictive Value of the Metropolitan 
hiew lished 
Korale of 
Tests and 


r  Unpub- 
jon Veg thesis, Clark University, 1941. (Abstracts € of 


litan Achievement Reading Test.” J Ed Psychol 32: 
Ei i E (fA 16:1207), E a ARES 
» Mites, Eunice C. Predicted Value for High School Suc- 


Primary Grades,” pp. E In 194. 
lent Sci 
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For reviews by J. W. Dunlap, R. Ledger- 
wood, C. W. Odell, E. V. Pullias, and Hugh 
B. Wood, of an earlier edition, see 38:874 and 
40: 1189. For reviews by Foster E. Grossnickle, 
Peter L. Spencer, Harry Grove Wheat, and 
Guy M. Wilson of an earlier edition of the 
arithmetic test, see 38:892 and 40:1458.1. For 
reviews by Ivan A. Booker, Joseph C. Dewey, 
and D. A. Worcester of an earlier edition of 
the reading test, see 38:1105 and 40:1551. 


[14] 
Ohio General Scholarship Test for High School 
Seniors. Grade 12; 1930-47; IBM; new form pub- 
lished annually; Ohio schools: 50 per test including 
scoring, tabulating, and reporting of results ; 150(180) 
minutes; Ohio Scholarship Tests, Ohio State Depart- 
ment of Education. 


REFERENCES 


1. Woop, Ray GromcE. The Ohio General Scholarship Test 
for High School Seniors, 1930-33, Inclusive: Its Diagnostic, 
Motivating, and Predictive Functions, Unpublished doctor's 
thesis, Ohio State University, 1935. (Abstracts of Disserta- 
tions . . . Spring Quarter 1935, 1936. pp. 397-410. 


C. C. Ross, Late Professor of Educational 
Psychology, University of Kentucky, Lexing- 
ton, Kentucky. [Review of the 1947 edition.] 
Each year since 1930 the Ohio State Depart- 
ment of Education has included in its state- 
wide testing program a general scholarship test 
designed specifically for high school seniors in 
the highest 40 per cent of their graduating 
classes. Strictly speaking, these tests are not 
standardized since the only norms available are 
based on the results obtained in the state-wide 
survey itself. Nor are they equated experimen- 
tally from year to year, although they are al- 
ways carefully constructed with the same prin- 
ciples and objectives in mind, are validated 
against the state curriculum and textbooks, and 
are administered under the same directions. 
Such a procedure is likely to result in tests 
similar in type but not necessarily in quality. 
However, the best assumption to make regard- 
ing the 1947 tests is that they possess the same 
merits and demerits as have their predecessors. 
_ Since there is no such thing as general valid- 
ity, any test must be judged in relation to the 
function it is designed to serve. According to 
a statement by the Director, the Ohio General 
Scholarship Tests are designed “to stimulate a 
better attitude toward scholarship especially, 
and education in general, and to find out stu- 
dents of college promise.” These are worthy 
objectives, but the experimental evidence avail- 
able relates to the last objective. 

Follow-up studies in past years have showed 
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correlations of ,404 to .832 between point stand- 
ing in college and scores on the tests as com- 
pared with correlations of .358 to .692 between 
point standing and scores on psychology tests 
for the same students. It is evident that these 
achievement tests are reasonably successful in 
their efforts "to find out students of college 
promise." 

"Tests of general scholarship have certain ad- 
vantages over tests of general intelligence as a 
basis for educational guidance. The scores on 
achievement tests made by individual students 
are more diagnostic and can be discussed with 
greater frankness with students and their par- 
ents than is often possible with scores on intel- 
ligence tests. School tests are also likely to 
serve as a greater motivation to students to 
apply themselves to the school program. But 
this is not an unmixed blessing. At times com- 
petitive examinations on a state-wide basis 
tempt both the students and the teachers to 
point toward these examinations rather than 
direct their efforts toward more defensible ob- 
Jectives, 

Unfortunately no objective evidence is avail- 
able regarding the reliability of the tests. While 
test makers in the past have undoubtedly over- 
emphasized reliability, it can scarcely be con- 
sidered a desirable practice to omit any men- 
tion of it altogether. However, it is a fairly 
safe assumption that a carefully constructed 
and highly objective test of 285 items will have 
no difficulty in meeting the minimum standards 
usually set. 

The test is easy to administer and is designed 
for machine scoring. Tabulations of scores are 
made by the Ohio Department of Education, 
and reports in the form of percentiles are sent 
to each participating school through the chair- 
man of the local committee. The names and 
test scores of high-ranking students are sent 
with a letter of recommendation to all colleges 
and universities in Ohio. The personnel record 
cards of all other contestants are sent to the 
institutions they desire to attend. It would also 
appear that an item analysis made year by year 
would shed considerable light on trends and 
needed changes in teaching emphases. 


[15] 
Progressive Achievement Tests, 1943 Edition. 
Grades 1-3, 4-6, 7-9, 9-13; 1933-43; 4 levels; the tests in 
reading, arithmetic, and language are available as ee: 
arates in both hand-scoring a machine-scorable edi- 
tions ; 35¢ per specimen set of any one level, postpaid; 


Ernest W. Tiegs and Willis W. Clark; California Test 
Bureau. 
a) PRIMARY BATTERY. Grades 1-3; 1933-43; Forms A, 
B, C; $175 per 25; 95(110) minutes. 
b) ELEMENTARY BATTERY. Grades 4-6; 1933-43; Forms 
A, B, C; $1.90 per 25; 120(135) minutes. 
C) INTERMEDIATE BATTERY. Grades 7-9; 1933-43; Forms 
A, B, C; $1.90 per 25; 150(165) minutes. 
d) ADVANCED BATTERY. Grades 0-13; 1034-43; Forms 
A, B; $1.90 per 25; 150(165) minutes. 
REFERENCES 
1, See 40:1193. 


2, McDowarp, Davin. The Relation Between Test Intelli- 
gence and Test Achievement in Grade Six. Unpublished doc- 
tor's thesis, University of Oregon, 1938. (Graduate Theses 


ve 1932742, 6, pp. 29:31. 
E ax. itis Winvieto. A Study of Factors Related to 


Mastery of Sills im Jada, a High Schools Unpablihed 
doctor's thesis, University of Southern California, 1941. (Abe 
stracts of Dissertations iste imation With Standardized 
Tests.” El Sch J 46:5748 Je "a6. * 1 

Paul A. Witty, Professor of Education, 
Northwestern University, Evanston, Illinois. 
PRIMARY BATTERY. The authors aimed to de- 
velop a diagnostic group test to determine pupils’ 
attainment of typical subject matter stressed in 
grades 1 to 3. Five major areas (reading vo- 
cabulary, reading comprehension, arithmetic 
reasoning, arithmetic fundamentals, and lan- 
guage) are divided into 19 subtests for diag- 
nostic purposes. 

The battery includes subject matter repre- 
senting the curricula of many schools; it offers 
suggestions for making profiles and includes 
data for grade-placement comparisons; and it 
makes possible an analysis of sixty elements 
associated with learning difficulty. Grade place- 
ment and percentile norms are provided as well 
as two sets of directions for administering the 
tests: one for hand scoring; another for ma- 
chine scoring. 

A simple scoring key and a class record 
sheet are included as separate items. The class 
record sheet gives essential information to en- 
able the teacher to tabulate, comprehend, and 
interpret test results. 

The test aims to measure “some of the most” 
tangible and easily identified objectives of the 
curriculum.” The selection of these items is, on 
the whole, satisfactory, although some schools 
will stress different outcomes. Reliability coeffi- 
cients are adequate for survey needs. Sugges- 
tions for diagnosis are helpful to the classroom 
teacher. 

These tests are carefully constructed and 
satisfy the requirements for effective survey 
tests. They are also useful in studying group 
attainments and in revealing needs in separate 
subjects. The diagnostic features of the tests 
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are distinctly superior; moreover, the tests can 
be readily used and interpreted by the classroom 
teacher. These tests are undoubtedly among 
the most superior and practical of the newer 
achievement batteries for primary grade chil- 
dren. 

ELEMENTARY BATTERY. These tests are simi- 
lar to the primary battery in mechanical fea- 
tures; reliability and validity measures are 
similarly presented. Two hours (instead of one 
hour and thirty-five minutes at the primary 
level) are required to complete these tests. The 
longer period of time permits a further break- 
down of skills. 

This battery is especially well adapted for 
survey purposes as well as for diagnostic en- 
deavor in the intermediate grades. The time 
limit—two hours working time—is adequate; 
the directions for giving and scoring are clear 
and concise; and the results can be readily 
comprehended, interpreted, and applied by class- 
room teachers. 

INTERMEDIATE BATTERY. This series carries 
forward testing in the five areas found in the 
primary and elementary batteries. The tests are 
expanded to cover appropriate subject matter 
to parallel the curriculum of the junior high 
school. For example, arithmetic reasoning in- 
cludes: number concepts, symbols and rules, 
numbers and equations, and problems. Items 
such as punctuation and parts of speech receive 
emphasis in the language test in the interme- 
diate battery. 

These tests, like the preceding batteries, base 
their validity on their inclusion of items found 
in typical curricula. They appear to satisfy this 
criterion of validity. Reliability, too, is ade- 
quate, and the diagnostic features of the tests 
are excellent. In fact, these tests conform with 
the best practices in test construction and 
standardization. 

ADVANCED BATTERY. This battery continues 
to employ the five subject headings, but in- 
cludes content in 21 subtests to reveal students' 
specific strengths and weaknesses at higher 
levels of learning. These tests are designed to 
serve as a basis for remedial work and con- 
structive educational guidance. 

Although the tests are designed for grade 9 
to the adult level, they seem to the reviewer to 
include material most appropriate for testing 
and guidance in grades 9 and ro. The test is 
also of value in disclosing the range of abilities 
of high school pupils and shows clearly certain 


strengths and weaknesses in fundamental skills. 
Reliability coefficients are adequate; scoring 
keys and materials are simple and easy to use; 
and the time limits are sufficiently long to ob- 
tain reliable diagnostic data. 

The tests for grade 9 to the adult level are 
of great practical value. It is clear that the 
selection of basal abilities at the higher levels 
is a difficult and controversial issue. Yet the 
selection of items for this test seems justified 
by the criterion of helpfulness in diagnosing 
the general needs of high school pupils. At 
every level, the Diagnostic Profile is proving of 
value. But like every other test profile with 
which the writer is familiar, this profile has 
serious limitations, To understand a child's 
needs, a much more penetrating study is re- 
quired. However, test results, when supple- 
mented by other data, offer the teacher and the 
clinician material of unquestionable worth. 

One important feature of the Progressive 
Achievement Tests is the emphasis given to 
the essential continuity of effective learning by 
including five areas of testing at all levels 
(from grade 1 to adult). Appropriate altera- 
tion and expansion in subtests adapt the bat- 
teries to the curriculum at the various levels. 
These batteries are admirably constructed and 
well standardized. At the present time these 
tests have no equal as practical survey and 
diagnostic instruments ; they can be understood 
and used by the classroom teacher. Publication 
of editions of the tests for machine scoring is 
another practical feature which increases the 
value of the tests. 

In conclusion, the reviewer recommends 
strongly the Progressive Achievement Tests 
for survey uses and endorses them also, when 
they are accompanied by supporting data, for 
diagnostic work. 


For reviews by D. Welty Lefever, Charles 
Watters Odell, and Hugh B. Wood of an earlier 
edition of the complete battery, see 38:876 and 
40:1193. For reviews by Harry A. Greene and 
J. Paul Leonard of an earlier edition of the lan- 
guage test, see 40:1292. For reviews by William 
Arthur Brownell, C. L. Thiele, and Harry Grove 
Wheat of an earlier edition of the arithmetic 
test, see 38:893 and 40:1459. For reviews by 
Ivan Albert Booker, Frederick Barton Davis, 
and Joseph Clarence Dewey of an earlier edition 
of the reading test, see 38:1110 and 40:1563. 
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X*Progressive Tests in Social and Related Sci- 
ences. Grades 4-8; 1947; IBM; 3 parts; Forms A, B; 
5¢ per copy of the machine-scorable edition of any one 
part; 3¢ per machine-scorable answer sheet of any one 
part; 60¢ per set of stencils for scoring answer shcets 
for any one part; 35¢ per specimen set of any one part; 
nonin noe mipit A minutes for machine- 
CO ion; Georgia Sachs 
Sexson; California Test Bureau. ergo ds 
a) PART I, SOCIAL STUDIES I. $1.25 per 25, booklet- 
marked edition; nontimed (50) minutcs, (62) minutes 
for machine-scoring edition. 
b) PART II, SOCIAL STUDIES II. $1.25 per 25, booklet- 
marked edition; nontimed (60) minutes, (75) minutes 
for machine-scoring edition. 
C) PART II, RELATED SCIENCES. $I per 25, booklet- 
marked edition; nontimed (40) minutes, (50) minutes 
for machine-scoring edition, 

[17] 
Public School Attainment Tests for High School 
Entrance: Examinations of Abilities in Reading, 
English, and Mathematics. Grades 8-10; 1935; à 
battery assembled from previously published tests; 1 
form; $4.80 per 100; 30¢ per specimen set; 90(100) 
minutes; Arthur E. Traxler, J. C. Tressler, Henry D. 
Rinsland, Roland L. Beck, and J. Murray Lee; Public 
School Publishing Co. 


REFERENCES 
1, See 40:t195. 


Benjamin S. Bloom, College Examiner, 
Board of Examinations, The University of 
Chicago, Chicago, Illinois. This battery booklet 
consists of portions taken from four standard 
tests and is organized to measure abilities in 
reading, English, and mathematics. It was evi- 
dently constructed very carefully and appar- 
ently has made good use of investigations on 
word difficulty, frequency of error, and fre- 
quency of use. 

The relatively high correlations between 
parts of the test and grades in high school in- 
dicate that the tests are of value in predicting 
academic achievement and that the scores can 
be used as a basis for advising students. One 
might question whether the authors' recom- 
mendations for use of the tests in eliminating 
algebra students or in sectioning classes are 
justified. In part, these recommendations are 
qualified by the suggestion that teachers con- 
sider not only the scores, but also the responses 
on each exercise, as well as the students' other 
achievements and traits. It is hoped that these 
suggestions are not so general as to be dis- 
missed by most of the teachers using the test 
results. 

The authors also recommend that the tests 
be used to determine pupils of outstanding 
ability who might be given supplementary or 
creative work and to serve as a basis for an early 
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diagnosis of the individual student as a means 
to the reduction of failures. This use of the 
tests is probably the most valid. A careful 
analysis of the results should give each teacher 
greater insight into the characteristics of the 
individual students. Such insight would make 
it possible for the teacher to aid each student 
to secure an optimal achievement. 

Since the tests are intended for high school 
entrants, it is unfortunate that the reliability 
coefficients were computed on a much more 
variable group—grades 7 to 12 of a selected 
high school. However, it is apparent from other 
data that the reliability coefficients are suffi- 
ciently high to permit use of the tests for mak- 
ing decisions about individual cases. 


For reviews by Harold Gulliksen and C. C. 
Ross, see 40:1195. 


[ 18 ] 
Stanford Achievement Test. Grades 2-3, 4-6, 7-9; 
1923-40; revision of New Stanford Achievement Test; 
3 levels; Forms D, E, F, G, H; the tests in arithmetic 
and reading are available as separates in regular hand- 
scoring edition for grades 4-6 and 7-9; tests having 
basically the same material in arithmetic, language 
arts, elementary science, literature, and social studies 
are available in machine-scorable editions; Truman L. 
Kelley, Giles M. Ruch, and Lewis M. Terman; World 
Book Co. 
a) PRIMARY BATTERY. Grades 2-3; $1.40 per 25; 35¢ per 
specimen set; 65(80) minutes. 
b) INTERMEDIATE BATTERY. Grades 4-6; Complete: $2.70 
per 25; 35¢ per specimen sct; 150(170) minutes; Par- 
tial: $2.20 per 25; 35¢ per specimen set; 110(125) min- 
utes. 
C) ADVANCED BATTERY. Grades 7-9; Complete: $2.70 per. 
25; 35€ per specimen set; 150(170) minutes ; Partial: 
$2.10 per 25; 45¢ per specimen set ; 110(125) minutes. 


REFERENCES 
1. Happock, Nrru P. “The Use of the Stanford Achieve- 
nan Test to Tigica Hin Sraa Success." Sch & Soc 25: 
114-1 a 22 27. 1:95! 
4-6 da 22 '27, © MO "The Purpose of a Standardized Test 
in Spelling.” J Ed Res 20:319-26 D '29, * (PA 4:902). 
Kerey, Truman L. "A Communication Concerning the 


3. 
Dite of ‘Achievement Test Items.” J Ed Res 22:309-14 N 


Trenon L, “Predicting Success in Algebra,” 
t a '3o. * (PA 411362) 4 
ee "A Reply to Kelley's Communication.” 

30. z 

6. Fonn. T. G., AND Loves, M. Epmuno. "The Relative 
Difficulty of Three Achievement Examinations." J Ed Psychol 
26:218-22 Mr '35. * (PA 9:4801) ^ 

7-8. Swicart, J. Farman. A Study of the Prognostic Value 
of the Stanford Achievement Test Scores, When Compared 
With Scores Received on the Sones-Harry Achievement Test 
and High ene Grades. Unpublished master’s thesis, Uni- 
versity of Oregon, 1935- 

9. Woorr, 5 AND Lino, CHRISTINE. “A Study of 
Some Practical Considerations Involved in the Use of Two 
Educational Test Batteries.” J Ed Psychol 26:629-34 N '35. * 


A 10: 
EIE Euzaser Lov, The Agreement Between 


10, PARKS, n 
Achievement Tests and Teachers’ Marks in Junior-High 
School. Unpublished master's thesis, University of Oklahoma, 
1937, o] P. 


1937. Pp. iv, 44. (Abstracts of Theses... 
Me PULLS. Far V. "Commercial Standardize 
pp. 65-80. In Variability in „Results From New Type 
ment Tests. Duke University Studies in Education, 
Durham, N. C.: Duke University Press, 1937. Pp. 100. Paper. 
$1.00 (London: Cambridge University Press. 4s. 6d.) * PA 


11:5966) 
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12. Barner, Dixie. Comparisons of the New-South Achieves 
ment Test Scores and the New Stanford Achievement Test 
Scores. Unpublished master's thesis, Southwest Texas State 
Teachers College, 1938. Pp. 39. 

13. BrtLHARTZ, WiLLIAM H., E The Derivation of Instru- 
ments for the Prediction of Co eie Success on the Basis of 
Measurements Made in Junior High School Years. Unpub- 
lished doctor's thesis, University of Pittsburgh, 1938. (Ab- 
stracts of Theses... 1938, RP 48-52.) : f 

14. FLOTTMAN, EDWARD ALBERT. The Prediction of High 

School Success From Achicvement Tests. Unpublished mas- 
ter’s thesis, University of Washington, 1938. (Abstracts of 
Theses... , 1938, pp. 79-81.) 
X 1$. Jorpan, WiLLIAM Leroy. The Relation Between Success 
in High School and Stanford Achievement Test Scores and 
Otn Factors, Unpublished master’s thesis, Baylor University, 
1938. 

16. Moore, Crarence Ler. A Study of the Norms on the 
New Stanford Achievement Tests and the Iowa Ever -Pupil 
Tests of Basic Skills. Unpublished master's thesis, University 
of Towa, 1938. Pp. 41. 

17. Power, Lrovp. The Relation of the Educational Age of 
the New Stanford Achievement Test to the Various Subject 
ie in an Oklahoma School. Unpublished master's thesis, Ok- 
lahoma Agricultural and Mechanical College, 1938. 

18. Osrwarr, Jay H. The Effect of Time Limits on Selected 
Sections of the New Stanford Achievement Test. Unpublished 
master’s thesis, Duke University, 1939. Pp. 56. 

19. Peake, NELLIE L, “Relation eset Spelling Ability 
and ranm Ability.” J Exp Ed 9:192-3 D '4o. * (PA 15: 


2377 

20. Fernnerc, Henry, "IO Correlated with EQ.” J Ed Psy- 
chol 32:617-23 N "41. * (PA 16:2607) 

21. Hayes, SAMUEL P. "Stanford. Achievement Tests for 
the Blind: New and Old." Teachers Forum 14:254 S “qin 
(PA 16:1177, title only) 

22. Hazar, Lucite. Comparisons of the New Stanford 
Achievement Tests and the Iowa Ev -Pupil Test of Basic 
Skills, Hipubliahod master's thesis, University of Iowa, 1941. 

MZEK, 


23. Ni 

offAcadend $ ; 

ment Test” J Seca! E Measured by the Stanford Achieve. 
24. SERITAN, KASPER THOMAS. Comparison of the Results of 


the New Stanford and Metropolitan Achievement Tests and 
the Use of f T d 


dition and the Older Edition of the Stanford Achieve- 
a” El Sch J 42:616-20 Ap '42. * (PA 
27; TRAXLER, Artur E, “A Study of the Revised Edition 


qe. fraxter, ARTHUR E., AND SELOVER, MARGARET S. “R 
lationship of Elementary-School Achievement Test te A hiec 
ment Taken in 


LU Convent, Erne rane Deus and Application 
e: 

Achievement” d Exp Ei 1a!207-26 Mr 4. DEPT S 
1. Lavos, Gro liabili ii 
Achievement Test Administered te te p oun, Educational 

sos. Ramen A A Ge tin wi 

Testa T El T d Jean vL c valuation With Standardized 
3. BOX. WiLtttAM H. Spelling Profici is ij 

Schools in Indiana: Analysts of the. Shell Prog ipe b 

101,747 Pupils in Grades 2 to 8 in 7,681 Teaching Units. | i 

letin of the School of Educa Indiana University, Vol. 23. 

No. 1. Bloomington, Ind.: sion of Research and Field 


Eyes January 1947. Pp. viii, 31. Paper. $0.50. * (PA 22: 


Walter W. Cook, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota. These tests constitute the second exten- 
sive revision of the pioneer among standardized 
achievement test batteries. When Forms A and 
B were published in 1923, achievement testing 


was in a state of chaos. Although a few of the 
earlier tests had been of high quality, the rush 
to publish and profit had resulted in a flood of 
tests, hastily constructed, inadequately stand- 
ardized, and full of technical imperfections. 
Tests in the various subjects had been con- 
structed independently of each other by differ- 
ent persons at different times and in different 
places. Norms for each test had been estab- 
lished with different groups of schools at dif- 
ferent periods of the school year by different 
procedures on population samples inadequate 
for reliable results. Rarely were adequate de- 
scriptions provided of the types and geographic 
distribution of the schools used. It was impos- 
sible to measure progress over a period of time 
because only one or two forms of a test were 
available. It was impossible to construct a pro- 
file of relative achievement in different subjects 
because the norms provided with the different 
tests were not comparable. When the first ex- 
tensive revision (Forms V, W, X, Y, Z) 
was published in 1929, it became possible not 
only to construct a profile of relative achieve- 
ment in the different school subjects but also to 
make growth studies. 

The second revision (Forms D, E, F, G, and 
H) may more accurately be described as five 
new forms. Approximately 80 per cent of the 
items are new, Some changes have been made 
to facilitate administration and scoring, but the 
nature of the tests is not essentially changed. 
Six of the tests, Word Meaning, Language 
Usage, Literature (acquaintance), Social Stud- 
ies I (History), Social Studies II (Geography), 
and Elementary Science, are completely objec- 
tive and scored with a perforated key. Four of 
the tests, Paragraph Meaning, Spelling, Arith- 
metic Computation, and Arithmetic Reasoning, 
require the writing of words or numbers, are 
not completely objective, and are scored with a 
panel key. The intermediate and advanced bat- 
teries could easily have been made completely 
objective and, by using an answer sheet, non- 
consumable. No doubt a machine-scored edition 
is contemplated. S 

No information is given regarding the sources 
of the material other than that the Library of 
Congress and the Library of the Office of Edu- 
cation were utilized in making an analysis of 
curricula, courses of study, and textbooks and 
that the items were evaluated by subject-matter 
specialists. 

The difficulty and discriminating power of 
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the items selected for the tests were determined 
by administering the original tryout items to 
over 200 pupils in each of grades 2 through 
10. Increase in per cent passing from grade to 
grade was the criterion. This procedure re- 
sulted in highly reliable tests in the skill areas 
(mostly above .90), but in the content areas 
(literature, social studies, and science) the re- 
liabilities are low, ranging from .71 to .84. It 
is doubtful whether tests in content areas, de- 
signed to be used on a national scale, should be 
given serious consideration when the curricula 
of the schools vary as much as they do. It is 
difficult to build reliable tests under such con- 
ditions. Reliabilities and the probable errors of 
measurement are reported for grades 3, 5, and 
8 with adequate cautions regarding the inter- 
pretations of individual profiles. 

National age and grade norms based on an 
adequate sampling of schools are provided but 
with an innovation which should receive atten- 
tion. The norms are based on pupils who are 
making normal progress through school, that 
is, at grade for age. They are particularly suited 
for those schools in which regular promotion 
is the rule. However, age and grade norms of 
the traditional type are also furnished for com- 
parative purposes. The striking fact is that the 
new norms are higher. For example, an equated 
score of 70 on any of the tests corresponds to 
an age score of 14 and a grade score of 9 on 
the new norms; on the traditional norms it 
corresponds to an age score of 15 years and 7 
months and a grade score of 9.6. This indicates 
that the practice of hoarding dull pupils 
through nonpromotion tends to lower the 
achievement averages of schools for both age 
and grade groups. 

The section of the manual dealing with the 
interpretation of test scores is well done. The 
authors have never made extravagant claims 
regarding the use of the tests. Survey func- 
tions have been emphasized: the comparing of 
the achievement of the individual with the 
class, the class with the school grade, the school 
with the school system, and the school system 
with national norms. When homogeneous 
grouping is practiced, the warning is given that 
the variability of the so-called homogeneous 
group in each test area should be studied. The 
profile chart furnished is ideal for recording 
progress through school, making case studies, 
and filing with the permanent record folder. 
However, the paper is poor, and bond paper 


profile charts, available from the publisher, 
should be used for the permanent record. The 
interpretation section of the manual would 
have been strengthened by preparing the user 
of the tests for the finding that the range of 
ability in a class will be from four to ten years 
and that no reasonable procedure of grouping 
or promotion will reduce this substantially. 


Ralph C. Preston, Associate Professor of 
Education, University of Pennsylvania, Phila- 
delphia, Pennsylvania, The care with which the 
Stanford Achievement Tests have been con- 
structed places them among the very best of 
comparable tests. Most of the items in this 
second revision are new and have. survived a 
rigorous scrutiny. Age and grade equivalents 
and a profile chart appear on the cover of the 
primary forms and inside the front cover of 
the intermediate and advanced forms, facilitat- 
ing the recording and interpretation of scores. 
Those who administer the tests will appreciate 
the brevity, simplicity, and clarity of the in- 
structions to the pupils. The uniformity in the 
arrangement and form of items within any one 
subtest will be appreciated especially by those 
who use them with younger children. The 
primer-size type on the first two pages of the 
primary forms is a commendable innovation 
and could appropriately have been used for the 
initial group of items in each of the subtests. 

The tests contain several doubtful features. 
For example, the word mastery subtest in the 
primary forms might better have provided for 
the spreading of each item along one horizontal 
line (as in the same subtest in the intermediate 
and advanced forms where simplicity and con- 
tinuity of arrangement are of far less moment) 
instead of distributing the responses on two 
lines and arranging the items in two columns. 
The most severe criticism of the batteries must 
be directed at those subtests covering the con- 
tent fields. In the first place, certain items are 
unsatisfactory. “The male tanager is easily 
seen because of its: bright colors, black crest, 
long tail.” While common sense and the process 
of elimination would lead the informed child 
to the correct response, the item would scarcely 
win the respect of the pupil who knows that 
the bird is quite drab for the greater part of the 
year. “The steam engine was greatly developed 
by: John Fitch, James Watt, Charles Duryea.” 
This seems to place undue importance upon the 
meaning of “greatly developed.” The President 
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during the World War was: Hoover, Coolidge, 
Wilson." This item would no doubt confuse a 
modern child who has grounds for believing 
that the war of 1939-1945 was the World War. 
Obviously, it is important that tests receive the 
confidence and respect of pupils and teachers 
using them. 

The subtests covering the content fields 
clearly are not abreast of curricular changes. 
To be sure, items on history and American 
government are labeled “Social Studies I" and 
items on geography are labeled “Social Studies 
II,” but these tests ignore the strong trend to- 
ward fusion and toward greater emphasis upon 
community study. A disproportionate number 
of history items deal with political, economic, 
and military events, and the history sections as 
a whole are inconsistent with the tendency for 
elementary school history to focus upon “How 
People Live”—a tendency which received for- 
mal endorsement by the Committee on Ameri- 
can History in Schools and Colleges in 1944. 
The tests of elementary science are heavily 
weighted with the content of health and nature 
study and are poorly adapted to any school 
offering a modern program of elementary sci- 
ence in which the physical and biological sci- 
ences are given approximately equal treatment. 
The titles dealt with in the literature tests re- 
flect an arbitrary, stereotyped tradition from 
which schools are struggling to emancipate 
themselves. One of the reasons for these diffi- 
culties, of course, is the lack of curricular uni- 
formity in the content areas. Tt suggests the 
possible desirability of confining nationally 
standardized achievement tests to the skill areas, 
Schools that contemplate the use of the Stan- 
ford Achievement Test at the intermediate or 
junior high school levels are strongly urged to 
Procure the Partial Batteries. These have not 
been surpassed as tests of the skill subjects and 
do not include the content subtests with their 
questionable clarity, emphasis, and scope, 


For review by Margaret G. McKim of the 
reading test, see 503. For a review by Ray G. 
Wood of the social studies test, see 505. 


[19] 
XTests for High School Entrants. 1945-47; Form 
j $2.50 per 25; rof per manual; 15¢ per specimen 
set; 40(50) minutes; Lester D, Crow and Alice Crow: 
Acorn Publishing Co. v 
Benjamin S. Bloom, College Examiner, 
Board of Examinations, The University of 
Chicago, Chicago, Illinois. This test consists of 


four parts: English, reading comprehension, 
arithmetic, and general information. It is de- 
signed to serve three main purposes: "to pre- 
dict probable success in high school, to assist 
in planning the program of the incoming high 
school pupil in terms of his demonstrated 
achievement, and to aid in the diagnosis of in- 
dividual difficulties or weaknesses." Since the 
authors are so explicit in stating the purposes 
the test is designed to serve, it seems only fit- 
ting and proper that the review should indicate 
the extent to which each of these purposes ap- 
pears to be met. 

To predict probable success im high school. 
Tests such as this have been found to have pre- 
dictive value for scholastic achievement. How- 
ever, no evidence is given of the actual relation- 
ships between the performance on this test and 
scholastic achievement. Instead, the authors cite 
two coefficients of correlation, one with a stand- 
ardized reading test and the other with a test 
of mental ability. Both correlations are exceed- 
ingly high, but this is evidence only that this 
test has many things in common with the older 
tests, One wonders whether there is any value 
in a creative effort which results in a new test 
having high correlations with a well-established 
test—unless it is intended to be a parallel form. 

To assist in planning the program of the 
incoming high school pupil in terms of his 
demonstrated achievement. Here, again, there 
is no evidence that students with one score on 
the test will do any different academic work 
from students with another score. Until such 
evidence is produced, it would seem very dan- 
gerous to follow the authors’ recommendation 
that a student in the lowest decile on this test 
“probably is unready for promotion to high 
school,” or that students in the highest fourth 
on this test “probably are ready for an en- 
riched curriculum or for special honors classes.” 

To aid in the diagnosis of individual dificul- 
ties or weakness. Here the authors are on firmer 
ground. Having constructed the test on the 
basis of material and skills represented in ele- 
mentary courses of studies, they may properly 
claim that students who are especially weak on 
any part or who have missed specific test ques- 
tions should be considered for remedial instruc- 
tion. One might question whether the authors 
have any basis for suggesting that pupils whose 
Scores on any part of the test fall in the lowest 
25 per cent should be scheduled for simple or 
remedial work. Undoubtedly, some of the low- 
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est students could profit from special aid and 
instruction. 

The following are some general comments on 
the test. This is essentially a speed rather than 
a power test since the time limits are too short 
to permit the majority of students to complete 
each of the sections. One searches in vain for 
some plan to the general information test with 
its almost random collection of names, dates, 
and other assorted facts. The rapid alteration of 
questions on definitions, spelling, and grammar 
presents an irritating task to the student. The 
authors neglect to give vital information on the 
population used in computing reliability coeffi- 
cients. 


J Consult Psychol 11:158 My-Je '47. * no 
data are given on the test's validity for these 
purposes. * The percentile norms, on over 5,000 
cases are for "high school entrants." Since this 
might mean seventh, ninth, or tenth grade stu- 
dents, the applicability of the norms is ques- 
tionable. 


[20] 

xTests of General Educational Development. 
High school, college; 1943-45; IBM; Form B; 2 
levels; separate answer sheets must be used; 40£ per 
25 machine-scorable answer sheets for any one test; 
15¢ per scoring key for any one test; 25¢ per specimen 
set of any one test, postpaid ; work limit (120) minutes 
per test; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25 of any one high school 
test; $3 per 25 of any one college test; 65¢ per 25 
machine-scorable answer sheets for any one test; 50¢ 
per key for any one test; 50¢ per specimen set of any 
one test.) 
à) HIGH SCHOOL LEVEL. 1943-45; 5 tests; $2 per 25 of 
any one of Tests 1-4; $1.75 per 25 of Test 5. 

1) Test 1, Correctness and Effectiveness of Expres- 

ston, 

2) Test 2, Interpretation of Reading Materials in the 

Social Studies. 

3) Test 3, Interpretation of Reading Materials in the 

Natural Sciences. 

4) Test 4, Interpretation of Literary Materials. 

5) Test 5, General Mathematical Ability. 
b) COLLEGE LEVEL. 1944-45; 4 tests; $2 per 25 of either 
Test 1 or 4; $3 per 25 of either Test 2 or 3. 

1) Test z, Correctness and Effectiveness of Expres- 

ston. 

2) Test 2, Interpretation of Reading Materials in the 

Social Studies. 

3) Test 3, Interpretation of Reading Materials in the 

Natural Sciences. 

4) Test 4, Interpretation of Literary Materials. 
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Herbert S. Conrad, Technical Consultant, 
College Entrance Examination Board, Prince- 
ton, New Jersey. I. THE Test SERIES AS A 
WHOLE. A. DESCRIPTION OF TESTS. The battery 
consists of five high school level tests and four 
college level tests. The five tests at the high 
school level are intended to be used as a single 
unit or team, rather than as separate, independ- 
ent tests. The four tests at the college level are 
intended for independent use. (1) The test on 
Correctness and Effectiveness of Expression 
for the high school level includes (a) 20 five- 
choice multiple-choice spelling items; and (b) 
80 multiple-choice items (from two to five 
choices per item) in which various corrections 
are made to specified punctuation marks, words, 
or phrases in a connected text. The college level 
test is similar, except that at this level somewhat 
greater emphasis is placed on “effectiveness of 
expression” than in the high school level test. 
(2) The test on Interpretation of Reading Mate- 
rials in the Social Studies presents reasonably 
extensive passages (two to four paragraphs 
long) “taken from textbooks and other writings 
in the social studies” ; multiple-choice questions 
are asked about each passage. The questions, 
however, are such that they serve not only as a 
measure of comprehension of reading, but also, 
in part, as a test of the subject’s “background 
of general knowledge in the social studies.” The 
college level test is similar, except that in this 
test there is a special category of questions which 
can be answered directly from the student’s 
general knowledge, without reference to the 
passage; these questions (interspersed among 
the other items of the test) have been marked 
by an asterisk. (3) The test on Interpretation 
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of Reading Materials in the Natural Sciences 
(at both the high school and the college level) 
"consists of a number of short reading passages 
taken from textbooks and other materials in 
the natural sciences." The passages are longer 
than in most other such tests (from two to eight 
paragraphs). (4) The test on Interpretation of 
Literary Materials (at both the high school and 
college levels) includes passages of prose and 
of poetry; most of the passages are compara- 
tively short (well below a half-page). Multiple- 
choice items follow each passage, measuring 
mainly comprehension ; a few items are devoted 
to literary criticism, identification of literary 
forms, and recognition of literary devices. 
(5) The test on General Mathematical Abil- 
ity is issued at the high school level only. 
This test consists mainly of arithmetical reason- 
ing problems in multiple-choice form; in addi- 
tion, the test includes four very simple problems 
from high school algebra, and two very simple 
problems from high school geometry. 

B. PURPOSE OF THE TESTS, The principal pur- 
pose of the tests at the present time is “to help 
the schools [including the colleges] determine 
the amount of academic credit which should be 
granted students for their educational experi- 
ences in military service.” “The high school 
level tests are intended to be used primarily to 
determine whether or not the individual has had 
the equivalent of a general high school educa- 
tion,” or whether the individual is "as able to 
carry on in a program of general education at 
the college level as a student who has formally 
completed high school.” The college level tests 
“are intended for use primarily to determine 
whether or not the individual tested is as capable 
of carrying on advanced college work as the 
student who has taken certain broad introduc- 
tory or survey courses generally offered in the 
first two years of the liberal arts college, or has 
reached the same level of general educational 
development as the student who has had such 
survey courses.” The underlying purpose is to 
“provide the basis for a much more rational 
solution of the accreditation problem than the 
‘blanket-credit’ solution of World War I.” 

Everyone will grant that “blanket credit” is 
well-nigh indefensible. There are, however, 
fundamental questions regarding accreditation 
by examination which the authors of the Tests 
of General Educational Development have ap- 
parently failed to face. First, written examina- 
tions fail to cover adequately the laboratory or 


field experiences which practically all good 
courses provide. Second, written examinations 
fail to cover such intangibles as the social and 
other benefits from class discussion, the favor- 
able emotional orientation toward a subject- 
matter field created by an alert, stimulating 
instructor, and the moral and ethical values flow- 
ing from a qualified teacher successfully leading 
an interested classroom of students. To deny 
such intangibles is to deny most of the useful- 
ness of face-to-face teaching. Finally, there is 
the question whether it is desirable to permit 
an A-student to “get by" with a barely passing 
performance on an accreditation examination, 
when he might well have done distinguished 
work, had he taken the course. 

While there is certainly room for difference 
of opinion in regard to the questions above, it 
is doubtful whether—in ordinary circumstances 
—the majority of the questions can be answered 
favorably to the assumptions underlying the 
Tests of General Educational Development, 
However, the war has created special circum- 
stances; and we may be entitled to give the 
veterans “the benefit of the doubt.” This seems 
especially true in regard to high school accredi- 
tation. Failure to possess the high school diploma 
is commonly a serious vocational (not to say 
educational) handicap. There are probably 
many young men whose education was cut short 
by the war, who are just as good (vocationally 
and educationally) as many others with a 
diploma. It seems an undue penalty on such 
individuals to deny them the diploma if, by 
the evidence of five tests, they demonstrate their 
general capacity and accomplishment. As a mat- 
ter of fact, about a dozen states have adopted 
the Tests of General Educational Development 
for peacetime use, in order better to meet the 
needs of special, individual cases. On a priori 
grounds, however, the justification for using 
the Tests of General Educational Development 
for the accrediting of college courses seems more 
doubtful; and the empirical evidence that exists 
(2, 3, 4) fails to support the granting of ad- 
vanced standing in college courses on the basis 
of scores on the Tests of General Educational 
Development. The authors of the tests may 
object that the evidence to date is not particu- 
larly germane or conclusive: for this, however, 
they may blame themselves—since it is one of 
the duties of the authors of educational tests to 
provide exactly such evidence themselves. (See 
the section on validity, below.) 
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Even if one grants the validity of the Tests 
of General Educational Development for the 
purpose of granting both high school and col- 
lege credits, one must still criticize the all-or- 
none attitude exposed in the Examiner's Man- 
ual. Here it is recommended that a student 
either be given a high school diploma or mot 
be given the diploma; in the latter case, the 
student presumably starts where he left off— 
certainly there are no suggestions as to what 
performance on the Tests of General Educa- 
tional Development justifies placement in the 
middle of grade 12, or at the beginning of 
grade 12, etc. Similarly, in the case of course 
credits in college, the Examiner's Manual rec- 
ommends either granting a full six units of 
credit (for a year's work in English composi- 
tion, or in a survey course in social studies, etc.) 
or none. Half-year credits would seem to be 
both a possible and desirable alternative. 

„C. GENERAL EVALUATION. In the present re- 
viewer's opinion, the general quality of most 
of the items in the Tests of General Educational 
Development (both at the high school and the 
college levels) is high or superior. A similarly 
favorable evaluation is made of the test direc- 
tions, which aim to acquaint all examinees with 
the purposes of the tests and the best procedures 
to use in answering the various questions. The 
directions to test administrators for scoring the 
tests are also adequate. 

On the basis of detailed scrutiny, the re- 
viewer judges that the scores on the Tests of 
General Educational Development reflect a mix- 
ture of intelligence (mainly verbal), general 
subject-matter background, and, occasionally, 
specific knowledge. Obviously, such composite 
or “hodge-podge” scores are resistant to clean- 
cut interpretation or research. The important 
question is whether the tests measure intelli- 
gence at the expense of subject matter. This 
question might have been answered by the appli- 
cation of a suitable one- or two-hour intelligence 
test to veterans at the time that they took the 
Tests of General Educational Development. 
Crawford and Burnham (2), working with a 
sample of 135 civilian freshmen at Yale, report 
a median correlation of .64 between the Scholas- 
tic Aptitude Test (Verbal) of the College En- 
trance Examination Board and the college level 
tests. This correlation would probably be higher 
for the more nearly unselected sample likely to 
take the high school level tests. Whether the 
correlations with intelligence would be higher 


or lower in a group of veterans several years 
out of school is difficult to say. 

The authors do not claim that the tests have 
had the benefit of pretrial or item analysis. Only 
suggestive standards or norms are provided; 
institutions are urged to prepare their own 
standards or norms. No data are presented 
concerning the reliability of the tests, the inter- 
correlations among the tests, validity, the com- 
parability of Form A and Form B, or the com- 
parability of the similarly named high school 
and college level tests in each field. Either a 
great deal of what is ordinarily regarded as part 
of professional test construction was not under- 
taken or it has not been reported. The complete 
absence of validity data is especially deplorable. 

Although the present reviewer is somewhat 
indiscriminately inclined to applaud Jong tests, 
many others will doubtless feel that the high 
school battery (five tests, with a total time 
allowance of ten hours) is too long. The danger 
in a long battery, of course, is that users will 
“skip” two or three of the tests. It will be re- 
called that the high school battery is intended 
for use as a single unit, so that—if the direc- 
tions are followed—all five of the tests must 
be administered to each examinee. 

The Examiner’s Manual refers rather dis- 
paragingly to learning acquired by “vicarious 
experiences or contacts with textbooks and for- 
mal pedagogical procedures." Warm approval 
is given to veterans’ educational growth which 
is “due in a much larger measure than normally 
to their own deliberations, to first-hand observa- 
tion, to direct experience, to self-directed read- 
ing and study, to conversation and informal 
group discussions, and to a process of matura- 
tion intensified and accelerated by the circum- 
stances of war." All this is apparently in defense 
of the nature of the tests, which are not closely 
related to “topical outlines of content in specific 
school courses." In the reviewer's opinion, ob- 
jective validity data would be much more to the 
point. The manual indicates clearly enough the 
“desirable general characteristics” which were 
to be built into the tests, but fails to supply evi- 
dence concerning the success of the attempt. 

D. FORMAT. The format of the tests appears 
to this reviewer exceptionally good. Especially 
commendable is the arrangement of the reading 
passages in such a manner that it is unnecessary 
for the examinee to turn over a page in order 
to refer back tà the text. This was accomplished 
at the cost of “wasted” paper—a courageous 
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action during the period of wartime shortages. 
On the negative side, it may be said that the 
high school level answer sheets fail to group 
items in the manner of the college level answer 
sheets; such grouping seems to facilitate scor- 
ing and to help prevent errors. 

E. DIRECTIONS. As indicated above, the test 
directions are superior with respect to inform- 
ing examinees concerning the purposes of the 
tests and the best procedures to use in answering 
the questions, Some aspects in which the direc- 
tions could be improved are as follows: (a) The 
directions on each test booklet state that the 
examinee will be allowed two hours; but the 
Examiner's Manual states that "each student 
should be given as much time as he needs to 
complete the tests." Although few examinees 
will require more than two hours, it is desirable 
that the two sets of directions be identical. (b) 
The preliminary directions to the examinee 
should tell him explicitly to go from section to 
section or page to page without waiting for 
further directions. This might be reinforced by 
placing the direction *Go on to the next page," 
below the end of the printing on each page. (c) 
The test administrator is nof provided with ver- 
batim directions throughout. This means that 
different test administrators will phrase their 
directions somewhat differently, with uncon- 
trolled variations of emphasis. Very likely it 
would be better to supply verbatim directions 
throughout. 

F. TIME LIMIT. According to the Examiner's 
Manual, the time allowance “is adequate for 
nearly all persons” to whom the tests may be 
given. Apparently, then, the speed factor has 
been eliminated, in the interest of a superior 
measure of “level” and superior convenience of 
test administration, It is a moot question, how- 
ever, whether the virtually complete elimination 
of speed results in the most valid possible test, 
and whether the application of completely un- 
speeded tests permits the most efficient use of 
the available testing time. 

G. DIFFICULTY LEVEL. The difficulty level of a 
test should be appropriate to the ability of the 
group for whom the test was devised, Appar- 
ently any veteran who cared to do so might take 
the Tests of General Educational Development; 
there was nothing to lose except time, and 
everything to gain. It is impossible to make a 
safe judgment concerning the difficulty levels 
of the various tests in the veteran group, be- 
cause no data of any kind are given for veterans 
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who took the test. All the data in the Examiner’s 
Manuals refer to civilian, nonveteran groups of 
students. If the reviewer were forced to guess, 
he would say that, for the veteran group, the 
tests are probably more difficult than they should 
be, from the point of view of most efficient 
measurement. 

H. DISTRIBUTION OF SCORES. The frequency 
distributions of raw scores for the civilian 
standardization groups are not presented, nor 
are the essential descriptive statistics of the dis- 
tributions (mean, standard deviation, and meas- 
ure of skewness). If a person is sufficiently 
interested to do so, the distributions of raw 
scores for the civilian groups can be reasonably 
well reconstructed by the joint use of the tables 
of percentile norms in the Examiner’s Manuals 
and the tables of standard-score equivalents of 
raw scores given on the scoring stencils. 

I. EQUALITY OF UNITS. If we assume that the 
standard scores are expressed in at least roughly 
equal units, then it appears that the raw scores, 
in general, fail to express equal units. Thus, in 
the test on Interpretation of Reading Materials 
in the Social Studies (High School Level), the 
five-point range from 40 to 45 covers a 6-point 
range of raw scores (24-30) ; whereas the five- 
point range from 50 to 55 covers a 12-point 
range of raw scores (36-48). The greater fine- 
ness of raw-score units in the higher region, 
50—55, serves no useful purpose, since the region 
of the recommended "cutting score"—where the 
raw-score units might well be finest—is at 45 
or below (in terms of standard scores). While 
it is not essential that raw scores should express 
equal units, it is generally considered preferable 
that the units be at least roughly equal (except 
perhaps for increasing fineness in the region 
of a "cutting score"). 

T. ALTERNATE FORMS. Special precautions are 
taken to protect the security of Form A of the 
tests; Form B is distributed more freely. The 
existence of only one secure form does not 
constitute a great disadvantage, since it is not 
likely that many individuals will be retested. 
The reviewer is not aware, however, that any 
precautions have been taken to prevent a veteran 
from being tested on Form A at one college, 
again on Form A at another college, and still 
again at a third college. The veteran might 
choose to enter the college where he made the 
best score on the tests and got the best “deal” 
with respect to advanced credit. 

K. NORMS AND STANDARDS. The high school 
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level norms "were established for a sample of 
35,432 public high school seniors tested in 
April-June 1943, just before graduation from 
a general (non-technical or non-trade) high 
school curriculum.” Norms for each of the 
college level tests “were established for a repre- 
sentative sample of college students (freshmen 
and sophomores only) just completing a related 
comprehensive introductory or survey course.” 
A total of from 2,290 to 3,285 usable papers were 
obtained for the various college level tests. Raw 
scores were converted to “standard scores” by 
the McCall T-scale technique (this involves 
converting the original score distribution for 
each test into a normal curve). The civilian edu- 
cational groups provide the basis for the evalua- 
tion of the veterans’ scores. 

The norms in the Examiner's Manual are ap- 
parently based on Form B of the tests. Form A 
is the form recommended for application to 
veterans, Although the assumption is clearly 
made that the norms for Form A are identical 
with those for Form B, no supporting evidence 
for this assumption is given in either the Exam- 
iner's Manuals or in the Guide to the Evaluation 
of Educational Experiences in the Armed 
Services (1). 

The norms for the high school level tests are 
presented for the "total U.S." and for six 
regional groups: New England, Middle Atlan- 
tic, South (white only ?), North Central, North- 
western, and California. This regionalization of 
norms is unusual and worthy of emulation. 
However, it may be noted that homogeneity does 
not prevail within a region ; thus, the educational 
status of Maine is much lower than that of 
Massachusetts, and that of Delaware is much 
lower than that of the District of Columbia. A 
superior basis of classification, for educational 
purposes, would include not only region (and 
state), but also the population of the town in 
which the student received his education (pri- 
vate school students excluded) ; or, still better, 
the average teacher's salary within the school 
system (data available, for nearly all school 
systems, from the U.S. Office of Education). 
In the tables of regional norms, a few curious 
shifts of strength and weakness may be ob- 
served, For example, the New England region, 
while generally strong, falls down on the Test 
of Interpretation of Reading Materials in the 
Natural Sciences. No explanation is suggested 
in the manual for such a shift. 

For the college level tests, norms are pre- 


sented for three "types" of college, as follows: 
Type 1, colleges whose entering freshmen made 
a mean score above 113 on the 1941 American 
Council on Education Psychological Examina- 
tion; Type 2, colleges with mean A.C.E. scores 
between 113 and 95; Type 3, colleges with mean 
A.C.E. scores below 95. This is a useful basis 
for the fixing of differential norms or standards. 
Two suggestions: first, it would be helpful, for 
the local determination of standards, to know 
not only the lower limit, but the mean A.C.E. 
score, of the institutions of Type 1; similarly, 
it would be helpful to know the mean A.C.E. 
score of institutions of Type 3. Second, since 
not all colleges use the A.C.E., it would be 
helpful if the Examiner’s Manual contained 
equivalent scores onthe Scholastic Aptitude 
Test of the College Entrance Examination 
Board and on the Ohio State University Psy- 
chological Test. 

A statement in the Guide to the Evaluation 
of Educational Experiences in the Armed 
Forces concerning a possible basis for granting 
a high school diploma has puzzled the reviewer. 
The Guide suggests that if “the examinee has 
made an average standard score of 45, or above, 
on the five tests in the [high school level] bat- 
tery,” he might be considered eligible for a 
diploma; and then it is further stated that, “in 
general, approximately 80 to 85 per cent of the 
students who are graduating from public high 
schools throughout the country would meet" 
this requirement. A standard score of 45 is one- 
half standard deviation below the mean in the 
normalized distribution which the authors em- 
ployed; and the percentage corresponding to the 
point, — 46 in a normal curve, is not “80 to 
85," but approximately 70. 

As already mentioned above, no standards 
are recommended for accreditation at any point 
below the full granting of the high school 
diploma; nor are any standards suggested for 
the granting of half-year credit for college 
courses. 

An interesting feature of the norms is that 
the college level standard scores are expressed 
on the same scale as the high school standard 
scores. This undoubtedly has both advantages 
and disadvantages. The reviewer feels that the 
college level standard scores would be more 
readily understandable by most persons if ex- 
pressed in terms of a mean of 50 anda standard 
deviation of 10 in the college group. 

L. RELIABILITY. No evidence is presented 
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concerning the reliability or error of measure- 
ment of any of the tests at either the high school 
or the college level. There appears to be no 
reason, however, to doubt the adequate reliabil- 
ity of the tests. 

M. INTERCORRELATIONS. The intercorrelations 
among the various tests are not reported. In 
view of the great length of the high school level 
battery (a maximum of 10 hours), information 
concerning the intercorrelations might be use- 
ful for the selection of an abbreviated battery. 
It appears likely that the test on Interpretation 
of Literary Materials might be omitted without 
serious damage; whether the battery could be 
still further reduced cannot be safely conjec- 
tured without the correlational data. 

N. VALIDITY. No evidence is presented con- 
cerning the validity of the tests for the purposes 
for which they were devised. Possibly this is 
due to the various pressures and irregularities 
affecting wartime research. However, no lengthy 
cumulative study was necessary to collect use- 
ful validity data. One chief question is whether 
the Tests of General Educational Development 
measure academic achievement independent of 
general intelligence or general reading ability. 
This could be answered by determining the 
semipartial correlation, regia), where e = 
scores on the Tests of General Educational De- 
velopment, g= grades in a particular subject 
(such as English composition, or social studies), 
i= intelligence test scores, and d — general- 
reading test scores. An objection to this tech- 
nique is that (for quick results) civilian high 
school and college students would have to be 
employed: and veterans differ from such cases 
in having been out of school for as long as two, 
three, or four years. To meet this problem, a 
different approach might have been employed, 
as follows: test a large sample of armed forces 
personnel on the Tests of General Educational 
Development, and analyze the results according 
to (a) educational level (last grade completed), 
(b) intelligence (as measured by the Army 
General Classification. Test, or by the Navy 
General Classification Test and the Navy tests 
on reading and arithmetical reasoning), and (c) 
years since leaving school. Suppose, for exam- 
ple, that, in groups alike in educational level 
and general intelligence but differing in num- 
ber of years since leaving school, no reliable 
differences were found in scores on the Tests 
of General Educational Development: such a 
finding would be consistent with the hypothesis 
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that the tests measure the "ultimate objectives 
of the whole program of general education, 
and....minimize as much as possible the more 
immediate and temporary content objectives of 
special school subjects." Further analysis should 
show an appropriate superiority of the college- 
trained over the high-school-trained group (in- 
telligence being held constant). The complete 
absence of such easily obtainable validity data 
as described above is rather hard to understand. 

0. NAME OF THE TESTS. The generic name, 
Tests of General Educational Development, will 
annoy those who envisage "education" as in- 
cluding the development not only of "learning 
and thinking," but also of interests, attitudes, 
skills, tastes, appreciations, and habits. The pres- 
ent tests obviously do not attempt to measure 
all the “ultimate objectives of the whole pro- 
gram of general education." 

II. GENERAL COMMENTS ON THE TESTS OF 
INTERPRETATION OF READING Marertats. In 
general, the passages used in the tests on Inter- 
pretation of Reading Materials in the Social 
Studies, Interpretation of Reading Materials in 
the Natural Sciences, and Interpretation of Lit- 
erary Materials seem to the reviewer sufficiently 
long and exceptionally well chosen. These 
reading tests are intended to measure (a) the 
student’s understanding of “basic terms and 
phrases,” (b) his “background of substantial 
knowledge in the field tested,” and (c) his 
ability “to interpret and to evaluate a number of 
reading selections representative of those he 
will have to read and study in subsequent school 
work.” These are certainly three commendable 
objectives : but is a reading test the most efficient 
vehicle for the measurement of the understand- 
ing of terms and phrases, and the measure- 
ment of “substantial knowledge”? The reviewer 
thinks not, and would prefer to see these aspects 
of ability sampled separately. Separate scores 
for these abilities might also be advisable, even 
if only a single composite score is finally used. 
Composite or “hodgepodge” scores are fre- 
quently acceptable—provided we know what 
they are a “hodgepodge” of. 

A serious question concerns the independence 
of the three specialized reading tests from in- 
telligence and general reading ability. This 
question could be answered by comparing the 
scores of those who have had the subject perti- 
nent to the test with the scores of those who 
have not had the subject (general reading ability 
and intelligence being held constant). Other 
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needed data are partial standard deviations of 
the type, Oe.ia, and semipartial correlations of 
the type, feçg.ia) (the symbols e, g, i, and d 
have been defined in a preceding section). 

The authors have apparently not run through 
the experiment of asking examinees to answer 
the reading questions without reading the pas- 
sages. Especially for the tests in the social 
studies and the natural sciences, it would be 
interesting to know to what extent the reading 
of the passages may be unnecessary. 

III. Comments on Inprvipuat Tests, Al- 
though each item of the tests was scrutinized, 
lack of space forbids detailed comment. The 
general quality of the items appears high. A few 
items look to the reviewer as though empirical 
item analysis might reveal ambiguity, or exces- 
sive “closeness” between the correct answer and 
a “decoy.” 

TEST I, CORRECTNESS AND EFFECTIVENESS OF 
EXPRESSION. The use of well-chosen, interest- 
ing, connected passages, instead of isolated sen- 
tences, makes this test more interesting to take 
than most; in addition, a larger field becomes 
available for the testing of punctuation, the 
placement of phrases, and numerous other as- 
pects of composition. Unfortunately, nearly 40 
per cent of the items in the high school level 
test relate to punctuation (with a heavy con- 
centration on the use of quotation marks for 
indicating conversation). The college level test 
shares the excellence of the high school level 
examination, with considerably less emphasis on 
punctuation. A peculiarity of the college level 
test is that the choice of “omit” (the question 
being whether to retain, shift, or omit a phrase) 
is correct nine times out of twelve. In neither 
the high school nor the college test do there seem 
to be sufficient items relating to coherence and 
unity. The reviewer would like to see separate 
scores for “correctness” and for “effectiveness” 
—even if such scores are later combined into a 
single, composite measure. 

A fundamental issue in connection with the 
test on Correctness and Effectiveness of Ex- 
pression is the question whether a test composed 
of recognition-type items is as valid as the 
free-answer or essay-type test. After all, the 
student’s chief task in English expression is not 
to recognize correct and effective English, but 
to produce it. Quite possibly, correctness of 
expression can be more validly measured by 
objective, recognition-type items than can effec- 
tiveness of expression. Crawford and Burnham 


(2), in their study of civilian Yale freshmen 
(N = 135), found a correlation of only .32 be- 
tween the English composition test of the Col- 
lege Entrance Examination Board and the test 
on Correctness and Effectiveness of Expression. 
This figure can hardly be regarded as reassur- 
ing, especially if it is considered that correctness 
of expression enters definitely into the scores 
on the College Board test. It may be added, how- 
ever, that a recent (unpublished) study at the 
College Board has obtained a raw correlation 
of .64 between an hour’s composition test and 
a 50-minute objective test on correctness of 
expression (N = 359). 

TEST 2, INTERPRETATION OF READING MATE- 
RIALS IN THE SOCIAL STUDIES, The questions in 
this test appear thoughtful and, on the whole, 
excellent. The reviewer objects strenuously, 
however, to the notion that “excessive protec- 
tionism” benefits “manufacturing concerns” 
(high school level test, Item 70). Obviously, an 
excessive tariff on, for example, article X would 
reduce purchasing power for all other articles 
with a lower consumption-priority than X, and 
all the manufacturers of these other articles 
would suffer thereby—unless, of course, all 
manufacturers were protected exactly equally. 
The error exemplified in this test item seems 
to derive in part from the custom, in elementary 
social science, of talking in generalities. It would 
probably be too much to ask for reading-test 
passages which depart from this custom; but it 
is not too much to ask that test items be free 
from unnecessary error due to overgeneraliza- 
tion. 

It seems noteworthy that none of the reading 
passages in the test relate to war, geography, or 
military strategy—the immediate bases (whether 
we like it or not) of our national existence. 

The reviewer would like to air two criticisms 
which should be charged against the typical 
social studies curriculum, rather than directly 
against the tests which must largely follow that 
curriculum: (a) Most texts (and hence also 
most tests) place a heavy emphasis on move- 
ments and trends and events to the comparative 
neglect of personalities, leadership, political 
partisanship, and personal intrigue. The view 
thus conveyed is that trends and events make 
men—and not vice versa. Both views, however, 
are correct; and we might have a more respon- 
sive public opinion if texts and tests gave suit- 
able emphasis to the fact that, to an important 
degree, men have controlled events and can 
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continue to control them. (b) Since many of 
the applications of history and economics to 
current affairs are dubious, there would seem 
to be room for more “pro and con” discussions 
in texts and more test items in which “cannot 
tell" is one of the choices ; another possible type 
of item is “cannot tell because” followed by 
four or five choices. (It is admitted that such 
items might be very difficult to prepare.) Ele- 
mentary social science tends to stick to safely 
factual and strictly historical material. Even at 
the elementary level, however, and certainly at 
the college level, it seems desirable to consider 
some of the controversial applications, if his- 
tory and economics are to be alive and make 
their proper contribution to current thinking. 

TEST 3, INTERPRETATION OF READING MATE- 
RIALS IN THE NATURAL SCIENCES. Here, again, 
there are excellent reading passages and excel- 
lent questions. In the reviewer’s judgment, how- 
ever, too little attention is given to the knowl- 
edge and skills developed in the laboratory; 
there are too few diagrams in the reading pas- 
sages; and there is an underrepresentation of 
computational problems and the handling of 
equations. The passage on blood groups, in the 
college level test, seems extremely poorly writ- 
ten, because a rule stated in the second para- 
graph is given a crucial qualification in a more 
or less incidental sentence toward the end of 
the fourth paragraph. However, such a reading 
passage may be typical of some of the writing 
in textbooks, and on this account defensible in 
a reading test. 

TEST 4, INTERPRETATION OF LITERARY MATE- 
RIALS. About one-third of the items in the high 
school level test, and one-half of those in the 
college test, are based on poetry. At each level, 
the test measures mainly comprehension ; but 
literary criticism, the identification of literary 
forms, and the recognition of literary devices 
are also included (more in the college level test 
than in the other). There are more items in this 
test for which the answers seem debatable than 
in either of the other two reading tests. 

Outstanding among the goals of instruction 
in English literature is the development of liter- 
ary interests, tastes, appreciations, and habits. 
An examination of the sort under review sup- 
plies valid evidence concerning the intellectual 
Prerequisites to such goals; but fails to provide 
evidence concerning the attainment of the goals 
themselves. Given capable instruction, it would 
appear safer to make course credit contingent 


upon both examinations and classroom attend- 
ance—reserving accreditation by examination 
alone for use only in especially favorable in- 
stances. 

TEST 5, GENERAL MATHEMATICAL ABILITY. 
This test appears only in the high school level 
form. Of the 50 items, only six are based on 
alegebra or geometry ; the remaining items are 
of the arithmetical-reasoning type. Some of the 
items require performing a rather large number 
of steps. Such items seem to the reviewer realis- 
tic and desirable, though multiple-step problems 
obviously reduce the number of “‘scorable units" 
and increase the risk of excessive penalty for 
minor errors. 

IV. Concrusion. The Tests of General Edu- 
cational Development appear to be superior tests 
of their type. This judgment must, however, be 
qualified by the fact that no data are presented 
on such basic matters as pretrial, item analysis, 
comparability of forms, reliability, intercorrela- 
tions, and validity. Since most of the tests meas- 
ure more than one recognizable type of ability, 
a more liberal use of part scores would appear 
desirable. The chief question regarding the tests, 
in the mind of the reviewer, concerns the extent 
to which the tests contribute information not 
already obtainable from tests of intelligence and 
of general reading ability. 


Warren G. Findley, Chief, Evaluation 
Branch, Educational Advisory Staff, Air Uni- 
versity, Maxwell Field, Alabama. These widely 
sponsored and widely used educational tests 
require criticism from two distinct points of 
view. First, they must be evaluated as test in- 
struments in terms of the canons of validity, 
reliability, standardization, administrability, etc. 
regularly applied to achievement tests. Second, 
they require evaluation as instruments designed 
to serve an important social purpose not ordi- 
narily required of achievement tests, the ap- 
praisal of the educational status and potenti- 
alities of veterans seeking further education 
after various periods of interruption due to 
war service. 

To quote from the Examiner’s Manual: 

The major purposes of the United States Armed 
Forces Institute Tests of General Educational Devel- 
opment are: 

I. to provide a more adequate basis for the educa- 

tional and vocational guidance of those who have 

served in the armed forces. 

2. to assist the schools in the appropriate placement 


in a program of general education of the students 
returning from military service. 
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3. to help the schools determine the amount of aca- 
demic credit which should be granted students for 
their educational experiences in military service. 
Among the many other possible uses of these tests, 
one in particular deserves special mention. With war 
service persons having no plans for further formal 
educational training, these tests can indicate to their 
prospective employers the extent to which they have 
attained the equivalent of a high school or college edu- 
cation, In consideration of the increasing frequency 
with which employers set definite educational require- 
ments for specific positions, as well as of the tendency 
to make increasing use of test results in selectine em- 
ployees, this latter use of the Tests of General Educa- 
tional Development may well prove of first importance 
to a large proportion of war service persons tested. 
The producers and sponsors of these tests 
»ave relied very largely on the face validity of 
these instruments for their purposes. The ur- 
gency of the need for educational placement 
of veterans demanded speed in the development 
of suitable tests. War conditions further ham- 
ered the conduct of normal preliminary ex- 
perimentation and validation of the tests and 
even of publication of findings. 5 
More basic factors, however, are responsible 
for the dependence on. face validity in the 
lanning and promotion of these tests. There is 
current a widespread lack of confidence in the 
content, organization, and method of academic 
high schools and traditional curricular offer- 
ings at the junior college level. The Eight Year 
Study of the Progressive Education Associa- 
tion was both a symptom and, in turn, a cause 
of this dissatisfaction. This loss of faith in the 
old educational patterns has not issued in dras- 
tic curricular reform at these levels in many 
places, but it has given pause to many educa- 
tional leaders and particularly to specialists in 
educational measurement and evaluation who 
look for objectives of instruction as the criteria 
against which to validate the general nature 
and specific content of their tests. Rejection of 
the composite of a series of course grades in 
an academic curriculum as a criterion was ex- 
pressed in Lindquist’s* insistence on apprais- 
ing the "ultimate objectives of instruction" 
which he called "general educational develop- 
ment" in launching in 1942 the Iowa Tests of 
- Educational Development at the high school 
level. The USAFI Tests of General Educa- 
tional Development at both the high school and 
the college levels reflect not merely structural 
similarity to these tests, after which they are 
patterned, but also a dependence on the same 
point of view, that the ultimate outcomes of 


E. F. Fall Testi ‘igh 
o. 


* Lindquist, Program for Towa Hig 
Schools. State University of Iowa Publications, New Series 
1242. Iowa City, Iowa: the University, 1942. Pp. 36. Paper. 
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instruction are to be found in “the acquisition 
of broad but definite generalizations, concepts, 
and ideas, and particularly upon the abilities to 
comprehend exactly, to evaluate critically, and 
to think clearly in terms of such concepts and 
ideas, rather than upon the detailed facts from 
which the ideas and generalizations were origi- 
nally derived” (1947 catalog of the Coopera- 
tive Test Service, p. 56). 

Probably the most crucial factor in the de- 
cision of the producers and sponsors of these 
tests to rely on their face validity, as distin- 
guished from their correlation with a criterion 
of ordinary academic success, is the inappro- 
priateness of such a criterion to validation of 
tests intended for appraisal of the educational 
status and educability of war veterans, Re- 
moved by several years of life and several de- 
grees of maturity from ordinary academic 
study, veterans would have forgotten much of 
the detail that a student would ordinarily be 
able to draw upon at the completion of a course. 
On the other hand, an active intellect would 
have seized upon opportunities available during 
military service to acquire content and habits 
of thinking which are the normal residue of 
academic study, 

The several tests, then, must be appraised 
for their success in meeting standards of face 
validity and, secondarily, for their correlation 
with such criteria of general educational devel- 
opment as college grade-point averages or grade 
status. 

A special merit of these tests for use with 
veterans is that they are work-limit, not time- 
limit tests. Anyone with experience in testing 
adults at any educational level will subscribe to 
the opinion that many adults are more delib- 
erate than school children in their test behavior. 
This same deliberateness characterizes many 
veterans away from ordinary study for the pe- 
riod of service. 

So far as one can judge, the procedures fol- 
lowed in standardizing these tests were wholly 
sound. In addition to employing representative 
sampling techniques in high schools and col- 
leges, the authors gave the tests of the high 
school and college batteries to comparable sam- 
ples of college students, so that results on tests 
at both levels may be referred to a common 
T-scale (mean 50, standard deviation 10) based 
on the group of 35,432 high school seniors. 

One can only deplore the omission of data 
on reliability of scores on these tests. Inter- 
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correlations of civilian and military forms of 
each test should have been computed and re- 
ported. Certainly, internal reliability of eaċh test 
might have been computed by split halves and 
the Spearman-Brown formula. In addition, real 
insight into the relations among these tests would 
have been provided by a table of intercorrelations 
of the tests of the high school and college batteries 
given the same students. Such intercorrelations 
could have been studied to determine whether 
correlations between corresponding tests in the 
different batteries were greater than correla- 
tions between different tests within the same 
battery, a condition basic to establishing the 
soundness of reporting scores on the separate 
tests as distinct measures of achievement. 

The basic innovation in these test batteries 
is the set of three go-item tests on the inter- 
pretation of reading selections in the social 
studies (Test 2), the natural sciences (Test 3), 
and literature (Test 4). The level of subject 
matter in these selections of one to five para- 
graphs and the broad significance of what is 
read make the taking of these tests a worth- 
while learning experience as well as a good 
basis for evaluating important abilities. 

The emphasis in these tests on ability to han- 
dle written materials of a field of study in the 
general light of one’s background of under- 
standing in these fields comes as a breath of 
fresh air to those who have sought by less 
drastic measures to make the testing of mas- 
_tery of the academic subjects more functional. 
Too much of current achievement testing is 
still based on the widely discredited notion that 
tests of important information in a field are 
sufficiently indicative of ability to apply that 
information in new settings to make unneces- 
sary the serious efforts required to construct 
questions compelling application of informa- 
tion. 

The critic may well question whether these 
tests enforce the use of background under- 
standing and information in sufficient measure. 
To this reviewer, the college tests do this more 
effectively than the high school tests. At the 
college level, too, the military forms contain a 
greater proportion of items requiring use of 
background than do the civilian forms. As one 
who has employed these types of items at the 
high school level, this reviewer can testify to 
the difficulty of preparing questions requiring 
both application of background information 
and interpretation of reading materials before 


the individual. This twin requirement should, 
however, be the goal for 9o per cent rather 
than 20 to 50 per cent of the questions if a. 
test is to have optimum validity for the meas- 
urement of background for advanced college 
study in a field. 

All criticism of these tests for failure to re- 
quire use of background information, expressed 
in this review and elsewhere, must be read in 
the light of the greater deficiencies of most 
achievement tests in requiring chiefly the abil- 
ity merely to recall information. If the USAFI 
Tests of General Educational Development 
would be better achievement tests were they 
more dependent on background information, 
the typical achievement test falls farther short 
of the ideal of requiring application of infor- 
mation. It is this reviewer's opinion that the 
ideal achievement test falls much closer to 
these USAFI Tests of General Educational 
Development than to the typical achievement 
test. 

Test 1 of each battery, on correctness and 
effectiveness of expression, contains a new and 
highly efficient form for testing objectively 
these abilities in written composition. Multiple- 
choice questions on usage in a continuous 
piece of writing are printed in a parallel col- 
umn beside the writing. The proofreading situ- 
ation thus produced is a much more valid test 
of the examinee's natural writing habits than 
other multiple-choice forms of testing which 
involve grouping questions by type of fault 
tested (punctuation, capitalization, grammar, 
sentence structure, usage, etc.) or the offering 
of sets of sentences for the selection of the one 
involving error. 

A special weakness of Test r in both bat- 
teries merits mention. Correctness and effec- 
tiveness of expression are difficult to measure 
by any means. Correctness of expression is at 
least partly a matter of taste, varying from 
time to time, from place to place, from person 
to person. Effectiveness of expression is even 
more definitely a matter of taste, but not closely 
allied to correctness. An item analysis of the | 
high school test, based on 400 air force offi- 
cers, revealed that 4 of the 100 items had neg- 
ligible or negative validity, largely due to pref- 
erence in the use of commas, The effect of 
these 4 items on total score is negligible but a 
source of unreliability. A similar analysis of 
the college test on 160 academic and adminis- 
trative military personnel revealed that 17 of 
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the 125 items had negligible or negative valid- 
ity, largely due to differences in taste on usage 
items. 

Test 5, General Mathematical Ability, High 
School Level, presents a cross section of diffi- 
cult arithmetic problems and elementary alge- 
bra and geometry items. The farm boy and the 
city boy will compete evenly because of the 
areas sampled. Some duplication of items is 
involved (e.g., Items 15 and 24 on percentage 
loss at resale). On the whole, this 50-item test 
compares favorably with achievement tests in 
general mathematics covering the content of 
arithmetic, elementary algebra, and plane ge- 
ometry, although it quite properly gives more 
emphasis to arithmetic than do many such tests 
that are based on a course in general mathe- 
matics in which instruction in each of the three 
areas has been given approximately equal 
weight. 

The claim of guidance value for the tests 
may be accepted at its "face" to the extent that 
reading tests have quite generally proved good 
predictors of ability to benefit from the general 
education offered in academic high schools and 
liberal arts colleges, while omnibus mathemat- 
ics tests, like the mathematical portion of the 
College Entrance Examination Board's Scho- 
lastic Aptitude Test, have as regularly proved 
good predictors of ability to benefit from study 
of quantitative subject matter of the sort en- 
countered in higher mathematics and science. 


The claim to differential guidance value among 


the three major areas of academic instruction— 
natural science, social science, humanities—is 
more difficult to accept. For instance, many years 
ago this reviewer found a very high correlation 
(.86) at the college entrance level between two 
parallel tests of reading, based on scientific and 
literary materials, respectively. 

Going beyond face evidence, Crawford and 
Burnham (2) found the battery of college level 
tests as predictive of general college achieve- 
ment at Yale as the tests of the College En- 
trance Examination Board. Callis and Wrenn 
(7) found a correlation of .72 with first semester 
grades of 60 veterans admitted to freshman 
standing in the General College at the University 
of Minnesota on the basis of these tests. It is also 
true that Crawford and Burnham found only 
moderate intercorrelations (.52 to .63) among 
the three reading tests of the college battery, thus 
indicating the probability of substantial differ- 
ences among the scores made by some individuals 


on these:three tests, the reliabilities of which in 
the Yale group may be assumed to be .90 or 
higher. Moreover, Dyer (3) found reliable dif- 
ferences in mean scores between scientific and 
nonscientific students at Harvard, in favor of 
the scientific students on the natural science 
reading tests and in favor of the nonscientific 
students on the social studies and literature 
tests. 

Probably the most serious criticism of the 
tests, in so far as guidance value is placed first 
among their purposes, i$ the omission of a 
mathematics test from the college level battery. 
Both the articles just cited point to the need 
for a supplementary test like the College En- 
trance Examination Board's Mathematical Ap- 
titude Test. The USAFI high school level test 
in mathematics should serve this purpose, 
though not those of placement or accreditation. 

The claim of general educational placement 
value is acceptable on its face. The standardi- 
zation of the high school tests by regions cor- 
responding to accreditation bodies and of the 
college tests by levels of college populations 
as reflected in achievement on the American 
Council on Education Psychological Examina- 
tion is a first step in the direction of providing 
reference values. The provision of parallel ci- 
vilian and military forms is a further step that 
permits each school or college to find its own 
norms and set its own standards accordingly. 
Data for the University of Michigan have been 
published by Donahue (5) as a guide to similar 
institutions. 

The possibility of using these tests for pre- 
cise placement of students in courses in the 
separate fields of natural science, social science, 
and humanities may be questioned in the light 
of the substantial intercorrelations previously 
quoted from Crawford and Burnham (2) and 
the data of Dyer (3), who found great overlap 
between groups of scientific and nonscientific 
students on all four tests of the college battery, 
in spite of reliable differences in mean achieve- 
ment between the groups on the tests. Both 
Dyer and Bradley (4) found negligible corre- 
lation between test scores and numbers of se- 
mester hours completed in a field. - 

The validity of these tests for accreditation 
of students is debatable. The most generous, 
and perhaps soundest, view to take of the use 
of these tests for accreditation is that this rep- 
resents a significant step forward over the prac- 
tice followed after World War I of granting 
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blanket credit for military experience. It should 
be added that these tests, on their face, are 
fairer to veterans than customary achievement 
tests that have been designed for use immedi- 
ately following regular courses of study. It is 
doubtful, however, that test scores containing 
_so large an "aptitude" ingredient should be 
translated directly and immediately into aca- 
demic credit of so basic a character as a di- 
ploma. Less than 50 per cent of the college test 
items on iriterpretation of reading materials re- 
quire background understanding in addition to 
general reading ability. The high school tests 
include even a smaller proportion of items not 
answered directly or by inference in the read- 
ing material. Placement may properly be made 
and credit granted within an academic curricu- 
lum with the knowledge that the recipient will 
have to validate this placement and these cred- 
its by further successful study in the institu- 
tion from which he will receive his credential 
of graduation. This should suffice. The veteran 
might well be given as advanced placement as 
there is reason to believe he can successfully 
carry through advanced work, but be given 
fully accredited status only after a year's suc- 
cessful study in the institution of his choice. 
The veteran who has not graduated from high 
school but who is ready for college as evidenced 
by his GED test scores and wishes to under- 
take academic study, might be placed in college 
and be granted his high school diploma on 
completion of a successful year in college. 

The final proposal, that the tests be used to 
indicate high school or college equivalence to 
employers who set educational standards for 
employment, is quite acceptable in the same 
sense that use of the tests for educational place- 
ment is recommended. Employment is vocá- 
tional placement and not accreditation. The va- 
lidity of this use of the test is not to be 
questioned. 

In conclusion, it may be said that validation 
of these tests, by their authors and others, has 
just begun. The merit of the tests outran 
proof. Interrelations among the tests them- 
selves and correlation of the tests with college 
achievement of veterans should produce a vol- 
ume of evidence of interest to all who must 
now judge these tests almost wholly in terms 
of their face validity. 


For a review by Charlotte W. Croon of Test 
I, College Level, see 122. For reviews by W. E. 


Hall and C. Robert Pace of Test 2, College 
Level, see 528. 
[21] z 

* Wide Range Achievement Test: Reading, Spell- 
ing, Arithmetic From Kindergarten to College, 
1946 Edition. Ages 5 and over; 1940-46; 1 form; $3 
per 50; 40¢ per manual; 45¢ per specimen set; (20-45) 
minutes; Joseph Jastak and Sidney Bijou; published 
by Joseph Jastak, Delaware State Hospital, Farnhurst, 
Del.; distributed by Psychological Corporation. 

Paul Douglas Courtney, Psycho-Educational 
Clinic, Harvard University, Cambridge, Mas- 
sachusetts. The Wide Range Achievement Test 
appears to have adequate face validity as a 
coarse screening measure in three restricted 
areas: spelling, word pronunciation, and arith- 
metic computation. With a range from kinder- 
garten to college, it is obvious that the number 
of items per grade is limited, and the number 
of pupils per grade on which it has been stand- 
ardized is likewise relatively small, Its "indi- 
vidual" feature is limited to the word-pronun- 
ciation section, and in general its use would not 
appear justified while we still have available 
tests at desired levels with adequate standardi- 
zation, such as the Metropolitan Achievement 
Tests, Stanford Achievement Tests, Coopera- 
tive Test Service tests, etc. 

Although the test is apparently adequate, 
subject to the restrictions presented above, the 
manual which accompanies the test is of such 
a nature that the use of the test itself ought to 
be definitely discouraged unless the manual is 


rewritten and carefully limited to statements 


which would not lead to a serious misuse of 
the Wide Range Achievement Test. This man- 
ual would appear to be the resultant of a seri- 
ous lack of sophistication on the part of the 
author in the field of tests and measurements 
and a complex of half-formulated and vaguely 
expressed clinical observations of the correla- 
tions existing between this test and fundamen- 
tal personality and intelligence factors. Two 
unsupported half-truths selected from the man- 
ual are presented below: 

a) After determining his level of comprehension by 
means of intelligence tests, the examiner is not inter- 
ested in how well the child can comprehend but in how 
well he can read. 

b) Quotients thus obtained ("test age" over chrono- 
logical age) are fully comparable with other intelli- 
gence ratings. 

These are but two of many and speak clearly 
for themselves. Since the author suggests the 
use of a number of other tests to supplement 
his test and to complete the diagnostic picture, 
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this reviewer feels that the disadvantages of 
the Wide Range Achievement Test and its 
manual óvershadow its advantages and that the 
first approximation of a trained educational 
clinician would fill the need filled by this test, 
i.e, a rough grade placement of a patient prior 
to more detailed diagnostic testing at specific 
grade levels. 


Verner M. Sims, Professor of Psychology, 
University of Alabama, University, Alabama. 
Designed for the use of clinical workers, the 
Wide Range Achievement Test purports to 
measure the school achievement of any indi- 
vidual (from kindergarten to college) in a to- 
tal of not more than 43 minutes of testing time. 
Success in such an undertaking obviously in- 
volves an extremely difficult job of sampling 
from the total of school learnings. In this test, 
the sampling is done, first, by limiting the meas- 
urement to certain mechanical aspects of three 
tool subjects (reading, arithmetic, and spell- 
ing) ; and, secondly, by selecting items, in each 
of these three areas, which showed a regularly 
increasing percentage of successes at suéceed- 
ing grade levels. 

The aspects of the three subjects tested are: 
(a) skill in pronouncing printed words (rang- 
ing from letters of the alphabet to such words 
as remiges, heinous, and conduit); (b) skill 
in mathematical computation (ranging from 
counting dots to simple algebra and logs) ; and 
(c) skill in spelling words dictated (ranging 
from letters of the alphabet to such unfamiliar 
words as pusillanimous, iridescence, and elee- 
mosynary). 

In the absence of any evidence that ability in 
these three rather narrow skills does reflect 
general school achievement and in the face of 
knowledge that achievement in school is a 
highly complex matter which is not likely to 
be reflected in measures of three simple skills, 
it seems safe to assume that the test is mis- 
named, that as a measure of school achievement 
it has doubtful value. 

The fact is, the author does not actually 
claim it measures school achievement except 
by name and through inference. He does pro- 
pose it, however, as a valid measure of ability 
in the fields of reading, arithmetic, and spell- 
ing. In view of the evidence at hand, the re- 
viewer is not willing to accept even this assump- 
tion; certainly not in the cases of arithmetic 
and reading. In his opinion, use of the test 


should be confined to occasions where informa- 
tion is desired concerning ability in the three 
specific skills named above; and even here, as 
is true with any wide-range test, the measure- 
ment is bound to be relatively crude. 

So far as clinicians are concerned, perhaps 
a more practical question would be whether or 
not information relating to these three skills is 
of unique worth in understanding and dealing 
with cases which are referred to them, The au- 
thor of the test is convinced that it does have 
value in remedial and vocational studies of 
children. The manual accompanying the test 
contains a considerable amount of "evidence," 
from analyses of errors and from a study of 
patterns of scores on the three tests, to the ef- 
fect that these tests when used along with in- 
telligence tests do aid in understanding the 
mental make-up of “problem” children. 

This evidence is such a confusing mixture of 
personal opinion and inferences drawn from 
clinical experience as well as from statistical 
analyses that the reviewer has difficulty in eval- 
uating it. Consider, for example, the following 
statements, quoted from the manual, none of 
which are supported by any facts: “Reading 
disability is practically always a deficiency in 
the mastery of mechanics and not of compre- 
hension" (p. 3) ; “There is little doubt that the 
adverse effects of personality aberrations upon 
school achievement are much more marked in 
arithmetic than in any other school subject" 
(p. 20) ; "Written spelling is a good test of 
somatic efficiency and muscular skill" (p. 24). 
Such statements seem to be in sharp contrast 
to the following: “Roughly 50 per cent of the 
total [Delaware State Mental Hygiene] clinic 
population of school age obtain the A-R-S pat- 


'tern [that is, are best in arithmetic and poorest 


in spelling]" (p. 21). Unfortunately, too much 
of the “evidence” presented in support of the 
test's usefulness is of the former rather than 
the latter type. 

Perhaps the best advice to give the Aiea] 
psychologist would be that he might try the 
test. There are, however, a number of other 
questions which, being unanswered, but add to 
the reviewer's conviction that it has limited 
usefulness: The reliability of the test has not 
been adequately established, particularly when 
it is designed for individual use. The objectiv- 
ity of the proposed error analysis was not 
tested, yet it is certainly not obvious. Evidence 
in support of the method of setting up norms 
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and of the use of test quotients as norms is 
lacking, yet both procedures need justification. 


Louis P. Thorpe, Professor of Education 
and Psychology, The University of Southern 
California, Los Angeles, California. The Wide 
Range Achievement Test is distinctive in that 
it measures reading, spelling, and arithmetic 
skills from kindergarten to college. It may thus 
be said to employ the principle of the power 
scale. It is designed primarily as an individual 
clinical test, but the spelling and arithmetic sec- 
tions can also be administered as group sub- 
tests. A valuable feature of this instrument is 
the provision made for analyzing personality 
adjustment from uneven scores in the three 
parts of the test. This feature has apparently 
been made possible by checking test results 
with clinical case studies in the Delaware State 
Mental Hygiene Clinic and elsewhere. 

The author makes the defensible claim that 
a word-pronunciation test is superior to both 
paragraph and picture-reading tests as a meas- 
ure of how well a child can read. Paragraph 
tests introduce context clues, thus encouraging 
guessing, and picture tests involve distraction 
as well as clues. The author also contends that 
the most accurate test of arithmetical ability is 
one which does not depend on reading. This 
point is debatable since most arithmetical com- 
putations are made in a patterned problem in- 
volving reading ability. However, the scaled 
computation items, ranging from simple addi- 
tion to college mathematics, appear to serve the 
clinical purpose of this test well. 

A time-saving, but nevertheless doubtful, 
practice in the construction of this test was the 
substituting of correlations with various sec- 
tions of the Stanford Achievement Test for di- 
rect standardizing on a national scale. These 
correlations range from .81 to .93 at the seventh 
and eighth grade levels, Just why the Stanford 
test should be lauded and used as a criterion 
for establishing grade norms in a different type 
of test is not clear to the reviewer. Such a pro- 
cedure certainly increases the chances of max- 
imum errors of measurement. Every testmaker 
should establish his own norms. 

Item analysis was made by the method of 
computing the percentage of successes at each 
grade level. Reliability coefficients, which were 
determined by repeated testing, ran from .88 
(120 cases) for spelling to .95 (110 cases) for 
reading. One might question the use of clinic 
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cases (280) in the determination of internal 
consistency because of the uneven scores made 
by many emotionally maladjusted individuals. 

Since it is customary, in the interest of 
standardization of test administration, to give 
exact (boldface) instructions to examinces in 
the manual, the author should have done so in 
the case of spelling and arithmetic as well as of 
reading. Perhaps he regards the reading test as 
being more crucial than the others. At least, he 
makes the claim, for which no evidence is pre- 
sented, that his word recognition test correlates 
to a greater extent than other tests with future 
achievement. 

The manual of directions of the Wide Range 
Achievement Test has provided a very helpful 
analysis of (a) strephic, (b) phonic, (c) idea- 
tional, and (d) expressional reading errors, 
especially as these relate to slow introverted 
and fluctuating extroverted types of individ- 
uals. It also shows how varying achievement 
patterns may be suggestive of specific per- 
sonality anomalies, 

An outstanding and promising feature of the 
test is, thus, its possibilities for the diagnosis 
of personality maladjustments and deviations 
through the avenue of uneven scholastic 
achievement scores. Such an analysis has ap- 
parently been made possible by the fact that 
the author of the instrument has access to an 
adequate number of clinic cases and, what is 
more important, has sensed the fact that a 
third dimension—the emotional component— 
must be added to the more traditional heredi- 
tary and gross environmental factors in the de- 
termination of scholastic achievement. 


a [22] 
*Wisconsin Composite Achievement Test. High 
school seniors and college entrants; 1944; Form A; 
5€ per test; 1o¢ per key; 25¢ per manual; 101 (110) 
minutes; arranged by Gustav J. Froehlich from mate- 
rials prepared for the Wisconsin Achievement Testing 
Program; manual by Gustav J. Froehlich and Bjarne 
R. Ullsvik; E. M. Hale & Co. 
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The Wisconsin Composite Achievement Test 
is designed to yield both an evaluation of sec- 
ondary school achievement and a prediction of 
scholastic success in college in terms of the 
chances in one hundred of getting an unsatis- 
factory, a satisfactory or better, or honors aca- 
demic average for work of the first semester 
in the university. It is a survey test which 
covers language usage, American history, gen- 
eral science, plane geometry, and algebra on 
the secondary school level. 

This instrument is distinctly a Wisconsin test. 
All test items originated within, and were tried 
out in, Wisconsin classrooms. The “end of sec- 
ondary school” norms are based upon the scores 
made by approximately 2,000 freshmen at the 
University of Wisconsin. 

Each of the five sections as well as the total 
test possesses high reliability as computed by 
the corrected split-halves method. The reliabili- 
ties of the subtests range from .826 to .966, and 
the reliability for the total test is .973. 

A study of validity data in the manual indi- 
cates that a careful and workmanlike job has 
been done. The items were selected on the basis 
of two criteria, adequate discrimination between 
good and poor secondary school seniors and 
satisfactory subtest correlation with first se- 
mester grades at the University of Wisconsin. 
Allitems were selected from thirty-three survey 
tests previously prepared for the Wisconsin 
Achievement Testing Program, in which the 
curricular validity of items had already been 
established. 

The manual accompanying the test is excel- 
lent. An unusually complete report of statistical 
work done in connection with the validation of 
the test is presented. Formulas used for predict- 
ing college success are explained and illustrated. 
Finally, in view of the fact that so many who 
use this test will be unable to follow or use the 
statistical techniques which are suggested, the 
authors have prepared a series of tables which 
serve as short cuts. 

The first three of these tables reveal the ap- 
proximate chances in one hundred of securing 
an unsatisfactory, satisfactory or better, or 
honors academic-grade average at the end of 
the first semester of college work. These three 
tables have been derived for freshman who 
enter college in the fall, immediately following 

t secondary school graduation. The chances of 
securing the above mentioned academic-grade 
averages are based on a knowledge of two fac- 


p= 
tors: the raw score on the Wisconsin Composite 
Achicvement Test, Form A, and the percentile 
rank of the student in his secondary school grad- 
uating class. Similar tables are prepared to pre- 
dict success in mathematics and chemistry using 
some other factors in prediction. 

The reviewer has had no opportunity to try 
out this test or to check any of the statistical 
work; however, an examination of the instru- 
ment itself appears to reveal well selected and 
carefully stated test items. The reviewer would 
have preferred more emphasis on social studies 
and less on mathematics for secondary school 
valuation purposes. However, this is no criti- 
cism of the test since the proportion of items 
was probably dictated by the nature of the cur- 
riculum. The test seems very well adapted for 
use in Wisconsin schools. 


Maurice E. Troyer, Director, Evaluation 
Service Center, Syracuse University, Syracuse, 
New York. Two uses are proposed for this 
test: (a) to evaluate secondary school achieve- 
ment by means of end-of-school norms and (b) 
to predict scholastic success at the University 
of Wisconsin in terms of the chances in 100 of 
getting an “unsatisfactory,” a “satisfactory or 
better," or an “honors” average for scholarship 
in the first semester in the university. Test scores 
and combination of subscores are similarly used 
to show chances of performing at these three 
levels of scholarship in certain freshman courses, 
These prediction tables are unique and poten- 
tially very useful. 4 

The test has five parts: language—54 lines of 
proofreading; American history—40 multiple- 
choice items; general science—66 matching 
items; plane geometry—24 items measuring 
ability to follow steps in proof and give sound 
reasons supporting these Steps; and algebra— 
40 items appraising thinking in algebraic terms 
and processes. 

The test derives its validity from two bases; 
(a) satisfactory subtest correlations with first 
semester grades at the University of Wisconsin 
and (b) adequate item discrimination between 
good and poor graduating secondary school 
seniors. Data from the case records of 6,000 
University of Wisconsin freshman were cor- 
related with Wisconsin Composite Achievement 
Test subscores made in the secondary schools 
and with the first semester averages. The sub- 
tests of the original experimental edition cor- 
related from .43 to .54 with the criterion, Item 
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analyses were then made of the test and all 
items were eliminated that did not discriminate 
positively between good and poor students in 
the criterion groups. Two hundred sixty-nine 
items remained, of which 44 were eliminated on 
a subsequent run of the second edition. The 
corrected reliabilities obtained by the split- 
halves method were .973 for the entire test and 
from .83 to .97 for each of the subtests. 

Directions for administering and scoring are 

adequate. The test, however, cannot be machine 
scored, Two sets of percentile norms for each 
of the five tests have been prepared: the first, 
for use when the test is administered within 
three months of the date of graduation from 
the secondary school, and the second, when the 
test is administered one or more years after 
graduation from the secondary school. There 
are similar tables of T-score norms. 
. While the test manual claims the total score 
to be a useful predictive index of first semester 
scholastic success at the University of Wiscon- 
sin, the predictive value is further increased by 
combining the test score with the percentile rank 
in the secondary school graduating class. The 
addition of an intelligence test score made for 
no increase in predictive value, 

By means of a not-too-cumbersome but aev- 
ertheless time-consuming formula, it is possible 
to predict first-semester credit-hour ratio from 
à combination of raw test score and rank in 
secondary school graduating class. The reviewer 
does not believe the use of this formula prac- 
ticable. Much more useful for purposes of ad- 
mission, classification, and counselling are the 


charts from which one can read directly the 
chances in 100 a student has of achieving hon- 
ors, satisfactory or better, or unsatisfactory 
grades when the raw score and the percentile 
rank in the high school graduating class are 
given. It must be emphasized again, that this 
test was developed for Wisconsin high school 
graduates and the University of Wisconsin 
freshmen. While the test would no doubt be 
useful in other institutions, experience with the 
Ohio State University Psychological Test and 
other tests for purposes of predicting academic 
success has shown that tests vary in their pre- 
dictive value from university to university and 
from college to college within universities. It 
would seem doubtful that this test would serve 
other institutions as well as a test standardized 
on a broader sampling. This is no criticism of 
the test or its makers. It is probable that in 
the near future, as each university develops a 
staff of competent test technicians, that the 
larger institutions will develop their own test 
in terms of their objectives and idiosyncrasies. 
Improved predictive value is likely to justify 
the adventure. 

With respect to the Wisconsin Composite 
Achievement Test, one is compelled to wonder 
if the predictive value would not have been im- 
proved had the test included items calling for 
use of knowledge in complex problem situa- 
tions. Certainly the University of Wisconsin 
has objectives that run along these lines, and 
evidence about these objectives is used in grad- 
ing students. Hence, it would seem that the 
predictive value of a test would be improved by 
including items that call for use of knowledge. 
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CHARACTER AND PERSONALITY 


Reviews nv Harold H. Abelson, Ralph C. Bedell, Arthur L. Benton, E. J. G. Bradford, R. A. 
Brotemarkle, Harold D. Carter, Glen U. Cleeton, William E. Coffman, Lee J. Cronbach, Ed- 
win W. Davis, Parker Davis, Jr, Susan K. Deri, Raleigh M. Drake, Stanley G. Dulsky, 
Jack W. Dunlap, Philip Eisenberg, M. H. Elliott, Albert Ellis, H. J. Eysenck, Paul R. Farns- 
worth, John C. Flanagan, John W. French, Henry E. Garrett, Edward B. Greene, Nelson G. 


Hanawalt, Starke R. Hathaway, Thelma Hunt, 
garet Ives, E. Lowell Kelly, Kate Levine Kogan, Morris Krugman, W. 


illiam A. Hunt, Ludwig Immergluck, Mar- 
C. Kvaraceus, Theo- 


dore F. Lentz, Louis Long, C. M. Louttit, James Maxwell, Paul E. Meehl, H. Meltzer, Lorenz 

Misbach, L. S. Penrose, Gilbert J. Rich, John W. M. Rothney, Julian B. Rotter, Laurance F, 

Shaffer, Benjamin Shimberg, William U. Snyder, Douglas Spencer, C. R. Strother, Percival 

M» Symonds, Howard R. Taylor, Edith 1. M. Thomson, Simon H. Tulchin, Goodwin Watson, 
Katherine W. Wilcox, J. R. Wittenborn, O. L. Zangwill. 


[23] 
A-S Reaction Study: A Scale for Measuring 
Ascendance-Submission in Personality. College 
and adults; 1928-39; manual revised 1939; separate 
forms for men ‘and women; $2.25 per 25; I5¢ per 
specimen set; nontimed (20) minutes; Gordon W. All- 
port and Floyd H. Allport; Houghton Mifflin Co. 
@) FORM FOR MEN. 1928, 
b) FORM FOR WOMEN, REVISED. 1928-39. 
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William U. Snyder, Associate Professor of 
Psychology and Associate Director, Psycho- 
logical Clinic, The Pennsylvania State College, 
State College, Pennsylvania. This well-known 
test was designed to discover the disposition 
of an individual to dominate his fellows or to 
be dominated by them in face-to-face relation- 
ships. It thus tries to measure an individual's 
ascendance or submission, traits which have 


been placed at opposite poles in a continuum, 
although the authors are careful to indicate that 
they are not necessarily opposites, but really 
independent traits. 

The test is widely used by clinical psycholo- 
gists and also by college teachers and guidance 
counselors; it is frequently employed in indus- 
trial situations, for which there is a separate 
revision (see 40:1199). There are separate 
forms for men and women, independently 
standardized and not interchangeable. They 
consist of 41 and 43 items for which multiple- 
choice responses varying from two to four an- 
swers are provided. The items seem familiar, 
and that is largely because they have been cop- 
ied on many later personality tests. They are 
written with a certain carefulness and qualifi- 
cation which makes answering simpler, because 
it reduces ambiguity. Scoring weights have 
been assigned to each response, and this makes 
scoring laborious if it occurs in any great 
quantity. 

Norms for the test are given in deciles, based 
on scores of 2,578 men and 600 women. These 
subjects were college students, widely distrib- 
uted geographically, Reliability quotients are 
reported as .85 for men and .9o for women. 
Numerous studies of validity of the test have 
been made in the nineteen years since its first 
publication. Using ratings and self-ratings as 
criteria of validity, the correlations have varied 
from .29 to .79. The authors state that using 
some of the better studies as typical, a suitable 
validity index would be .45. Thus, they do not 
make extensive claims of validity, holding that 
ultimate validity must depend on results ob- 
tained in a wide variety of clinical situations. 
Some external validation has been made in dif- 
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cantly between delinquent and nondelinquent 
boys, the internal consistency indexes of the 
items, and the validation coefficients for most of 
the items. Four hundred items were originally 
culled from the literature on delinquency, from 
case histories of delinquents, and from Hay- 
ward's Family Inventory. The final 150 items, 
. for the most part, have met a reasonable crite- 
rion of acceptance before being incorporated in 
the questionnaire. The average validity coeffi- 
cients of the items, using case study information 
as criterion data on items that the author was 
able to validate, are reported as .68 and .72 for 
two groups of male delinquents, 5o in each 
group. A similar study for 50 girl delinquents 
yielded a validity coefficient of .52. Although the 
validation and normative data are available on 
very small samples and although results are not 
available on a contrasting nondelinquent female 
group, sufficient information is presented by 
the author to suggest that this instrument is 
worthy of further study and use in clinical 
situations in which the individual follow-up 
technique will yield the best validation possible. 

While the nature of the items is such as to 
render the questionnaire most allergic to incon- 
sistencies in responses, the reported reliabilities 
of .92, .94, .83, and .85 are surprisingly high for 
this type of instrument. 

Considering the multiplicity of problems pre- 
sented by children popularly and perhaps too 
frequently mislabeled as delinquent, this type 
of instrument, when further refined, will prove 
a valuable supplement to the achievement, intel- 
ligence, and aptitude tests now commonly used 
in schools and clinics. 


Simon H. Tulchin, Consulting Psychologist, 
30 East óoth St., New York 22, New York. 
The test consists of 150 cards on which are 
printed questions relating to various areas of 
delinquent behavior and maladjustment. The 
subject is provided with two boxes and he is 
told to place those cards which describe him in 
the box marked YES and those cards which 
do not describe him in the box marked NO. 
All those who make fourth grade scores on 
reading tests require no assistance in sorting 
the cards. Others may have the items read to 
them. The test has been given to 200 delin- 
quent boys who ranged in age from ro to 18 
years; 50 delinquent girls ages 12 to 18; 5o 
School boys ages 11 to 17; and 25 boy scouts 
ages 12 to 17. The average ages of the groups 
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ranged between 13.6 and 14.8 years. The score 
on the test is the number of cards placed in the 
YES box. Average scores are given for the 
four groups and percentile scores for the three 
larger groups. 

The author of the test feels that the card 
sorting method has the advantage over the 
questionnaire method in that the child can con- 
sider each question separately and more objec- 
tively since no written record is made while he 
is in the room. A score of 50 is considered as 
suggestive of a disturbing conflict situation. A 
score of below 15 in a boy with a history of 
serious and repeated maladjustment is taken to 
be suggestive of schizophrenic tendencies. On 
the other hand, scores of below 20 in individ- 
uals who have previously shown good adjust- 
ment but who are brought into court on serious 
delinquency charges are given good prognosis 
for a successful adjustment on probation. As 
proof for the above conclusions, which should 
require a great deal of evidence, the author 
offers the following single sentence: “Fol- 
low-up studies indicate that this is a well 
founded observation." 

Correlations of test score and IQ are zero 
while correlations of test score and age are 
negative, showing that younger delinquents 
make higher scores than older delinquents. Re- 
liability is fairly high. Correlations of each 
question with information available in the his- 
tories of the groups of delinquent boys and 
girls show a wide range of coefficients. 

While it may well be that for some children 
and adolescents answering direct personal ques- 
tions by means of sorting cards is a less threat- 
ening experience than replying to questions in 
an interview, it is also true that sympathetic 
questioning makes possible an evaluation of re- 
sponses which affords much greater insight 
into the personality of the subject and the na- 
ture of the maladjustment. The author recog- 
nizes, for example, that the delinquent girls 
responded less frankly than the boys. It is quite 
likely that this is in part due to the nature of 
their delinquency. No data are presented on a 
control group of girls. 

For the delinquent boys fairly high agree- 
ment was found between the known behavior 
as reported in the case histories and the replies 
to the questions. High agreement is probably 
to be expected since all of these boys were com- 
mitted to an agency for study and observation 
following their delinquencies and behavior mal- 
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adjustment and recognized that certain facts 
about them were well known. No histories were 
available for the relatively small group of con- 


trol boys, and no attempt was made to check 
their replies, 

An examination of some of the questions 
reveals a lack of consistency in the replies. 
Thus to the question: “Do you tell lies some- 
times ?" 86 per cent of the delinquent boys, 56 
per cent of the control boys, and 60 per cent 
of the delinquent girls answer in the affirma- 
tive as against 37, 23, and 16 per cent respec- 
tively to the question: “Do you say things that 
aren’t true?” Are these differences due to the 
word “sometimes” or to the fact that eighty- 
three other questions intervene between these 
two? Similar discrepancies occur with other 
questions. 


Apparently a great deal of further research 
on this test remains to be done. 


[ 26 ] 
California Test of Personality: A Profile of Per- 
sonal and Social Adjustment. Grades kgn-3, 4-8, 7- 
10, 9-16, adults; 1939-43; IBM ; 5 levels; Forms A, B; 
$1.25 per 25; sé per copy of machine-scoring edition 
(grades 4 and over) ; 2¢ per copy of machine-scorable 
answer sheet; 25 per specimen set of any one level; 
nontimed (45) minutes; Louis P. Thorpe, Willis W. 
Clark, and Ernest W. Tiegs; California Test Bureau. 
a) PRIMARY SERIES, Grades kgn-3; 1940. 
b) ELEMENTARY SERIES. Grades 4-8; 1939. 
C) INTERMEDIATE SERIES. Grades 7-10; 1939-40. 
d) SECONDARY SERIES. Grades 9-16; 1942. 
€) ADULT SERIES. 1042-43. 
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Laurance F. Shaffer, Professor of Educa- 
tion, Teachers College, Columbia University, 
New York, New York. The California Test of 
Personality consists of a series of five ques- 
tionnaires for successive developmental levels, 
purporting to measure a number of components 
of personal and social adjustment. The com- 
ponents are uniform throughout the five series. 
The two principal components are Self Adjust- 
ment and Social Adjustment. Self Adjustment 
is further subdivided into six subtests called 
self-reliance, sense of personal worth, sense of 
personal freedom, feeling of belonging, with- 
drawing tendencies, and nervous symptoms. 
The score on Social Adjustment is based on 
six parts designated social standards, social 
skills, antisocial tendencies, family relations, 
school relations, and community relations, At 
the primary level there are 8 questions in each 
subtest; at the elementary level, 12; and at the 
intermediate, secondary, and adult levels, 15. 
The questionnaires may be given to groups, 
except that individual oral administration is 
recommended for the primary scale. Manuals 
of from 8 to 16 pages are provided for each 
series. All except the primary series are 
adapted to machine scoring. 


‘Interests and Personality Traits 
J Social Psychol 26:225-33 N 
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Subjective evaluation of the questions gives 
the impression that they are skillfully worded 
and are well adapted to the developmental levels 
for which they are intended. Considerable use 
is made of indirectly worded questions, that 
might almost be designated as “projective” in 
character, in order to increase the likelihood 
of truthful and revealing responses. Thus, in- 
stead of asking “Do you cheat?” the authors 
ask "Are your tests so hard or unfair that it 
is all right to cheat?” Instead of asking “Do 
you quarrel?” the question is worded “Do your 
classmates quarrel with you?” Although the 
well-phrased questions give a presumption that 
misrepresentation will be held to a minimum, 
the questionnaires do not provide a means for 
estimating the truthfulness of an individual’s 
responses, corresponding, for example, to the 
“L” score of the Minnesota Multiphasic Per- 
sonality Inventory. 

The reliabilities of the total scores on the 
tests range from .918 to .933, based on the 
split-half method, corrected by the Spearman- 
Brown formula, as determined for N's of from 
237 to 792 for the various forms. The reliabili- 
ties of the principal component of Self Adjust- 
ment range from .888 to .904 and of Social 
Adjustment from .867 to .908. Subtest relia- 
bilities are not given for two of the series; for 
the other three they are stated to be from .60 
to .87. Although the latter reliabilities are as 
high as can be expected from 15-item subtests 
of this type, they are far too low for satisfac- 
tory individual diagnosis. 

The low reliabilities of subtests are ignored 
in the instructions and illustrations on the in- 
terpretation of the test profile. Readers of the 
manual are encouraged to interpret subtest 
scores by descriptive phrases, such as that 
“Joan A. Doe . . . is low in self-reliance . . . 
is free from nervous symptoms . . . community 
relations are somewhat low." Thus, a person- 
ality picture is drawn from indicators based on 
scores whose reliabilities are as low as .60. The 
statistically unsophisticated user is almost cer- 

“tain to succumb to this attractive pitfall. 

The manuals give no data on the validity of 
the California Test of Personality, other than 
to appeal to the face values of the items, and 
to make vague references to the use of teacher 
judgments, student reactions, and biserial 7’s. 
What was correlated with what in establishing 
validity is not stated; The manuals do not refer 
to any experiments! studies of validity, not 
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even to such an obvious criterion as to whether 
selected groups of maladjusted children do or 
do not make scores significantly different from 
those of children in general. It must be con- 
cluded that the validity of these questionnaires 
is entirely unestablished. 

Percentile norms are given for total scores, 
for the two principal components, and for each 
of the twelve subtests. These norms are said 
to be based on “over 1,000" cases for the pri- 
mary and elementary tests, but no numbers are 
given for the higher series. All normative cases 
are from Los Angeles or its vicinity, which 
raises serious doubts concerning the applica- 
bility of the norms to persons in other subcul- 
tural areas. Because of the short range of the 
subtests, their percentile scales are unduly 
sharp. For example, on the "community rela- 
tions" subtest of the Elementary Series, a raw 
score of 10 is equivalent to a percentile rank 
of 35, of tt to one of 55, and of 12 to one of 
8s. Thus, a probably unreliable difference of 
two raw-score points may lead to an interpreta- 
tive difference of from "very low" to "very 
high." 

The manuals give extensive suggestions for 
the treatment of children who show deviations 
on test scores. Some of these are trite but un- 
objectionable, as illustrated by a technique for 
helping a child who lacks self-reliance to recite 
in class. Others are clearly contrary to good 
modern practice in counseling and therapy. One 
trembles at a suggested cure for a child who 
underestimates his own capacities: “explain his 
difficulties and show him his errors" ! 

On the whole, the faults of the California 
Test of Personality are those of personality 
questionnaires in general. Such devices vainly 
seek the pot of gold at the end of the rainbow: 
a simple, cheap, foolproof method for studying 
human personality. Teachers, administrators, 
and school counselors who are tempted to con- 
sider the use of such devices would be bene- 
fited by a psychological insight into the fact 
that their own great need to do something 
about personality problems leads them to the 
delusion of accepting instruments of very low 
objective value. 


Douglas Spencer, Assistant Professor of 
Psychology, Queens College, Flushing, New 
York. This series of self-report, YES-NO re- 
sponse, group questionnaires—described in the 
publisher's catalogue as “designed to reveal the 
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status of certain highly important factors in 
personality and social adjustment usually desig- 
nated as intangible"—has now been extended 
to include two forms at each level from kin- 
dergarten to adulthood. 

Unfortunately little has been done, it seems, 
to correct in these tests the serious weaknesses 
pointed out by reviewers (see 40:1213) of the 
earlier Elementary and Secondary forms. The 
elaborate manuals still offer no evidence of 
validity and but scant information as to con- 
struction and standardization. The stepped-up 
reliability coefficients, although fairly high, are 
of doubtful value. Factor analyses are again 
reported as "in progress," but no proof is 
offered that the two parts of the tests entitled 
Self Adjustment and Social Adjustment—with 
six alluringly labeled subsections under each— 
really measure personality functions that are 
separate and meaningful. 

Like many older self-inventories, these tests 
involve the arbitrary scoring of highly subjec- 
tive responses. Many items will prove ambigu- 
ous to even well-intentioned subjects of con- 
siderable insight. No less than earlier attempts 
of this sort, these tests appear to be invitations 
to overrating, haloing, and easy deception. This 
reviewer cannot agree with an earlier critic 
(40:1213) that the efforts to disguise the items 
have been successful. The authors point out 
that they do not ask, for instance, “Do you tell 
lies?” Instead, they ask, “Have you found that 
telling falsehoods is one of the easiest ways for 
people to get out of trouble?” Instead of ask- 
ing “Do you play truant?” the test asks “Are 
things frequently so bad at school that you just 
naturally stay away?” The chance that such 
items, on a printed form administered by teach- 
ers, will obtain confessions by hitting upon the 
subject’s personal rationalizations would seem 
to be rather tenuous. This reviewer finds few 
items in the entire series that an average sub- 
ject at each age level could not “see through.” 
Frankness may also be discouraged by the 
names of the twelve types of adjustment tested 
printed on the face of the test where the sub- 
ject signs his name. These appear on both 
forms for three of the series. On the Adult and 
Secondary tests these labels have been abbrevi- 
ated (for example, Soc. Adj. for Social Ad- 
justment). But these older subjects should 
have little difficulty figuring them out, espe- 
cially with the undisguised title of the test and 
the subtitle, “A Profile of Personal and Social 


Adjustment,” clearly printed above. It seems 
amazing that psychologists or business execu- 
tives would expect job applicants to risk an 
unfavorable response to items in the Adult 
forms like the following: “Do you feel that 
most employers keep in mind the welfare of 
their workers?”; “Do you feel that many em- 
ployers are unfair in their methods of making 
promotions?"; “Is it often your belief that it 
is difficult to gain promotions on the basis of 
merit?"; “Do you sometimes wonder whether 
people approve of your work?” Yet these forms 
are offered as instruments for selecting em- 
ployees, for making job assignments, for sal- 
vaging unsatisfactory employees and for im- 
proving employee-management relations. In 
school use, truthful responses could be expected 
only if careful rapport were established with 
each student, and even then they would be diffi- 
cult to interpret on the basis of the norms re- 
ported since we know nothing about the rap- 
port conditions during standardization. 
Discussions of validity in the manuals are 
vague. It is claimed that “many of these items 
had previously been validated by other work- 
ers.” Items finally included were selected on 
the basis of judgments by teachers and prin- 
cipals, reactions by students, a study of the 
agreement between student responses and 
teacher appraisals, and a study of the relative 
significance of items by means of biserial r 
technique. No evidence is reported, so one still 
"looks in vain for some substantial research.” 
It is a doubtful service to clinical psychology 
to have given so much care and space in the 
manuals and in accompanying promotional bul- 
letins to making the sales appeals of these tests 
so attractive, while omitting essential infor- 
mation about standardization and basic data 
needed to show what these tests are good for. 
The Adult series manual states vaguely that 
the "test is based on a study of several hun- 
dred ways adults respond. . . ." Manuals for 
the other four series state that they are based 
on a study of 1,000 specific adjustment patterns 
or modes of response. . 
“Percentile norms based on several thot 
sand students for each series are given in the 
manual of directions," according to the sales 
catalogue. Actually, the number of cases is not 
given for the percentile norms in the manuals 
of the Adult, Secondary, and Intermediate 
tests; the Elementary and Primary norms are 
reported as based on "over a thousand pupils." 
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The groups on which the high split-halves, 
Spearman-Brown reliabilities are based are: 
-Adult, 250 subjects; Secondary, 558 cases; In- 
termediate, 792; Elementary, 334; and Pri- 
mary, 237 pupils. It is not stated which of the 
two forms for each series these data refer to 
nor how the forms were equated. Inspection 
of scoring keys reveals varying ratios of re- 
quired YES to NO responses in the alternate 
forms and in the subtests of each, despite the 
fact that this has been found to influence val- 
idity of such schedules.* 

Nothing is reported about the conditions of 
standardization or the age, sex, and IQ dis- 
tribution or the social, racial, and economic 
background of the subjects, except that they 
were all in or near Los Angeles. Thus even 
the “California” in the title is open to question, 

The worst features of the tests, in the opin- 
ion of this reviewer, are the elaborate sugges- 
tions to teachers for the treatment of conditions 
claimed to be revealed by the scores, profiles, 
and even individual item responses. When not 
clearly dangerous, these procedures are stereo- 
typed, superficial, and lacking in clinical sense. 


Clearing House 16:188 N "41. E(arl) R. 
G(abler). * Precautions were taken to insure 
validity, and the tryout reveals reasonable reli- 
ability. The format promises ease of scoring and 
interpretation—which is something we urgently 
need in tests of this sort, The individual and 
class profile chart speaks aloud and gives a 


diagnostic story of each person and class taking 
the test. 


For reviews by Raymond B. C attell, Perci- 
val M. Symonds, and P. E. Vernon of the 
Elementary and S econdary Series, see 40:1213. 
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Concept Formation Test. Normal and Schizophrenic 
adults; 1940; 1 form ; $12.00 per testing outfit; (10-60) 
minutes; Jacob Kasanin and Eugenia Hanfmann; 
C. H. Stoelting Co. 
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O. L. Zangwill, Assistant Director, Institute 
of Experimental Psychology, University of 
Oxford, Oxford, England. This test procedure 
serves to define, in an experimental setting, the 
peculiar disorder of thought and judgment 
characteristic of schizophrenia, It also provides 
a test, admittedly a somewhat difficult one, for 
the intellectual deficit states which may be as- 
sociated with organic lesions of the brain cortex. 
It does not, however, enable a differential diag- 
nosis to be made between organic and func- 
tional conditions or provide a quantitative 
measure of the degree of thought disturbance 
present in any given case. 


The main value of the Hanfmann-Kasanin 
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technique lies in the opportunity it provides for 
the display and qualitative elucidation of con- 
ceptual disturbances. It should, at its present 
stage at least, be treated not as a strict psy- 
chometric test but as a flexible tool in research 
investigations. Thus, it should have real use in 
studying the vexed question of whether the 
thought disturbance in schizophrenia is primary 
or secondary to the affective disorganization. It 
should also prove of value in analyzing the 
conceptual disturbances associated with lesions 
of the frontal lobes and in examining patients 
treated by prefrontal leucotomy. 

It would be an advantage if this test could 
be fully standardized for the different age and 
intelligence levels, At present, the clinical in- 
vestigator is obliged to base his assessment 
very largely on crude empirical standards. Itis 
therefore doubtful whether there is much to be 
gained by including it in a test battery for rou- 
tine neuropsychiatric examination. 


For related reviews, see 28. 
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*[Re Concept Formation Test.] HANFMANN, EU- 
GENIA, AND Kasanın, Jacos. Conceptual Thinking 
in Schizophrenia. Nervous and Mental Disease Mon- 
ographs, No. 67. New York: Nervous and Mental Dis- 
ease Monographs, 1942. Pp. viii, 115. Out of print. * 
(PA 16:2289) 


Am J Psychiatry 100:299-300 S ’43. David 
G. Wright. * The authors....have done a praise- 
worthy thing in attempting an objective ap- 
proach to this problem. However, they have 
fallen considerably short of this objective. They 
have dealt with a very small number of individ- 
uals, each group being an extremely heteroge- 
neous one. Too many variables have been in- 
troduced, or have not been excluded, to allow 
for the validity of any of the conclusions. It is 
somewhat disappointing to find in this study 
yet another instance of psychiatric writing 
which purports to be objective, but must be 
said to draw conclusions that are not so justi- 
fied. 

Am J Psychol 60:146-8 Ja '47. George W. 
Hartmann. * The authors....were particularly 
concerned to check Vigotsky’s contention that 
the essence of the schizophrenic disorder lies 
in a loss of ability to think in abstract concepts 
with regression to a more primitive level where 
objects are not viewed under general cate- 
gories but are seen merely as individuals. This 
thesis appears to represent essentially an ex- 
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tension of Goldstein’s position reached from 
his study of brain injuries. * The most appar- 
ent contribution of this study is that a reduc- 
tion in the level of conceptual thinking occurs 
in some schizophrenics and is lacking in.others. 
It is possible that patients so classified and ex- 
hibiting differing functional disorders have 
been grouped under a misleading common label, 
or conceivably the range of variation within 
the category schizophrenia is greater than ordi- 
narily recognized. One wonders therefore why 
the authors declare they have confirmed Vigot- 
sky's thesis. True, they propose three correc- 
tions, viz., (1) a trichotomy instead of a di- 
chotomy of thought levels, (2) not normals but 
only college-educated adults typically reach the 
highest level, and (3) the presence of concep- 
tual defect is rigorously demonstrable in no 
more than one-half of the schizophrenic cases. 
Moreover, in opposition to Cameron's position, 
they found their patients wmable rather than 
umwilling to conceive of other possibilities of 
grouping than the one momentarily suggested 
to them by the material. A stimulating idea is 
the suggestion that the similarity in “concep- 
tual loss” between schizophrenics and organics 
is sufficiently striking to point to the possibility 
of somatic origin for the disintegration often 
seen in the former. While the authors are 
plainly familiar with the basic experimental 
literature on ‘the thought processes’ since the 
pioneer work of Ach, it is a bit puzzling to see 
them, like the early behaviorists, disregard al- 
most completely the interest or affect factor in 
their interpretations of problem-solving be- 
havior, especially since they plainly state (p. 
101) that “impairment of conceptual thinking 
is not the only, and certainly not the most, con- 
spicuous disturbance found in the schizophrenic 
mentality." Emotional, attitudinal and valua- 
tional elements presumably must exist in effec- 
tive combination with certain cognitive forces 
before the behavior on any of the three levels 
can occur. Excessive reliance upon a single 
test and insufficient use of genetically compre- 
hensible ‘frames of reference’ constitute the 
chief limitations of this otherwise intensive 
analysis. 

Arch Neurol & Psychiatry 49:321 F '43. * 
a close relationship was noted between per- 
formance at the highest conceptual level and 
high educational level. To this reviewer, the 
authors seem, at this point, to overlook the op- 
portunity for clarifying the basic question as 


. to whether this relationship is not actually one 


- 


between. intelligence level and high perform- 
ance. * Unimpaired conceptual thinking was 
found most frequently in the group of patients 


«characterized by prevalence of neurotic symp- 


toms, while impairment was most evident in 
the groups showing (1) disturbance of intellec- 
tual function expressed in a tendency to inco- 
herence and irrelevance, (2) marked dissocia- 
tion with extensive fantastic elaboration of 
ideas and (3) a paranoid-hebephrenic trend 
with dull affectivity. While these observations 
seem to indicate that impairment in conceptual 
thinking is not, as Vigotsky claimed, the cen- 
tral disturbance in schizophrenic thinking, al- 
though it may be found in cases in which the 
personality is grossly affected, the authors for- 
mulate their conclusions rather confusingly by 
stating that their study "confirms" Vigotsky's 
thesis that conceptual thinking suffers impair- 
ment in schizophrenia. They take issue with 
Vigotsky's dichotomy of conceptual and “primi- 
tive" thinking, which they regard as inade- 
quate, but do not comment on the factor of 
"regression," which he stresses. * This study is 
provocative and stimulating and represents an 
interesting contribution to the literature of 
schizophrenic thinking. * 

Brit J Psychol 33:187-8 Ja '43. A. S. 
P(ickford) and R. W. P(ickford). * a valuable 
account of the use of Vigotsky's Concept For- 
mation Test, originally due to Ach and modi- 
fied by Saharov, in an investigation of the 
degree of impairment of thinking in schizo- 
phrenia * It would be of interest if the authors 
could find out exactly how performance of nor- 
mals in the test is correlated with intelligence, 
and if possible what part emotional and other 
factors such as personal taste and prejudice 
play in hindering or assisting their perform- 
ance, The method of scoring described by the 
authors is found to have many difficulties in 
practice by the reviewers. 

Int J Psycho-Analysis 23:183 pts 3& 4'42. 
W. H, Gillespie. * presents the results of a 
careful and well-controlled piece of research * 
The authors....find that it is necessary to dis- 
tinguish not less than three grades of thinking, 
as opposed to Vigotsky’s simple division into 
conceptual and concrete (‘complex’) thinking. 
They find, in fact, that thinking in true abstract 
concepts is practically confined to the college- 
trained subjects; other normal subjects gen- 
erally reach only the intermediate grade of 
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partially conceptual thinking. It is therefore 
essential to make allowance for a person’s edu- 
cational level * There is also a wide variation 
among schizophrenics of the same educational 
level, and further investigation led to the dis- 
covery that this variation is related to the differ- 
ent clinical forms of the disorder. * the results 
are not nearly so clear-cut as Vigotsky sug- 
gested, and they appear to rule out the possi- 
bility that this particular kind of thought dis- 
order (which can reasonably be regarded as 
regressive, for a similar disability is found in 
children before puberty) is the essential change 
underlying all cases clinically diagnosed as 
schizophrenia. This result is scarcely surpris- 
ing in view of the growing modern opinion that 
‘schizophrenia’ is not a nosological entity, Re- 
searches such as the present should help to 
break it into its component parts, an important 
step towards a more satisfactory approach to 
the problem of schizophrenia. 

J Am Med Assn 119:600 Je 13 ’42. * For 
any one wishing to learn the fundamentals of 
the test the book is highly recommended. 

J Nerv & Mental Dis 101:302-3 Mr '45. K. 
Goldstein. 'The book of Hanfmann and Kasanin 
comprises the well-known studies by these au- 
thors of the thinking processes in schizo- 
phrenics. There was much theoretical specula- 
tion in this field until the Russian psychologist, 
L. Vigotsky, began to analyze the behavior of 
schizophrenics by means of special perform- 
ance tests. He came to the conclusion that the 
schizophrenic thought disorder is due to a “loss 
of ability to think in abstract concepts and a 
regression to a more primitive level of thinking 
in complexes." He used a concept formation 
test devised by Ach and later modified by 
Saharoff. The present authors in appreciation 
of the fruitfulness of Vigotsky's conception 
examined, by means of the Vigotsky test, a 
great number of different kinds of schizo- 
phrenic patients and compared the results with 
findings in patients with organic brain disease 
and normal subjects of different educational 
levels. * This book is excellently written and 
certainly cannot be overlooked by anyone who 
is interested in research in schizophrenics. 

Mental Hyg 27:318-20 Ap ’43. Wendell 
Muncie. * There are several important points 
to note about the findings: A shift from con- 
ceptual thinking to more concrete thinking 
often occurred under the stress of emotional 
frustration, which Presupposes a certain grasp 
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of the situation, Yet the authors deny Cameron's 
insistence on the emotional basis for the char- 
acteristic thinking disorders he discovered by 
use of the same and other tests. To the reviewer 
—who did not sit in on either set of experi- 
ments, but who has had acquaintance with 
many schizophrenics—it appears that accurate 
judgment on the emotional state of these sub- 
jects might be well-nigh impossible, certainly if 
based exclusively on their verbal comments. 
Can the authors assert that emotional factors 
have been excluded simply because those pa- 
tients who showed the ordinary emotional ex- 
pression were eliminated ? The authors suggest 
that the performance of schizophrenics is quali- 
tatively in many respects not unlike that of 
paretics and arteriosclerotics (“irreversible’’ 
cases), and this observation warrants the sug- 
gestion that deterioration in the thinking proc- 
esses may proceed along a final common path- 
way, even when initiated by diverse factors. 
The authors mention the reviewer's clinical 
observations (among others) relative to the 
schizophrenic's misuse of metaphors. They 
omit the really pertinent item from this con- 
tribution: that such misuse is spontaneous, 
and always related directly in an emotional way 
to the dynamic factors in the illness. This was 
noted in the complete absence of similar dis- 
tortions in the experimental situations of ex- 
plaining metaphor, proverbs, and so forth. The 
reviewer finds it somewhat difficult to believe 
that an “impersonal” disease such as paresis or 
arteriosclerosis—if any disease is really im- 
personal—could bring about disturbance along 
such "lines of cleavage" of the personality. My 
view would fit in with certain of the authors' 
findings and with Cameron's—that the thinking 
disorder proceeds under the impetus of transi- 
tive emotions. Furthermore, I would not hold 
the view that experimental situations, in con- 
trast to clinical observation, necessarily connote 
“more controlled observations.” This is a good 
book, full of interesting detail, and providing 
a most helpful index to the use and interpreta- 
tion of the Vigotsky test. The literature is 
covered in a welcome fashion. 

Psychiatry 6:248-9 My '43. Meyer Maskin. 
* the authors conclude that conceptual thinking 
is significantly impaired in Schizophrenia, save 
in borderline states. * There will be little clini- 
cal denial that what is ordinarily termed con- 
ceptual thinking is altered in Schizophrenia. As 
in all experiments of this type, however, there 
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are inevitable and basic conundra : first, it is in- 
variably difficult to evaluate precisely what part ~ 
or parts of the mental apparatus actually oper- 
ate in a given artificial test performance, Sec- 
ondly, it may be quite incorrect to conclude 
that conceptual thinking is necessarily impaired 
as evidenced in test responses by schizophrenics. 
There are too many variables to permit this 
easy, happy conclusion. Since the schizophrenic 
is caught in a mighty web of feeling and think- 
ing detached from prevailing reality, his re- 
sponse to such realities will vary. To establish 
definitively a lessened capacity to think ab- 
stractly, it should be demonstrated that such 
impairment exists within the domain of the 
schizophrenic’s chief concerns. And clinically 
there is much reason to believe that the schizo- 
phrenic is fluently abstract in his symbolic for- 
mulations, neologisms and other-worldiness. 
The monograph is provocative, however, in sug- 
gesting continued research into the nature of 
thinking. There is a bibliography of 61 refer- 
ences. - 
Psychoanalytic Q 12:257-9 Ap '43. Thomas 
M. French. * an exceedingly painstaking and 
well thought out experimental study of thought 
disorders in schizophrenia. * The procedure is 
based upon experiments of the Russian psy- 
chologist, Vigotsky, whose methods are fol- 
lowed with slight modifications. The subjects 
are given twenty-two wooden blocks, varying 
in color, shape, height, and size. On the under- 
side of each is written one of four nonsense 
words. The subject is told that there are four 
different kinds of blocks before him, that each 
kind has a name and that his task is to find and 
to separate these four kinds. The examiner 
then shows him the name of one block and asks 
him to pick out all blocks which he thinks 
might'belong to the same kind. After he has 
attempted to do so the examiner turns up one 
of the wrongly selected blocks showing that 
this is a block of a different kind and then en- 
courages the subject to continue trying. This 
procedure is repeated until it becomes clear 
whether he is able to complete the task of sep- 
arating the four kinds, The subject is encour- 
aged to think aloud and all comments are 
recorded in detail throughout the experiment. 
This test was given to three groups of sub- 
jects: schizophrenic patients, healthy adults 
and patients with an irreversible organic brain 
disease. * the authors offer the very tentative 
hypothesis that the similarities between the 
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thinking of schizophrenics and that of organic 
patients 'are sufficiently striking to suggest the 
possibility of somatic origin of schizophrenic 
disorder or rather of some of its variants.’ 
However, one of their findings is not given 
sufficient importance. In some cases the authors 
report that the performance of the subject de- 
teriorated sharply upon the development of an 
emotional disturbance during the test. This fact 
is consistent with that view of schizophrenia 
which regards the disturbance of thought as a 
secondary result of disturbances in the emo- 
tional sphere rather than with the view which 
the authors seem cautiously to favor, that the 
disturbance of thought is primary. 

Psychoanalytic R 33:383-4 Jl '46. Clara 
Thompson. * 'The test performance used is 
fully and clearly described and the formulation 
of the results is presented in a well-organized 
and simple manner. * The authors conclude 
that while their findings in general confirm 
Vigotsky, they cannot agree that conceptual 
thinking is a possession of every normal adult, 
and they have shown that some cases of schizo- 
phrenia are free from defect. The book is well 
organized and well written. The research seems 
carefully planned and executed and the con- 
clusions are convincing. The only criticism is 
the one which can be made concerning the ap- 
plication of laboratory methods to human be- 
havior generally. For example, on what basis 
was it determined that a subject was normal. 
Also is it not possible that the same schizo- 
phrenics at another stage of their illness would 
have shown different results! 

Psychosomatic Med 5:202 Ap '43. Oskar 
Diethelm. The authors present....the results of 
their investigation of conceptual thinking by 
means of the Vigotsky test. After a brief re- 
view of the pertinent work of the Russian in- 
vestigators, the test performance is analyzed 
under the headings of interpretation of the in- 
structions by the subject, attempts at solution, 
and finding and mastering the correct solution. 
This chapter should be of interest to anyone 
wishing to use the Vigotsky test or to study 
conceptual thinking by any other method. * 
The discussion of the findings of other investi- 
gators in the field of conceptual’ thinking is 
highly informative. The authors accept educa- 
tional standards as true indicators of intelli- 
gence. The results would be more valid if in- 
telligence tests had been used. The normal 
findings in some schizophrenic patients bring 
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up the question whether a study of schizo- 
phrenic patients in general can give adequate 
results or whether these studies should not be 
related to carefully observed psychopathologi- 
cal reactions in general instead of to an in- 
volved and disputed “schizophrenia.” The re- 
viewer, however, agrees that “these studies 
and the possibilities of new approaches that 
they have uncovered are encouraging.” 


[ 29 ] 
xControlled Projection (1944): A Standard Ex- 
perimental Procedure. Ages 6 and over, adults above 
average intelligence; 1945; J. C. Raven; H. K. Lewis 
& Co., Ltd. (The manual and a set of sketches are ob- 
tainable in the United States for $4, postpaid, from 
the Psychological Corporation.) ADM 
a) [INDIVIDUAL TEsT.] Age 6 and over ; individual. 
b) [GROUP OR SELF-ADMINISTERED TEST.] Adults above 
average intelligence; separate forms for men and 
women, 

Arthur L. Benton, Associate Professor of 
Psychology, University of Louisville School of 
Medicine, Louisville, Kentucky. An adult sub- 
ject is shown a drawing of a person of the 
same sex as himself sitting at a table and writ- 
ing on paper. The subject is asked to draw 
whatever he wishes and at the same time to 
answer questions concerning the interests, likes, 
dislikes, fears, etc., of the person in the pic- 
ture. Children are also requested to draw what- 
ever they wish but, instead of answering ques- 
tions, are asked to complete items in a standard 
story presented to them. 

The fundamental assumption underlying in- 
trepretation of the subject’s performance is 
that the statements of the subject about the 
picture refer primarily to himself and are there- 
fore a projection and an unwitting self-evalua- 
tion. The test is still in an experimental stage 
in that its essential validity and clinical useful- 
ness have not yet been established. 


Percival M. Symonds, Professor of Educa- 
tion, Teachers College, Columbia University, 
New York, New York. This unpaged book de- 
scribes a projective technique which combines 
drawing with storytelling. Raven calls his 
method “Controlled Projection,” using the de- 
vice of “projection pursuit.” In brief, the 
method consists of asking the subject to draw 
a picture while plying him with questions as he 
is drawing. The author states that the method 
can be used with “almost everybody” over six 
years of age and provides simultaneously “two 
distinct and yet correlated forms of self-expres- 
sion without disconcerting apparatus.” He 
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states the aim of projection pursuit to be “to 
discover not facts, but how people interpret the 
events of life to themselves; how they organise 
their thoughts and actions to their own satis- 
faction; how they regard other people, and 
imagine other people regard them; the motives 
they attribute to people's actions ; and to reveal, 
as far as possible, the personal preferences, dis- 
likes, desires and judgments by which their ex- 
perience becomes organised and character is 
formed." 

When the test is given to a child, he is told 
to draw anything he wants to. A standardized 
list of eleven questions are asked the child as 
he draws. The pursuit comes by "inducing a 
person to develop his ideas to his own satisfac- 
tion," and by following up answers with such 
questions as “Why ?”, “How?”, “Yes?”, “And 
then?”, “What then?", and “What or who 
else?” 

With adults a skeleton sketch of a road and 
of a box is given to the subject, and he is asked 
to sketch in what he might see walking along 
the road or what he imagines might be inside 
the box. A standard list of questions is pro- 
vided to ask the subject while he sketches. In 
addition, for adults the materials include the 
picture of a person drawing. The subject is 
asked to sketch on paper what he thinks the 
person in the picture might be drawing or writ- 
ing and a list of questions is asked the subject 
simultaneously, 

The book contains a number of samples of 
children’s and adults’ drawings together with a 
“verbatim projection record” which consists of 
the answers given to the standardized questions. 
No interpretation of the pictures or of the 
answers is given. 

The idea of distracting the subject by requir- 
ing him to do two things at the same time is 
novel, but no data or evidence are provided 
which indicate the value or strength of the 
method. There is considerable doubt as to 
whether a uniform set of questions can be 
made applicable to every case. The book pro- 
vides nothing in the way of expectation or 
norms. The author believes that telling a per- 
son that his imagination is being tested is detri- 
mental to the results. He believes that the 
method can be used as a self-administered or 
group test. 

Undoubtedly within the next few years many 
different kinds of projective techniques will ap- 
pear, but there is no way of knowing ahead of 


time what value each variation may have. Cer- 
tainly much can be learned from stories and 
drawings told by a person no matter under 
what conditions they are obtained. 


Am J Orthopsychiatry 17:548-9 Jl’47. * Al- 
though the author mentions H. M. Murray’s 
Explorations in Personality, he leans very heav- 
ily on Spearman’s The Nature of Intelligence 
and Principles of Cognition. * The book ccn- 
tains numerous records and drawings. Unfor- 
tunately, Dr. Raven does not go into the psy- 
chodynamic implications of these records, but 
keeps the reader hanging in mid-air after he 
has looked at the various pictures and read the 
stories which are not related to the drawings. 
The technique in itself is a very interesting 
one and warrants further experimental inves- 
tigation. 

Brit J Ed Psychol 16:48 F ’46. * Unfortu- 
nately, although some detailed examples of cases 
are published, the author refrains from even a 
tentative evaluation which would have added 
immeasurably to the value of the book as a 
stimulus to further experimental work in a 
field where reliable techniques are urgently 
necessary. It is unfortunate too that there is 
neither the briefest table of contents nor even 
page numbering. 

Brit J Psychol 36:177 My ’46. * A series of 
illustrative case histories and the correspond- 
ing projected material is given. But neither 
these cases nor the general results of inquiry 
made by the techniques are discussed with 
sufficient clarity or detail to demonstrate how 
the projected material is used and interpreted. 
The answers may express the individual’s con- 
scious wishes and fears more clearly than 
would his answers to direct questioning, as at a 
psychiatric interview. But is it possible to use 
this technique to relate such wishes and fears 
to the personality as a whole, and to the in- 
dividual’s unconscious mental contents? Are 
the latter perhaps expressed in the drawings 
and ‘doodlings’? If so, in what way? The 
method and its results need a more systematic 
exposition before it is possible to judge their 
value. 

J Mental Sci 92:197-8 Ja '46. M. B. Brody. 
* Many records are given in full. The problem 
is real and important, and Raven's method 
promises to be richly productive. Time will tell 
if this promise will be fulfilled. Raven also 
makes some interesting general observations on 
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the nature of projection, and on the factors 
which modify the drawings and replies to 
questions. The beauty of the reproductions 
cannot be passed without mention. 

Nature 156:701—2 D '45. John Cohen. Recent 
developments in projective technique offer a 
new approach to the mind, the exploration of 
which is just beginning. The underlying con- 
cept of projection (attributing our own ‘qual- 
ities’ to others) owes its origin to Freud. It 
was postulated to explain the origin of delu- 
sions of persecution and ideas of reference in 
paranoia. Freud thus identified a distinctive 
mental process of which we had only been 
vaguely aware. Many proverbs embody the be- 
lief that projection enters into daily intercourse. 
Folk-knowledge recognizes that in judging 
others we judge ourselyes, that in condemning 
or condoning the actions of others we betray 
our own private inclinations. In the narrower 
sense in which Freud understood it, however, 
projection is a mechanism of defence employed, 
as an alternative to repression, with the aim of 
ridding the mind of an intolerable burden of 
aggression or guilt. Some recent experiments 
have thrown further light on these alternative 
extra- or intro-punitive processes. Projection 
helps to shape our relationships with the outer 
world. We ‘infer’ from the dim or clear aware- 
ness of our own state of mind the state of 
mind of another person. It is not inference in 
the logical sense of inductive reasoning, but 
arguing by analogy from one particular to an- 
other. In Spearman’s terminology, projection 
means educing correlates of our own social at- 
titudes. Taken jointly, projection and correlate 
education are rich in explanatory value. Thus, 
to take one example, the late Prof. Aveling 
showed that ideas of cause and effect are gen- 
erated (educed as correlates) by projecting on 
to external objects the experience of the self 
acting upon the organs and limbs of the body. 
The recognition that projection plays a major 
part in the organization of mind has led to its 
being exploited as a method of penetrating be- 
neath the surface of personality. Indeed, the 
importance of projective techniques to the psy- 
chologist has been likened to that of the X-ray 
apparatus to the physician. A person’s projec- 
tions can be evoked so as to uncover his inner- 
most impulses, wishes, fears and values. He 
will interpret an ambiguous social situation in 
a way which, unknown to himself, will disclose 
some of the hidden secrets of his mind. In 


choosing his invective he will most likely dip 
into his personal inventory of self-condemna- 
tion. In forecasting an event on inadequate 
data he will project himself into his predictions 
thus revealing what he wants to happen or 
what he privately fears might happen. He may 
perhaps project his own ignorance into the out- 
side world. This last possibility was well exem- 
plified by the Press in 1941 when ignorance of 
Japan’s plans in regard to entering the war was 
projected on to the Japanese rulers, who were 
believed to be utterly uncertain of their own 
intentions. Projective techniques are methods 
of indirectly locating crucial foci of value and 
conflict. Just as a searchlight, in projecting its 
beams on to an object, discloses its own loca- 
tion, so the mind by projecting its own wishes, 
hostility, fears and values, reveals its key cen- 
ters of activity. Projection is something more 
than a subjective mental process. It has a so- 
cial quality inasmuch as it is the main avenue 
for the mind’s interpretation of its environ- 
ment, human and material. Hence its signifi- 
cance for the study of interpersonal relations. 
In England, projective techniques appear to 
have been first used by Wedeck in 1933 and de- 
scribed by him in a thesis deposited in the Uni- 
versity of London library. His object was to 
inquire whether any ‘group factor’ existed 
which implied a special ability for understand- 
ing human character and relationships, whether, 
in other words, the interpretation of facial ex- 
pression, gestures, social situations, proverbs 
and the like, required an ability statistically in- 
dependent of general intelligence (Spearman’s 
g’ factor). Wedeck did, in fact, claim to have 
detected a special ability of estimating charac- 
ter uncorrelated with ‘g.’ The potentialities of 
projective techniques in the study of person- 
ality were first fully appreciated, however, in 
the United States. H. A. Murray seems to have 
been early in the field. He set out the methods 
in his “Explorations in Personality,” which at- 
tracted curiously little attention in England 
when it was published in 1937. He had a differ- 
ent purpose in mind; not to find out whether 
the subjects were good judges of character or 
not (as measured by some agreed criterion), 
but to discover their general mental and social 
orientation. There is variety in projective tech- 
niques. The basic procedure requires that the 
subject should freely manipulate experimental 
materials presented to him. Alternatively he 
may be required to interpret the materials or 
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situation, or he may be asked to carry out cer- 
tain prescribed acts under observation. The 
underlying assumption is that the subject will 
employ the material or situation so as to ex- 
press his inner values, his mental conflicts, his 
tendencies, aspirations and imaginary roles. He 
throws or projects himself into the medium of 
expression. Such techniques include the Ror- 
schach Test, the Thematic Apperception Tests 
of Murray (briefly known as TAT), psycho- 
dramatic situations, and, in the case of chil- 
dren, play activities. Closely allied procedures 
are observations of expressive movement, pos- 
ture, gait and physical activity of almost any 
kind. The fantasies projected by the subject 
or patient are not regarded as final indications 
of the trends in his mind but simply used as 
working hypotheses to be checked by other 
criteria, Murray’s tests, which are the most im- 
portant of these devices, were developed at the 
Harvard Psychological Clinic. In principle 
similar to the Rorschach Test, the material 
consists of reproductions of paintings instead 
of ‘ink-blots.’ The latest revision of the the- 
matic apperception tests has just been pub- 
lished. Though originally devised for adults of 
normal intelligence, the tests have also been 
applied to mental defectives. The exigencies of 
war soon determined that, in the interests of 
military efficiency, methods should be adopted 
for the assessment of personality in selecting 
personnel for higher duties. As is now perhaps 
well known, the important step was taken in 
1942 of introducing projective techniques into 
War Office Selection Board procedure as an 
aid in selecting candidates on grounds of ‘offi- 
cer quality’ or leadership. Here the current 
tests of intellectual abilities were not appropri- 
ate and the new methods were of great help in 
detecting weakness and strength in the per- 
sonality structure of candidates, and in the 
adequacy of their social adjustment. Similar 
methods are now being employed in selecting 
candidates for the higher grades of the Civil 
Service. Raven's work, stimulated by Murray's 
efforts and by the 'completion test' of a French 
investigator, M. Thomas, introduces new steps 
in testing procedure, part of which is common 
both to children and to adults. The subject is 
asked to draw, and while he is drawing, to 
imagine and describe a series of events. Since 
he is occupied with two tasks there is little 
residual energy left for self-observation or 
self-criticism. There are supplementary tests 


suitable for adults only. Although best results 
are obtained with individual administration, an 
adaptation of the test material is also provided 
for administration to adult groups. Raven uses 
two kinds of material, verbal and spatial simul- 
taneously, and he 'pursues' the subject's fan- 
tasies in a number of prescribed directions. 
There is thus a twofold pursuit of the subject's 
projections. In the result two distinct yet par- 
allel forms of expression emerge side by side, 
one oral, the other pictorial; each is a check on 
the other. Both the verbal utterances and the 
drawings can be systematically followed, step 
by step, by the experimenter until he is satis- 
fied that his explorations of the subject’s per- 
sonality are as complete as necessary. The 
materials used are simple and lack any struc- 
ture of their own. The ambitious aims of 
‘projection pursuit’ are clearly set out by Raven. 
It is not facts about the subject’s past which 
are sought, but his interpretation of the events 
of life, how he organizes his thoughts and ac- 
tions, how he regards other people and how he 


‘imagines other people regard him, what mo- 


tives he attributes to the actions of others, what 
are his likes and dislikes, his wishes and judg- 
ments, and how, on the basis of these, his char- 
acter is formed. The sample records given by 
Raven illustrate (they are not norms) what re- 
sults are to be expected of children at different 
age- and ability-levels within the normal range 
of intelligence. Some records are also given of 
the projections of adults differing in age, sex 
and ability. Several criticisms of the present 
stage of Raven’s techniques come to mind. The 
methods do not, as is claimed, appear to be ap- 
preciably subject to controlled variation. Nor 
do the results lend themselves at all easily to 
statistical treatment. Above all, to interpret the 
records is exceedingly difficult, and practically 
no guidance is offered on the subject. Raven 
admits that much has still to be learnt about the 
most informative questions to be put, about the 
best methods of pursuit, and abovt ways of 
recording the responses. These limitations the 
reader will readily concede. The value of the 
present arrangement of test material lies less 
in what it achieves than in what it foreshadows. 
It provides not so much a procedure ready for 
clinical application as a “method of enquiry 
suitable for experimental work”; it is there- 
fore as a pioneer effort that the work is to be 
judged, and the reader must not be too critical, 
though there is much to criticize. 
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Cowan Adolescent Adjustment Analyzer: An In- 
strument of Clinical Psychology. Ages 12-18; 1935- 
46; revision of Cowan Adolescent Personality Sched- 
ule (see 40:1217) ; 1 form; $2.50 per 25; 50¢ per speci- 
men set; nontimed (10-30) minutes; Edwina A. 
Cowan, Wilbert J. Mueller, and Edra Weathers; 
Cowan Research Project, Salina, Kansas. 


REFERENCES 
1-3. See 4011217. 


Harold H. Abelson, Associate Professor of 
Education and Director, Educational Clinic, 
The City College of New York, New York, 
New York. Despite its striking title, this in- 
strument is simply a self-marking personality 
questionnaire with convenient arrangement for 
obtaining separate scores in nine categories: 
fear, family emotion, family authority, feeling 
of inadequacy, nonfamily authority, maturity, 
escape, neurotic, and compensation. Its value 
will depend largely on its use or misuse, While 
the authors do not intend that the “Analyzer” 
be used in lieu of a full array of clinical pro- 
cedures, their ready use of illustrative clinical 
interpretations may encourage users to expect 
too much of the instrument, which is simply a 
source of data and not a diagnostic program. 

Skeptical also must be one’s reaction to the 
use of the “objective” validation procedure em- 
ployed. Avoiding evidence from clinical sources 
as being too subjective, the authors resort to 
graphic comparisons of several deviate groups to 
show that the instrument is valid. A nonprecise 
inspection technique is used to ascertain whether 
there are differences in adjustment scores among 
the several groups. Moreover, while presumably 
validating the test, the authors slip into the fal- 
lacy of using the test findings as if they were 
already known to be valid. Thus, if one group is 
found to have lower “maturity” scores than an- 
other, the first group is said to be more mature; 
if no difference is found, the groups are said to 
be the same, Instead of producing a rigid valida- 
tion of the tests, the authors simply show that 
the test results of the several groups conform 
in general to what one may plausibly expect of 
these groups. On the basis of such observa- 
tions, slight group trends are employed as sup- 
posedly reliable indications of individual char- 
acteristics, 

Although the length of the questionnaire has 
been reduced to go items as compared with the 
201 items of the original Cowan Adolescent 
Personality Schedule, and the score in each 
category is based on only 1o questions, the au- 


thors lay claim to the reliability of the obtained 
scores. The method of establishing reliability 
is a simple and specious one, namely, that of 
showing that the distribution graphs of groups 
from three towns in widely separated parts of 
Kansas “do not differ from each other in any 
category as much as they differ in some cate- 
gories from distributions obtained with selected 
groups of children living under conditions pre- 
sumed to foster maladjustment.” No retest or 
split-half coefficients are quoted. The reliability 
of an individual's score on the test as a whole 
or in any of the nine categories clearly has not 
been established. 

In making this revision, the authors have 
followed an elaborate procedure for selecting 
the questions to be retained. Simply put, an 
item was retained when it correlated with scores 


-on the test as a whole but did not correlate too 


highly with scores in categories other than the 
one in which it fell. Apparently the original 
placement of an item in a category was not 
modified, such placement having the same sub- 
jective basis as on the original Cowan schedule. 
None of the objections previously voiced to 
possibly inadequate wordings of questions have 
been met in the revision. Although the subject 
is asked to indicate whether each parent lives 
at home and the number, age, and sex of siblings, 
no provision is made for correcting scoring 
when items relative to these are not applicable 
to a particular case. Another minor defect is 
found in the fact that while there are 77 XES's 
among the "maladjusted" answers, there are 
only 13 NO's. The effect of the uneven dis- 
tribution is not clear. Norms are based on 500 
high school pupils in one state and are not 
differentiated for age. 

Despite its limitations, the new Cowan ques- 
tionnaire, like its older counterpart, may well 
be the best available inventory for the adolescent 
range. Particularly significant is its recognition 
of two basic principles: (a) that admitting to 
few symptoms is likely to reveal maladjust- 
ment and (b) that the internal comparison of 
an individual's scores in the several categories 
may (in the hands of a clinically oriented per- 
son) lead to a preliminary understanding of 
certain aspects of the individual's adjustive 
processes. In its new form it certainly is a con- 
venient and interesting device. Let us hope that 
those who use the instrument—and it warrants 
widespread use—will employ it with a full un- 
derstanding of its merits and its limitations. 


67] 


William U. Snyder, Associate Professor of 
Psychology and Associate Director, Psycho- 
logical Clinic, The Pennsylvania State Col- 
lege, State College, Pennsylvania. This per- 
sonality test is designed to assist the clinical 
psychologist to understand the fields of emo- 
tional and social maladjustment of the adoles- 
cent and to show the adjustive devices upon 
which the subject is depending. More specifi- 
cally, it is a questionnaire which obtains scores 
for a given adolescent that are comparable to 
standard scores obtained on 500 Kansas high 
school children in response to 90 questions di- 
vided into groups of 10 each for the following 
nine traits: fear, family emotions, family au- 
thority, feeling of inadequacy, nonfamily au- 
thority, maturity, escape, neuroticism, and com- 
pensation. 

The make-up of the test is in the form of 
three alternate answers to each question, viz., 
“YES,” “NO,” and “?.” The test is self-scoring, 
like the Henmon-Nelson Mental Ability Test. 
Questions are so arranged that upon opening 
the folder the scorer finds parallel lines sep- 
arating the nine sections and has only to count 
the number of encircled dots in each section 
and record the scores on a table which auto- 
matically provides standard scores and a pro- 
file of the subject's status in the nine traits. 
The questions themselves are also summarized 
inside the folder for easy reference by the 
clinician using the analyzer as an interview aid. 

The wording of the questions is simple and 
appropriate for adolescents. The questions 
themselves show considerable care in selection 
and are not ambiguous. They are the items that 
proved most satisfactory of the 201 which were 
used in Cowan’s former test, the Cowan Ado- 
lescent Personality Schedule, The reviewer 
found that test to have considerable merit in its 
field, 

The selection of items was based on the re- 
sults obtained from 1,150 questionnaires admin- 
istered to unselected high school students. Two 
commendable techniques were utilized in this 
selection. The authors used the ten questions 
for each trait which best distinguished between 
the average score and the score of tlie upper 
quartile in maladjustment of a random sample 
of the total group. No question was included 
which showed less than 10 per cent difference, 
and most of them showed a much greater per- 
centage of difference. The second technique 
was the avoidance of overlap between the nine 
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traits. Tables have been prepared to show that 
there is not a great deal of such overlap. 

Reliability was based on 300 cases, used in 
the final norms, and while no reliability index 
appears in the literature for this test, the curves 
obtained for the three different populations of 
100 look very similar on all traits but matu- 
rity. It would have been well if a test of good- 
ness of fit had been used. 

The test has some shortcomings. Most seri- 
ous, as usual, is the question of validity. The 
authors reject the usual criteria of validity, clin- 
ical judgments of raters, but it is doubtful 
whether they have improved on that method. 
They have assumed that if the test is valid it 
should show a difference between an unselected 
population and adolescents living under special 
circumstances such as institutions, foster homes, 
or correctional schools. They compared their 
500 unselected adolescents with 94 “broken 
home" and 145 delinquent children, classified 
into different subgroups. But the comparison 
is by inspection of frequency curves only, no 
predictive device being used. The authors seem 
occasionally to be reasoning after the facts, 
i.e., because a “difference” appears in two fre- 
quency curves an explanation is found to ac- 
count for it. 

Like all authors of questionnaires, the pres- 
ent authors have difficulty rationalizing the 
problem of subjective error. It has been sug- 
gested by Cowan and others that the true facts 
about a child are no more significant than the 
child's believing he is a deviate. But they do 
not answer the fact that by proper "lying" a 
child can obtain a normal answer although he 
is really abnormal. However, by allowing for 
deviation at both ends of the scale, they do 
greatly decrease this possibility. 

Despite the care used in selecting the items, 
some question might be raised about the ade- 
quacy of the trait names. For example, it 
seemed to the reviewer that five of the ten 
questions on fear really pertain to inadequacy. 
The scoring of several items seems dubious, 
i.e., is talking to oneself about one's troubles a 
sign of escape or is it neuroticism ? 

One difficulty in administering the test is 
that some of the questions pertain to parents 
and siblings. The authors instruct the subject 
without parents to omit these items; but, if he 
does, he omits six items out of the ten relating 
to family emotion and all ten questions relating 
to family authority. Orphaned children might 
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have good reason to skip these 16 questions, 
and children in foster homes might not know 
which of their different sets of foster parents, 
or real ones, they should refer to. 

Sex differences are lightly treated; one hun- 
dred children of each sex from the same school 
were compared. Differences were found on at 
least two traits (fear and inadequacy feelings 
occurred more frequently for the girls), but 
the authors suggest that the clinician can allow 
for this fact and that the number of children 
compared was quite small, anyway. 

Nothing is said about age of subjects, except 
that they were from high schools and were ado- 
lescents. The reviewer is also somewhat con- 
cerned about the use of children from only one 
state in the establishing of norms. And he won- 
ders about the wisdom of using only ten ques- 
tions to measure the incidence of any trait; a 
single error in the child's judgment would 
change his scores about half a standard devia- 
tion. 

Despite the weaknesses mentioned it is the 
reviewer's opinion that this test is one of the 
better personality tests available for use with 
adolescents. Especially if used by a qualified 
child psychologist in the clinical situation, it 
“should be a valuable tool, 


For reviews by Harold E. Jones and Good- 
win Watson of an earlier edition, see 38:918 
and 40:1217. 


[31] 
XDetroit Adjustment Inventory: Alpha Form of 
“Telling What I Do." Grades 7-12; 1942; 1 form; 
$2.10 per 25; 42¢ per specimen set; nontimed (30-40) 
Sr Harry J. Baker; Public School Publishing 


Albert Ellis, Consulting Psychologist, 2505 
University Avenue, New York, New York. 
The distinguishing characteristics of the De- 
troit Adjustment Inventory are these: (a) Five 
choices of answers are provided instead of the 
common Yes-No scoring; and these are ran- 
domly assorted rather than being printed in 
Systematic sequence. (b) The inventory is self- 
administering and can be given to one or more 
individuals. (c) The 120 items are divided into 
24 major topics representing different diagnos- 
tic categories. (d) Each question is specific, 
and is worded in simple, popular style, making 
frequent use of the pronoun “I.” (e) Twenty- 
four sets of remedial suggestions, one to cover 


each of the main topics of the test, are option- 
ally provided for school use. 

It will be noted that none of these distinguish- 
ing features of the inventory, except possibly 
the last one, is a radical departure írom 
present-day practice in personality test con- 
struction. While the author must be commended 
at least for giving some careful consideration 
and discussion to the matter of remedial treat- 
ment as well as diagnosis, his questionnaire still 
suffers from most of the usual limitations of 
this kind of personality test. Specifically : 

a) There is no indication that the test items 
were selected by methods other than the all too 
usual armchair technique. No preliminary item 
analyses are mentioned, nor any validating cri- 
teria for the individual items. 

b) Validation is most inadequate. Although 
some 3,000 children are said to have taken the 
test, only 61 behavior problem and 27 non- 
behavior problem cases are used for validating 
purposes. The groups chosen are very malad- 
justed children on the one hand and very well- 
adjusted ones (according to teachers’ ratings) 
on the other hand—with the great majority of 
presumably normally adjusted subjects ex- 
cluded. The difference between the average 
Scores of the two groups is said to be 6.1 times 
the standard error of the difference; and on 
the basis of this rather meaningless group dif- 
ference both individual diagnosis and remedial 
treatment value are claimed for the test. 

c) No reliability data are given for the in- 
ventory. 

d) It is claimed that the respondents often 
give more objective answers to the question- 
naire than they give in interviews and that hon- 
est answers are generally given to the test 
questions; but no adequate proof of this con- 
tention is provided. It is naively suggested that 
there is a large amount of inherent honesty in 
the great majority of people, despite over- 
whelming evidence to the contrary as far as 
personality questionnaire responses are con- 
cerned. 

€) Diagnosis is made according to no less 
than 24 major categories. This kind of specifi- 
cally detailed diagnosis is recommended in spite 
of the facts that only five questions are em- 
ployed in each category and that, according to 
the author's own admission, several of the 
questions—particularly those about home and 
parents—"have been selected with great care so 
as not to emphasize bad features.” This is akin 
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to diagnosing an. individual's marital adjust- 
ment by asking him a few innocuous questions 
about how he likes the hats his wife wears. 

f) The remedial suggestions optionally pro- 
vided with the test are of the pat-on-the-back 
exhortation type that put the entire remedial 
burden on the child—thus pleasantly relieving 
the school authorities from the necessity of 
finding out why the child possesses certain un- 
favorable symptoms and what may effectually 
be done to get at their roots. 

g) The Teacher's Handbook accompanying 
the inventory is an odd mixture of its admitted 
limitations on the one hand and cagey circum- 
locutions of these on the other. 

In sum: Baker's Detroit Adjustment Inven- 
tory, being little better or worse than the aver- 
age personality questionnaire of its kind, makes 
up for none of the serious limitations still in- 
herent in these instruments. 


[32] 
Detroit Scale of Behavior Factors. Grades 1-12; 
1035-36 ; $3.85 per manual (The Diagnosis and Treat- 
ment of Behavior-Problem Children) ; 20¢ per case 
record form; Harry J. Baker and Virginia Trap- 
hagen; Macmillan Co. 
REFERENCES 
1, Baker, Harry J., AND TRAPHAGEN, Vi ic 

nosis and Treatment of Behavior-Problem Children. New York: 
Macmillan Co., 1935. Pp. xiv, 393. $3.85. (London: Macmil- 
lan & Co. Ltd., 1936. 205.) * (PA 10:2743) 


For related reviews, see 33. 


[ 33 ] 
[Re Detroit Scale of Behavior Factors.] BAKER, 
Harry J., AND TRAPHAGEN, ViRGINIA. Diagnosis and 
Treatment of Behavior-Problem Children. New 
York: Macmillan Co., 1935. Pp. xiv, 393. $3.85. (Lon- 
don : Macmillan & Co. Ltd., 1936. 20s.) * (PA 10:2743) 


Fed Probation 3:38-9 My 739. Edw. H. 
Stullken. * gives recognition to the complexity 
of the problem of delinquency. The very length 
of the scale should make it more valid than 
most scales for discriminating between delin- 
quent and non-delinquent types. The scale on a 
whole is well balanced and no particular school 
of thought predominates. The scale should act 
as a check on the user in overcoming any ten- 
dency to overemphasize any particular factor. 
The book gives publicity to some of the White 
House Conferences studies made in the White 
House Conference on Child Health and Pro- 
tection, There may be a question whether the 
authors, who found personality and social fac- 
tors more important than material and physical 
factors, give enough weight in the scale in this 
direction. The authors might have emphasized 
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more the attitude toward the school and toward 
satisfaction in school life and the importance 
of the influence of other members of the prob- 
lem child’s family. The statistical studies re- 
ported in the book of the constellation of 
factors in behavior problem cases are interest- 
ing and important. The book makes generalized 
suggestions for treatment, but does not give 
any specific results of what such treatment will 
do. Students of child nature and of behavior 
problems will find much of value in the volume 
concerning the intricacies of human motiva- 
tion. The book is of value to clinical workers, 
teachers and administrators of special classes 
and special schools and to social workers and 
teachers in general. It is hoped that this volume, 
will be followed by another giving more results 
of the use of the scale. 


* 

For additional reviews, see 36:B24, 38: 
B306, and 40:B835. 

[34] 
Diagnosis and Treatment of Pupil Maladjust- 
ment. Grades kgn-12; 1938; Part A, Pam Adjust- 
ment Inventory, 50¢ per 40; Part B, tudy Rec- 
ord, 156; Part C, Symptoms, Causes, and Remedies: 
A Handbook for Teachers, $1; Part D, Abbreviated 
Case Study Record, $4 per 100; Part E, Cumulative 
Pupil Adjustment Record, $4 per 100; Class Summary, 
free; $1.75 per specimen set; T. L. Torgerson; E. M. 
Hale & Co. f 
REFERENCES 

1. Raren, H. Tuosurn. “The Measurement and Improve, 
ment of Pupil Adjustment.” Nat El Sch Prin 16:508-13 eT Ae 

2. Wauistrom, LAWRENCE F. A Statistical Analysis of the 
Torgerson Pupil Adjustment Inventory. Unpublished master's 
thesis, University of Wisconsin, 1937- 

Laurance F. Shaffer, Professor of Educa- 
tion, Teachers College, Columbia University, 
New York, New York. Torgerson’s materials 
for studying and treating maladjusted children 
consist of six items. The Pupil Adjustment In- 
ventory is a single sheet on which each of ten 
supposed traits is rated on a four-point scale. 
The choice of traits and their description leave 
much to be desired. For example, the four 
rated degrees of “nervousness” are stated as 
"(x) calm and self-controlled; (2) uneasy, 
bites fingernails, etc.; (3) stammers, nervous, 
hyperactive; (4) afflicted with nervous dis- 
ease.” It is intended that the Pupil Adjustment 
Inventory be used to survey a class, the teacher 
rating each pupil, to identify the ones needing 
further study. 

The Case Study Record is a sixteen-page 
form for the more detailed study of a child. It 
includes check lists for rating school history, 
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study habits, physical efficiency and health 
status, interests and attitudes, and home envi- 
ronment. Space is provided for test results, and 
entirely blank pages are provided for "case his- 
tory,’ "remedial treatment" (11 lines), and 
"results." Other forms include the Abbreviated 
Case Study Record and the Cumulative Pupil 
Adjustment Record, which are four-page ma- 
nila folders, and a class summary form. 

The heart of Torgerson's system is found in 
his handbook Symptoms, Causes and Reme- 
dies. The "symptoms," obtained from the Pupil 
Adjustment Inventory, are related by a chart 
to the "causes," which follow the outline of the 
Case Study Record. Thus, one has only to read 
a chart to locate the exact possible causes of 
any symptom, To give only a small sample, the 
chart shows that “laziness” may be caused by 
fatigue, malnutrition, poor posture, unsuitable 
curriculum, unwholesome motivation, over- 
indulgent parents, or bad moral surroundings, 
among the twenty-six listed “causes.” 

Once one determines the “causes,” treatment 
is almost automatic. Turning to page 19 of the 
handbook, one finds that for "over-indulgent 
parents,” treatments 35, 60, 67, and 68 are in- 
dicated. These are found, in the section on 
remedial procedures, to be: "create a desire in 
the pupil to take part in extra-curricular activi- 
ties,” “suggest how family relationships can be 
improved,” “urge the parents to give the child 
regular home duties,” and “help the parents to 
encourage independence.” 

With the perspective attained in the years 
since its publication in 1938, one may view 
Torgerson’s arrant nonsense with tolerant 
amusement. This system for handling the diag- 
nosis and treatment of human maladjustments 
by a series of mechanical maneuvers is the 
reductio ad absurdum of all routinized guid- 
ance procedures. Those who have real profes- 
sional training will not need a system. Those 
who lack psychological knowledge will help 
pupils more effectively by using simple human 
warmth and interest than by thumbing a hand- 
book of oversimplified recipes. 


[35] 
*Dunlap Academic Preference Blank. Grades 7-9; 
1937-39; IBM; Forms A, B; $1 per 25; 70 per set of 
scoring keys for one form; 35¢ per specimen set; non- 
amie (10-15) minutes; Jack W. Dunlap; World Book 
o. 


REFERENCES 
1. DUNLAP, Jack W. “Preferences as Indicators of Specific 
Academic Achievement.” Abstract. Psychol B 31:706- "34. 
* (PA 9:1396, title only) Pascoe ICD 


2. Duntap, Jack W. “Preferences as Indicators of Specific 
Academic oer Mos J Ed Psychol 26:411-5 S 'as. * (PA 
10:596) 


né Ae ae i Various School Subjects J Appl 
Psychol 19:53-8 F '35. * (PA 9:4800) 

4. DUNLAP, Jack W. “Relationships Between Constancy of 
Expressed Pre! erences and Certain Other Factors." J Ed Psy- 
cup IUE PEPpUMEMM 
sponses to the Items of the Dunlap Academic Preference 

lank Over an Interval of One Year. Unpublished master's 
thesis, Fordham University, 1938. (Dissertations... , 1939, PP. 
[T Vincent J., AND Dunar, Jack W. “Study of 
the Reliability and Validity of the Academic Preference 
Blank.” J Ed Psychol 31:103-10 F ’40, * (PA 14:3785) 

Lee J. Cronbach, Assistant Professor of Ed- 
ucation, The University of Chicago, Chicago, 
Illinois. This blank has been constructed with 
unusual care and technical skill. This is its fifth 
revision; in each previous revision each item 
has been validated empirically. 

The purpose of the blank is strikingly differ- 
ent from that of other interest tests. It does 
not attempt to describe the pupil’s interests but 
rather predicts from them his academic achieve- 
ment. Each paper may be scored on ten keys: 
paragraph meaning, word meaning, history, 
language usage, geography, literature, arithme- 
tic, general achievement, mental age, and IQ. 
These scores predict the pupil's future achieve- 
ment on the corresponding subtests of the 
Metropolitan Achievement Tests and the New 
Stanford Achievement Test and his score on 
the Terman Group Test of Mental Ability. 
That this is an effective predictor is evidenced 
by validity coefficients ranging well above .50. 
Mental ages estimated from interests correlate 
49 to .70 with tested mental ages. 

The uses of the test have not yet been ade- 
quately investigated. One can visualize situa- 
tions where it might be profitable to estimate 
intelligence while seeming to test interest. 
Whether this test is useful as a predictor of 
achievement depends on whether it predicts 
better than an intelligence test or enhances a 
multiple correlation based on mental age, Un- 
fortunately it predicts less well than the group 
mental test and enhances multiple 7’s by such 
negligible amounts as .02. Whether predicting 
scores on traditional achievement tests is im- 
portant should also be examined further. These 
tests rarely measure all significant outcomes, 
and good performance on them may not be the 
major goal of many teachers, 

The test is probably not usable for describing 
individual interests. Only 9o items are pre- 
sented, and these do not sample interests 
broadly. In part, scores represent achievement, 
since the response "unknown" is usually penal- 
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ized. The test may obtain spurious attitudes, 
since pupils may feel that they should like 
learning about Valley Forge or hyphens. Re- 
sponse sets may influence scores. The pupil 
who indiscriminately responds "like" to all 
these rather schoolish items will tend to get a 
better score than one who reports liking only 
for his strong interests. In this respect, the 
K uder Preference Record uses a superior tech- 
nique. Detchen has recently shown that the 
Kuder blank may be so adapted that it also 
predicts achievement, 


W. C. Kvaraceus, Associate Professor of 
Education, Boston University, Boston, Massa- 
chusetts, This instrument is composed of 90 
LIDU (Like, Indifferent, Dislike, Unknown) 
response items—ro were lopped off after the 
fourth revision as a concession to machine scor- 
ing. Predictive interest scores in terms of per- 
centiles are available for each grade in the 
eight subject-matter areas patterned after the 
Stanford and the Metropolitan achievement 
tests. In addition, mental age and "intelligence 
alertness" may be estimated from the preference 
scores but not with any high degree of accuracy. 

The author suggests the following general 
uses to which the instrument may be put: guid- 
ance, classification, and individual measure- 
ment. The author states, “the Blank gives valid 
estimates of the relative scholastic aptitude of 
students. Thus, measures of achievement in 
specific school subjects, general achievement, 
and intelligence or scholastic aptitude can be 
secured from a single test.” This is a remark- 
able feat for a go-item, 10-minute question- 
naire. The potential test buyer should note that 
the author only claims that estimates of rela- 
tive scholastic aptitude are yielded. He might 
have added that these estimates are rough and 
lack precision, as indicated by the validity 
coefficients which range from .28 to .73, using 
as criteria test results on the Metropolitan 
Achievement Tests and the New Stanford 
Achievement Test, and the Terman Group Test 
of Mental Ability. 

An excellent description is provided of the 
item analysis starting with 435 items from 
which two nonequivalent blanks of 100 items 
each were derived in the fourth edition. The 
reported validity and reliability data appear to 
have been gathered on the fourth edition before 
the machine-scored edition of 90 items was 
issued, What happened to the validity and re- 


liability of the instrument after 10 per cent of 
the questionnaire was discarded is not indicated. 

For the guidance worker in the junior high 
school there is only limited value in the author's 
division of the questionnaire into the selected 
subject-matter elements since the choices to be 
made and the problems to be met do not corre- 
spond closely with his breakdown of school 
experiences. The results of achievement tests 
themselves, school marks in the different sub- 
ject-matter areas, and intelligence test scores, 
which cannot be obtained, to be sure, in ten 
or fifteen minutes, individually and in com- 
posite, will yield the more precise data and 
information upon which classification, guid- 
ance, and individual measurement should be 
based. The naive guidance worker who has 
little training but does have ten or fifteen min- 
utes available for "testing" will find in.this in- 
strument a dangerous booby trap if he attempts 
to lean upon it in the learning, teaching, or 
guidance processes. 


Edith I. M. Thomson, Lecturer in Educa- 
lion, University of Edinburgh, Edinburgh, 
Scotland. This blank examines "interests" at 
the junior high school stage by obtaining judg- 
ments of liking, indifference, dislike, or igno- 
rance on go words, chosen as representing 
many fields of knowledge. The pupil spends 
about fifteen minutes in recording his re- 
sponses, after which it is claimed the counselor 
can discover in which subjects he will do best, 
the principal can classify him, and “valid esti- 
mates of the relative scholastic aptitude of stu- 
dents" may be made. “Thus measures of 
achievement in specific school subjects, general 
achievement, and intelligence or scholastic apti- 
tude can be secured from a single test.” These 
are considerable claims for a 90-word test last- 
ing 15 minutes. 

The author certainly presents evidence of 
immense diligence in test preparation. The 
stimulus words were chosen from courses of 
study in New York and New Jersey, and for 
scoring purposes they are classified under the 
headings used in the New Stanford and Metro- 
politan Achievement Tests. The subjects in the 
standardising groups took those two tests, also 
the Terman Group Test of Mental Ability, and 
an elaborate item analysis was undertaken by 
means of biserial correlations between academic 
success and judgments on each item in the 
blank. Thus, the final choice of items was made, 


36 THE THIRD MENTAL MEASUREMENTS YEARBOOK [72 


and a simple weighting system of scores was 
devised. Data on validity and reliability are 
given; it may be observed that median relia- 
bility coefficients between alternative versions 
are comparatively low, ranging from .83 to .70, 
though the self-correlations of the blanks, es- 
pecially Form B are higher. Validity measured 
by correlation with subtests of the Stanford 
and Metropolitan tests is mostly in the neigh- 
bourhood of .5 and .6, with a tendency to drop 
in the higher grades. This decrease of correla- 
tion with age is also noticeable in the reliability 
scores. Thus, the “valid estimates” must be 
accepted with some caution. 

Scoring is rapid and simple by means of per- 
forated key sheets or, if special pencils are 
used, by a mechanical scoring device. 

It is the actual content of the test and its 
scoring which leaves the critic sceptical. For 
example, let us look up certain items using the 
“General Achievement” key. “Liking” for cer- 
tain arithmetical terms is not a favourable sign; 
to score points you must dislike “Multiplicands,” 
“Drill in Addition,” “Product in Multiplying” ; 
to be “indifferent” gives you no mark; to “like” 
“Checking Problems” causes you to have a 
mark deducted; if you "like" London you get 
no mark, you ought to be “indifferent”; if you 
"dislike" “Expletives,” you have a mark de- 
ducted—you should not know the meaning of 
the word. It will be said that the scoring has 
been experimentally determined; but after do- 
ing the test, one is left with the feeling that 
too much depends on how one happens to in- 
terpret "like" and that, while the score no 
doubt throws some light on “interests,” more 
moderate claims should be made regarding its 
prognostic value. 


Teach Col J 13:24 S "41. E. L. Abell. * Cor- 
relations were computed with intelligence scores 
and with subtests of the Metropolitan Achieve- 
ment Test and the Stanford Achievement Test 
using the biserial-r technique. Correlations 
range from .52 to .72 and are thus high enough 
to suggest a strong relationship but not high 
enough to give accurate individual predictions. 
The blank should prove very useful in furnish- 
ing clues as to the subjects in which the pupil 
is likely to do best. 
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XEuphorimeter. Adults; 1940; $2 per manual (Chart 
for Happiness) ; Hornell Hart; Macmillan Co. 


REFERENCES EE. 
. Hart, HomwELL. Chart for Happiness. New York: Mac- 
ctm can 1940. Pp. xi, 198. $2.00. ? S 15:928) ad he 
2. Hart, HORNELL, AND nm n MM E Se 
i i iti . 59-63. Bo: .: D.C. 
Ur DI Editon DP; $974 1526, title only) 


For related reviews, see 37. 


[37] 
*[Re Euphorimeter.] Hart, Horner. Chart for 
Happiness. New York: Macmillan Co., 1940. Pp. xi, 
198. $2.00. * (PA 15:925) 


Am J Psychol 56:158 Ja '43. J. W. Mac- 
millan. * The fine disregard of the most ele- 
mentary statistical precautions, the broad a 
priori statements, the positive examples cited 
as proof, the blatant appeal to the untrained 
layman and the obviously uncritical attitude of 
the author combine to make this a book which 
might better have been left unwritten, although 
considerable amusement may be derived from 
it by those familiar with the techniques used 
and the problems involved in the measurement 
of personality. 

Am J Sociol 47:652-3 Ja’42. J. L. Moreno. * 
a book which should be of interest to sociolo- 
gists, ministers, educators, and psychiatrists. * 
The book has a religious tinge. Professor Hart 
not only measures happiness but also tells the 
reader how to become happy. He measures 
happiness, diagnoses it, and builds it. The book 
contains tables for scoring “at-the-moment 
euphorimeters,” a bibliography and an index. 
It seems to me that Professor Hart has pub- 
lished his findings prematurely. He might well 
have approached the complex problem of meas- 
uring a state of mind by means of other meth- 
ods as well as that which he has chosen: by a 
more thorough analysis of the subjective aspect, 
for instance, by more intensive case studies, 
and by a larger number of them. This proce- 
dure might have avoided the suspicion of sur- 
face research and surface therapy. 

J Abn & Social Psychol 36:449-50 Jl "4r. 
Howard Davis Spoerl. The concept of happiness 
is rare in psychology, at least as a formal cate- 
gory. Thus it is with some degree of interest 
that one turns to a semi-technical study of the 
subject, such as the work under consideration. 
What is offered in this case is a means of test- 
ing happiness, either as a transitory state or as 
a protracted condition. The device employed, 
which was conceived and developed by the | 
author, is called the “euphorimeter,” and the 
quantitative index of happiness is a sum of 
weightings known as “euphor-units.” The 
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euphorimeters consist of a list of 48 adjectives 
to be selected as describing the subject's af- 
fective consciousness, some rating scales, and 
questions of the sort usually contained in adjust- 
ment inventories. * In the preface the euphorim- 
eter is modestly classed with intelligence tests, 
the clinical thermometer, and the electrocardio- 
graph, as a revolutionary diagnostic instrument. 
One chapter is devoted to its possible users; no 
field, from medical and ministerial activity to 
personnel management, appears to have been 
overlooked. * The author's definition of the 
quality which he hopes he is measuring is a 
semibehavioristic statement of hedonism * In 
simpler, and "operational" terms, happiness is 
"the state in which people are when they say 
sincerely, ‘I am happy.’” No attempt is made 
to employ a standard psychological concept of 
pleasantness, and the definition seems, there- 
fore, to lose distinctiveness. Moreover, since 
euphoria ordinarily refers either to bodily or to 
psycho-pathological states, the designations 
“euphor-unit” and “euphorimeter,” apart from 
the linguistic shudders they induce, are not apt 
denotations, The general carelessness in char- 
acterizing delicate affective discriminations 
stigmatizes the crude superficiality of the book 
as a whole, It is divided into two parts entitled 
“The Pursuit of Happiness," and “How to Use 
the Euphorimeter Tests." The reader is invited, 
if he wishes "anything approaching scientific 
insight" into his adjustments and apparent satis- 
factions, to study the tests themselves before 
turning back to pursue happiness through play 
and work, love and home, vitality and success, 
sociability and religion. After determining 
whether or not he is happy, from a test based 
on his ability to state sincerely “I am happy,” 
our scientifically reassured reader may thus pro- 
ceed to absorb the array of pep-talks, barber- 
shop diagnoses and remedies, temperance ser- 
mons, and advice to the lovelorn of which 
Chapters 3 to to consist. Later he may make 
liberal use of the “euphorimeter advice form” 
presented in Chapter 13, to derive therefrom 
all manner of suggestions such as “learn to meet 
life courageously” and “get out of debt—by 
heroic measures if necessary.” Although he is 
long on advice couched in generalities of this 
kind, the author is short on concrete methods 
sufficiently profound to alter the motivation of 
the maladjusted. Indeed, the quest for happi- 
ness seems to consist in rearranging one’s sur- 
face outlook while causing a minimum of dis- 
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turbance to actual mechanisms of adjustment. * 
It is something of a shock, however, to encoun- 
ter the results of the studies of Terman and of 
Burgess and Cottrell stated baldly in terms of 
euphor-units, nor is it made clear how the trans- 
lation was achieved. The same dubious honor 
is paid to Shailer and to Goodwin Watson. The 
lay réader is almost certain to adopt the sugges- 
tion that these various authorities have already 
taken the euphorimeter to their collective bosom. 
* One further feature of the work should be 
mentioned—its potentiality for damage or at 
least its inevitable nuisance value. On the basis 
of the meager statistical account in the last 
chapter concerning the reliability of the tests, 
their widespread utilization is recommended, 
and the tests are commercially advertised. In 
the hands of untrained persons, or of zealous 
members of professions like the ministry, a 
considerable amount of harm might result from 
indiscriminate application and exaggerated pre- 
tensions, especially when one takes into account 
the present inordinate respect on the part of the 
digest-reading public for quasi-psychological 
mumbo-jumbo. The fact that the author fre- 
quently takes time out to caution the reader to 
seek only expert advice in sundry situations, 
seems more likely to silence by anticipation the 
protests of psychiatrists, than to inspire ade- 
quate caution in those who would voyage to 
salvation by this chart for happiness. Professor 
Hart has not succeeded in justifying one’s inter- 
est in the psychological nature and possibilities 
of the concept. 

Fed Probation 5:49 Ja-Mr '41. Dortha Wil- 
liams Osborn. * The chief limitation of the... 
[euphorimeter] appears to be its finite useful- 
ness to those with less than two years' high 
school training. The author states that “per- 
sons who have quit school in the grades, or who 
have found high school work difficult, are likely 
to misunderstand the directions or to fail to 
know the meaning of important words." * this- 
limitation is a serious one for social workers, 
particularly in the correctional field, as it will 
prevent its use as a formal test except in se- 
lected cases. However, the careful analysis of 
the fields of happiness and unhappiness, and the 
many suggestions for remedial work, cannot 
fail to be helpful to the correctional worker. 

Mental Hyg 26:144 Ja ^42. H. Neumann. * 
The book is careful to warn against self-treat- 
ment where expert consultation is needed; nor 
does it pretend to offer the results of new re- 
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Search. It serves a highly useful purpose in 
making available much that most readers would 
never come upon. For example, the score aver- 
ages for couples who disagree chronically about 
friends, play, philosophy of life, are much lower 
than for cases in which either partner feels sex- 
ually starved. So, too, while extremely inade- 
quate incomes are a handicap to happiness, 
more than enough scores indicate that amount 
of income is not the all-important factor some 
schools of philosophy would make it. * 

Sci Ed 25:354 N "41. C(larence) M. P(ruitt). 

Sociol & Social Res 25:579-80 Jl-Ag "41. 
E(mory) S. B(ogardus). It is in Part II that 
the author explains his euphorimeter tests and 
demonstrates that he is working upon a real 
human problem and is making headway. Part I 
is devoted largely to advice to people who for 
one reason or another are unhappy and are dis- 
turbed by their lack of happiness. A “long-run” 
test and an “at-the-moment” test of happiness 
are described, both to be used by the subject 
himself, who reports his own subjective reac- 
tions to life according to his feelings. The “long- 
run" test includes answers to a brief and selected 
list of questions, and the "'at-the-moment" test 
relies on underlining and crossing out words in 
a series of forty-eight adjectives. The scoring 
methods are simple enough to be used by the 
subject. The ways in which the methods of 
scoring were derived are not fully explained or 
justified. Next the author develops a diagnostic 
test, whereby the subject may be expected to 
discover why he is unhappy if either the long- 
run or at-the-moment test indicates that he is 
below normal in the happiness that he enjoys. 
The diagnostic test is somewhat elaborate. It 
seeks to locate the subject's areas of happiness 
or unhappiness in such phases of life as recrea- 
tion, work, love life, home, health, sense of suc- 
cess, mental harmony, economic status, friend- 
ships, loyalty to groups, religious adjustment, 
and altruism. The results of these subtests may 
be brought together, and the result is labeled the 
subject's "happiness spectrum." While many de- 
tails will need to be perfected and vital changes 
made, yet the cause of scientific measurement 
and of prediction is furthered by the unique 
ways in which Dr. Hart has taken previous 
techniques and developed them and produced his 
euphorimeter tests. 
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XEvery-Day Life: A Scale for the Measurement 
of Three Varieties of Self-Reliance. Grades 9-12; 


1941; IBM; $175 per 25; machine-scorable answer 
sheets also available, 2¢ each ;,$3.50 per set of machine- 
scoring stencils; 40¢ per specimen set; nontimed (30) 
minutes; Leland H. Stott; Sheridan Supply Co. 
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Prosperity in Relation to the 
arm Folks." Rural Sociol 10: 


Albert Ellis, Consulting Psychologist, 2505 
University Avenue, New York, New York. This 
questionnaire purports to measure three varie- 
ties of self-reliance: I, Independence in Per- 
sonal Matters; II, Resourcefulness in Group 
Situations; and III, Personal Responsibility. 
The individual items of the inventory are of the 
conventional type but are more specific than the 
usual Woodworth-Bernreuter-Thurstone kind 
of question, Five categories of answers are pro- 
vided for each question: YES (always), Yes 
(usually), ?, No (usually not), and NO 
(never). The advantages and disadvantages of 
the inventory are these: 

a) The construction of the questionnaire was 
apparently empirically approached and carefully 
done, with adequate consideration being given 
to the statistical selection of the individual test 
items, 

b) The split-half reliabilities of the three 
scales—ranging from .84 to .94—are satisfac- 
torily high. 

c) It is stated that “the validity of the scores 
as indicators of the ‘factors’ measured was in- 
sured by the methods of item selection em- 
ployed. Every item scored for a particular factor 
was selected, first, because it was found to be 
very significantly correlated with a criterion 
score based upon the items highly loaded with 
that factor in the original analysis, and second, 
because it bore a logical relationship with the 
factor." At best, this statement means only that 
the test was shown to be valid for the groups 
on which it was standardized. Since no other 
validation attempts are reported for the inven- 
tory, the validity of this questionnaire has still 
not been established, 
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d) The correlation coefficients of .60 between 
factors I and II and of .64 between factors I 
and III raise the question whether this inven- 
tory is really measuring three independent traits 
as is claimed by the author. 

In sum: Stott’s Every-Day Life appears to 
be a well-constructed questionnaire, but its prac- 
tical value still remains unproven. 
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*Examining for Aphasia and Related Disturb- 
ances. Ages 8 and over ; 1946; individual; $5 per man- 
ual and 50 record forms, postpaid; $4 per so record 
forms, postpaid; $2 per manual, postpaid; (30-120) 
minutes; Jon Eisenson; Psychological Corporation. 

C. R. Strother, Associate Professor of Clini- 
cal Psychology and Speech Pathology, The 
State University of Iowa, Iowa City, Iowa. 
Examining for Aphasia is designed primarily 
for the use of therapists who are concerned with 
planning and evaluating rehabilitation programs 
for aphasic patients. It does not purport to be 
a manual for comprehensive psychological ex- 
amination. 

The manual is divided into two sections. The 
first contains tests for receptive disturbances, 
and the second, tests to assess impairment of 
expression. Following Nielson, Eisenson in- 
cludes tests for visual, auditory, and tactile 
agnosia and for apraxia. These tests should be 
useful in assisting the clinician to evaluate dis- 
abilities which present special problems in re- 
habilitation. 

The tests for receptive disturbances include 
measures of auditory verbal comprehension and 
reading comprehension. The range of difficulty 
is limited. Since the items are multitple choice, 
they can be presented in such a manner as to 
avoid expressive difficulties. The tests for ex- 
pressive difficulties include automatic speech, 
spelling, writing from dictation, calculations, 
oral reading, and clock setting. The brevity of 
the tests precludes much of a range of difficulty 
in the items and also constitutes a serious limita- 
tion on their use as a measure of improvement. 
The spelling test, for example, includes only six 
words, The method of administration is flex- 
ible, and no norms are provided, it being the 
author's intention that the tests be used as flex- 
ible clinical instruments rather than as rigid 
measuring devices. 

The clinician should find the manual useful 
in obtaining an indication of the areas of diffi- 
culty presented by the patient and, perhaps, as a 
very gross measure of improvement. As Eisen- 


son indicates, more extensive tests will be found 
necessary. 


Arch Neurol & Psychiatry 56:244 Ag’46. * 
intended for the use of speech pathologists in 
planning a program of language rehabilitation 
for individual aphasic patients. The manual pre- 
sents a comprehensive language examination in 
a clear form and should be useful for recording 
the progress in language function. The instruc- 
tions for the various test items are specific and 
easily followed, and much of the test material 
is included in the manual, The author states that 
his interest is in speech therapy, and not in 
neurologic and neurosurgical implications of 
language dysfunctions ; consequently, it is not a 
criticism of his work to point out that the exam- 
ination is not entirely satisfactory for use by the 
clinical neurologist. wv 


[40] 
*Furbay-Schrammel Social Comprehension Test. 
Grades 9-16; 1941; $1.70 per 25, postpaid; 15¢ per 
specimen set, postpaid; 80(85) minutes; John H. Fur- 
bay and H. E. Schrammel; Bureau of Educational 
Measurements, Kansas State Teachers College of Em- 
poria. 
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*Goldstein-Scheerer Tests of Abstract and Con- 
crete Ds Adults ; 1945; individual; $25.50 per 
testing outfit; Kurt Goldstein and Martin Scheerer; 
Psychological Corporation. 

d) GOLDSTEIN-SCHEERER CUBE TEST. $3.25 per 50 record 
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forms, postpaid. 
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al Tests Used for the Evaluation of Brain Injury. Be 


Kate Levine Kogan, Clinical Psychologist, 154 
Chesterfield Road, Pittsburgh 13, Pennsylvania. 
These techniques, together with two other meth- 
ods, are described in a monograph entitled Ab- 
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tract and Concrete Behavior: An Experimental 
Study With Special Tests (9). A clear and thor- 
ough discussion of the concepts of abstract and 
concrete behavior is provided, and the test pro- 
cedures are depicted in great detail. Thus, the 
purpose of the examinations is outlined ex- 
plicitly, and a wealth of illustrative responses is 
analyzed and discussed. The material presents 
an aspect of behavior which has received little 
attention elsewhere in the literature; further- 
more, the concepts involved have been afforded 
such clear delineation that they are readily appli- 
cable to, and recognizable in, the behavior of 
patients in reaction to many other test items and 
situations which are more commonly included 
in clinical examinations, 

The three procedures for which complete test 
material can be purchased constitute only part 
of a battery which the authors recommend be 
used in its entirety, Since this recommendation 
must be ignored because of the unavailability 
of standard equipment, the methods will be 
considered separately for brief evaluation of 
their efficacy. 1 

The Cube Test is of especial interest since it 
is based on the block-design technique which 
occurs in several of the intelligence scales most 
widely used with both adults and children. 
Hence, the occurrence of a marked deviation of 
score on the block-design items as compared 
with the remainder of the scores on the scale 
would make an opportunity for more careful 
analysis of the patient's approach to this task 
both pertinent and desirable. The Cube Test, 
therefore, has been proved useful in conjunc- 
tion with other test procedures for obtaining 
further, more qualitative, elaboration. The ex- 
actitude required by the two-pamphlet record 
form, with a separate page for each test design, 
is possibly somewhat out of proportion to the 
broad, subjective evaluation of the data which 
must be made. 

One of the greatest values of the Color Form 
Sorting Test lies in its brevity and simplicity. 
Success or failure is clearly and objectively de- 
fined. Under most circumstances, less than five 
minutes will suffice for administration. Intellec- 
tual level plays a small part, since most normal 
children above the ages of seven or eight years 
can achieve a solution. Motor disabilities, which 
so often accompany cerebral pathology, inter- 
fere with effective response very little. Hence, 
the absence of prompt solution in an adult 
patient is likely to be significant, For this reason, 
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however, the technique is less likely to be dis- 
criminative in cases of mild or early brain 
pathology. 

The Stick Test has been, in the writer's expe- 
rience, the least effective of the three methods 
under discussion. Usually, only patients with 
fairly advanced and extensive cerebral pathol- 
ogy experience sufficient difficulty to make their 
productions significant. The first part of the 
test requires the patient to copy the structure. 
Other available techniques using paper and pen- 
cil offer less cumbersome administration and 
scoring and permit simpler permanent record of 
the performance; they also make use of more 
complex perceptual units and hence, are appli- 
cable to less severely impaired patients. The 
second portion of the test requires reproduction 
of designs from memory and has no discernible 
advantage over other more commonly used and 
better standardized techniques. 

The entire battery used in this experimental 
study has achieved particular prominence as a 
method of determining and diagnosing organic 
brain disorder. This view of its use is a mis- 
conception in terms of actual experience and, 
apparently, in terms of the authors’ intention. 
In the writer’s opinion the authors offer a sys- 
tematized description, analysis, and explanation 
of the test responses of a selected group of 
patients known to have cerebral pathology. 
Actually, many patients with various forms of 
cortical defect exhibit no “impairment of the 
abstract attitude” on these tests. For example, 
in an extensive research study of patients suffer- 
ing from the late effects of mild concussive head 
injury, without focal lesion, no distinction could 
be made between the responses of the patients 
and members of normal and neurotic control 
series, Furthermore, it has been pointed out that 
some schizophrenics exhibit impairment anala- 
gous to that described for organics (3). Rapa- 
port has made use of a modification of one of 
the techniques discussed in the study, Object 
Sorting, in his large-scale diagnostic study of 
functional disorders (26). It is regrettable, in 
terms of our interest here, that patients with 
organic disorders were not also included for 
comparative purposes. 

It should be noted that this “experimental 
study” presents only qualitative results. There is 
no statement of the number or type of patient 
in whom the technique elicited the crucial be- 
havior indicative of impairment. Not only are 
all data pertaining to standardization lacking, 


but any definition of the extent to, or frequency, 
with, which normal variability might introduce 
concrete elements into the responses of essen- 
tially healthy persons is omitted. There is no 
systematic method of scoring or evaluating test 
performance, and in this respect the methods 
fall slightly short even of the authors’ statement 
that the tests enable the examiner “to determine 
whether a patient can or cannot assume the ab- 
stract attitude.” For they offer incontrovertible 
evidence only in the very rare cases in which 
the subject experiences difficulty with all items 
presented. If one should obtain a record in 
which partial successes are present, no means 
has been offered whereby one may profit by the 
experience of the test authors in reaching a de- 
cision or interpretation; the material must be 
interpreted in the light of the more restricted 
past clinical experience of the particular exam- 
iner. Within these limits the techniques offer an 
opportunity for careful observation, clarifica- 
tion, and analysis of the patient’s psychological 
approach as it is exhibited in a variety of situa- 
tions. 

Thus, in summary, these tests offer a means - 
of evaluating aspects of behavior which are not 
treated adequately elsewhere; but they must be 
subjected to careful and well-planned standard- 
ization before they can make their fullest con- 
tribution to the studies of the clinician engaged 
in individual casework. 


C. R. Strother, Associate Professor of Clini- 
cal Psychology and Speech Pathology; and 
Ludwig Immergluck, Research Assistant in 
Psychiatry; The State University of Iowa, Towa 
City, Iowa. [Review of the monograph, Ab- 
stract and Concrete Behavior, which serves as 
a manual for the Goldstein-Scheerer Tests.) 
This 151-page monograph represents the re- 
sults of extensive investigation of organic brain 
diseases and is intended to present a series of 
tests designed to evaluate these disorders. 

The first part of the monograph consists of 
a theoretical discussion of the nature of think- 
ing disorders. The authors base most of their 
discussion upon their extensive clinical expe- 
rience. However, in spite of their impressive 
backgrounds as practicing clinicians, the reader , 
cannot help but be struck by the lack of demon- 
strable evidence for many of the statements 
made. The authors conceive of behavior changes 
in the brain diseased, schizophrenic, ament, and 
dement in terms of an impairment of "abstract 
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behavior.” These findings are based on a series 
of investigations originally undertaken by Gelb 
and Goldstein. In their present monograph, 
Goldstein and Scheerer claim that the normal 
individual may assume two distinct types of 
behavior, the concrete and the abstract. These 
two types of behavior, in turn, are dependent 
upon two corresponding basic attitudes. The 
authors recognize that other investigators have 
made similar distinctions and have come to 
comparable conclusions. However, in defining 
the concrete and abstract attitude as basic 
“capacity levels of the total personality” they 
hope to distinguish themselves from previous 
investigators. Unfortunately, the authors take 
the opportunity to raise, on the basis of their 
own concepts, theoretical and systematic argu- 
ments against those psychologists who believe 
that concrete and abstract behavior is learned 
‘or acquired. Against this notion, Goldstein and 
Scheerer offer their own and, incidentally, 
vaguely defined concept of "capacity level." 
"These reviewers, however, are at a loss to under- 
stand why the term "capacity level" necessarily 
precludes the concept of "learned" or "ac- 

“quired.” Furthermore, the authors imply a 
qualitative distinction between the abstract and 
concrete levels rather than, as many other in- 
vestigators hold, conceiving of them as lying on 
a continuum. 

Goldstein and Scheerer cite the following 
eight characteristics of the abstract attitude: 
(a) to detach our ego from the outer world or 
from inner experiences; (b) to assume a men- 
tal set; (c) to account for acts to oneself ; to 
verbalize the account; (d) to shift reflectively 
Írom one aspect of the situation to another; 
(e) to hold in mind simultaneously various 
aspects of a situation; (f) to grasp the essen- 
tials of a given whole; to break up a given whole 
into parts; to isolate and synthesize them; (g) 
to abstract common properties reflectively; to 
form hierarchic concepts; (A) to plan ahead 
ideationally ; to assume an attitude towards the 
“mere possible" ; and to think or perform sym- 
bolically. 

The concrete attitude, on the other hand, 
“does not imply conscious activity in the sense 
of reasoning, awareness or self-account of one's 
doing. We surrender to the experiences of an 
unreflective character: we are confined to the 
immediate apprehension of the given thing or 
situation in its particular uniqueness.” Further- 
more, the authors claim that concrete behavior 


has none of the above-mentioned eight charac- 
teristics of the abstract attitude. Most investiga- 
tors, however, would agree that even the most 
deteriorated organic or schizophrenic patient 
exhibits behavior which could not be character- 
ized adequately in terms of the authors’ strict 
definition of concreteness. 

Following the introduction and discussion of 
basic concepts, the remainder of the monograph 
is devoted to a presentation and discussion of 
five tests designed to determine the extent of 
impairment of abstract thinking. The tests of- 
fered are: The Goldstein-Scheerer Cube Tesi— 
This is a modification of the Kohs Block Design 
Test. The modification consists essentially in 
presenting the subject who has failed under the 
usual procedure with card designs of the same 
size as the blocks, with designs with super- 
imposed lines helping the subject perceptually 
to ‘break down’ the design, and, finally, with an 
actual block model of the design to be repro- 
duced. The Gelb-Goldstein Color Sorting Test 
—The subject is presented with woolen skeins 
of different hues and shades and asked to sort 
them into groups. The Gelb-Goldstein-W eigl- 
Scheerer Object Sorting Test—This test con- 
sists of a set of objects which can be grouped 
according to material (e.g., metal, wood, porce- 
lain, etc.), color, or form. The subject is asked 
to sort them into groups. Various groupings 
are possible and ability to shift is noted. The 
Goldstein-Scheerer Stick Test—By means of a 
series of sticks (3.5 inches and 5.5 inches long); 
the subject is required to reproduce from mem- 
ory a series of figure designs. 

Each test is carefully described and preceded 
by a discussion of its psychological rationale. 
The instructions to be used are presented ver- 
batim, aud each step is outlined in detail. How- 
ever, no norms or standardization data are 
given, and these reviewers looked in vain for 
a description of the population on which these 
tests were developed. Instead of presenting 
norms, the authors content themselves with 
citing examples of typical failures of “patients.” 
No evidence is presented which allows the reader 
to infer that there is any consistent relationship 
between test performance and degree of the 
patient’s clinical impairment. Certainly, such 
information is of utmost importance for deter- 
mining the usefulness of the test. 

The seasoned practitioner may find these 
tests extremely useful as a qualitative tool and 
as an adjunct to his clinical judgment. The 
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novice and student, on the other hand, are, in 
the absence of statistical data, deprived of the 
opportunity to evaluate the tests on their own 
merits. The authors’ advice that “qualitative has 
to precede statistical analysis” is very well taken. 
Unfortunately, they have taken only the first 
of these steps. 


O. L. Zangwill, Assistant Director, Institute 
of Experimental Psychology, University of 
Oxford, Oxford, England. These tests are well- 
adapted to elicit defects of abstract or “categori- 
cal" thinking of the type commonly, although 
by no means invariably, associated with injury 
or disease of the brain cortex. In using them 
with neuropsychiatric cases, however, it is im- 
portant to bear the following points in mind : 

4) The tests are essentially qualitative and 
standards of normal performance are somewhat 
problematical. As in the closely parallel case of 
the Rorschach test, it is most essential to gain 
experience in administering the tests before the 
results can be evaluated with any confidence. 

b) The location of the cerebral lesion has 
some influence on test performance. Thus, gross 
defects on the Cube and Stick Tests with little 
or no impairment on the Color Form Sorting 
Test are quite common in patients with circum- 
scribed lesions of the posterior parts of the 
brain cortex. In cases of this kind, the impair- 
ment is almost certainly due to specific agnostic 
or apraxic defects rather than to loss of “ab- 
stract attitude" in Goldstein's sense of the term. 
In patients with lesions of the frontal lobes, on 
the other hand, impairment in the categorical 
sphere is very common and may be predomi- 
nantly displayed on the Sorting Test. 

c) Some patients with ascertained cerebral 
lesions perform in an essentially normal fashion 
on these tests. This is by no means unusual in 
cases of traumatic brain injury and may be due 
to high previous intelligence level, to restitution 
of cortical function, or to a combination of these 
factors. In view of the possible influence of 
previous intelligence level-on test performance, 
it. would be most useful to have a suitably stand- 
ardized series of abstraction tests of graded 
difficulty. 

d) Certain cases of primary psychosis, in 
particular schizophrenia, show an impairment 
of conceptual understanding and thought some- 
what similar to that of the organic.case. Tt is 
also possible that psychogenic disturbances in 
patients of low intelligence may lead to patterns 


of test performance easily confused with the 
reactions of organic patients. " 

€) The significance of abstraction defects 
from the point of view of occupational disability 
and problems of resettlement of the organic case 
is most difficult to assess. It is not uncommon 
to find patients with marked conceptual defects 
achieving adequate occupational readjustments. 
On the other hand, patients who perform well 
on the tests may display a marked impairment 
of practical judgment in daily life. Research on 
the significance of conceptual defects in rela- 
tion to resettlement is needed before any but the 
most guarded opinion can be based on the test 
findings. 


For related reviews, see 42. 


[42] 

*[Re Goldstein-Scheerer Tests of Abstract and 
Concrete Thinking.] GOLDSTEIN, Kurt, AND ScHEER- 
ER, Martin, Abstract and Concrete Behavior: An 
Experimental Study With Special Tests. Psycho- 
logical Monographs, Vol. 53, No. 2, Whole No. 239. 
Washington, D. C.: American Psychological Associa- 
tion, Inc., 1941. Pp. vi, 151. Paper. $225. * (PA 16: 
1004) 

Loyola Ed Digest 17:7 Je '42. Austin G. 
Schmidt. * Kurt Goldstein and his collaborators 
began as early as the first World War to de- 
velop certain sorting tests by means of which 
a subject's tendency to think in concrete terms 
when he might think in abstract terms could be 
measured. The present study, a most interesting 
one, is an analysis of the responses of abnormal 
subjects to the Goldstein-Scheerer Cube Test, 
the Gelb-Goldstein Color Sorting Test, the 
Gelb-Goldstein-Weigl-Scheerer Object Sort- 
ing Test, the Weigl-Goldstein-Scheerer Color 
Form Sorting Test, and the Goldstein-Scheerer 
Stick Test. The study is descriptive and qualita- 
tive rather than quantitative. It throws much 
light on how abnormal subjects think and on 
why they think as they do. The description 
(pp. 4-7) of the eight conscious and volitional 
modes of behavior of which the abstract attitude 
is the basis will be of interest to the teacher and 
the test-maker, 

" [43] a 
*Guilford-Martin Inventory of Factors GAMIN, 
Abridged Edition. College and adults; 1943-45; su- 
persedes original edition; IBM; 1 form; $2.50 per 25; 
machine-scorable answer sheets also available, 2¢ 
each; $1.25 per set of plastic hand-scoring stencils 
for test booklet; $3.50 per set of machine-scoring sten- 
cils for answer sheet; $2.50 per set of hand-scoring 
stencils for answer sheet; 40¢ per specimen set; non- 
timed (30) minutes; J. P. Guilford and H. G. Martin; 
Sheridan Supply Co. 
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H. J: Eysenck, Senior Psychologist, Mauds- 
ley Hospital, London, England. This test con- 
sists of 186 questions of the Yes-?-No type 
and purports to measure five independent vari- 
ables of temperament, viz., general pressure for 
overt activity, ascendancy in social situations, 
masculinity of attitudes and interests, lack of 
inferiority feelings, lack of nervous tenseness 
and irritability. Isolation of these variables was 
achieved by factorial analysis and subsequent 
item analysis. The original scale contained 270 
items and used a more complicated scoring sys- 
tem; the abridged scale is no less reliable than 
the old. Norms are based on 80 male and 8o 
female university students. 

The inventory is open to criticism on the fol- 
lowing grounds: (a) There is no evidence of 
validity for any of the scales, excepting the 
masculine interests scale, which correlates .84 
(phi coefficient) with sex of the subject filling 
in the inventory. The measurement is based on 
the assumption that the person who is suffering 
from nervous tenseness and irritability, for in- 
stance, will endorse the items grouped together 
in the factor analysis; there is no proof that 
there is any such correspondence. Clinical vali- 
dation would appear to be a sine qua non for a 
scale of this type. (b) The factors measured by 
the inventory are not by any means independ- 
ent; correlations between them range as high 
as .70. Thus, the factors themselves define a 
second-order factor which, judging from the 
figures given by Guilford, is responsible for a 
greater portion of the total variance than any 
of the factors given by the inventory. Yet no 
score or method of measurement for this “gen- 
eral factor” (presumably one of “general neu- 


roticism") is provided, although this is the one: 


factor for which there is good clinical evidence. 
(c) The suggestion is made in the manual that 


the test might play a part in the selection of% 


industrial personnel. It does not appear likely, 
in the absence of any evidence, that an untried 
test would prove of great value in an industrial 
situation ; here again what is lacking is evidence 
of validity. 

As opposed to these criticisms, it should be 
said that in other respects the inventory has 
certain advantages over other well-known tests: 
(a) The statistical work is of a very high stand- 
ard, and in particular the factorial approach to 
the problem of inventory construction, which is 
largely due to Guilford’s original impetus, 
promises to result in much more analytical tests 
than we have known so far. (b) The inventory 
is not encumbered, as are the Humm-Wads- 
worth Temperament Scale and the Minnesota 
Multiphasic Personality Inventory by reliance 
on obsolescent psychiatric classifications not 
even widely accepted in psychiatric circles. 

As a total summing up, it may be said that 
the test presents a distinct advance in technique 
of construction but lacks validation data; it is 
therefore a test for the experimenter rather 
than for the practician. 


For areview by R. A. Brotemarkle, see 45. 
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XGuilford-Martin Personnel Inventory I. Adults; 
1943; IBM; 1 form; $2.50 per 25; machine-scorable 
answer sheets also available, 2¢ each; $1.25 per set of 
plastic hand-scoring keys for test booklet; $3.50 per 
set of machine-scoring stencils for answer shect ; $2.50 
per set of hand-scoring stencils for answer sheet; 40% 
per specimen set; nontimed (30) minutes; J. P. Guil- 
ford and H. G. Martin; Sheridan Supply Co. 
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Benjamin Shimberg, Research Assistant, 
Division of Educational Reference, Purdue Uni- 
versity, Lafayette, Indiana. This is a carefully 
prepared questionnaire which aims at detecting 
the potential “troublemaker” in business or in- 
dustry. The traits measured are called O (objec- 
tivity), Ag (agreeableness), and Co (coopera- 
tion). The test actually seeks to identify persons 
low in these traits—those who show high tend- 
encies toward personal reference, belligerence, 
and overcriticalness. 

The inventory consists of 150 questions which 
may be answered by circling “Yes,” “?,” or 
"No" in the test booklet or by marking the 
appropriate space on the machine-scored answer 
sheet. The questions are applicable to most in- 
dustrial and business situations and deal with 
experiences fairly common to workers. Since 
the questions are not of a highly personal nature, 
workers should not object to answering them. 

The purpose of the test is thinly disguised 
and the validity of the responses depends on the 
willingness of the person taking the test to co- 
operate and reveal his true attitudes, In the 
initial study, which involved 500 California in- 
dustrial workers of both sexes, and in subse- 
quent validation studies, the test was given to 
persons already employed. No information is 
presented to show that applicants for jobs can 
be expected to cooperate to the same extent. 
If we could be sure that an undesirable job- 
applicant would answer the questions candidly, 
any strong tendency toward "troublemaking" 
would probably show up. Whether such persons 
are clever enough to "beat the test" is a prob- 
lem which has not been investigated. It is pos- 
sible that the use of the forced-choice type of 
item might be effective in reducing the tendency 
of job applicants to give the response which 
they think the examiner wants. 

In constructing the test, items were selected 
by the criterion of internal consistency on a 
sample of 400 workers. Those items were re- 
tained which differentiated significantly be- 
tween the upper and lower quartiles in terms 
of score for each trait on this test. In an effort 
to keep the intercorrelations among the traits 
as low as possible, only four of the items were 
weighted for more than one trait. Nevertheless, 
the scores for the three traits yielded fairly high 
intercorrelations: Co and O, .55; Co and Ag, 
.63; and O and Ag, .64. The weights were 
initially developed on the basis of subjective 
clinical information and later refined by statisti- 


cal techniques devised by Guilford. There have 
been several studies which indicate that test 
scores for the three traits agree fairly well with 
employee ratings on these traits. Details of these 
studies may be obtained from the publisher. 

Published validation reports on this test give 
some indication of promise, although they do 
not present any convincing evidence that the 
test has validity when used for employment pur- 
poses. The groups studied, thus far, have been 
employed persons whom management had rated 
as being either well-adjusted or maladjusted 
workers. In two of the studies, results showed 
that the test identifies approximately 80 per 
cent of those rated as "troublemakers," while 
at the same time erroneously classifying about 
35 per cent of the satisfactory workers by plac- 
ing them in the undesirable group. The authors 
recognize the fallibility of using the opinions 
of management as a sole criterion. It would be 
helpful if more information were given as to 
the number of judges used, how the ratings 
were made, and what standards were set to 
classify “troublemakers.” 

The norms which are published in the manual 
are based on the 500 cases used in the original 
study. Since this sample was not seeking em- 
ployment or promotion, it is very likely that 
these norms would not hold for a group tested 
under other circumstances, The authors are 
themselves well aware of the methodological 
weaknesses of their instrument, and they advise 
prospective users to make validity studies to 
determine the usefulness of the test in their 
own employment situation. The publishers of 
the test offer technical assistance to those who 
wish to plan validity studies or to establish 
their own norms. 


For a review by R. A. Brotemarkle, see 45. 


[45] 
xGuilford-Martin Temperament Profile Chart. 
College and adults; 1934-45; a profile chart for the 
following three inventories : Guilford-Martin Inven- 
tory of Factors GAMIN, Abridged Edition (see 43), 
Guilford-Martin Personnel Inventory I (see 44), and 
Inventory of Factors STDCR (see 55) ; 2¢ per chart; 
J. P. Guilford and H. G. Martin; Sheridan Supply Co. 


REFERENCES 
See references for 43, 44, and 55. 


R. A. Brotemarkle, Professor of Psychology 
and Director of The Psychological Laboratory 
and Clinic, The University of Pennsylvania, 
Philadelphia, Pennsylvania. Guilford and Mar- 
tin have developed three inventories for the 
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Measurement of thirteen variables of person- 
ality. Each inventory has been used and recom- 
mended for use in studies as a separate test and 
as one of a group for the construction of a 
temperament profile. We shall present the fac- 
tors under each inventory as they are recorded 
on the profile chart and discuss the construction 
of the chart later, 

Guilford’s Inventory of Factors STRDC was 
developed on the basis of a factor analysis of 
items in personality questionnaires. Taken to- 
gether, the factors probably cover the so-called 
introversion-extraversion syndrome. The fac- 
tors represent dimensions of Personality (with 
opposite poles) as follows: S, social introver- 
sion—shyness and withdrawal tendencies; T, 
thinking introversion—meditative, philosophiz- 
ing, analyzing self and others; R, rhathymia— 
carefree disposition, liveliness, and impulsive- 
ness; D, depression—feeling of unworthiness 
and guilt; C, cycloid—strong emotional reac- 
tions, fluctuation in mood, tending to instabil- 
ity. The 175 items were retained on the basis of 
internal consistency of responses of some 400 
college students, predominantly of sophomore 
level. The Spearman-Brown estimated reliabil- 
ity (200 cases, roo men and roo women of 
original criterion group) are S .92, T .8o, R .89, 
D ot, and C -91. These have been verified by 
other groups and statistical methods. Factoriàl 
studies revealed positive correlations among S, 
T, and D in the neighborhood of -50 and a small 
negative correlation between D and R. Centile 
norms are given, based on 388 University of 
Nebraska students, 

The Guilford-Martin Inventory of Factors 
GAMIN measures five traits with operational 
definitions as follows: G—great pressure for 
overt activity ; A—ascendancy in social situa- 
tions, leadership ; M—masculinity in attitudes 
and interests; I—lack of inferiority feelings, 
self-confidence; N—lack of nervousness and 
irritability. The 270 items were developed from 
300 questionnaire items given to 500 (250 men 
and 250 women) Southern California college 
and university students, sophomores to seniors, 
ages I9 to 30. Items revealing heavy loadings, 
under item and factorial analyses, are used 
under each of the traits, Scoring keys were 
developed by the Guilford abac method for the 
highest and lowest quarters of original cases for 
G, A, I, and N. Item analysis was employed for 
M, using the 100 highest male scores and 100 
lowest female scores, Reliability was tested on 


the remainder of the group with random halves 
of items as follows, when corrected: G, .89; 
A, 88; M, .85; I, .91; N, .89. Intercorrelations 
are generally low enough to assume useful dif- 
ferences between trait scores for the same indi- 
vidual, with some question in the case of the 
land N relationship. Scoring is facilitated by 
a five-column number giving the weight for 
each trait at a single glance on the key. 

Due to the excessive length and difficulty of 
scoring the original Guilford-Martin I nventory 
of Factors GAMIN, it was abridged in 1945 
with the reduction from 270 items to 186. 
Changes in reliability are reported as "merely 
of a random sample order." The scoring of the 
short form is greatly simplified, and norms 
based upon 160 University students, 80 men 
and 80 women, have been presented, and the 
C-scores changed for use on the new forms of 
the Temperament Profile Chart. 

The Guilford—Martin Personnel Inventory I 
measures three traits of the syndrome more 
commonly called “paranoid” for the purpose of 
searching out discontented individuals who are 
likely to become “troublemakers.” The traits, 
using the more favorable end of the two-pole 
concept, are O—objectivity as opposed to sub- 
jectivity ; Ag—agreeableness as opposed to bel- 
ligerence, dominance, and overreadiness to fight 
over trifles; Co—cooperativeness as opposed 
to faultfinding or overcriticalness of people and 
things. From 200 items used with 500 employees 
(250 men and 250 women) representing a 
diverse employment population, ages 20 to 45, 
150 items were retained as significantly diag- 
nostic as evidenced by correlation with the high- 
est and lowest quarters of the sample. Scoring 
weights (Guilford method) were used with 100 
(50 men and 50 women) to obtain split-half 
reliability as follows: O, .83; Ag, .80; Co, .91. 
Intercorrelations ranging from .55 to .64 were 
accepted as low enough to warrant use of sepa- 
rate scores. A validity study on 51 employees, 
using medians as criterion points, identified 
73 per cent of those judged by employers as 
unsatisfactory and misplaced only 34 per cent 
in the satisfactory group. 

Raw scores, for each of the 13 traits, ob- 
tained by keys are readily translated to C-scores 
(scaled scores) and are recorded on the Guil- 
ford-Martin Temperament Profile Chart on the 
following rr-point scale: o represents lowest 
1% of 500 cases used for standardization; 1, 
next 3%; 2, next 7%; 3, next 12%; 4, next 
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1776; 5, middle 20%; 6, next 17%; 7, next 
12% ; 8, next 7%; 9, next 3%; and ro, the 
highest 1%. The reverse of the chart contains 
brief statements of interpretation of high, low, 
and middle-range scores, usually in terms of 
desirability for mental health, emotional adjust- 
ment, and work or social situations. 

Most of the studies reported to date have 
presented the original report of construction of 
tests, scoring techniques, determination of re- 
liability, validity, and interpretations. 

In addition to the presentation articles, the 
authors and publishers have made available 
mimeographed information showing results 
with particular groups : troublemakers, purchas- 
ing and material-control employees, sales and 
Service employees, engineers, and executive and 
administrative employees. In all of these, well- 
guarded statements of interpretation have been 
made. Particular emphasis has been given to the 
uniqueness of the separate traits and to the vari- 
able manner of clustering which necessitates 


` continued study for specific groups before ap- 


plication of results. 

One very significant study by Constance 
Lovell* shows cause for caution in interpreta- 
tion and needed revision in specific traits C and 
D. A factor analysis (Thurstone and Guilford 
methods) based on results of 200 undergradu- 
ates in an elementary psychology class with so- 
called “normal” investigation set reveals four 
significant superfactors or trait clusters: I, 
drive-restraint (high loading on general drive, 
carefreeness, sociability, and social ascend- 
ance) ; IT, realism (high loading on objectivity, 
masculinity, freedom from nervousness, and 
freedom from inferiority feelings) ; III, emo- 
tionality (high loading on stability of emotional 
reactions, freedom from depression, and extra- 
vertive orientation of thinking process); IV, 
social adaptability (high loadings on lack of 
quarrelsomeness and tolerance). Equations for 
predicting the above superfactors from the 13 
C-scores were developed by the Doolittle 
method. 

While this work is in line with recent research 
developments, especially the work of Raymond 
B. Cattell, it is unfortunate that Lovell should 
aid and abet any possible interpretation that the 
total structure of personality, is revealed in the 
thirteen measures or in their clustering in four 


* Lovell, Constance. “A Study of the Factor Structure of 
Thirteen Personality Variables.” Ed & Psychol Meas 5:335-50 
w'4s. 


superfactors by such a statement as "The re- 
sults seem to picture the structure of personality 
as consisting of four general areas." This is true 
even with her protective statement that the study 
is based only on “the subjects used." It is self- 
evident that the complex structure of personal- 
ity is not to be measured by any such brief series 
of trait measures as is found in the Guilford- 
Martin series or any other series at present. 


For reviews by H. J. Eysenck and Benjamin 
Shimberg, see 43, 44, and. 55. 


[ 46 ] 
High School Attitude Scale. Grades 7-16; 1935; 
Forms A, B; mimeographed; 14 per test; 10¢ per 
sheet of directions and key; 156 per specimen set; 
F. H. Gillespie; directed and edited by H. H. Rem- 
mers; Division of Educational Reference, Purdue 


University. 


Lee J. Cronbach, Assistant Professor of Edu- 
cation, The University of Chicago, Chicago, 
Illinois, This scale is one of the many Thurstone- 
type devices prepared by Remmers and his asso- 
ciates. It was constructed by the usual procedure 
and has the advantages and disadvantages to be 
expected in scales of this type. The scale requires 
little time and has adequate reliability for screen- 
ing purposes. Parallel-form correlations are .753 
and .727. Validity, as in all self-report devices, 
is open to question, but there is no doubt that a 
pupil reporting an unfavorable attitude toward 
school should be singled out for study. The scale 
may be said to measure attitude toward the 
value and pleasantness of high school. State- 
ments are general and do not permit diagnosing 
specific causes of low morale. 

The apparent weaknesses of the scale prob- 
ably do not detract greatly from the accuracy 
of scores. Some statements are so extreme that 
it is doubtful if anyone could rationally agree 
with them: “A high school education is worth 
a million dollars to any young person,” or “Any 
old fogy knows more than a high school gradu- 
ate.” The scale does not discriminate finely; in 
the preliminary trials of the test, only 4 pupils 
out of 168 fell in the half of the scale from 5.1 
to 11.0. Statements do not always measure what 
the scale tries to test. Many statements are 
checked by those whose attitudes do not agree 
with the scale value of the statement. Such state- 
ments as the following dilute the validity of the 
test: “I like to go to school to be with other 
people” ; “If one’s parents need him at home he 


> 


ought not to go to high school” ; and “The kind- 
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est and best people I know don't have high 
School educations." 

Schools may find it important to use such a 
test of morale as a means of identifying dis- 
satisfied pupils and groups. The test would have 
no value unless careful follow-up of such cases 
were made. Research uses of the test may be 
important, although the limited validity of self- 
report must be borne in mind. The test is suf- 
ficiently sensitive to measure trends in group 
morale with changing conditions. Such surveys 
have proved useful in industrial personnel stud- 
ies and might well be tried by school adminis- 
trators. For this use norms would be of interest. 


[47] 
XH-T-P: House, Tree, and Person: A Measure of 
Adult Intelligence and a Projective Device: Pre- 
liminary Edition. Adults; 1946-47; 1¢ per drawing 
form ; 2¢ per scoring folder (1946) ; 2¢ per interroga- 
tion folder (1947) ; John N. Buck; Steward's Office, 
Lynchburg State Colony, Colony, Virginia. 
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Morris Krugman, Assistant Superintendent 
in Charge of Guidance, Public Schools, New 
York, New York. The H-T-P (House-Tree- 
Person) purports to be an objective instrument 
for measuring adult intelligence (more precisely, 
conceptual level) as well as a projective technique 
both for appraising personality and for determin- 
ing the existence of various forms of psycho- 
pathology. The subject is asked, successively, to 
draw a house, a tree, and a person; and in each 
instance, spontaneous comments and behavior 
during the performance are noted. An important 
phase of the test is the “post-drawing interroga- 
tion," consisting of 46 specific questions and a 
follow-up of any leads resulting from replies to 
these. 

The test was standardized on 120 adult sub- 
jects (C.A. 15 years and over), selected for in- 
tellectual level, 20 at each level from imbecile to 
“above average." The coefficient of correlation 
(Pearson) between the H-T-P and the full 
Wechsler-Bellevue for 100 cases was found to 
be .746 + .029. The method of scoring the 
H-T-P for intelligence is so complicated, how- 
ever, that it seems doubtful whether anyone but 


the author of the test can achieve so high a 
correlation with a standardized intelligence 
scale. Furthermore, the time required for scor- 
ing the objective part seems prohibitive, al- 
though extensive practice should reduce this 
time. The test can be used either as a group or 
an individual instrument, although, as Buck 
states, it is not as productive when used as a 
group test. 

The more important aspect of the H-T-P is 
its possibilities as a projective technique for 
studying personality. Buck claims high validity 
for the test, using hospital diagnoses, Rorschach 
studies, and opinions of competent observers as 
criteria for 60 cases; but no data are supplied in 
the preliminary manual of 89 pages. Study of 
this manual leads to the conclusion that this 
phase of the test is still extremely subjective and 
will depend largely upon the examiner's psycho- 
logical orientation. The point of view of the 
author of the test is partly Freudian, To this, 
Buck adds much of his own symbol interpreta- 
tion. One using the H-T-P as Buck evidently 
intends it to be used should be well versed in 
Freudian psychology. The type of qualitative 
appraisal, or “inspectional analysis," of the 
drawings and the replies to the questions will 
thus depend upon the competence of the analyst 
and upon his background in "dynamic" psy- 
chology. 

It may be unfair to the author to cite examples 
taken out of context, but in a brief review this is 
unavoidable. Only a few samples are given to in- 
dicate Buck's approach to personality evaluation : 


lt should be further noted that the subject often 
gives expression to his previously suppressed feelings 
by the use of abnormal sequence of detail: for in- 
stance, he may find it impossible to draw the door of 
his house until he had put in almost every other possi- 
ble detail. In brief he does not wish to "open up." A 
patient desirous of flight, however, might first draw 
himself a side porch entrance before putting in any 
other details; thus giving himself a means of escape 
before committing himself further. Another might 
draw a fireplace without a fire in it, but toj ped by a 
chimney from which smoke is pouring forth in great 
volume. This might be interpreted as indicative of a 
RU of recent coldness within the family relation- 
ship. 

The matter of force must always be considered: if 
rain, or wind, or storm is shown, great stress may be 
considered as being present, 

The tree appears most commonly to represent the 
subject’s felt ability to derive satisfaction from his 
environment. It is believed that the trunk of the tree 
represents the basic feeling of power within the self, 
that the limb structure symbolizes the external self in 
relation to the environment. It is believed that the 
tree whose upper branches are “chopped off,” so to 
speak, by the paper’s upper edge, represents a tendency 
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on the part of the individual to find satisfaction at the 
level of irreality. 

The timid, anxious person tends to draw small, con- 
stricted trees. The uninhibited maladjusted child may 
show his lack of inhibition by drawing an enormous, 
expansive sort of tree. 

The dead or dying tree must always be regarded as 
pathological unless proven otherwise. 

Quite frequently the tree appears to represent the 
subject's psychosexual level. The child before puberty 
may draw for example, two trees, identifying one as 
the father, one as the mother. The juvenile young 
adult may indicate mother fixation by drawing a most 
maternal sheltering tree. Since the tree lends itself 
peculiarly well to expressions of masculinity or fem- 
ininity and since the tree is a living object, subjects 
frequently project into it attitudes toward sex which 
they have previously been unable to verbalize. 

The tree may also represent the subject's temporal 
relationship with his environment. When considering 
the aspect of time, the groundline is assumed to repre- 
sent infancy; the growth upward from the groundline, 
the person’s temporal progress through life, with the 
newest branches representing latest contact and growth. 
Occasionally it is possible to uncover psychic traumatic 
episodes of early life by close questioning concerning 
scars on the trunk or limbs that have obviously been 
chopped off, Withered or dead limbs frequently repre- 
sent unpleasant episodes in the past. 


A third aspect of the test is the rating of P, 
or pathological factors. These are of three levels 
of intensity and are ostensibly objective, but 
here, too, subjectivity is high. Buck believes that 
less experience is required for the rating of P 
factors than for the inspectional analysis, but 
this seems doubtful. P factors are intended to 
lead to a clinical diagnosis, and this should cer- 
tainly not be left to the inexperienced. 

The H-T-P has great possibilities as a pro- 
jective technique, but, as Buck implies in his 
manual, the method needs much more work be- 
fore it can be placed alongside the Rorschach 
or the TAT as a proven clinical instrument. For 
those well versed in the dynamics of behavior, 
the H-T-P should be useful even in its present 
state. 


Katherine W. Wilcox, Chief Psychologist, 
Traverse City State Hospital, Traverse City, 
Michigan, The H-T-P is a nonstructured verbal 
and nonverbal projective test for clinical use in 
the evaluation of personality including intelli- 
gence. It consists of drawings by the subject of 
a House, a Tree, and a Person, followed by an 
interrogation period. 

The quantitative scoring needed for the in- 
telligence classification is based on the responses 
of a selected sample of population who, after 
a battery of other tests plus observation, were 
clinically judged to represent degrees of intelli- 
gence ranging from imbecile to superior with- 


out marked personality deviation. Each item 
(analyzed characteristic), whether of detail, pro- 
portion, or perspective, receives a score on an 
eight-point scale of factor symbols: D3, very 
inferior; D2, imbecile; Dr, moron; Ar, border- 
line; A2, low average; A3, average; S1, above 
average; S2, superior. For example, Dı means 
that a particular item occurred in the drawings 
of at least 50 per cent of the persons of moron 
intelligence but in less than 50 per cent of all 
the combined higher levels. Each specific item 
analyzed in the drawings receives a weighted as 
well as a raw score. All average or superior 
scores are classified as G(Good). The percentage 
Raw G score and the weighted Good / weighted 
Flaw (ratio) score are obtained. These show the 
intelligence classification according to the au- 
thor's norms. The analysis of the score pattern 
is often revealing in showing up some particu- 
larly good or poor responses in some fields. A 
certain amount of scatter is expected. A superior 
record containing several seriously low Flaw re- 
sponses (D2 or D3) may be indicative of some 
malfunctioning. A difference of more than one 
intelligence classification level between the sum 
of the weighted Good and the sum of the 
weighted Flaw scores may be regarded as sig- 
nificant. Great scatter suggests some factor of 
repression or deterioration. 

For the sake of qualitative analysis of the per- 
sonality a complete record is kept of the subject's 
verbal responses, emotional manifestations, items 
of difficulty or blockage, and inter-item latencies 
together with when they occurred in relation to 
the drawing. In the interrogation an effort is 
made to get the subject’s attitudes toward a 
dwelling place, a living thing, and a human being 
and his identification of others or himself with 
the respective wholes. For example, in the re- 
viewer's series, a psychoneurotic with a reactive 
depression identified herself with a weeping wil- 
low tree completely bowed down by the weight 
of circumstances. 

The qualitative analysis of the H-T-P draw- 
ings is based on selected samples of 60 adult 
patients at the University of Virginia Hospital 
plus 90 from the Lynchburg State Colony and 
its clinics. The P factors were postulated to be 
items which in most instances did not differen- 
tiate as to intelligence level and other items of 
concept, time, comments (spontaneous and in- 
duced), line quality, self-criticism, attitude, and 
drive which might serve to differentiate between 
drawings produced by persons of well-integrated 
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and mature personalities and those produced by 
persons who were, on the other hand, malad- 
justed, psychopathic, psychoneurotic, or frankly 
psychotic. The P factors are graded Pr, P2, 
and P3, according to whether they are judged 
to be a first-degree deviation from average, 
pathoformic, or pathological. There is no simple 
table by which such P items can be scored be- 
cause each must be evaluated in its relationship 
to the entire configuration presented. Thus, the 
drawing of a person with the legs protruding 
from the head would merit a P3 rating if the 
subject were known to be or to have been re- 
cently of average intelligence ; on the other hand, 
it might not even merit a Pr rating in the draw- 
ing of a congenital imbecile. 

In favor of the employment of the H-T-P 
appear to be the following: (a) the relatively 
primitive method of expression, drawing, which 
aids the examiner in obtaining information from 
subjects who are withdrawn or who are of less 
than average intelligence; (b) its almost com- 
plete unstructuring during the nonverbal portion 
(the only constant stimulus is the printed word 
House, Tree, or Person at the top of the form 
sheet) which compels projection; (c) the fact 
that previously suppressed or repressed material 
can often be verbalized after the intensely emo- 
tion-producing drawings are completed; (d) the 
fact that the approach to the total personality is 
both nonverbal and verbal. 

As the author himself points out, the major 
disadvantages at present appear to be: (a) the 
lack of objectivity of the qualitative scoring sys- 
tem and method of interpretation ; hence the ne- 
cessity for extensive clinical experience on the 
part of the examiner ; (b) the absence of specific 
score patterns and response patterns identified 
as pathognomonic for specific syndromes; (c) 
the absence of quantitative and qualitative norms 
for children. 

The H-T-P test is still in process of develop- 
ment and is now in use in selected hospitals and 
clinics by experienced clinicians who are inter- 
ested in working with the author in its further 
development and refinement. When it becomes 
available to the profession in published form, it 
will undoubtedly prove to be a valuable projec- 
tive technique complementary to the Rorschach 
and the Thematic Apperception Test. 


[48] 
Humm-Wadsworth Temperament Scale, 1940 Edi- 
tion. Adults; 1934-43; 1 form; manual revised, 1942; 
supplementáry manual, 1943; employee evaluation re- 


port form, 1939; qualitative evaluation sheet, 1942; 
quick-scoring sheet, 1943; work sheet, 1939; addenda 
1-3, not dated; distributed on a restricted basis to 
authorized technicians only; nontimed (30-90) min- 
utes; Doncaster G. Humm and Guy W. Wadsworth, 
Jr., Humm Personnel Service, 1219 West Twelfth St., 
Los Angeles, California. 
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H. J. Eysenck, Senior Psychologist, Mauds- 
ley Hospital, London, England. The Humm- 
Wadsworth scale was one of the first analytic 
scales; instead of assigning the individual a 
hodgepodge rating of “neuroticism” or “intro- 
yersion” on the basis of his answers to ques- 
tions which had practically never been properly 
validated, an attempt is made to investigate the 
temperamental constitution of the subject by 
means of a “profile,” made up of seven com- 
ponents (normal, hysteroid, manic, depressive, 
autistic, paranoid, and epileptoid). These com- 
ponents are rated on a seven-degree scale, and 
the over-all picture considered. 

The components were chosen on the basis of 
Rosanoff's psychiatric theory of personality 
structure. Scoring was derived from the fol- 
lowing groups: hysteroids (habitual criminals 
in state prisons) ; cycloids, schizoids, and epi- 
leptoids from state hospitals for the insane. 
Four hundred and thirty-six subjects were 
studied. Reliabilities of the several components 
have been reported for various groups to be 
in the .80's; validity studies claim correlations 
of .94 and .98 between inventory scores and 
case history. 

A special feature of the test is the "no- 
count"; records are rejected as probably in- 
valid if the number of “no” answers varies far 
from the normal in either direction. As many 
as 30 per cent of normal subjects may invali- 
date their tests in this fashion. Recently intro- 
duced supplementary techniques have reduced 
the percentage of rejections, but the effect of 


these changes on the validity of the test is as yet 
unknown. 

The scale is meant for use in industry, not 
in the clinic, although most of the validation 
data are culled from the clinic, not from indus- 
try. Scoring is complicated, although many aids 
are provided which reduce time of scoring and 
chances of error. 

The scale would seem to embody a number 
of desirable features: (a) It is based on a defi- 
nite theory of personality which may be em- 
pirically tested. (b) The scoring method is 
empirically validated on groups whose life 
histories show them to differ profoundly from 
each other, in the directions required by the 
theory. (c) Statistical treatment of the data is 
more adequate than is usual with scales of this 
type. (d) Widespread use of the scale in indus- 
try appears to have shown it to have a certain 
amount of validity. (e) The provision of the 
"no-count" counters one of the most usual 
criticisms of self-rating scales, viz., that people 
tend to suffer from "halo" effects as far as 
their personal qualities are concerned ; the "no- 
count" excludes persons who too consistently 
give themselves favourable or unfavourable 
scores. 

The scale is also subject to a number of 
criticisms: (a) The theory on which it is based 
is not widely held among psychiatrists or psy- 
chologists, and there appears to be little evi- 
dence in its favour. (b) For normal groups, 
such as those for whom the test is intended, 
the seven components correlate rather highly 
together (correlations of as high as .8 having 
been reported); this indicates that the com- 
ponents are not factorially pure and that a 
simpler scoring system might be devised by 
using factors derived from the intercorrela- 
tions between components rather than the com- 
ponents themselves. (c) While the "no-count" 
is useful, the number of blanks rejected on this 
account is so high that it may seriously restrict 
the usefulness of the whole test. (d) The evi- 
dence on which one may judge the test is not 
adequate. The reports in the literature are 
hardly ever full enough to make it possible for 
the critical student to evaluate them. Popula- 
tions are seldom described in sufficient detail 
to identify them; numbers of cases are stated 
in such a way that considerable doubt remains 
as to the breakdown of the total number into 
the numbers concerned with each component ; 
possible selective factors are not considered or 
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discussed. (A validity coefficient o£ .98 is evi- 
dence of a very high degree of selectivity in the 
Writer's experience and has little relevance to 
populations such as one encounters in normal 
life.) 

Tn spite of these criticisms, it is doubtful if 
a better test exists for the particular purpose 
for which the Humm—Wadsworth scale was 
constructed. The proposed publication of fur- 
ther experimental detail in book form by the 
authors of the test will perhaps serve to allay 
such doubts as the reviewer still experiences. 


H, Meltzer, Director, Psychological Service 
Center, St. Louis, Missouri. “I make mistakes 
all right but you are the one who makes the 
enemies.” This is the response of a 19-year-old 
machine shop operator to a 35-year-old inspec- 
tor. It illustrates the fact that temperamental 
differences do make their appearance in indus- 
trial situations and can play an important role 
in determining morale and productivity. It fol- 
lows, therefore, that an instrument which 
measures temperament can be helpful in deal- 
ing with the human factor in industry. One 
outstanding fact about the Humm- Wadsworth 
is that it has been developed in industrial situ- 
ations. Another is that, unlike other inventories, 
it is based on a definite theory of personality 
in the light of which items were selected, vali- 
dated, and organized. 

The test itself is composed of 318 questions 
to be answered Yes or No. Of these only 165 
were discovered to be differentially worth 
while. The remaining 153, though not scored, 
were kept for context on the impression that 
they improved the test atmosphere. The ques- 
tions included were picked from two thousand 
originally considered by Rosanoff. Items se- 
lected for each component and their weightings 
were standardized on fourteen groups of con- 
trol subjects, two for each component. The 
Seven components and the behavior tendencies 
associated with them are outlined as follows: 
Normal—integration, adjustment, and well- 
rounded development of the subject; H ysteroid 
—evidence of self-concern, self-interest, and 
possessiveness; Manic—evidence of cheerful- 
ness, drive toward activity, alertness, and ex- 
citability; Depressive—evidence of depression, 
retardation, and indecision ; Autistic—evidence 
of imaginativeness, retiring disposition, and 
tendency to flinch from social situations; Para- 
noid—evidence of tenacity of opinion and de- 


fense of systematized ideas; and Epileptoid— 
tendencies toward projects and inspiration 
toward achievement. 

To obtain a temperamental profile three sys- 
tems of scores are used. First are the raw 
scores themselves, referred to as the "log 
scores." These are determined merely by add- 
ing the values of scored questions as indicated 
on scoring keys. Profile scores, obtained from 
a nomograph available to users of the test, 
provide profile points corrected to compensate 
for the no-response tendency. And there are 
regression scores provided for cases where the 
no-count is so extreme as not to provide an 
acceptable profile score. This at times is neces- 
Sary because some people tend to overreport, 
others to underreport, and in standardizing the 
test it was discovered that the no-count is of 
significance in determining the acceptability of 
the scores. The degree of acceptability and 
validity of the scores can be obtained from a 
range of no-counts provided. Norms derived 
from the subjects on the original standardiza- 
tion and norms established from administering 
the scale to a large group of employed workers 
are furnished. A high no-score is interpreted 
as indicating defensiveness; a very low no- 
Score, a tendency to overreport or suggestibility. 
Detailed instructions for determining the ac- 
ceptability of the profile are given. By the use 
of supplementary techniques Humm has in- 
creased the number of temperamental scales 
found to be acceptable from 7o per cent re- 
ported in earlier studies to 9o or 95 per cent. 

An abundant number of temperamental pat- 
terns to illustrate those which go with success 
and failure in certain positions are available in 
the literature furnished by the publishers of the 
test. Reports on the frequency of appearance 
of each component are also given. Emphasis is 
put on the need for considering temperamental 
integration from the profile, and warning is 
given to consider the total profile rather than 
the measures of each component in a segmented 
manner. Interpreted realistically, the tempera- 
mental pattern can furnish relevant informa- 
tion about the emotional load an individual can 
or cannot comfortably hold. 

Reliability is reported to be higher on cases 
where there are acceptable no-counts. In cases 
of acceptable no-counts a correlation of -94 is 
reported, whereas for all cases the correlation 
given is .85. Studies of validity include the 
reports on a comparison of 548 employees se- 
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lected by interview alone as against 550 selected 
by the use of the Humm and intelligence tests. 
Comparisons of the interview versus the use of 
the Humm range as follows: 29 per cent of the 
problems for the first group as against 5.58 per 
cent for the Humm group, 49 per cent satis- 
factory as against 61.5 per cent, and 22 per cent 
outstanding as against 33 per cent. This means 
five times as many problems were accepted by 
the use of the interview alone. Also given as 
measures of validity are the facts that many 
companies have had long continuous service 
histories through the use of the test and that 
some psychologists and psychiatrists have used 
it successfully over a long period of years. 

As in any inventory, one questions the frank- 
ness of the subjects. Tiffin reports that when 65 
college students were given the test with two 
sets of instructions, one to be as frank as pos- 
sible and another to assume they were filling it 
out in an employment office when seeking a 
job, large changes occurred; the shift from 
clinical to employment ranged from +42 in 
Normal to —63 in Hysteroid. College students 
picked and instigated to mark tests in a certain 
way cannot be considered a very lifelike situa- 
tion. For counseling and evaluating purposes in 
personnel evaluation programs of a staff al- 
ready hired, the reviewer has generally found 
answers as frank as they are in clinical situa- 
tions. Even in hiring, deviations are surpris- 
ingly small and infrequent; that is, the 
tendency to polish answers does not show up 
nearly so often as it does on shorter forms of 
inventories. 

"There is still room for more studies on the 
reliability and validity of this temperament 
scale. The scale has been criticized for using 
categories that are abnormal, which sound 
pathological, but they are less so than others 
and are susceptible to interpretation with the 
use of a more normal vocabulary load, One can 
question whether Rosanoff’s categories will 
hold up in the long run. Rosanoff himself con- 
sidered these components more or less as con- 
stitutional, inheritable traits. Humm definitely 
does not so consider them. 

All in all, the reviewer would be willing to 
say from his years of experience with this 
instrument that he has found it to be very use- 
ful in industrial situations, particularly on a 
supervisory level; also at times he has found it 
useful in industrial counseling—he has found 
it advisable to use it more often in industrial 


settings than in clinical work. Used well in 
known situations for known and well-defined 
purposes in connection with other relevant tests 
of skills and abilities and interpreted in the light 
of life stories, clinically evaluated, it has been 
and can be a worth-while aid to a realistic per- 
sonnel evaluation program. So conceived, it has 
functional relevance and significance. It can 
profit from further research and knowledge of 
the relationship of temperament to definite situ- 
ations ; for example, the organization pattern of 
a company considered in terms of personalities 
and the organization of channels of communica- 
tion implied. 


Lorenz Misbach, Professor of Psychology, 
The University of Kansas City, Kansas City, 
Missouri. Peculiar merits of this scale appear 
to derive from the comprehensiveness of its 
theoretical grounds, its origin within an indus- 
trial situation, scoring techniques with refer- 
ence to variations in frankness of subjects, and 
accumulation of statistical and qualitative evi- 
dence from extensive industrial use. 

The dynamic character of Rosanoff's theory 
of temperament, which is basic to the scale, 
requires interpretation of profiles in terms of 
interrelations between forces, or components, 
of personality. This is an analytic rather than a 
typological enterprise. Familiarity with psychi- 
atric concepts and considerable experience with 
the scale and related materials are necessary 
for use of it as intended by the authors. 

Original standardization was sponsored by 
the Southern Counties Gas Company of Cali- 
fornia, For each of the seven preselected com- 
ponents—normal, hysteroid, manic, depressive, 
autistic, paranoid, and epileptoid—two groups 
of subjects were selected, from study of over 
4,000 life histories, one group showing extreme 
manifestation of the subsumed traits and the 
other showing extreme deficiency in respect to 
these traits. Original work sheets show that the 
fourteen groups ranged in size from 23 to 62, 
totaling 436. Of these, 73 were from institu- 
tional populations and 383 were industrial 
employees. 

* From reinterviews with subjects concerning 
their attitude in taking the.test, it was found 
that over- or under-frankness is reflected in the 
number of “no” answers (2, 35) and in the 
shape of the profile. In 1939 (12) a method 
of correcting scores with reference to frank- 
ness was described. Later research has resulted 
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in a method of correction more practicable 
clerically. This newer method is derived from 
two equations for regression of no-count and 
profile-count, separately, with each of the seven 
components, The method of correction and the 
research from which it was developed are de- 
scribed in the 1943 manual. 

Studies of validity, beyond initial standard- 
ization procedure, as with all tests, involve the 
difficulty of establishing trustworthy criteria. 
The highest coefficient of validity, .85, is given 
by Humm and Humm (35), with case studies 
as criterion. Seagoe (41) and Dorcus (32) re- 
Port correlations of .65 and .55 respectively, 
with success ratings as criteria. These results 
seem to the reviewer to be surprisingly indica- 
tive of validity, in view of the known unrelia- 
bility of ratings. In Dorcus' study, despite 
incompletenesses of method mentioned by Dor- 
cus and by Humm (33), if correction is made 
on the assumptions that the validity of pooled 
ratings does not exceed .40 and that the best 
Obtainable correlation of the scale with any 
criterion would be .85 (35), multiple correla- 
tion, with the standard multiple regression for- 
mula, would give a correction to .78 as the real 
validity. Reports by Wadsworth (16) and 
Fulton (36) concerning effectiveness of the 
scale in appraisal programs are still more im- 
pressive. 

The authors of the scale urge clients to 
employ an essentially clinical approach to ap- 
praisal with all available indices of fitness 
taken into consideration. The reviewer under- 
stands their position to be that although the 
final test of effectiveness of an instrument is 
experience with it, there are grave hazards in 
making predictions beyond statistically justified 
estimates of probability. 

Office records of the Humm Personnel Serv- 
ice, which were made freely available to the 
reviewer, show that the scale has remained in 
continuous use by the corporation originally 
sponsoring its development and that over two 
million scorings have been made for employee 
selection and appraisal in a wide variety of in- 
dustries. Major industrial users have insisted 
upon restrictions as to sale and use of the 
scale. However unfortunate the effect of these 
restrictions may be in the area of professional 
relations, a beneficial result for research pur- 
poses is the opportunity provided for accumu- 
lation of data in a central office, 

The reviewer has used the scale for several 


years in clinical and guidance work and is well 
acquainted with the experience of a Midwest- 
ern industrial engineering firm which has used 
the scale, in selective and appraisal programs, 
for four years. In the reviewer's opinion, 
which he believes has been formed with due 
awareness of the limitations inherent in any 
questionnaire technique, the Humm-—Wads- 
worth Temperament Scale is, if used within its 
proper limits, a remarkably effective instru- 
ment, 


For reviews by Forrest A. Kingsbury, Dan- 
tel A. Prescott, and P. E. Vernon, see 38:920 
and 40:1223. 
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*Hunt-Minnesota Test for Organic Brain Dam- 
age. Ages 16 and over; 1943; individual; $5 per set of 
testing materials and 25 record sheets; $1.25 per 25 
record sheets; (15 or 30) minutes; Howard F. Hunt; 
University of Minnesota Press. 
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Margaret Ives, Psychologist, St. Elizabeths 
Hospital, Washington, D. C. This test, in con- 
trast to others devised to measure intellectual 
deficit, is “specifically designed for routine clin- 
ical detection of organic brain damage” which 
has occurred after the subject has reached in- 
tellectual maturity. It consists of three parts— 
the 1937 Stanford-Binet vocabulary, “rela- 
tively insensitive to brain damage’%s’a series of 
verbal and nonverbal learning and recall tests, 
especially “sensitive to brain damage”; and a 
group of interpolated tests which supposedly 
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"serve as validity indicators" by picking out 
those with emotional and motivational disturb- 
ances serious enough to invalidate the entire 
test. 

Dr, Hunt makes allowance for normal retro- 
gressive changes with age and includes a 
weighted age score subtracted from the vocab- 
ulary rating in calculating the half of the score 
to be compared with new learning efficiency. 
However, recently (3) he has reduced the 
range for which the test is maximally valid to 
persons between 20 and 55 years with Binet 
vocabularies from 12 to 32 words, although 
beyond these limits the test may be used cau- 
tiously by competent clinicians. At one end it is 
mathematically impossible for older persons 
with low vocabularies to have pathological 
scores. Conversely, above a certain level in- 
creased vocabulary does not accompany in- 
creased efficiency of new learning but only 
penalizes the subject. Shipley, in his Shipley- 
Institute of Living Scale, has circumvented this 
difficulty by granting no additional credit toward 
the “conceptual quotient” for vocabulary over 
32 words. 

The test is standardized and statistically 
evaluated so that the optimal level of learning 
can be predicted from the age and vocabulary 
scores. This predicted value is represented by a 
T-score of 5o, and one standard error of esti- 
mate below this results in a T-score of 60. Dr. 
Hunt claims in a supplementary note (3) that 
T-scores over 66 indicate organic brain damage 
and that over 60 they justify suspicion of 
pathology. 

In the attempt to make the test extremely 
delicate in detecting minimal defect, speed of 
recognition and response, timed to the half 
second, is all important on the crucial learning 
tests. It seems to this reviewer impossible, even 
with a stop watch, to score accurately such 
split-second intervals; but that difficulty is 
minor compared to the unfortuate fact that 
although 21 neuropsychiatric patients were in- 
cluded in the standardization group, there was 
only one depressed psychotic and no catatonic 
schizophrenics, Yet these are the groups most 
likely to show retarded thinking even when 
superficially nearly recovered. Subsequent work 
with depressives at Minnesota shows that this 
group achieves a mean rating within the range 
considered indicative of organic brain damage. 
Thus, the test.is not valid for differentiating 
organic damage from functional impairment 
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involving slowed intellectual processes, perhaps 
temporary and reversible. 

The test should nevertheless prove useful in 
detecting intellectual loss in persons within the 
age and vocabulary limits given, where there is 
no question of functional psychosis or neurotic 
depression. It seems probable that any test 
which will adequately differentiate functional 
from organic groups must be subject to quali- 
tative as well as quantitative evaluation. 


O. L. Zangwill, Assistant Director, Institute 
of Experimental Psychology, University of 
Oxford, Oxford, England. This test is based on 
the well-recognized clinical observation ee 
formation and retention of new associations is 
very generally impaired in organic affections of 
the brain. It comprises three major divisions: 
the Stanford-Binet vocabulary test, which is 
relatively insensitive to brain damage (except 
in cases of dysphasia) ; the learning and recall 
tests, utilizing both visual and auditory mate- 
rials; and a group of interpolated tests, which 
serve inter alia as a means of filling the time 
intervals between the tests of recall in a uni- 
form manner. Hunt has evaluated the discrim- 
inatory power of his tests by selecting 33 cases 
from a group of brain-damaged patients and a 
similar number of non-brain-damaged cases 
equated with the former on the basis of vocab- 
ulary and age. By applying the chi-squared test 
to check the significance of the differences in 
scores, it was found that the organic cases 
showed significant deterioration. The degree - 
was assessed by obtaining each subject's pre- 
dicted score on the deterioration tests by the 
use of regression weights determined from the 
statistics of the normal sample. The difference 
between this and the actual score furnishes an 
index of deterioration. 

The test method is open to criticism on sev- 
eral counts: In the first place, more extensive 
control studies are required; R. F. Malamud 
(8) has shown that some normal persons give 
results indistinguishable from those of an or- 
ganic case. In the second place, it has been 
shown by Meehl and Jeffery (9) that the test 
is not entirely specific to brain damage. Cases 
of affective disorder without known structural 
lesions are likely to give results strongly sug- 
gestive of organic damage. In the third place, 
all the patients in Hunt's original brain- 
damaged group were (with one exception) 
cases of diffuse cerebral disease, Preliminary 
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research undertaken by the present reviewer 
suggests that the findings may be very different 
in cases of focal cerebral lesion, Thus cases 
with defects of visual or verbal learning asso- 
ciated with appropriately localized focal lesions 
but not with general cognitive deterioration 
show an impairment on the tests out of all pro- 
portion to the patient's mental status as as- 
sessed clinically. It is therefore essential that 
the test should be used with some care in the 
examination of cases with localized traumatic 
brain injuries. And in the fourth place, the 
prognostic significance of the milder grades of 
learning defect is difficult to assess in the ab- 
sence of data bearing on the occupational com- 
petence of the patients tested, 

From the diagnostic point of view, on the 
other hand, the test should be of some value 
in experienced hands, especially in detecting 
milder degrees of intellectual deficit associated 
with widespread cortical dysfunction. But it 
should not, at the present stage at least, be 
treated as a self-sufficient method for the de- 
tection and measurement of organic intellectual 
deterioration. 


[50] 
Information Blank EA: A Questionnaire on Emo- 
tional Adjustment, Provisional Form. Grades 10- 
16; 1938; IBM; 1 form; $1 per 50 machine-scorable 
test-answer sheets; 25¢ per specimen set; nontimed 
(10-20) minutes; H. T. Manuel, F., J. Adams, and Paul 
White; Steck Co. 


Percival M. Symonds, Professor of Educa- 
tion, Teachers College, Columbia University, 
New York, New York. This 30-item psycho- 
neurotic inventory prepared for use in the 
1938-1939 testing program of the Texas Com- 
mission on Coordination in Education has been 
given the innocuous title Information Blank 
EA. It is designed not only to locate students 
with emotional difficulties but also to provide a 
counselor with some information concerning 
the specific nature of these difficulties. The 

` blank is printed on an IBM machine-scoring 
- sheet so that it is adapted for automatic scor- 
ing. A high score indicates favorable psycho- 
neurotic adjustment. The reliability by the 
adjusted split-half method on a group of one 


. » hundred college students is .768. 


—— As one peruses the items, he finds a judicious 


selection of questions representing neurotic 
Symptoms. There are three psychosomatic 
. items which were found in Army experience 


- to be So significant in detecting neurotic poten- 
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tiality. However, questions with regard to 
headaches and dizziness, not included in the 
scale, may be of more significance than a feel- 
ing of tenseness in the forehead or neck, or 
sweaty hands. There would seem to be an over- 
emphasis on questions relating to mood, and 
such questions as "Do you enjoy introducing 
someone to a person of high position?" or 
"Does the thought of your own sins or mis- 
takes disturb you very much?" may not apply 
to all individuals. On the whole, however, it 
would appear that this brief collection of ques- 
tions with regard to psychoneurotic and psy- 
chosomatic condition is effective for the purpose 
for which it was constructed. 


For review by Stanley G. Dulsky, see 40: 

1224. 
[51] 

*Interest Analysis, 1942 Revision. Ages 16 and 
over; 1938-42; $3 per introductory set including ro 
tests, postpaid; 5¢ per extra test, postpaid; introduc- 
tory sets may be ordered on approval; nontimed (10- 
B minutes; Eugene J. Benge; Management Service 

Edward B. Greene, Chief, Personnel Train- 
ing, Michigan Unemployment Compensation 
Commission, Detroit, Michigan; and Lecturer 
in Psychology, University of Michigan, Ann 
Arbor, Michigan. This form is a single sheet, 
printed on both sides, with five sections en- 
titled: (a) School Interests (37 items), (5) 
Discussion of Topics of Interest to You (42 
items), (c) Personal Activities (34 items), (d) 
Occupational Preferences (40 items), and (e) 
Personal Traits (40 items). In the first three 
sections you are asked to indicate your interest 
by a "plus" mark, a "minus" mark, or no mark 
at all if you neither disliked or liked an item. 
In the last two sections you are asked to indi- 
cate a choice between pairs of items by putting 
a "plus" mark in front of your preference. 
There is no apparent logical arrangement of 
items within sections, but rather they seem to 
have been arranged on a chance basis. It is also 
apparent that none of these sections attempt to 
be complete in covering a wide variety of items. 
The reason for this is that the test is not de- 
signed to be a measure of interest at all, but to 
indicate two personality traits, extraversion 
and dominance. For each of these.traits a sep- 
arate key is provided. 

Dominance is indicated by such school in- 
terests as taking part in athletics, debating, 
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playing hookey, dates with the opposite sex, 
dramatics. A negative aspect of dominance, or 
submission, is indicated by interest in algebra, 
English composition, civics, mechanical draw- 
ing, arithmetic, etc. 

Extraversion is supposed to be indicated by 
taking part in athletics, debating, football games, 
choral singing, playing hookey, economics, civics, 
American history, amateur magic. Nearly all of 
the items which indicate dominance also indicate 
extraversion, but in addition a fairly large num- 
ber of items are scored “positive” for extra- 
version. 

The method of selecting these items and 
scoring them is not indicated, nor is there any 
definition of extraversion attempted. 

In a short paragraph the author advises in- 
dustrial concerns to establish standards for 
satisfactory workers in each distinct group, and 
then use these standards as passing grades for 
new employees coming into the group. “The 
interviewer should realize that many factors 
beside extraversion enter into success. Hence, 
the test should largely be used for rejective 
purposes—applicants who failed badly to meet 
the standard set should be excluded quickly. 
Those who pass should be subjected to further 
interview and examination.” This last quota- 
tion indicates that the instrument is designed to 
be a quick screening procedure. 

With regard to dominance, Mr. Benge writes, 
“Dominance is the characteristic of an individual 
whereby he desires to be ascendant in many situ- 
ations. Most individuals are dominant in some 
situations, but the characteristic here referred 
to indicates a desire to be ascendant over other 
people, or dominant in most human relation situ- 
ations.” The author again advises company of- 
ficials to set their own standards by selecting 
ten individuals who are dominant and ten who 
are submissive. If the blanks show that the ten 
selected for dominance average higher domi- 
nance scores than the ten submissive, then the 
scale may be used. If not, it should not be used 
for indicating dominance in the local situation. 
No figures are given for reliability, validity, or 
any indication of the sample of workers upon 
which the norms were secured. 

The phrasing of individual items leaves con- 
siderable to be desired. Most of them are single 
words, although Section 5 (Personal Traits) 
uses equally vague, short phrases, such as “I like 
exciting amusements,” or “Quiet amusements 
interest me most.” Many persons are baffled by 


the fact that they like both exciting and quiet 
amusements in a reasonable but unknown mix- 
ture. ` 
[ 52] 

*Interest Inventory for Elementary Grades: 
George Washington University Series. Grades 4-6; 
1941; Form A; sé per test; 15¢ per manual; 25¢ per 
specimen set; nontimed (30-40) minutes; Mitchell 
Dreese and Elizabeth Mooney; Center for Psycho- 
logical Service, George Washington University. 
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Harold D. Carter, Associate Professor of 
Education, University of California, Berkeley, 
California. This inventory is intended for meas- 
urement of general interest patterns of boys and 
girls. It contains 250 items which elicit reactions 
to such things as movies, radio, things to read, 
games and toys, hobbies, things to own, school 
subjects, people, occupations, and activities. The 
items appear to have been well selected, on the 
basis of survey of children’s activities. 

The test is scored to measure normalcy of in- 
terest according to sex and grade. The scoring 
is such as to indicate the nature and extent of 
agreement between the interests of the individ- 
ual and those of the group. It is intended that 
the test shall be useful in measurement and diag- 
nosis, to aid the school in developing desirable 
interests among the children. 

The test, containing 250 items, is probably 
fairly reliable. The evidence of reliability is in- 
adequate; it consists of evidence that the item 
responses show marked consistency. There is no 
correlational evidence that the scores yielded by 
the test have satisfactory reliability. The valid- 
ity of the test has apparently been taken for 
granted; the manual presents no evidence of 
validity. 

The test has been carefully constructed and 
arranged for simple administration and scoring. 
Significant facts about a child’s interest pattern 
are to be found through examination of the in- ` 
ventory. The test seems promising. Its guiding 
philosophy is constructive. One can imagine that 
research might show that the test can be used 
to measure various aspects of interests, such as 
ignorance, indecision, maturity, and concentra- 
tion of interests. 

As it now stands, the consumers must find 
out for themselves to what extent the test is 
useful. The manual does not provide the neces- 
sary evidence. 3 
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‘Lee J. Cronbach, Assistant Professor of Edu- 
cation, The University of Chicago, Chicago, 
Illinois. Elementary school teachers frequently 

„wish to analyze the interests of pupils for cur- 

“riculum planning and guidance. Most teachers 
use interviews and other informal methods to 
obtain this information. The Interest Inventory 
permits a more rapid but less flexible survey of 
interests in a group. It requires each pupil to 
respond Like, Indifferent, Dislike, or Unknown 
to 250 activities. The list is varied and probably 
as satisfactory as any standard checklist. Most 
items deal with interests significant to educa- 
tion, but the list may overlook interests impor- 
tant in a given child unless it is supplemented 
by an individual interview. 

The only reported evidence of reliability is 
that fifty children retested after a few days 
changed only 15 per cent of their answers. Diag- 
nosis on the basis of checked interests is likely 
to be correspondingly imperfect. There is no evi- 
dence that checked interests correspond to inter- 
ests shown by behavior. The authors tend to 
regard responses of the child as unquestionably 
valid. Thus many likes “will show whether he is 
in general interested in most things.” Such a pat- 
tern may as easily represent merely an acquies- 
cent set toward suggestions, a desire to make a 
favorable impression on the teacher, or a tend- 
ency to say that he likes all activities that might 
appeal to him momentarily. 

The authors recommend a scoring plan to rate 
each pupil on normality of interest. The pur- 
pose is to screen pupils having interests unusual 
for their grade and sex for special attention. 
This is equivalent to implying that interests of 
normal children do not require individual study, 
an idea which is thoroughly false. Scores are 
based on an empirically determined system of 
weights. There is no justification for such a 
time-consuming system as opposed to a mere 
count of “normal” responses. The weights are 
based, for any grade and sex, on an inadequate 
sample—an average of seventy cases, mostly 
suburban. There is no evidence of the reliabil- 
ity or empirical validity of scores. 

The authors appear to have the false picture 
that the “normal child” has all the interests of 
his grade-sex group and no others. It is more 
likely that the “normal child” is advanced in 
some areas of interests and relatively undevel- 
oped in others. The present scoring system penal- 
izes such a pattern. The test would be improved 
if the maturity of the pupil in significant areas 
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rather than his normality were described. A 
concept of “interest age” could be profitably 
applied. 

Minor faults also appear, such as a short and 
somewhat irrelevant set of preference items 
added at the end and a system of part scores 
based on brief and heterogeneous subtests. In 
general, the test appears useful as a checklist 
for item-by-item study and nearly worthless as 
a scored measuring instrument. The older Pres- 
sey Interest-Attitude Tests (see 40:1243.1) for 
grade 6 and over isa far sounder test. 


[53] 
Interest-Values Inventory. Grades 9-16 and adulte; 
1939; I form; $9.35 per 100; 25¢ per specimen set; 
nontimed (30) minutes; J. B. Maller and Edward M. 
Glaser; Bureau of Publications, Teachers College, Co- 
lumbia University. 
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E. Lowell Kelly, Professor of Psychology, 
University of Michigan, Ann Arbor, Michigan. 
This inventory was designed to measure the 
relative dominance of four of the six major in- 
terests or values proposed by Spranger and 
measured by the Allport-Vernon Study of Val- 
ues: Theoretical, Aesthetic, Social, and Eco- 
nomic. It does not provide scores for religious 
or political values as does the Allport- Vernon 
scale. 

The Interest-Values Inventory per se consists 
of four parts. Part I consists of ro items, each 
composed of four words. The subject is asked 
to indicate which of the four words he likes 
best. Part II is composed of ro items of the 
controlled association variety with four alterna- 
tive response words, Part III consists of 12 
questions or problems each of which is followed 
by four answers or comments. The subject is 
asked to rate each of the four alternatives on 
a 3-point (like-neutral-dislike) scale. Part IV 
consists of a list of 48 personality trait names. 


The subject is asked to rate himself on each of 


ers 9:67-81 S "40. * 
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these traits by noting whether he is strong, aver- 
age, or weak. 

Each question in the inventory provides for 
the expression of any of the four types of in- 
terest which the scale purports to measure. Items 
which appear in the scale were retained on the 
basis of significantly different responses by four 
groups composed of 50 students, each chosen 
because of occupation or field of study to repre- 
sent one of the four types of interests. Scoring 
is accomplished with the aid of a transparent 
stencil, and positive unit weights are counted 
successively to obtain each of the four scores. 

Retest reliabilities of the resulting scores 
range from .87 to .93 for a ten-day interval and 
.70 to .87 for a three-month interval (N = 100 
college students). Separate percentile norms are 
provided for high school students (N = 350) 
and adult graduate students (N not given). 

As compared with Allport and Vernon’s Study 
of Values, this instrument appears to require 
about the same time to administer, to be more 
easily scored, and to yield significantly higher 
reliabilities. Strangely, however, even though 
acknowledging their indebtedness to the previ- 
ous work of Allport and Vernon, Maller and 
Glaser fail to report the correlation between the 
sets of four supposedly similar measures pro- 
vided by the two instruments. And again, al- 
though the authors state that their classification 
of interests is based in part on Thurstone’s factor 
analysis of Strong’s Vocational Interest Blank, 
they fail to report a single correlation with any 
kind of vocational interest score. This omission 
is all the more pertinent when it is remembered 
that the four groups used for item validations 
were chosen on the basis of vocational or pre- 
vocational college interest. 

To this reviewer, it would seem that persons 
who introduce a new test have a professional 
obligation to report correlations of scores on it 
with those on all widely used and related tests. 
Otherwise it is extremely difficult for the pro- 
spective user of the new test to judge its unique 
value (if any) over and above the instruments 
with which he is already familiar. 

Part V of the test is in no sense a measure 
of interests but rather a supplementary inven- 
tory consisting of questions designed to distin- 
guish between happy and unhappy persons. This 
part of the test is scored with apparently a priori 
weights to yield an “Emotional Satisfaction 
Score.” No reliability is reported for this score, 
nor any indication of its validity. Just why this 


unvalidated instrument is printed in the same 
booklet is not clear. One can only surmise that it 
was felt its inclusion would make thetest a more 
attractive instrument for student counseling! 


+ 


Paul E. Meehl, Assistant Professor of Psy- 
chology, University of Minnesota, Minneapolis, 
Minnesota. This test is designed to measure rel- 
ative dominance of theoretic, aesthetic, social, 
and economic interests or values. It was devel- 
oped to be an improvement over Allport and 
Vernon’s Study of Values, eliminating the “po- 
litical” and “religious” scoring in the latter. An 
effort was made to improve the reliability of 
the "social" value score, which was highly un- 
stable in the Allport-Vernon. The present test 
aims at somewhat greater subtlety by making 
use of the testee's preference for certain words 
over others and of his associations between 
words, in addition to his statement of preferred 
solution to a problem situation involving value 
conflict. The new items do not seem particularly 
subtle, but no evidence is available on this point. 

The authors state that the items were selected 
empirically, on the basis of differentiation among 
college criterion groups defined by their chosen 
majors. It is not stated how large an initial item 
pool was involved, which makes it difficult to 
evaluate the critical ratios reported. It is to be 
presumed that differentiation would be reduced 
if the test were applied to a new “test” group 
other than those on which the items were se- 
lected. However, within the reported sample, 
there seems to be pretty good evidence for va- 
lidity. The means of each of the four criterion 
groups run between 1.5 and somewhat over 2 
standard deviations above the other three groups 
on the appropriate scale. On any given value, it 
would be possible to separate each major group 
from the other three from 25 to 34 per cent 
better than by chance, using the appropriate 
value score alone. Considering the numerous 
extra-value factors which influence a student's 
decision to major in one area or another, this 
seems fairly good evidence of validity. There are 
a number of important exceptions to the differ- 
entiation which the authors do not stress but 
which appear in their published material. For 
example, male students in the practical, economic 
area are not appreciably different in “social” 
value-interest from those majoring in the social- 
service area. There is also a numerical contra- 
diction in both the manual and their published 
article (Table I, mean social value score for total 
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economic criterion group) which tends to make 
the results look better than they are. Their vali- 
dation data are given in terms of contingency 
coefficients, based upon the dominant value clas- 
sification of students. 

Normsare given for high school students only, 
except that the theoretical value norms are based 
upon adult graduate students, who are almost 
certainly atypical with respect to this particular 
value. It would seem desirable to have given sex 
norms separately, but this was not done. The 
description of the four value types is fairly ade- 
quate, although the lumping together of artistic 
and musical under Thurstone's “linguistic” in- 
terest factor would seem to be rather precari- 
ous. Scoring is medium in ease. There is the 
usual reference to "adults" together with high 
School and college students, with no evidence 
that the ordinary adult is validly testable. The 
directions and vocabulary make it at least ques- 
tionable whether the run-of-mine adult is ade- 
quate to the task. However, the nature of the 
variables is such that this is not a serious draw- 
back, since only rarely would this sort of measure 
become clinically relevant at the general adult 
level. 

[54] 
X*Inventory of Affective Tolerance. College and 
adults; 1942; 1 form; $1.75 per 25; 40€ per specimen 


set; nontimed (25) minutes; Robert I, Watson and 
V. E. Fisher; Sheridan Supply Co. 
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3. Watson, Ronert I. “The Relationship of the Affective 
Tolerance Inventory to Other Personality Inventories.” Ed & 
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17:31 

7$. Warsow, Ronens I. “Clinical Validity of the Inventory of 
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Paul R. Farnsworth, Professor of Psychology, 
Stanford University, Stanford, California. This 
personality inventory of 61 items (six-answer 
multiple-choice) purports to measure "sheer 
capacity to withstand or ‘take’ emotional disturb- 
ances." Tt also attempts to tap "the individual's 
ability to vent or discharge his affective tension, 
and his ability to give a subjectively appropriate 
form and direction to his release of tension." In 
the senior author's opinion this inventory has 30 
items which are similar, or at least partially 
similar, to those of Bell's Adjustment Inventory 
and 29 quite like those of the Bernreuter Per- 
sonality Inventory. Hence, it is not surprising 
that scores on this newer test correlate —.56 


with total scores on the Bell and from —.56 to 
—.66 with the Bernreuter F1C (self-confidence) 
scores. 

The taking of the Inventory of Affective Tol- 
erance requires no more than 25 minutes at the 
most, and the hand scoring from two to three 
minutes. The weights of the items were figured 
by the t-ratio method. The test possesses a split- 
half reliability somewhere in the nineties. Cen- 
tile norms for college students are offered which 
show that men, on the average, achieve slightly 
higher scores than do women. Validity data have 
been assembled through the matched pairing of 
normals and neurotics and by the study of cases 
with minor behavior difficulties. 

Scrutiny of the available material on this test 
forces the conclusion that this work of R. I. Wat- 
son and V. E. Fisher is as good as, and perhaps 
a trifle better than, the general run of pencil- 
and-paper adjustment inventories. Tt will select 
out many individuals who should have clinical 
attention. But it will just as certainly fail to 
screen out many others who may have an equal 
need for mental therapy. 


E. Lowell Kelly, Professor of Psychology, 
University of Michigan, Ann Arbor, Michigan. 
This instrument purports to measure "affective 
tolerance," defined by the authors in the Manual 
as "the capacity to deal with affective tensions 
or emotional excitements. The sheer capacity to 
withstand or ‘take’ emotional disturbances is the 
most basic aspect.” 

The inventory consists of 61 statements, typi- 
cal of which are, “I do things in spite of myself 
and against my better judgment,” “I have peri- 
ods of anxiety,” and “I experience feelings of 
hate,” to which the testee responds by checking 
one of the adverbs which define a six-point scale: 
e.g., continuously, very frequently, frequently, 
occasionally, rarely, never. 

The 61 items composing the inventory were 
selected from 150 original items on the basis of 
“F” and "f?" values of total scores associated 
with individual item responses. Scoring weights 
ranging from o to +6 were assigned to each of 
the alternative responses of each item on the 
basis of t-ratios. 

Reliability coefficients (split-half) are reported 
as .93, .94, and .93 for three different groups 
of college students. Because of significant sex 
differences in resulting scores, separate centile 
norms are provided, based on 270 men and 270 
women students selected from four western state 
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universities, one midwestern, and one eastern 
private college. 

just what this inventory measures is not easy 
to judge. Women score significantly lower (i.e., 
less tolerance) than men, and neurotics score 
lower than normals, Watson-Fisher scores cor- 
relate about —.60 with Bernreuter FIC scores 
and —.70 with scores on the Thurstone Person- 
ality Schedule: Clark Revision. Watson has also 
shown that college students who seek psycho- 
logical counseling for emotional problems make 
significantly lower scores than those seeking as- 
sistance for nonemotional problems. 

Since it is difficult to know what the instru- 
ment is measuring, it is also difficult to know 
where and how it might profitably be used. The 
manual suggests that it may be used “to select 
students in need of personal counseling, as a tool 
of guidance choice [sic] where such a measure 
may be significant and for purposes of research 
in the fields of personality and emotion." For 
the first and last of these suggested uses, this 
instrument would appear to be no better (or 
worse!) than several other so-called neurotic in- 
ventories. But just how it could be used as “a 
tool of guidance choice" this reviewer does not 
pretend to understand. 

In summary, the Inventory of Affective Tol- 
erance appears to be simply another personality 
test which yields fairly reliable measures of 
some as yet unidentified variable or variables. 
Thus it promises to provide uncritical investi- 
gators with one more set of relatively meaning- 
less scores. Like similar inventories, responses 
to individual items may be of some use to the 
busy counselor or clinician in providing leads 
for the counseling interview. 


William U. Snyder, Associate Professor of 
Psychology and Associate Director, Psycho- 
logical Clinic, The Pennsylvania State College, 
State College, Pennsylvania. This test was de- 
signed to measure affective tolerance, or the 
ability of an individual to deal with emotional 
tensions. This appears to be the same as what 
Rosenzweig has described as frustration toler- 
ance and is closely related to emotional adjust- 
ment. 

The test consists of a questionnaire of 61 
items, each of which may be answered by one 
of six possible choices, such as “never,” “rarely,” 
“occasionally,” etc. Many of these items are the 
same ones that have been appearing in personal- 
ity tests for thirty years. The publishers claim 


that the test serves in much the same way as 
tests formerly called tests of neurotic tendency 
but with a new approach. About the only new 
aspect would seem to be the name of the test 
and possibly some of the items. 

A favorable point regarding this test is the 
fact that only one aspect of personality is meas- 
ured by the 6r items. This allows for some cor- 
rection of subjective errors. But the publisher’s 
dismissal of this problem is cavalier: “The in- 
ventory calls for the person’s own evaluation of 
his affective tolerance. Hence, cooperation, hon- 
esty, and insight are necessary for valid results 
to be obtained.” Assuming that the first two fac- 
tors could be assured, no easy dismissal of the 
problem of insight is justified. On such ques- 
tions as, “I possess self-confidence in my associ- 
ations with members of the opposite sex” and 
“T am in fear of violating my own standard of 
morals,” the aspect of opinion error must be 
quite high. Furthermore the discerning subject 
will quickly realize that the choices become in- 
creasingly favorable from “a” to "f" in every 
case; and even if he wants to be honest, he can- 
not then eliminate the subjective factor. 

There is little doubt about the reliability of 
the test; it is in the nineties. But validity is a 
different matter. The publishers report that the 
discriminatory power of the test was measured 
by submitting it to a check with matched nor- 
mals and neurotic adults. They do not report the 
numbers or the selection criteria of the groups 
mentioned, and this failure is indefensible. The 
trait of neuroticism is not adequately enough 
defined as yet to be taken for granted. Did the 
authors use only someone else’s as-yet-unvali- 
dated test of neuroticism as a criterion? This is 
answered in a published article. There were 25 
neurotics and 25 normals matched for sex, age, 
marital status, rural-urban status, occupation, 
education, and income. The neurotics were so 
judged by Dr. Fisher in extensive interviews 
during clinical practice. There was virtually no 
overlap between neurotic and normal, i.e. college 
students, scores. A group of 93 psychotics scored 
between the normals and the neurotics and the 
authors attribute this to lack of insight. 

A more recent study by the authors adds 40 
more neurotic persons, of whom 16 were diag- 
nosed after the scores on the inventory were 
available. There was no overlap with the nor- 
mals’ scores. 

The norms for the test are based on scores ob- 
tained from 540 college students, equally divided 
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as to sex, and selected from different colleges, 
three-quarters of which are in the northwest. 
The test is therefore significant for use only with 
college students. 

Scoring of the test proves tedious with any 
sizable group. Each response is weighted, and 
the 61 weights must be summed. This sum may 
be interpreted on a table of centile norms. It is 
unfortunate that the authors of this relatively 
new test did not follow the request of the Amer- 
ican Psychological Association regarding the use 
of standard scores instead of percentiles. 

In general, it is the reviewer’s opinion that the 
clinical psychologist need not be too concerned 
if he has not been using this test. Other tests, 
we believe, measure the same thing with equal or 
better validity and possibly with more originality. 


[55] 

Inventory of Factors STDCR. Grades 9-16 and 
adults; 1934-40; IBM; $2 per 25; machine-scorable 
answer sheets also available, 2¢ each; $1.25 per set of 
plastic hand-scoring keys for test booklet; $3.50 per 
set of machine-scoring stencils for answer sheet; $2.50 
per set of hand-scoring stencils for answer sheet; 40¢ 
per specimen set; nontimed (30) minutes; J. P. Guil- 
ford; Sheridan Supply Co. 
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H. J. Eysenck, Senior Psychologist, Maudsley 
Hospital, London, England, This test consistsof 
175 questions of the Yes-?-No type and pur- 
ports to measure five independent variables of 
temperament, viz., social introversion, thinking 
introversion, depression, cycloid disposition, and 
rathymia. Scores are given in terms of C-scores, 


the norms being based on 800 senior high school 
students, 388 university students, and 80 adults 
(thus giving three sets of norms, not differen- 
tiated as to sex). The reliability of each of 
the separate five variables is high, both by the 
Spearman-Brown and by the Kuder-Richard- 
son formulas. 

The inventory is open to criticism on the fol- 
lowing grounds: Evidence of validity is rela- 
tively weak, particularly for factors T and C 
(thinking introversion and cycloid disposition). 
In particular, little has been published regarding 
the practical use of the inventory in high school 
and college counseling services, in penal insti- 
tutions, in family relations, and in hospitals, 
speech clinics, industrial plants, and so forth; 
while the inventory is known to have been widely 
used by these various agencies, the results still 
await publication. In the absence of such evi- 
dence, judgment must be withheld. The method 
of factorial analysis used, although orthodox, is 
open to criticism. The resulting rotated factors 
are not uncorrelated; correlations between 
scores of as high as .85 being reported. A fac- 
torial analysis of these correlations between 
scores has given rise to a second-order factor 
which accounts for a greater proportion of the 
variance than any of Guilford’s five factors. In 
addition, clinical evidence for the validity of this 
“general factor of neuroticism” was presented 
(9) ; yet no score or method of measurement is 
given by Guilford for this second-order factor. 

As opposed to these criticisms, it should be 
pointed out that Guilford’s inventory has cer- 
tain advantages over other well-known tests: (a) 
The statistical work is of the expected high 
standard, and in particular the factorial approach 
to the problem of inventory construction, which 
is largely due to Guilford’s original impetus, 
promises to result in much more analytical tests 
than we have known so far. (b) The inventory is 
not encumbered, as are the Humm-Wadsworth 
Temperament Scale and the Minnesota Multi- 
phasic Personality Inventory, by reliance on 
obsolescent psychiatric classifications not even 
widely accepted in psychiatric circles. 

As a total summing up, it may be said that 
the published evidence regarding this test shows 
it to mark a distinct advance in test construc- 
tion but to lack in validation data; it is there- 
fore a test for the experimenter rather than for 
the practical tester in clinic or factory. 


For a review by R. A. Brotemarkle, see 45. 


——— MN 
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X*JNB Psychograph. Patients in mental hospitals; 
1941; 2¢ per form; for manual, see (1) below; John 
N. Buck; Steward's Office, Lynchburg State Colony, 
Colony, Virginia. 

REFERENCES 
1, Bucx, x N. "The JNB Psych A A 

sie om bu ade Ie 'sychograph. Appl Psychol 

[57] 
x Johnson Temperament Analysis. Grades 9-16 and 
adults; 1941-44; manual revised, 1944; IBM; $1.75 
per 25; 2¢ per response record sheet; I per analysis 
profile; 6o per set of unweighted scoring stencils; 
$1.75 per set of weighted scoring stencils; 35¢ per spec- 
imen set; 7¢ per copy of machine-scoring edition; 2¢ 
per machine-scorable answer sheet; nontimed (40-50) 
minutes; Roswell H. Johnson; California Test Bu- 


reau. 
REFERENCES 
1, Sutnert, Eart W. “An Evaluation of the Johnson Tem- 


perament Analysis.” J Clin Psychol 1:193-5 "45. * (PA 
1913411) 

H. Meltzer, Director, Psychological Service 
Center, St. Louis, Missouri. An inventory for 
measuring temperament, to a large extent in- 
fluenced by and to some extent patterned after 
the Hwumm-Wadsworth Temperament Scale, 
is the Johnson Temperament Analysis. It does 
have some differentiating features. For exam- 
ple, unlike the Humm-Wadsworth, it is not 
based on any distinct theory of personality but 
is, instead, organized around nine behavior 
tendencies considered important in the light of 
Dr. Johnson's family relations experience. 
Traits used by him are arranged in opposites 
and include: nervous-composed; depressive- 
gay-hearted ; active-quiet ; cordial-cold ; sympa- 
thetic-"hard boiled” ; subjective-objective; ag- 
gressive-submissive; critical-appreciative; and 
self-mastery-impulsive. Instead of 318 questions, 
there are 182. To avoid the error introduced by 
forcing a client to answer yes or no, he is ad- 
vised to rate the subject in the highest 40 per 
cent, the middle 20 per cent, or the lowest 40 
per cent. Another feature is that no single ques- 
tion is used to measure two traits. 

The analysis profile resembles the Humm- 
Wadsworth profile except that it is shaded so 
that the user is protected from the grave dan- 
ger of assuming that either an average or a 
maximum is necessarily the best condition. The 
traits selected show relationships to the Humm- 
Wadsworth list of components based on Rosa- 
noff's personality analysis. This inventory has 
the disadvantage that it does not represent a 
distinct theory, but it does have the advantage 
of resembling more normal types of behavior 
patterns. There is very little published about 
the test, its reliability, validity, or usefulness. 


In the manual there is a consideration of reli- 
ability, which is none too specific, The statisti- 
cal reliability given for 300 cases for the vari- 
ous traits ranges from .57 to .77. Weighted 
norms are reported for high school girls and 
boys and for men and women. Most of the 
adult norms are given from cases reported at 
the American Institute of Family Relations. 
The high school weighted norms are based on 
200 high school students, mostly seniors. Va- 
lidity is reported for name agreement, and the 
use-validity is based on 100 spouses (50 men 
and so women) from the Institute as compared 
to 100 club and class people. Tetrachoric cor- 
relations for 100 men and 100 women are re- 
ported. 

In the literature the reviewer has found only 
one article reporting the use of the test. This 
was at the Green Mountain Junior College for 
the freshman group of 1944-45. There is 
nothing in this article that would add to the 
literature on either the reliability or validity, 
or even the usefulness of the test. The results 
are reported in vague terms; the assumption 
is made that the test is valid because the nine 
traits were at or near the soth percentile for 
the college freshman group. No more specific 
norms are reported, and the conclusion is made 
that the first evaluation of the test warrants its 
further use and further research as to its 
validity. 

The crisscross method’ of scoring is advised 
for use in marriage cases. The most frequent 
use of the test to date has apparently been in 
marital counseling. The reviewer has not used 
this test and his reactions, therefore, are neces- 
sarily based solely on a review of written mate- 
rials furnished by the author. On this basis it 
is the impression of the reviewer that the test 
is still too much in need of improved construc- 
tion, improved description, and improved stand- 
ardization in real situations before much can 
be said about its possibilities and limitations. 
Evidence to date is not too convincing. 

[ 58 ] 

*Jurgensen Classification Inventory. Adults ; 1947; 
1 form; separate answer sheets must be used; $5 per 
25; 125 or more tests, $4.50 per 25; $1.90 per 100 an- 
swer sheets; 500 or more answer sheets, $1.71 per 100; 
single copies, 2726; 50¢ per specimen set, postpaid ; 
test materials will be sold only to members of the 
American Psychological Association; score keys and 
norms are not provided as each user is expected to 
develop keys and norms to fit his own situation; (20- 
40) minutes; Clifford E. Jurgensen; the Author, Min- 
neapolis Gas Light Co. 
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*Mental Health Analysis. Grades 4-8, 7-10, 9-16, 
adults; 1946; IBM; Form A; $1.75 per 25; 35¢ per 
specimen set, any one level; 7¢ per copy, machine- 
scoring edition; 2¢ per answer sheet; nontimed (50) 
minutes; Louis P. Thorpe and Willis W. Clark; Er- 
nest W. Tiegs, consultant; California Test Bureau. 
Q) ELEMENTARY SERIES. Grades 4-8. 
b) INTERMEDIATE SERIES. Grades 7-10. 
¢) SECONDARY SERIES. Grades 9-16. 
d) ADULT SERIES. Adults. 


William E. Coffman, Assistant in Psycho- 
logical Foundations and Educational Research, 
Teachers College, Columbia University, New 
York, New York. With the publication of this 
inventory, the authors offer an instrument which 
avoids some of the more obvious defects of 
their California Test of Personality. The list of 
questions has been enlarged and refined. The 
categories under which responses are classified 
have been reduced from 12 to 10, and the num- 
ber of questions in each has been increased 
from 12 to 20. The questions in each category 
are scattered throughout the inventory so that 
a pupil is less likely to perceive a pattern of 
“correct” responses. A coding system prevents 
him from discovering the exact nature of the 
analysis. The reliability coefficients (Kuder- 
Richardson) of the total score (.954) and of 
the two section scores, mental health liabilities 
(.924) and mental health assets (.906), are 
unusually high for a personality inventory. No 
evidence of reliability is offered with respect to 
separate category scores. 

The prospective user of the inventory should 
give special attention to the list of limitations 
contained in Part 4 of the manual and to the 
suggestions for checking the validity of a pro- 
file given in Part 10. Concrete illustrations of 
these limitations were secured when the re- 
viewer administered the inventory to a fifth 
grade class. The boy judged most maladjusted 
by the teacher made the highest score on the 
inventory. Some children asked the meaning 
of the words contented, traits, and tense. All 
responded “no” to the question, “Do you spend 
more time than you need to on your school 
work?" Evidently the authors intended this 
question as a disguised form of “Do you try 
to get by with as little school work as possible?” 
Several pupils were puzzled by the question, 
“Do you think that what people do is more im- 
portant than who they are?” There appears to 


be an error in the scoring key for Item 12. Any 
interpretation of an individual profile should 
be tentative and should be followed by a search 
for additional evidence. 

The Mental Health Analysis is offered as an 
aid to “teachers, parents, and advisors in ob- 
taining a better understanding of the subtle 
forces which condition and détermine mental 
health.” It appears to have special value as a 
device for calling the attention of teachers to 
the fact that elementary school children have 
such worries, fears, and feelings of inadequacy 
as are someimes revealed by answers to the 
questions contained in the section labeled Zia- 
bilities. This aspect of adjustment is not neces- 
sarily evident from observations of overt be- 
havior in the classroom. A careful study of the 
responses of a class to these items and an at- 
tempt to interpret each pupil’s response should 
provide a body of concrete material contributing 
meaning to a study of mental hygiene prin- 
ciples, 

The limitations of the suggestions for diag- 
nosis and treatment given in Part 9 of the man- 
ual are not clearly indicated. The classroom 
teacher will need careful guidance and extensive 
study if she is to conduct satisfactory follow-up 
remedial work. 


Henry E. Garrett, Professor of Psychology, 
Columbia University, New York, New York. 
This questionnaire consists of 200 items in- 
tended to assay the degree of mental health 
possessed by the examinee. Separate booklets 
have been prepared for four levels of maturity: 
for elementary grades, intermediate grades, 
secondary school-college, and adults. Each of 
the 200 questions at each level is to be an- 
swered by circling YES or NO. 

Five sorts of personality “liabilities” are 
investigated by 100 of the questions. These 
liabilities are described as (a) behavioral im- 
maturity, (b) emotional instability, (c) feel- 
ings of inadequacy, (d) physical defects, and 
(e) nervous manifestations. The other 100 
questions are designed to estimate the exami- 
nee’s personality “assets.” These are designated 
as (a) close personal relationships, (b) inter- 
personal skills, (c) social participation, (d) 
satisfying work and recreation, and (e) ade- 
quate outlook and goals. The items of the 
questionnaire are so distributed that 20 apply 
to each specific classification. 

High reliability is reported for the question- 
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naire, and the authors properly report the prob- 
able errors of their scores. Validity depends 
largely upon a consensus of experts. While 
many of the items are original, the authors 
have drawn upon the large amount of material 
available in other inventories and question- 
naires. The use of queries designed to bring 
out specific attitudes rather than the use of 
questions dealing directly with general atti- 
tudes is to be commended. 

While the various behavior classifications are 
useful practically, no evidence is given to show 
that these groupings are established upon any 
but an empirical or common sense basis. The 
diagnostic profile should be of some value in 
differential analysis, but I doubt if the total 
liability or asset score has any but the vaguest 
meaning. To know that a child answers many 
questions which indicate that he is socially im- 
mature is certainly valuable. But one cannot 
add such a "score" to other subscores and hope 
to determine the child's total personality lia- 
bilities unless he knows the relations among 
the categories being combined. Personality is a 
pattern of traits, not a sum or average of be- 
havioral odds and ends. 


C. M. Louttit, Professor of Psychology and 
Dean of the Undergraduate Division, Univer- 
sity of Illinois, Galesburg, Illinois. The Mental 
Health Analysis is essentially a standard type 
of personality questionnaire in which the items 
are selected upon a more definitely systematic 
basis than is usually the case. Briefly, the in- 
strument aims to reveal mental health liabilities 
(divided into five categories) which should be 
minimized or corrected and mental health assets 
(also divided into five categories) which should 
be sought or amplified. Each of the ten cate- 
gories is carefully described, and percentile 
norms are provided for them. The perform- 
ance may be delineated on a profile chart, and 
the authors have written independent discus- 
sion of clinical interpretation for each of the 
four levels. The reliabilities, based upon data 
from 1,000 or more subjects for each of the 
four forms, are represented by Kuder-Rich- 
ardson formula correlations ranging from .890 
to .967. In respect to the theory upon which 
the questionnaire is based, the care used in 
item selection and formulation, the mechanics 
of administration and scoring, and the statis- 
tical adequacy of norms and reliabilities, these 
instruments are equal or superior to similar in- 
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struments. However, we find the same faults 
here as are found elsewhere when validity is 
considered. 

The authors rest the case for the validity of 
this instrument on a discussion of the ade- 
quacy of the selection of items, of the meaning- 
fulness of their analysis of mental health cate- 
gories, and of the cleverness in disguise of the 
items. However,-no matter how weighty are 
the a priori arguments for validity, they are no 
proof that the instrument" will serve the func- 
tion which is claimed. There are no data, and 
no suggestion that any effort was made to se- 
cure data, which would show whether this in- 
strument would even crudely select from a class 
those children, who by independent criteria 
would be judged of good or poor mental health. 
The extensive discussion of the use of scores 
presents illustrative case problems, but there is 
no indication that such problems were discov- 
ered by the instrument. The criticism of lack 
of validity studies against independent criteria 
cannot be directed against this questionnaire 
alone, but for a publication of 1946 not to have 
acknowledged the problem is almost inexcus- 
able. For use in a clinic, the Mental Health 
Analysis may be useful in supplementing clini- 
cal study; for use in studying groups, it has 
all of the dangerous faults found in earlier 
questionnaires. 


James Maxwell, Principal Lecturer in Psy- 
chology, Moray House, Edinburgh, Scotland. 
The Analysis is constructed on the proposition 
that mental health is divisible into certain “cate- 
gories,” not entirely discrete, which can be clas- 
sified as either Liabilities or Assets. There are 
ten such categories, each being covered by 20 
questions of the Yes-or-No type. 

The accompanying manual contains rather 
wide claims about the value of the Analysis, 
but no objective evidence is offered in support 
either of its practical value or of its validity. 
The main evidence of validity is contained in 
the authors’ claim that the Analysis is valid in 
virtue of the careful selection of items and that 
“the ten categories . . . represent functionally 
related groups of crucial, specific evidences of 
Mental Health assets or liabilities . . ." There 
is no evidence that the Analysis discriminates 
significantly between those whose mental health 
is good and those whose mental health is not. 

Reliabilities for all four series are of the 
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order y = .9 or over, which is satisfactory for 
this type of test. 


The most serious defect of the Analysis, 
however, is its failure to discriminate clearly. 
In the Adult Series, for example, the standard 
deviation is 31.5 for 200 items. For Liabilities 
and Assets (100 items each) S.D.'s are 16.5 
and 15.0, respectively. These values are rather 
low for a test of this length.. 

A conversion table provides percentile norms 

for the various subdivisions of the test. Exami- 
nation of this table confirms one's suspicions 
about the validity and discrimination of the 
Analysis. Throughout the table, scores of up to 
50 per cent “correct” items fall almost entirely 
within the lowest ten percentile ranks, with 
the remaining scores spread thinly over the re- 
maining 90 percentiles. In the Liabilities, for 
instance, the percentile rank of 1 includes all 
scores from o to 39, a range of nearly 10 P.E. 
of an observed score; at other score levels, on 
the other hand, an increase of 13 points of 
score (less than 4 P.E. of observed score) 
represents a movement from the soth to the 
goth percentile rank. In certain of the individ- 
ual categories, also, a difference of 2 points of 
score may cause a shift from the 3oth to the 
79th percentile position. Over most of the table, 
in fact, a very slight variation in score may be 
reflected as a very considerable change in per- 
centile rank. Nor are the other three series of 
the Analysis any better. 

It is not possible to say whether this Analysis 
would function more satisfactorily as a quali- 
tative clinical aid, but it does not on the surface 
appear to be more suitable than the various 
other inventories specifically devised for that 
purpose. 

The test is well printed and well arranged, 
with adequate accessory material. 


Douglas Spencer, Assistant Professor of 
Psychology, Queens College, Flushing, New 
York. This new test is offered as 


an instrument, prepared for various levels of maturity, 
that provides an analysis of mental health liabilities 
and assets. Five types of liabilities to be minimized or 
corrected where present are included: (1) behavioral 
immaturity, (2) emotional instability, (3) feelings of 
inadequacy, (4) „Physical defects, and (5) nervous 
manifestations. Five types of . . . assets to be sought 
or amplified when not sufficiently present are included : 
(1) close personal relationships, (2) interpersonal 
skills, (3) „social participation, (4) satisfying work 
and recreation, and (5) adequate outlook and goals, 


Coming from the same source as the Cali- 
fornia Tests of Personality, this series reveals 
many of the same weaknesses. It is just an- 
other series of self-report, Yes-No response, 
group questionnaires that, however well in- 
tended, can be used to exploit urgent needs in 
education, With no evidence of validity, one 
may doubt the sales promises—despite relia- 
bility coefficients that are very high for such 
tests. Again, the title clearly reveals to subjects 
the purpose of the tests, and administration 
under signatures in school, college, or employ- 
ment situations would seem to encourage con- 
cealment of liabilities and exaggeration of as- 
sets on many inadequately disguised items. As 
to the selection of the 200 items, there is but 
sketchy information and no data. The language 
employed, however, has been commendably 
evaluated by means of the Lewerenz Vocabu- 
lary formula to avoid reading difficulties at 
each level. 

The tests have certain superiorities. The re- 
liabilities (Kuder-Richardson formula) range 
from .93 to .96 and are based upon groups of 
from 980 to 2,075 subjects. Percentile norms 
derive from groups of adequate size: Elemen- 
tary, 1,000 cases, from 9 school districts in 3 
states (unnamed); Intermediate, 2,000 cases 
from 12 districts in California, Colorado, New 
York, and Oregon; Secondary, 2,000 cases from 
14 schools in the same states ; Adult, 1,200 cases 
from 8 communities (unlocated). Background 
information on these groups is not given, but 
it is stated that age and sex made no significant 
differences. Despite the more carefully derived 
norms and the reliabilities high enough for in- 
dividual prediction, one may question the 
claimed adequacy of the tests for “individual 
diagnosis” in the absence of external validation 
and evidence that the components of the Lia- 
bilities and the Assets sections are separate 
and meaningful functions. Intercorrelations of 
the component scores with each other and with 
the total scores are not mentioned. 

It is unfortunate that the superficial assets- 
liabilities approach, long outmoded in social 
case work, should now be deemed worthy of 
group psychometrics. Judged by the treatment, 
suggestions, and examples elaborated in the 
test manuals, a lot of school children—if un- 
wisely frank on these tests—may well be the 
victims of zealous, stereotyped symptom treat- 
ment through the "talk-talk," “give 'em in- 
sight" methods that horrify experienced case 
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workers, even those not devotees of the non- 
directive cult. 


J Consult Psychol 11:156-7 My-Je '47. * 
The virtues of this blank are its emphasis on 
assets as well as shortcomings, and its well- 
worded items, many of which seem sufficiently 
disguised to penetrate superficial ego-involve- 
ments. The deficiencies of the questionnaire are 
numerous. Although the total scores have fair 
reliability (.89 to .97), almost all of the sug- 
gested interpretations use the subtest categories, 
which are of unknown and probably low relia- 
bility. On some categories, a probably insignifi- 
cant difference of 3 score points may change a 
child from the 5oth to the 9oth, or from the 
10th to the soth, percentile. The intercorrela- 
tions of the ro categories are not given, so that 
there are no data by which to judge their in- 
dependence or relationships as factors. More 
serious than these technical considerations is 
the question of the real contribution of such 
questionnaires to individual mental health. 


[ 60 ] 
*Minnesota Multiphasic Personality Inventory, 
Revised Edition. Ages 16 and over; 1942-47; indi- 
vidual and group forms; 9 scales : hypochondriasis, de- 
pression, hysteria, psychopathic deviate, masculinity 
and femininity, paranoia, psychasthenia, schizophrenia, 
hypomania; 4 additional scores: question, lie, validity, 
and K; all prices postpaid; Starke R. Hathaway and 
J. Charnley McKinley ; Psychological Corporation. 
4) INDIVIDUAL FORM (“THE CARD SET"). $20 per testing 
outfit including 50 recording sheets ; $2.50 per 50 record 
sheets; nontimed (30-90) minutes. 
b) croup FORM (“THE BOOKLET FORM"). IBM; 25 per 
test; $1 per manual; $4 per set of scoring stencils 
(specify hand- or machine-scoring) with manual ; $2.75 
per 50 machine-scorable answer sheets including ac- 
companying summary cards; nontimed (40-90) min- 
utes. 
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Arthur L. Benton, Associate Professor of 
Psychology, University of Louisville School of 
Medicine, Louisville, Kentucky. The inventory 
consists of 550 statements which the subject 
indicates as being true or false with respect to 
himself. Provision is made for placing items in 
a doubtful (?) category if the subject is not 
able to make a judgment concerning them. 
Different areas of life experience are covered 
by the items, e.g., somatic experiences, family 
relations, social-political attitudes, sexual atti- 
tudes, mood-tone, beliefs, etc. On the basis of 
the performance of patients in the various psy- 
chiatric diagnostic groupings, scoring scales 
(based on a total of 351 items) have been con- 
structed for the following personality trends or 
structures: (a) hypochondriasis; (b) depres- 
sion; (c) hysteria; (d) psychopathic deviate ; 
(e) masculinity-femininity ; (f) psychasthenia ; 
(g) paranoia; (h) schizophrenia; (i) hypo- 
mania. Three scales designed to check the va- 
lidity of the results are also part of the inven- 
tory: (a) a “lie” scale designed to show the 
extent to which the subject may be falsifying 
his answers to put himself in a socially favor- 
able light, the essential effect of which is to 
tend to reduce the magnitude of abnormal 
scores; (b) an “F” scale, a high score on 
which raises the question as to whether the 
subject actually understands what he is read- 
ing; (c) a “?” scale, derived from the number 
of items placed in the “?” category, a high 
score on which tends to reduce the magnitude 
of abnormal scores. Both individual and group 
forms of the test have been devised. Since the 
reviewer's experience has been restricted to the 
use of the individual form, the remarks which 
follow should be taken to apply only to that 
form of the inventory. 
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The inventory purports to give an accurate 
measure, in terms of a score, of the strength of 
certain trends or components in the personality 
recognized in current psychiatric nosology. It 
was introduced to the medical profession as 
“an easily applicable measuring device which 
would identify and characterize the psycho- 
neurotic patient with a minimum use of the 
time-consuming interview technique that is 
conventional in the psychiatric approach" (8). 
Cases are reported showing how the scores can 
be used in differential diagnosis, high scores 
definitely indicating significant psychiatric devi- 
ations even within the framework of estab- 
lished organic disease and low scores tending 
to exclude the possibility of an emotional eti- 
ology for somatic complaints with a consequent 
focusing of the clinician's attention on an or- 
ganic basis for the complaints. Thus, Schiele, 
Baker and Hathaway (9) state: "a neurotic 
score indicates the presence of a neurotic tem- 
perament, but does not prove the absence of 
organic disease. Conversely...stable persons, 
even though suffering from widespread or- 
ganic disease, score little higher on hypochon- 
driasis or hysteria than do the normal." "The 
negative or normal profile....is a reassurance to 
the clinician, which relieves him in part from 
disturbing concern with psychological factors.” 

It is somewhat surprising, in view of the ex- 
tensive utilization of the test by clinicians, to 
find how little work has been done to assess its 
validity as an analytic-diagnostic device. In this 
respect, it should be kept in mind that the in- 
ventory has been advanced as "the first inven- 
tory measuring common specific clinical syn- 
dromes, in contrast to the earlier schedules 
designed for either the more general concept 
of ‘neuroticism’ or special states like ‘infe- 
riority’” (9). Consequently, studies which re- 
port that the inventory is effective as a screen- 
ing device for “abnormality in general” in mass 
selection situations cannot be considered as 
true assessments of the analytic-diagnostic va- 
lidity of the inventory, Nor would such studies 
appear to have great clinical import in view of 
the demonstrated practical effectiveness for 
screening purposes of simple, abbreviated ques- 
tionnaires. To utilize as elaborate an instru- 
ment as the inventory for screening purposes 
which are achieved as effectively by a twenty- 
item questionnaire is truly to “send a man to 
do a boy’s work.” 

The studies that have been made on the va- 


lidity and clinical usefulness of the inventory 
as an analytic-diagnostic device can be con- 
veniently considered under two categories: (a) 
Studies assessing the validity of the scores de- 
rived from the separate scales; (b) Studies as- 
sessing the validity of "patterns" or "profiles" 
of scores. 

Studies of the first type, i.e., evaluations of 
the validity of single scale scores by comparing 
such scores with established diagnoses, have 
been reported by Leverenz (7), Benton and 
Probst (17, 34), Michael and Buhler (29), and 
Morris (67). Leverenz found that in a group 
of normal soldiers, the median scores on all 
scales were within normal limits. Of 105 pa- 
tients in surgical wards, only two showed 
"abnormal profiles." In a group of patients 
diagnosed as “psychoneurosis, hypochondriacal 
type," 96 per cent had T-scores greater than 
70 on the hypochondriasis scale. In a group of 
patients clinically judged to be “depressed,” 93 
per cent made T-scores greater than 70 on the 
depression scale. It is stated, without citation 
of data, that similar results were found for 
patients judged to be cases of hysteria, psy- 
chasthenia, paranoid sensitivity, and psycho- 
pathic personality. It is further stated that 
“subjects with abnormal homosexual behavior 
invariably received an elevated score on the 
masculinity-feminity scale.” A high proportion 
of a group of patients in whom the diagnosis 
of peptic ulcer was established made high scores 
on the “neurotic” scales. Leverenz concludes 
that the inventory is a highly valid analytic- 
diagnostic instrument of great clinical useful- 
ness. In contrast to this favorable report of 
Leverenz are the findings of Benton, Probst, 
Michael, Buhler and Morris. Benton, working 
with small groups of selected patients in whom 
the diagnoses were established beyond reason- 
able doubt, secured the following results: (a) 
only about one-half of a group of hysterical 
patients made high scores on the hysteria scale ; 
(b) only about one-half of a group of schizo- 
phrenics made high scores on the schizoprenia 
scale; (c) about four-fifths of a group of psy- 
chopathic delinquents made high scores on the 
psychopathic deviate scale; (d) virtually all of 
a group of confessed male homosexuals made 
high scores on the femininity scale. However, 
when these individuals were instructed to at- 
tempt to conceal the fact of their homosexual- 
ity, a majority of them were able to make nor- 
mal scores on the femininity scale; (e) only a 
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very small proportion of a group of patients 
suffering from diabetes mellitus made high 
scores on the “neurotic” scales; in contrast, a 
majority of a group of patients suffering from 
cardiovascular disease made high scores on the 
“neurotic” scales. Benton concludes that “in its 
present stage of development the Minnesota 
Inventory should not be regarded as a practical 
clinical test, the results of which can be accep- 
ted at face value by the practicing psychiatrist 
and the internist.” Benton and Probst, compar- 
ing the ratings by psychiatrists of their patients 
with respect to the personality trends investi- 
gated by the Minnesota and the test scores them- 
selves, found a significant degree of agreement 
between the psychiatric ratings and the test 
scores in respect to the psychopathic deviate, 
paranoia and schizophrenia scales. On the other 
hand there was no significant degree of agree- 
ment in respect to the other scales. Here, as in 
the earlier study of Benton, the over-all va- 
lidity of inventory scores is called into serious 
question. 

Michael and Buhler (29) found in go cases 
diagnosed in one of three broad categories 
(psychoneurosis, psychopathy, psychosis) that 
the Minnesota results agreed with the clinical 
diagnosis in only 45 per cent of the cases. They 
state that “the main problem which we found 
in the use of the Minnesota Multiphasic was 
that there were many unclear results in which 
the interpretation of the curve seemed doubt- 
ful.” 

Morris, comparing the mean scores of pa- 
tients in various diagnostic categories (psycho- 
neurosis, schizoid personality, constitutional 
psychopathic state-emotional instability) with 
those of a normal control group, found that the 
inventory readily distinguished the normal con- 
trols from the pathological groups. However, 
the mean score profiles of the abnormal groups 
were practically indistinguishable from each 
other, all having high hypochondriasis, depres- 
sion and hysteria scores and all having femi- 
ninity and hypomania scores within broad nor- 
mal limits. The schizoid personalities had high 
schizophrenia scores but so did the emotionally 
unstable psychopaths. The anxious psychoneu- 
rotics had high psychasthenia scores but so did 
the schizoid personalities and the emotionally 
unstable psychopaths. The point is made that 
the emotionally unstable psychopaths scored 
within normal limits on the psychopathic deviate 
scale but it can be justly argued that this group 


is not an adequate validating group for the psy- 
chopathic deviate scale which purports to assess 
delinquent and antisocial trends not “emotional 
instability.” Morris concludes that the inven- 
tory "does differentiate borderline normals 
from serious pathological states but does not 
aid in the differential diagnosis among the 
pathological groups.” im 

Studies of the second type, i.e., examination 
of the clinical validity of “profiles” or “pat- 
terns" of scores on the test have been reported 
by Schmidt (30), Gough (40) and Meehl (47). 
Schmidt's data, presented in terms of group 
means, show clearly that groups of patients in 
various psychiatric categories can be readily 
distinguished from a group of normal individ- 
uals. However, the situation is quite different 
in respect to comparisons between the abnor- 
mal groups (mild psychoneurosis, severe psy- 
choneurosis, constitutional inadequacy, sexual 
psychopathy, psychosis) where relatively few 
statistically significant differences emerge. For 
example, even when dealing with group aver- 
ages, there is not a single statistically significant 
difference in any scale score between the con- 
stitutionally inadequate personalities and the 
psychotics, between the constitutionally inade- 
quate personalities and the severe psychoneu- 
rotics or between the severe psychoneurotics and 
the psychotics. Statistically significant differ- 
ences do appear if the disorders are classified 
according to degree of severity, e.g., between 
mild and severe psychoneurosis and between 
psychosis and mild psychoneurosis. Schmidt de- 
scribes what he feels to be characteristic differ- 
ences in the profiles of the different groups. 
These differences are not statistically significant 
and would seem to have negligible clinical value 
in their application for classification purposes 
to an individual case. 

Gough's paper, which includes a penetrating 
discussion of the sagacious use of the inven- 
tory, presents data purporting to show that 
psychoneurotic, psychopathic and psychotic 
subjects can be distinguished from each other 
on the basis of the relative predominance of 
the scores on the neurotic scales, psychopathic 
deviate scale and the psychotic scales. The re- 
viewer's examination of these data fails to dis- 
close the alleged discriminatory value of the 
score profiles. Particularly disturbing in this 
respect is the fact that the score profiles of 
severe psychoneurotics and psychotics appear 
to be practically identical. Gough's data, like 
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Schmidt’s, do show a correlation between mag- 
nitude of scores in general and severity of dis- 
order in general, but the findings on score 
profiles seem to lack any real qualitative differ- 
ential-diagnostic significance, 

Meehl evaluated the effectiveness of a rapid 
objective analysis of inventory scores by com- 
paring the broad diagnostic category derived 
from such analyses with the established broad 
diagnostic category (psychosis, neurosis, con- 
duct disorder). About two-thirds of the cases 
were placed in the correct category. It is con- 
cluded that “while the discriminations achieved 
are very much better than chance in the statis- 
tical sense....the proportion of false classifica- 
tions is considerable.” 

Tt must be concluded that the claims of the 
authors of the Minnesota Multiphasic Person- 
ality Inventory in respect to the face validity 
of the inventory scores have not stood the test 
of critical examination. Recent publications of 
the Minnesota group contain many critical and 
incisive observations concerning the proper use 
of the inventory and indicate that they are well 
aware of the shortcomings of a naive use of 
the test scores. The place for these observa- 
tions would seem to be in the manual and in 
general articles in medical journals as well as 
in special articles in psychological journals. 

As a detailed self-evaluation by the patient, 
the inventory has considerable potential use- 
fulness both as a research instrument and a 
practical clinical aid. Continued experience with 
it suggests many values to the soundly trained 
clinical psychologist and appears to yield data 
of real clinical import. In utilizing the inven- 
tory in clinical situations, it would seem that 
the psychologist should report his conclusions 
and clinical judgments, supported by citation 
of the empirical data, rather than a mere score 
profile which, one can be sure, will be assessed 
at the face value it does not possess. But this 
potential usefulness of the inventory, it must 
be said, has been overshadowed to date by its 
gross misuse, which has tended to weaken 
clinical psychology in the eyes of critical psy- 
chiatrists and internists (who quickly learn 
that the inventory’s performances fall far short 
of its claims) and which has made a “psycho- 
diagnostician” of anyone who can score the test. 


H. J. Eysenck, Senior Psychologist, Mauds- 
ley Hospital, London, England. This scale rep- 
resents a further development of the old neuroti- 


cism-cum-introversion inventory along the two 
directions indicated first by Humm and Wads- 
worth: In the first place, there is not one 
over-all score, but a number of diagnostic in- 
dices are derived from the subject’s answers, 
and in the second place, attempts are made to 
gain some information on the accuracy with 
which the questions have been answered. 

The 550 questions included in this inventory 
are available either in booklet form or printed 
on cards for sorting into the usual “Yes,” “?,” 
“No” categories. From the answers, a profile 
chart is constructed which shows the deviation 
from the standardization group on the follow- 
ing scales: Hypochondriasis, Depression, Hyste- 
ria, Psychopathic Deviate, Masculine-Feminine 
Interests, Paranoia, Psychasthenia, Schizo- 
phrenia, and Hypomania. Items defining these 
various scales were derived from clinically 
diagnosed samples of patients exemplifying 
these various disorders. 

Additional scores are provided on four scales : 
the Question score, the Lie score, the Validity 
score, and the K score. The Question score 
simply records the number of “?” answers; à 
high score is taken to mean that the true scores 
on the diagnostic categories would in all prob- 
ability be further away from the mean than 
they are. The Lie score is made up from the 
answers to a number of questions which make 
the subject appear in a slightly unfavourable 
light ; subjects claiming the favourable alterna- 
tive in each case are presumed to have falsified 
their diagnostic scales also in the direction of 
greater favourableness. The Validity scale con- 
sists of items, which are infrequently answered 
in the scored direction by either normal or ab- 
normal subjects; high scores, with certain ex- 
ceptions, indicate that great care is necessary 
in interpreting the record. The K scale acts as 
a suppressor variable and is claimed to sharpen 
the discriminatory power of the diagnostic 
scales. 

The good points of this inventory are as 
follows: (a) The standardization has been done 
very carefully and conscientiously, (b) The 
statistical work is of a high quality and leaves 
little to be desired. (c) The four correction 
scores (Question, Lie, Validity, K) are a dis- 
tinct improvement on the corresponding scale 
in the Humm-Wadsworth inventory (the No- 
count). (d) The whole scale is an "open" one; 
ie., new diagnostic scoring categories can be 
added without necessitating a new set of ques- 
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tions. (e) The inventory is analytic and gets 
away from the simplicity of the neuroticism 
questionnaire. (f) Scoring is less laborious 
than on the Humm-Wadsworth scale and can 
be done in a relatively short time. (g) The 
method of having the subject sort cards, in- 
stead of fill in a questionnaire, is a distinct 
improvement with abnormal subjects. 

Drawbacks are as follows: (a) The diag- 
nostic groups are based on psychiatric classifi- 
fication, which is notoriously inaccurate and 
disputable. (b) Nothing is known about the 
intercorrelations of the diagnostic scales. (c) 
Reliability of the scales is not very high. (d) 
The number of items is excessive; the authors 
claim that the time taken for filling in the inven- 
tory is commonly as short as thirty minutes, but 
in our experience normal, and particularly ab- 
normal, subjects take well over one hour, and 
at times as much as five hours. 

Of its kind, this is probably the best inven- 
tory in existence; in the clinical situation, it is 
definitely preferable to the Humm- Wadsworth. 
Tn the industrial situation, however, I think the 
Humm-Wadsworth would probably have the 
stronger claims on the basis of better valida- 
tion. Both these scales should ideally be based 
on factorial analyses, like the Guilford scales, 
but even in the absence of such analysis they 
have a definite contribution to make, 


L. S. Penrose, Galton Professor of Eugenics, 
University College, London, England. The test 
material consists of a set of questions in the 
form of statements, to which the subject is ex- 
posed and required to determine if they are 
true or false. There are in all 566 statements 
and they are designed to give information about 
personality traits. The following examples in- 
dicate the range of reactions investigated : 


I enjoy detective mystery stories. 

I have a cough most of the time. 

Iam worried about sex matters. 

Someone has been trying to influence my mind. 
Policemen are usually honest. 

I have no fear of spiders. 


The questions can be presented either on a 
series of cards or in a booklet. If cards are 
used, the subject is asked to sort them into 
three piles according to whether he considers 
the statements printed on them true or false as 
applied to himself or whether he is unable to 
answer. Blank forms are supplied, on which a 
subject working from the booklet can record 


his decisions. Scoring is made as mechanical as 
possible and suitable for routine testing. Though 
the mechanizations makes the test remarkably 
easy to use, it conceals the force of individual 
items so well that it is difficult for the critical 
observer to appreciate how the result is ob- 
tained and, hence, to evaluate it. 

The construction of the inventory is based 
on principles made well known by the work of 
Woodworth. The method of presentation is 
objective, in that the examiner need not enter 
into the picture as he must in any type of 
standardized interview. On the other hand, the 
fine points of clinical observation, which enable 
an experienced examiner to gain information 
about a subject at a personal interview, are 
eliminated also. The wording of the questions 
in the inventory is, on the whole, straightfor- 
ward and independent of educational back- 
ground but at times perhaps too colloquial. A 
few trivial alterations might have to be made if 
the test were to be widely used in the British 
Isles. 

The inventory has the ambitious object of 
measuring specific clinical syndromes and not 
merely of deciding whether or not the subject 
is neurotic, as has been the case with some 
earlier schedules. Accordingly, the results of 
choices are scored on a number of different 
(though not necessarily independent) scales 
which purport to give quantitative measure- 
ments of hypochondriasis, depression, hysteria, 
psychopathy, paranoia, psychasthenia, mascu- 
linity or femininity, schizophrenia and hypo- 
mania, An ingenious device called ‘validity 
score” tends to show whether the subject is 
taking the test seriously and honestly giving 
his opinion or not. The final result is put in the 
form of a profile with standard scores which 
make the deviation from normality in each 
scale comparable. A standard score of 50 is 
average and every IO points above or below 
represents one unit of standard deviation in 
the general population. Thus, standard scores 
above 70 are considered to indicate presence of 
abnormal amounts of the properties in ques- 
tion though, since the distributions of scores of 
some scales are far from being Gaussian, this 
criterion is questionable. 

Much work has been done by experimenting 
with the test on various types of abnormal sub- 
jects and inevitably the scoring on various 
items has had to be altered from time to time 
as new information became available, The ex- 
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perimental results, however, indicate that some 
scales are much less reliable than others. In all 
cases there is, of course, an overlap between 
scores of known normal and known abnormal 
subjects and when the degree of overlapping is 
great, the test is less useful than when it is 
small, The best results appear to have been ob- 
tained on the scale of depression, where the 
overlap is such that only 12 per cent of cases 
would be misclassified, if the inventory were 
used to diagnose presence or absence of a de- 
pressive state (8). On the other hand, a state 
of acute depression is, from the psychiatric as 
well as from the common sense point of view, 
easy to appreciate and it is not clear whether 
the inventory will detect potential depression in 
a person who, for the time being, is reacting 
normally. The hypochondriacal scale seems to 
be also quite good in differentiating normal 
from abnormal subjects. On the other hand, 
the psychasthenic scale is almost useless, since 
diagnosis by this scale would classify one out 
of three symptomatic cases wrongly (5). Some 
other scales are of doubtful value. It is not to 
be expected that systems of scoring, stand- 
ardized on the basis of a limited number of 
typical cases, will necessarily continue to be 
strongly discriminative when applied to fresh 
unknown populations, 

The inventory will probably be widely used 
because of its attractive methods of presenta- 
tion and scoring. In making "blind" diagnoses, 
its findings must be treated with reserve and 
laymen or psychologists without clinical ex- 
perience might be encouraged by its use to 
make quite unwarranted inferences in individ- 
ual cases. On the other hand it is claimed that, 
since the clinical impression obtained from 
psychiatric investigation is not always corrobo- 
rated by the scores obtained on the inventory, 
this new instrument is capable of bringing to 
light hitherto unsuspected abnormal personal- 
ity changes. 


Julian B. Rotter, Associate Professor of 
Psychology, The Ohio State University, Colum- 
bus, Ohio. The Minnesota Multiphasic Per- 
sonality Inventory is an attempt to provide in a 
single scale an objective appraisal of many of 
the major personality characteristics of interest 
to individuals working in the area of person- 
ality abnormality. As an inventory-type test, it 
has an advantage over other inventories in that 
it attempts to measure the validity of the test 


for the particular individual with three differ- 
ent measures. One of these is the question 
score, which is dependent upon the number of 
items categorized as questionable by the sub- 
ject. A second, the Lie score, attempts to meas- 
ure falsification by the type of answer to crucial 
questions; and the third, the validity score, a 
measure of pertinency, rationality, and consist- 
ency, attempts to get at the subject’s response 
to the whole testing situation. This attack on 
some of the usual criticisms of personality in- 
ventories is all to the good and is not lessened 
in value by the fact that some of these meas- 
ures are related to the specific characteristics 
being studied by the test as a whole. The per- 
sonality scales are Hypochondriasis, Hysteria, 
Psychopathic Deviate, Interest (masculinity- 
femininity), Paranoia, Psychasthenia, Schizo- 
phrenia, Depression, and Hypomania. The 
presence of several scales standardized on simi- 
lar populations allows for studies of profiles 
and relationships rather than of unrelated en- 
tities. 

Test-retest reliabilities for the separate scales 
are reported as ranging from .71 to .83. Va- 
lidity measures are for single scales and are 
usually in terms of overlap between patients of 
a given nosology, unselected patients, and nor- 
mals. Satisfactory differentiation is reported in 
most instances, In selecting subjects the impor- 
tant and extremely difficult factors such as 
severity and duration of illness and length of 
hospitalization are frequently neglected, but 
many controls which are not present in most 
personality-inventory validation groups, such 
as cultural factors, patient status, and socio- 
economic level, are made. The number of cases 
for validation and cross-validation of some of 
the specific scales is still somewhat small, A 
new scale, the K scale, which attempts to act as 
a correction factor to increase the validity of 
several of the personality scales, has been pub- 
lished. 

The test as used individually requires the 
subject to sort cards, The test may be scored 
by machine or by hand. The time for adminis- 
tration is usually from forty-five minutes to 
two hours and does not require supervision ex- 
¢ept as supervision of some psychotic patients 
is always necessary. Five hundred and fifty 
items are included. 

In appraising the value of the test as a clinical 
instrument, stress should be placed on the fact 
that scoring standards were developed by de- 
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For item analysis, 25 high-scoring men and 
women and 25 low-scoring men and women 
were selected on the basis of their scores on 
each of the five parts of the test. Critical ratios 
showing the ability of each item to discriminate 
between the superior and inferior groups were 
computed. In two successive experiments with 
different groups of subjects the items with low 
critical ratios were eliminated, until only very 
few were left that did not have critical ratios 
of at least 3.0 for the men and for the women. 
This standard of internal consistency is high 
for tests of this type, although it seems sur- 
prising that a test of emotional stability should 
contain such items as “Do you have....athlete’s 
foot?" and "Do your teeth seem to need dental 
attention ?" 

Two other characteristics of the test were 
selected on the basis of experimental evidence: 
(a) Five choices instead of the more usual 

“three are used in spite of increasing the diffi- 
culty of scoring because reliability was shown 

-to be appreciably higher with five choices than 
with three. (b) The items were written in the 
second person, like most items of this kind, in- 
stead of in the third person because the second 
person was preferred by 80 per cent of the 
subjects and yields the same reliability, though 
‘it also yields a somewhat more favorable mean 
score than was obtained by using items in the 
third person. 

It must be borne in mind that personality in- 
ventories of this type are generally unsuitable 
for selective purposes when the subjects can 
gain by not answering the items frankly, Little 
effort has been made to disguise the purpose of 
this test. Satisfactory results can be obtained 
only in counseling or experimental situations 
where the subjects either are indifferent to the 
test results or desire an honest diagnosis of 
their personality. 

[ 62 ] 
*Minnesota T-S-E Inventory. Grades 10-16 and 
adults ; 1942; preliminary manual, 1942; $2.15 per 25; 
35¢ per set of scoring stencils; 50¢ per specimen set; 


nontimed (25) minutes; Catharine Evans and T. R. 
McConnell; Science Research Associates. 
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is ree, Enea, Soy renes. Unpublished. mae 
ters thesis, Universi erwe. “Social Adjustment and Interest 
Scores of Introverts and Extroverts." Ed & Psychol Meas 7: 
157-67 sp '47. * (PA 22:626) 

Philip Eisenberg, Research Psychologist, Co- 
lumbia Broadcasting System, New York, New 
York. This test was constructed to measure 
three types of introversion-extroversion : think- 
ing, social, and emotional, which Guilford had 
isolated in a factor analysis of available I-E 
tests. They bear some relationship to, but are 
not the same as, Jung's types. Jung portrayed 
typologies of entire personality structures, 
while these tests measure only overt interests 
and actions. The overt interests may reflect 
deeper, underlying motives and needs, but this 
aspect of personality is not touched upon in the 
paper-and-pencil test items nor in the construc- 
tion and validation of the test. 

The separateness of the three types of I-E 
was demonstrated beyond Guilford's analysis 
by low intercorrelations between the three tests. 
The split-half and repeat reliabilities of each 
test lie within the high .80’s, which according 
to the authors “seem to be sufficiently reliable 
for individual prediction.” It seems to this re- 
viewer, that a little better than 50 per cent 
prediction above chance is adequate for group, 
but not for individual, prediction. 

The validation of the test has been mostly 
that of differentiation between groups with ap- 
parently different interests. Thus, physical 
education majors are more often thinking ex- 
troverts than English, art, mathematics and 
social studies majors. Sorority and fraternity 
members are more likely to be socially extro- 
verted than students who do not belong to such 
groups. For the most part, college women tend 
to be more emotionally extroverted than col- 
lege men. Also life insurance salesmen were 
found to be more extroverted on each of the 
three scales than men in general. There seems 
to be some differences in I-E between success- 
ful and unsuccessful air-line stewardesses. 
Thinking extroverts tend to prefer naturalistic 
pictures, and thinking introverts non-naturalis- 
tic pictorial representation. Some relationships 
have been found between scores on the Minne- 
sota T-S-E Inventory and the Minnesota Per- 
sonality Scale and the Kuder Preference 
Record. 

These relationships suggest the usefulness of 
this test in situations where interests may be 
important as in choice of college majór and in 
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vocational guidance. However, such uses for 
this test have not yet been demonstrated by re- 
search. Beyond measurement of interests, this 
test seems to be limited as a diagnostic instru- 
ment, It is further limited by the fact that it 
can only be used with senior high school and 
college students. 


John W. French, Research Associate, Col- 
lege Entrance Examination Board, Princeton, 
New Jersey. The test was constructed by gath- 
ering items about activities which were thought 
to discriminate introverts from extroverts in 
three fields of measurement: thinking, social, 
and emotional. Classification of the items into 
these three categories was done by judges. 
These three aspects of the introversion-extro- 
version problem are probably well chosen, since 
a factor analysis by Guilford brought out the 
factors “thinking introversion,” “shyness,” and 
“depression” which are similar to the three 
used here, 

The manual gives an impressive table of va- 
lidities, In each of the three fields groups of 
people having contrasting age, sex, or occupa- 
tion are shown to differ significantly on the 
introversion-extroversion scale in the expected 
direction. However, the test uses direct ques- 
tions about the interests and activities of groups 
of people similar to those used in the validity 
studies. For example, the tables show that the 
test will discriminate physical education majors 
from English majors. This is done by such 
questions as “Do you enjoy watching football, 
‘basketball, or baseball games?” and “Do you 
enjoy the plays of Henrik Ibsen?” Since the 
activities of English majors are associated with 
introversion, their activities are keyed for in- 
troversion in the test. Then the test is claimed 
to be valid, because it finds English majors to 
be an introverted group. This reasoning is dan- 
gerous unless it is known that English majors 
actually are an introverted group. 

It must be borne in mind that personality in- 
ventories of this type are generally unsuitable 
for selective purposes when the subjects can 
gain by not answering the items frankly. No 
effort has been made to disguise the purpose of 
the test. What value the test may have can be 
realized only in counseling or experimental sit- 
uations where the subjects either are indiffer- 
ent to the test results or desire an honest 
diagnosis of their personality. 


[63] a 
*Ohio Guidance Tests for Elementary Grades. 
Grades 4-6; 1944-46; 5 parts; 1 form; 25¢ per manual ; 
38€ per specimen set; nontimed; manual prepared 
under the direction of Ohio Scholarship Tests and thc 
Division of Elementary Supervision of the Ohio State 
Department of Education by Wellington G. Fordyce, 
Wilbur A. Yauch, and Louis Raths; tests developed by 
the Euclid, Ohio, elementary teachers in cooperation 
with the College of Education, Ohio State University ; 
Ohio Scholarship Tests, Ohio State Department of 


Education. 

4) OHIO INTEREST INVENTORY FOR THE INTERMEDIATE 
GRADES. Form G-1 ; 2¢ per test; nontimed (60) minutes, 
b) OHIO INDIVIDUAL SUMMARY SHEET FOR COMMITTEE 


SELECTIONS. Form G-2; 2¢ per sheet. 
C) OHIO SOCIAL ACCEPTANCE SCALE FOR THE INTERMEDI- 


ATE GRADES. Form G-3; 2¢ per scale. 

d) OHIO RECOGNITION SCALE FOR INTERMEDIATE GRADES : 
WHO'S WHO IN MY GROUP? Forms G-4-A, G-4-B (both 
forms should be used) ; 4¢ per set; 1¢ per individual 


summary sheet. 
€) OHIO THINKING CHECKUP FOR INTERMEDIATE GRADES. 


Form G-5; 2¢ per test. 


REFERENCES 
1. Foroyce, WzLLINGTON G, "Teachers Can Build a Test." 
Ed Res B 22:62-5 Mr 17 
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3; p Louis, "Ídentifying the Social Acceptance of 
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vp. “The Committee Idea in Test Construc- 
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Li 

6. fares, Louis, AND SCHWEICKART, E. F. “Social Ac- 
ceptance Within Interracial School Groups. Ed Res B 25: 
85-90 Ap 10 '46. * (PA 21:547) 

7. Ratus, Lovis. “Evidence Relating to the Validity of the . 
Social Acceptance Test." Ed Res B 26:141-64- S 17 '47. * 

8. Youxc, L. L, “Sociometric and Related Techniques fo! 
Appraising Social Status in an Elementary School.” Sociometr: 
10:168-77 My '47. * 

M. H. Elliott, Director of Research, Public 
Schools, Oakland, California. The Ohio Guid- 
ance Tests represent a desirable movement in 
the direction of developing instruments dealing 
with social and personal competence at the ele- 
mentary school level. As such they constitute a 
break with the tradition of tests of academic 
skills and information. 

The Ohio Interest Inventory contains 360 
items (“To lead a class, discussion,” “To use a 
hammer and saw”) to which children are to re- 
act by indicating a like or dislike. The items 
are classified into eighteen interest areas, such 
as: sports, school, helping others, music, 
movies, etc. This inventory bears an obvious 
relationship to the various vocational interest 
inventories in use at the secondary school level. 
The interest items and the vocabulary seem to 
be suitable for the ages for which it is intended. 

The Ohio Individual Summary Sheet for 
Committee Selections deals with a sociometric 
technique. The class is asked to vote for mem- 
bers of a committee. The teacher uses the indi- 
vidual summary sheet to record who voted for 


each child and for whom he voted. 
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The Ohio Social Acceptance Scale consists 
of a six-point rating scale ranging as indicated 
by the following abbreviated descriptions from: 
“1. I would like to have this person as one of 
my very best friends,” to “6. I do not like to 
work with this person and would rather not 
talk to this person.” Each child uses this scale 
to rate every other member of the class. 

The Ohio Recognition Scale—W ho's Who 
in My Group contains 18 paragraph descrip- 
tions of behavior and personal characteristics. 
The children are asked to indicate under each 
paragraph the names of any members of the 
class to whom the description applies. 

The Ohio Thinking Checkup, more nearly 
than the others, conforms to the usual concep- 
tion of a “test.” Essentially it is a test of inter- 
pretation of data and indicates tendencies to be 
overly cautious, to go beyond the data in draw- 
ing conclusions, etc. 

The tests described above are developments 
or modifications of various instruments which 
have been rather widely used in research 
studies. Some of the original instruments were 
developed or utilized by Dr. Raths and numer- 
ous collaborators in the Eight-Year Study at 
the secondary level. Others are derived from 
the various developments of the *Guess Who" 
technique, and one stems from the sociometric 
work of Moreno. 

The first four tests (excluding the Thinking 
Checkup for the moment) have their greatest 
usefulness in stimulating teachers to study 
children as individuals. No teacher can use 
instruments such as these without becoming 
more aware of differences. Furthermore, data 
of the kinds obtained will both challenge the 
teacher and suggest practical procedures to be 
followed in the classroom. Teachers are not 
always aware of the ramifications of the “peer 
culture" of the children. The natural tendency 
is to see children in terms of adult standards. 
These instruments reveal in various ways the 
children's judgments of each other. 

The only statistical material furnished to the 
reviewer was a percentile table for scores on 
the Social Acceptance Scale. The number of 
children represented and their grade level was 
not indicated. However, on the first four of 
these tests wide-scale norms seem to the re- 
viewer to be of relatively little importance at 
present. These four tests are of most signifi- 
cance for the training and sensitizing of teach- 
ers to the finer shades of social and personal 


adjustment. For most purposes it is sufficient 
for the teacher to study and understand the 
range of differences within her own room. 
Furthermore, general norms would have little 
meaning. The scores on some of these tests 
would vary tremendously in a particular class- 
room with such factors as size of class, length 
of time which the class had been together as a 
unit, etc. The advent of a new and particularly 
attractive child could affect the scores obtained 
by a number of other children. 

The Thinking Checkup belongs in a some- 
what different category from the other four 
tests although it will in part reflect “personal- 
ity” differences. On this particular test some 
sort of norms for different age groups might 
be interesting. This instrument would also fit 
into the orthodox pattern of determining reli- 
ability and validity more easily than the others. 

These Ohio Guidance Tests should be con- 
sidered as interesting instruments for teacher 
training and in-service training programs. Their 
use in this connection can be very stimulating. 
They are still in the experimental stage. The 
average teacher would not be able to make full 
use of them without much help and guidance. 

One minor and probably unwarranted criti- 
cism of the presentation of this series is the 
possible implication that a teacher interested in 
guidance of her pupils should make use of the 
whole series, An interested teacher might prof- 
itably use the results of any one of the tests. 
However, the reviewer doubts that any of the 
busy elementary school teachers of his acquaint- 
ance could work out, comprehend, and fully 
utilize the results of the whole series of tests 
in one school term unless she had an assistant 
to carry on classroom instruction. 


John W. M. Rothney, Associate Professor of 
Education, The University of Wisconsin, Mad- 
ison, Wisconsin. By requiring children to mark 
as liked or disliked the 360 items of an interest 
inventory; by selecting committee members 
from among their classmates ; by choosing chil- 
dren who fit certain classifications of accept- 
ability; by naming pupils who fit descriptions 
of personality; and by interpreting paragraphs 
which reveal attitudes, the Ohio Guidance 
Tests attempt to provide an appraisal of social 
adjustment of intermediate school children. 
Suggestions for treatment of children whose 
characteristics are thus appraised are presented 
in the manual, 
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The tests provide good coverage of impor- 
tant characteristics. They do not attempt to 
"compare achievement of an individual with an 
established norm," but to provide information 
for his guidance. Percentile norms are, how- 
ever, provided for the social acceptance scale. 
To this reviewer, it is unfortunate that this 
comparative aspect has been included since it 
invites possible misuse of the tests. The instru- 
ments would have been better named if the 
word "test" had been left out of the title. 

'The items of the scales have been carefully 
selected and the selection of vocabulary has 
been well done. Requiring the child to respond 
to 360 items on the interest inventory may re- 
sult in the forcing of statement of preference 
where none really exists. While voting for 
committee members, the child is not given the 
opportunity to use a secret ballot and his choice 
may be influenced accordingly. The authors ex- 
press proper doubt concerning the ability of 
children to make the discriminations required 
by the social acceptance scale, but use of the 
test may help to resolve the doubt. 

The teachers who helped to build this scale 
should be able to use it effectively and those 
who know the field of child psychology will 
appreciate its value. To others, it will seem to 
demand a lot of extra work, and statements 
such as "the teacher will do this during recess 
or noon hour, or after school" will not make it 
seem less forbidding. The construction of these 
tests has shown what can be done by a concen- 
trated and cooperative effort of a group of 
teachers and experts in the field of child study. 
Their work can be used very effectively as an 
example of what other school systems could do 
if they would attempt similar tasks. The proc- 
ess of development of such instruments will do 
more to make a group of teachers pupil-minded 
than the application of ready-made techniques. 
It is hoped that others will follow the example 
set by the Ohio group. 


[64] 

XPersonal Audit. Grades 9-16 and adults; 1941-46; 
Forms SS (short form), LL (long form) ; Clifford R. 
Adams and William M. Lepley; Science Rescarch 
Associates. 

a) FORM ss. $2.65 per 25; 50¢ per specimen set; non- 
timed (35) minutes. 

b) For LL. $3.65 per 25; 60 per specimen set; non- 
timed (45) minutes. 
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Percival M. Symonds, Professor of Educa- 
tion, Teachers College, Columbia University, 
New York, New York. This is a self-adminis- 
tering personality test coming in an eleven-page 
folder. The test consists of nine separate ques- 
tionnaires, each section containing 50 items. 
According to the manual, the Personal Audit 
measures the following personality character- 
istics: Part I, Seriousness-Impulsiveness ; Part 
II, Firmness-Indecision; Part III, Tranquil- 
lity-Irritability; Part IV, Frankness-Evasion ; 
Part V, Stability-Instability; Part VI, Toler- 
ance-Intolerance; Part VII, Steadiness-Emo- 
tionality; Part VIII, Persistence-Fluctuation ; 
Part IX, Contentment-Worry. Reliability for 
any part is stated to be .90 or above, based on 
the scores of 442 college students. Norms are 
given for high school students, college fresh- 
men, general college, business and industrial, 
and general adult subjects. 

It is the impression of this reviewer that 
claims are made for this instrument which go 
beyond the data which the authors present or 
have at their disposal. For instance, in Part T 
high scores are said to indicate “a serious dis- 
position characterized by quietness, ambition, 
and studiousness. Usually interests are special- 
ized and stable resembling those of successful 
draftsmen, engineers, mathematicians, and 
chemists. Cautiousness is often present to some 
degree. Socializing tends to be confined to close 
friends. Agreeableness and cooperation may be 
present.” In other words, one score is claimed 
to measure quietness, ambition, studiousness, 
cautiousness, agreeableness, and cooperation. 
The authors describe patterns of scores for the 


nine parts of individuals who are stubborn, de- 
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linquent, cooperative, agreeable, realistic, im- 
pulsive, happily married, but do not present the 
evidence on which these patterns are based. 

The manual presents extensive validity data 
based on correlations of the Personal Audit 
with other tests and questionnaires. In general 
the correlations are low. Out of all those pre- 
sented, few are .3 or above, indicating that the 
Personal Audit on the basis of the evidence 
presented is not related to other recognized per- 
sonality scales. In order to validate the names 
assigned to the nine parts and the descriptions 
of what high and low scores for each part indi- 
cate. "Thirty psychologists, including twelve 
who had used the instrument, were asked to 
state what each part measured. The consensus 
of their judgments coincides with the descrip- 
tions which have been prepared for the various 
traits." This statement is extremely loose. It is 
impossible to believe that their free statements 
of what each part measures could coincide ex- 
actly with the descriptions provided for the 
scale by the authors. However, the degree of 
discrepancy is not stated in the manual. 

A number of the parts represent new collec- 
tions of items. For instance, Part III is a list 
of annoyances ; Part V, a list of common fears; 
Part VI, a list of persons whom one might dis- 
like; Part VIII, current social phenomena 
towards which the subject is asked to state 
changes in his feelings. 

This reviewer is not convinced that the Per- 
sonal Audit provides measures which corre- 
spond to the titles given to the separate parts. 


[ 65] 
*Personality Index. Adults; 1945; 1 form; $3 per 
25; $1 per specimen set ; nontimed (25) minutes; How- 
ard K. Morgan; La Rue Printing Co., 906 Baltimore 
Ave., Kansas City 6, Missouri. 


Benjamin Shimberg, Research Assistant, 
Division of Educational Reference, Purdue Uni- 
versity, Lafayette, Indiana. This inventory rep- 
resents an effort to “define personality in com- 
mon business terms and not in psychological 
terms.” The traits measured are called (a) job 
interest, (b) social intelligence, (c) leadership, 
(d) planning, (e) drive, and (f) follow through. 
These traits were supposedly educed by job 
analysis and from an extensive study of other 
tests and rating scales. In commenting on the 
table of intercorrelations given in the manual, 
the author states that the “intercorrelations are 
low enough to be satisfactory.” Eight of the 15 
intercorrelations are found to have values be- 


tween .45 and .72, while only three are less 
than .20. Tests with intercorrelations of this 
magnitude seldom yield any noticeable increase 
from zero-order validity coefficients to multiple 
correlation coefficients. 

The Index consists of 84 questions, about 
two-thirds of which deal with topics typical of 
most personality questionnaires. The other 
third are items concerning job interest, work 
habits, work attitudes, and the like. The answer 
to each question is selected from among three 
alternatives. Like the items in most question- 
naires of this type, considerable ambiguity is 
present in the questions, making it difficult to 
give honest answers without knowing more 
about the circumstances involved. 

Scoring the test must be done manually and 
is rather cumbersome. It involves aligning a 
score sheet with each column of the test book- 
let, indicating how the questions were answered 
by crossing out the alternatives which were not 
used, and then adding the appropriate numbers 
for each column to get the raw scores for the 
various traits, These may then be converted to 
percentile scores from a table, and a trait psy- 
chograph plotted on a card which is provided. 

Only limited data are given in the manual as 
to the composition of the groups used in devel- 
oping the weights or establishing norms. The 
author states that “normal industrial groups 
were used to set the test limits and averages” 
and that “persons from routine to important 
supervisory jobs were included,” both sexes be- 
ing represented about equally. The author has 
informed the writer that 75 cases were used in 
the initial study. Norms should be checked be- 
fore use to make sure that the groups being 
tested are comparable to those used in develop- 
ing this test. 

Items for the test were selected by testing 
“known groups” and using only those items 
which differentiated the best from the poorest 
in the group with respect to a given trait. It 
appears, however, that the validation data re- 
ported as having been obtained by the use of 
the “small group method” utilized the same 75 
individuals upon-whom the items were initially 
evaluated. This factor would tend to make the 
validity coefficients spuriously high. Several of 
these coefficients were, in fact, .9o or greater. 
Another group, about twice the size of the first 
group, was used for an independent validation. 
Validity coefficients for the various traits on 
the second group ranged from .47 to .75. How- 
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ever, even these values seem unusually favor- 
able for tests of this sort. It would be desirable 
to have more detailed information regarding 
the procedures and statistical methods em- 
ployed. The small manual which is supplied 
with the test fails to provide this information. 
This writer would recommend caution in using 
this instrument until its worth has been more 
fully demonstrated. 


[ 66 ] 
XPhilo-Phobe. Age 10 and over ; 1943-47 ; individual ; 
3¢ per form sheet; John N, Buck; Steward's Office, 
Lynchburg State Colony, Colony, Virginia. 


„ REFERENCES 
vici Pr S SE at VO 

2. FnENCH, Vera V. “The Structure of Sentiments: IL, A 
Preliminary Study of Sentiments." J Personality 16:78-108 S. 
47. * (PA 22:2267, 

3. LANDISBERG, SELMA. “A Personality Study of Institution- 
alized Epileptics.” Am J Mental Def 52:16-22 Jl '47. * 

Parker Davis, Jr., Assistant Professor of Psy- 
chology, Rutgers University, New Brunswick, 
New Jersey. This instrument is a semiformal- 
ized psychiatric examination which according 
to the author stands “somewhere between the 
‘free interview’ and the stereotyped projective 
techniques—designed to help the examiner se- 
cure valuable information concerning the sub- 
ject’s personality integration and affective reac- 
tions.” It is intended to be a “supplemental 
affair’ to formal history and psychometric 
procedures. Buck states, “The examiner must 
know the relative parts played in the subject’s 
development by his economic, social and cul- 
tural backgrounds; his heredity ; and his physi- 
cal disabilities, if he is to evaluate the responses 
accurately.” 

Administration is individual with examiner 
recording responses verbatim. Two or more 
sessions of one to two hours each are required. 
Considerable clinical experience is necessary 
for administration and analysis. Examination 
consists of a series of direct questions divided 
into four main groupings (“Aspiration,” “Emo- 
tion,” “Judgment and Insight,” and “Ethico- 
moral Development”) which serve as a basis for 
general analysis. Further analysis is made of 
“Response Patterns” from various points of 
view. 

The character of the questions is illustrated 
by the following examples: “What would you 
like most to do?” “What worries you most?” 
“Whom do you like best in all the world?” 
“Do you think anyone is ever justified in tell- 
ing a deliberate lie?” 


According to the author, the instrument has 
been developed empirically with the theoreti- 
cal structure eclectic. The primary purpose is 
the development of an objective-type research 
instrument in the field of personality dynamics 
based upon direct questions. No published re- 
search is available as yet. 

In our opinion, the Philo-Phobe has its 
greatest usefulness as a guide and instrument 
for training in psychiatric interviewing. It 
should be particularly appealing to clinical psy- 
chologists who either are unfamiliar with the 
"free interview" or who have a strong positive 
bias for objective interview methods. As a re- 
search and clinical instrument it has a number of 
rather serious weaknesses: (av) Interpretation 
is essentially subjective and can only be made 
by an experienced clinician; (b) It is supple- 
mentary to, and requires for interpretation a 
vast amount of, other information regarding the 
subject ; that is, the Philo-Phobe does not make 
any unique and independent contribution such 
as the structural diagnosis of the Rorschach ; 
(c) It is highly dubious both in the light of 
previous attempts and in view of our present 
knowledge of psychodynamics whether any in- 
strument whose basic approach consists of a 
standardized series of direct questions can ever 
be successful in depth analysis; for this reason, 
the Philo-Phobe will have slight appeal for 
Rogerian and projective technique psycholo- 
gists. 

[67] 
Problem Check List. Grades 7-9, 9-12, 13-16; 1941— 
475 3 levels; 1 form; for adaptations, see 68 and 60; $1 
per 25; 6o¢ per manual for either the high school or 
college form; nontimed (20-40) minutes; Ross L. 
Mooney; Ohio State University Press. 
a) JUNIOR HIGH SCHOOL FORM. 1942-47. 
b) HIGH SCHOOL FORM. 1941-47. 
€) COLLEGE FORM. 1941-47. 


REFERENCES 


1. Comas, AgTHUR W, The Problems of High School Stu- 
dents in a Typical American Community: A Survey of Major 


. 136, 
ntifying Pupil Needs, Concerns 

T lum. Revision in Stephens- 
mas- 


5. . Worry: A Common Problem of 
Elementary School Children. Unpublished master's thesis, Ohio 
State University, 1942. Pp. D b 

6. Arnot, Dwicut L., anp Mooney, Ross L. “A Students’ 
poem Check List for Junior High School.” Ed Res B 22:42-8 

17 743. 

7. Coxcpon, Nora A. “The Perplexities of College Fresh- 
men.” Ed & Psychol Meas 3:367-78 w '43. * (PA 18:2563) 

8. Mooney, Ross L. “Community Differences in the Problems 
of High-School Students: A Survey of Five Communities by 
Means of a Problem Check List." Ed & Psychol Meas 3:127-42 
su 43. * (PA 18:1890) 
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9. MooNzy, Ross L, “Exploratory Research on Students’ 
Problems.” J Ed Res 37:218-24 N '43. * (PA 18:1240) .—- 

10. Mooney, Koss L. "*Personal Problems of Freshman Girls: 
REM Dormitory Survey by Means of a Problem Check 
List." J Higher Ed 14:84-90 t "43. * us 17:2189) 

11, Prieur, MamjomrE. "Personal 
nomics.” Ed Res B 22:118-22 M: "43. * 

12. Hiner, Francis W., AND Larsen, AnTHUR Horr. “Prob- 
lems of Upperclass Students in a Teachers College.” J Appl 
Psychol gruss Je "44. * (PA 18:3289) 

13. Houston, V. M., anp MamzoLz, SrANLEY S. “Faculty 
Use of the Problem Check List.” J Higher Ed 15:325-8 Je "44. * 
(PA 18:3292) $ ` x 

14. Prieur, MARJORIE. “A Guidance Point of View and Its 
Practical Application.” Pract Home Econ 22:328-9+ S one ig 

15. MamzoLr, STANLEY S., AND Larsen, AmrHum Horr. 
“Statistical Interpretation of Symptoms Illustrated With a 
Factor Analysis of Problem Check List Items.” Ed & Psychol 
Meas 5:285-94 au '4s. * (PA 20:2905 

16. YouNc, Harry A. The Personal-Social Problems of Youth 
in Relation to Curriculum Planning. Unpublished doctor's 
thesis, University of are 1 45. Pp. 150. 

17. Fiscurr, Rosert P, “Si ed Versus nsigned Personal 
IESU DM J Appl Psychol 30:220-5 Je '46. * (PA 20: 
4193; 


Ralph C. Bedell, Professor of Educational 
Psychology and Measurements, The University 
of Nebraska, Lincoln, Nebraska. The Problem 
Check Lists consist of phrases intended to 
make it easy for students to express their 
troublesome problems. The arrangement is 
similar to that of interest inventories except 
that the items are problems rather than inter- 
ests. The student underlines the problems that 
are of concern to him, circles the ones of most 
concern, answers a few general questions de- 
signed to help him organize his feelings about 
his problems and writes a summary in his own 
words. There are no time limits and directions 
are simple. Thirty to forty minutes represent 
the approximate maxlmum time needed. 


uidance in Home Eco- 


All three forms are similar in arrangement" 


with considerable duplication in content but de- 
signed to be most appropriate to the groups 
concerned. The two manuals for the College 
and High School Forms are similar and for 
many users either manual will suffice for both 
forms. Users of the Junior High School Form 
are referred to the High School or College 
manual. The available manuals are unusually 
complete, and all users are urged to make a 
careful study of at least one of them. 

The items contained in the College and High 
School Forms are classified into eleven differ- 
ent general areas, with thirty items in each 
area. The Junior High School Form has seven 
general areas, with thirty items in each area. 
The areas employed in the College and High 
School Forms are indicative of those used in 
the other form: (a) health and physical de- 
velopment; (b) finances, living conditions and 
employment; (c) social and recreational activi- 
ties; (d) social-psychological relations; (e) 
personal-psychological relations; (f) courtship, 
sex and marriage; (g) home and family; (h) 


morals and religion; (i) adjustment to college 
(school) work; (j) the future: vocational and 
educational; and (k) curriculum and teaching 
procedures. 

The construction and use of the Problem 
Check Lists present many problems, most of 
which have been anticipated by the authors. 
Generally, the problems have been met-on the 
basis of pragmatic criteria and much reliance 
has been placed on the judgments of those who 
use the check lists. 

The technical development of these instru- 
ments differs in many respects from traditional 
methods of test construction; consequently 
many of the usual criteria for tests do not 
apply. In many respects Professor Mooney is 
correct when he states that “The Check List is 
not to be thought of as a test." There are no 
set scores to be placed on a continuum; the 
counting (or scoring) indicates only the num- 
ber of problems marked in each general area. 
The lists provide means of indicating the be- 
havior problems that occur in conjunction in 
any one individual or group of individuals; 
there is no attempt to arrive at categorical con- 
clusions. 

The traditional concepts of reliability and 
validity are not appropriately applied to the 
lists. Inconsistency in marking items is neither 
good nor bad, but merely presents new infor- 
mation to be used in following the changes in 
an individual or group. Repetition of markings 
with an interval of one week shows a fairly 
high stability in the number of problems 
marked in each general area (rank order coeffi- 
cient, .90), which supports the author's clinical 
judgment that changes in the data from a check 
list function to add greater clarity to patterns 
and progressions. 

The lists do not purport to represent some 
internal capacity or trait or to make categorical 
predictions. They do purport to give students 
an opportunity to express their problems and 
provide assistance in understanding the prob- 
lems expressed. The evidence indicates that 
students can and do express the problems in- 
cluded. The average number of problems 
marked by students in most groups is thirty; 
one-fourth checked forty or more. A group of 
one hundred students will mark about 95 per 
cent of the different problems in the list. No 
check list can present definitively all problems, 
and users of this as well as other check lists 
should bear that fact in mind. 
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The principal contribution of the check lists is 
to be found in the extent to which they increase 
the understanding of the problems marked by 
students. These markings can best be under- 
stood by clinical evidence. The general areas 
presented in the check lists are primarily de- 
scriptive and the extent to which causal analy- 
ses may result is dependent on the skill of the 
user. It is to be hoped that further research 
will indicate those syndromes which occur as 
“central tendencies” and the variations which 
are to be expected under different conditions. 
Marzolf and Larsen (15) have done a pre- 
liminary study which indicates the probability 
that syndromes may be identified and, by fur- 
ther clinical study, explained. The judgment 
of experts in the classification of problems into 
general areas is to be respected as a valuable 
tool too infrequently used, but it is believed 
that the increased precision which results from 
statistical analysis is essential for maximum in- 
terpretative value. 

The manuals devote much space to the uses 
of the lists, and ingenious personnel workers 
and teachers will find additional uses. The fol- 
lowing are illustrative with suggested limita- 
tions: (a) As a point of departure in the in- 
terview, a list may be used either before or 
during the conference. The list is a means of 
exploring students’ problems and is not diag- 
nostic. (b) As a survey instrument to facilitate 
a personnel program, students may mark a list 
to assist in locating personnel agencies, or agen- 
cies may examine marked lists to assist in lo- 
cating students in need of their special services. 
(c) As an instrument for educational surveys, 
the lists may be used to assist in locating needed 
adjustments in the school offerings, dissatis- 
factions with teaching methods or administra- 
tive procedures and general conditions of stu- 
dent morale. Young (16) has shown how the 
check lists may be used to differentiate cur- 
ricula within a single metropolitan area. (d) As 
an aid to classroom teachers, the lists may be 
used as a basis for revising course content and 
modifying teaching procedures to meet individ- 
ual needs, and as an aid in fusing teaching, 
counseling, and personnel work. (e) As an aid 
in the training of teachers and personnel work- 
ers, the lists provide a means of sensitizing 
trainees to the wide range of problems pre- 
sented by individual students and serve as a 
point of departure in the development of in- 
creased precision in the description of students" 


problems. (f) As an aid to research workers, 
the lists furnish educators an additional means 
of determining the influence of specific educa- 
tional programs on the problems of pupils. 
Psychologists will find the check lists valuable 
in the many types of analyses designed to de- 
termine the extent to which problems appear 
in conjunction and the significance of the vari- 
ous syndromes thus discovered. 

As a paramount precaution to users, it should 
be emphasized that the check lists are not diag- 
nostic instruments; rather they are descriptive 
and to some extent analytical. Their interpreta- 
tion depends upon a pragmatic analysis, the 
accuracy of which will be limited by the train- 
ing of the user. The interviewer or counselor 
probably will use about the same qualities in 
interpreting the lists as he uses in his everyday 
contacts with students. The lists have the ad- 
vantage of filling a gap between the use of the 
more precise diagnostic instruments and the 
skill of a naive counselor. This reviewer takes 
vigorous exception to the author’s statement 
that technical training in testing, psychometrics, 
and complex statistical processes is not required 
for interpretation of results in an individual 
case. The lists will be helpful to teachers and 
personnel workers at all levels of training, but 
they will be of greatest value in the hands of 
those most highly trained, 


Theodore F. Lentz, Director, Attitude Re- 
search Laboratory and Associate Professor of 
Education, Washington University, St. Louis, 
Missouri. [Review of College and High School 
Forms.] coLLece rorm. This check list con- 
sists of 330 items (such as: being overweight, 
too little money for clothes, awkward in meet- 
ing people, losing friends, losing my temper, 
being in love, parents’ drinking, losing faith in 
religion, slow in reading, not interested in any 
vocation, having unfair tests) classifiable in 
eleven areas: (a) health and physical develop- 
ment; (b) finances, living conditions, and em- 
ployment; (c) social and recreational activities ; 
(d) social-psychological relations ; (e) personal- 
psychological relations ; (f) courtship, sex, and 
marriage; (g) home and family; (h) morals 
and religion; (i) adjustment to college work: 
(j) the future: vocational and educational ; and 
(k) curriculum and teaching procedures. 

“The form is similar to that of interest in- 
ventories except that the items are problems 
rather than interests. The student goes through 
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the list, underlines the problems which are of 
concern to him, circles the ones of most con- 
cern, and writes a summary in his own words." 
“The function of the Problem Check List is to 
help students in the expression of their per- 
sonal problems." 

The manual presents an immense amount of 
simple arithmetic data regarding students' re- 
actions—emotional reaction to the exercise, 
relative frequency of problems, sex differences, 
grade progression, etc. 

The author recommends two major subdi- 
visions of function: (a) aid to individual in- 
terview; and (b) group projects to furnish 
guidance for students, personnel agencies, ad- 
ministrators, teachers, and research workers. 
In this second major division, the administered 
list and tabulated results yield scores per item 
and per area regarding the group, class, or 
campus as a whole. There is no set score for 
the individual. The author reports 9o per cent 
of students reacting positively and favorably 
to the exercise. The results aimed at are sug- 
gestive and not definitive. 

The discussions of reliability and validity 
are only partially satisfying to this reviewer. 
As the author implies, the validity of the in- 
strument is difficult to ascertain, since this is 
in part a function of the interpreter. To quote, 
“The interpreter himself is in the midst of the 
process." 

All criticisms that occur to this reviewer in- 
here in the unfinished nature of the instru- 
ment's research. The data presented are best 
conceived as a progress report. 

More data will surely be forthcoming on 
(a) the validity and reliability of the responses 
of the individual, (5) the validity of the scores 
per item, and (c) the cluster factors among 
the items, on which intriguing beginnings are 
reported. Very little data are presented on the 
"practice effect" of the exercise. When full 
data are forthcoming on this point, the most 
valuable feature of the check list may appear. 
The above considerations, and many others 
that could be mentioned, argue inescapably for 
a special “Bureau of Standards” for educational 
psychology, backed by unlimited funds for re- 
search, 

The reviewer knows of no alternative or 
comparable instrument designed for the same 
purpose. It seems to constitute a unique con- 
tribution to our counseling aids and techniques. 
It is an instrument for an area of research 


which must not be overlooked. As it stands, 
the instrument is highly commendable to the 
serious practitioner of guidance or the research 
scientist in social psychology. 

With wider use, further verification, and ex- 
tension, this instrument should have real ef- 
fectiveness in upholding the morale of the 
philosophy of those who entertain the permis- 
sive in guidance and the democratic in school 
design. 

HIGH SCHOOL FORM. Practically everything 
said about the college form can be said about 
this form: number of items, form, administra- 
tion, functions, voluminousness of the manual, 
large quantity of, as well as demand for, more 
data, and significance as a contribution to prac- 
tice and research instrumentation. 

A number of the items are different to suit 
the age difference between high school and 
college students. 


[68] 
XProbiem Check List: Form for Rural Young 
People. 1946-47; an adaptation of Problem Check 
Lists by Ross L. Mooney; $1 per 20; no manual; 
nontimed (20-40) minutes; Ralph E. Bender; Ohio 
State University Press. 
REFERENCES 

1. BENDER, RALPH E, “The Development of a Problem Check 
List and a Demonstration of Its Use in Planning Rural Youth 
Programs." Agric Ed Mag 20:116-7 D '47. * 

2. BENDER, RALPH E. The Development of a Problem Check 
List and a Demonstration of Its Use in Planning Rural Youth 
Programs. Unpublished doctor's thesis, Ohio State University, 


1947. 
[69] 
XProblem Check List: Form for Schools of Nurs- 
ing. Student nurses; 1945-47; an adaptation of Prob- 
lem Check List, College Form by Ross L. Mooney; $1 
per 20; no manual; nontimed (20-40) minutes; Luella 
J. Morison; Ohio State University Press, 
REFERENCES 


1, Morison, LuzLLA J. “A Problem Check List: Its Use in 
Student Guidance.” 4m J Nursing 47:248-51 Ap '47. * 


[70] 
*Recreation Inquiry. Grades 9-16; 1941; 1 form; 
$2.25 per 25; 35¢ per specimen set; nontimed (50) min- 
utes; Richard Wilkinson and Sidney L. Pressey; Pys- 
chological Corporation. 
REFERENCES 

1. CAVANAUGH, JEAN Ocpzx. “The Relation of Recreation to 

Personality em " J Social Psychol 15:63-74 F '42. " 
16:3159 

2. HapLEY, Loren S. A Study of the Leisure-Time Activity 

Patterns of Adults, Unpublished doctor's thesis, Ohio State 


University, 1942. (Abstracts of Dissertations . . . Summer 
Quarter 1941, 1942, pp. 87-92.) 


Theodore F. Lentz, Director, Attitude Re- 
search Laboratory, and Associate Professor of 
Education, Washington University, St. Louis, 
Missouri. This inquiry contains 183 recreational 
items, such as football, bridge, campfire girls, 
leather craft, scholastic team, athletic honor 
societies, dramatics, history, church, orchestra 
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—grouped under nine heads: sports, games, 
social, hobbies, honoraries, literary, subject 
clubs, religious, and music. For each item the 
subject indicates his knowledge, interest, mem- 
bership, and participation. In addition he in- 
dicates frequency of participation in ten activi- 
ties and the hours per month at reading, radio, 
and automobiling. Further he lists activities 
wherein he is a leader and makes a selí-evalu- 
ation of his total recreational life. 

The instrument has two functions: (a) sur- 
vey—valuable for study and appraisal of insti- 
tutions and communities—and (b) individual 
guidance. An exceedingly brief manual in- 
cludes norms for each section of the test for 
high school seniors, college freshmen, and col- 
lege seniors—300 cases each. Along with some 
other instruments of this sort, these norms 
(probably like norms for any other community) 
furnish excellent challenge in the contrast be- 
tween interest on one hand and participation 
on the other. 

The authors claim comprehensiveness for the 
list and state that this has been experimentally 
determined. The data for this are not presented. 
One questions the presence of eight items under 
the classification "religious" and none under 
the classification “civics or politics," with the 
possible exception of such items as "public 
speaking" and “bull sessions." This would seem 
to be either a reflection on the instrument or 
a contradiction of the assumption that man is 
a political as well as a religious animal. In 
common with most studies and instruments for 
study of recreation, the instrument leaves out 
taboo items which would make a study more 
accurate but the instrument nonusable in cer- 
tain situations. In view of the large place of 
recreational reading in our lives, reading has 
probably been given too little space. 

Most observers will not blame the.authors 
as much as our culture for the small amount of 
data presented with the test. When will our 
society give human relations, including research 
on behalf of recreational guidance, adequate 
consideration ? 


Louis Long, Director of Division of Testing 
and Guidance and Assistant Professor, Depart- 
ment of Student Life, The City College of New 
York, New York, New York. The manual 
states that the purpose of the Recreational In- 
quiry "is to provide a ready means for survey- 
ing the recreational and social life of the stu- 


dent and to obtain a broad basis for consideration 
of his recreational and social needs.” Certainly, 
the list of recreational activities is exhaustive, 
and the plan of having the student indicate the 
extent of his participation on a four-point scale 
(knowledge, interest, membership, or now- 
active) is a good one. It would seem that the 
most effective use of the inquiry would be in 
an interview situation, since a rapid scanning 
of the responses would give a good picture of 
the recreational and social life of the person. 

The authors suggest that the form can also 
be used with groups. The interpretation of the 
results in a group situation presents a real 
problem, since the same total score for two in- 
dividuals may be subject to very different inter- 
pretations. The form should be useful in mak- 
ing systematic surveys of local campuses where 
the investigator is interested primarily in deter- 
mining whether the school’s socio-recreational 
program is adequately fulfilling the needs of 
the student body. 

Separate norms (in terms of quartile ratings) 
are presented for men and women, and each 
group is further subdivided into high school 
seniors, college freshmen, and college seniors. 
Whether or not the authors found statistically 
significant differences between the means of 
these groups is not indicated, nor is the exact 
number of subjects stated. The only reference 
to the number of subjects examined is that the 
“tentative norms are based on about 300 cases 
for each class.” No information about the re- 
liability of the form is included. 

[71] 
Revision of the Babcock Examination for Meas- 
uring Efficiency of Mental Functioning. Ages 7 
and over; 1940; a revision of Babcock's Mental Deteri- 
oration Scale (No. 37057.5) ; 1 form; $8.40 per test 
booklet and manual (No. 30758A) ; $2.30 per 25 record 
blanks (No. qosB; 186 per Color Naming Test: 
Woodworth-Wells (No. 12309) ; $1.40 per Cube Imi- 
tation Test: Pintner (No. 33304) ; $1.10 per Memory 
for Designs Test: Army Performance Scale (No. 
32317) ; nontimed (704-) minutes; Harriet Babcock 
and Lydia Levy; C. H. Stoelting Co. 
REFERENCES 
T4. See 40:1248. 


15. Smarr, AGNES A, “A Statistical Analysis of the Validity 
of the Babcock Test of Mental Deterioration,” Abstract, Psy- 
chol B p6:so8 Ji '39. * (PA 13:6290, title only) 

16. YACORZYNSKI, G. K. “An Evaluation of the Postulates 
Underlying the Babcock Deterioration Test." Abstract. Psychol 
B 37:425-6 Jl '40. * (PA 1 apnea be only) 

17. Bascock, Harriet. "The Level-Efficiency Theory of In- 
telligence.” J Psychol 11:261-70 Ap "at. * (PA 15:3305) 

18. Bascock, Harriet. Time and the Mind: Personal Tempo 
—The Key to Normal and Pathological Mental Conditions, 
ç rebar ere Sci-Art Publishers, 1941. Pp. 304. $3.75. * 

15:3651 

19. GrLsERT, JEANNE G. “Memory Loss in Senescence.” J 
Abn & Social Psychol 36:73-86 Ja "4x. * (PA 15:2$41) | 

20, RAUTMAN, AnTHUR L. "The Measurement of Deteriora- 
tion in Mental Deficiency.” Am J Mental Def 46:220-4 O '41. * 
(PA 16:1718) 
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a1. Yacorzynsx1, G. K. “An Evaluation of the Postulates 
Underlying the Babcock Deterioration Test.” Psychol R 48: 
261-7 My ‘41. * (PA 4513879) 3 

22, Broby, M. B. “The Measurement of Dementia.” J Mental 
Sci 88:317-27 T "42. * (PA 16:3324) tee 

HP Bropy, M. B. “A Psychometric Study of Dementia." 
J Mental Sci 88:512-33 O '42. * (PA 17:1178) | T 

24. Hayman, Max. "A Rapid Test for ‘Deterioration’ With 
Comparison of Three Techniques.” J General Psychol 29:313-7 
O "43. * (PA 18:1423) 

25. Knicut, Ropert P.; Gitt, Merton; Lozorr, MILTON; 
AND Raraport, Davin. “Comparison of Clinical Findings and 
Psychological "Tests in Three Cases Bearing Upon Military 
Personnel Selection.” B Menninger Clinic 7:114-28 My ‘43. " 
(PA 17:4241) 3 : ; 

26. GILBERT, JEANNE G. '"Measuring Mental Efficiency in 
Senescence.” Discussion by David Shakow. Am J Orthopsy- 
fiot, 14:267-71, discussion 271-2 Ap '44. * (PA 18:3104, title 
only) 

27. Rapaport, Davip; WITH THE COLLABORATION OF Roy 
ScHAFER AND Menton Girt. "The Bellevue Scale,” pp. 121-52. 
In their Manual of Diagnostic Psychological Testing: I, 
Diagnostic d of Intelligence and Concept Formation. 
Preface by Karl Menninger. Review Series, Vol. 2, No. 2. New 
York: Josiah Macy, Jr. Foundation, 1944. Pp. xiii, 239. Paper, 
lithotyped. Out of print. * (PA 19:3195) 

28. SCHAFER, Roy. “The Significance of Scatter in Research 
and Practice of Clinical Psychology.” J Psychol 18:119-24 Jl 
"44. * (PA 19:147) P 
1 an SsipENFELD, Morton A, “Measuring Mental Competency 
in the Aging." Discussion by F. H. Finch. Am J Orthopsychiatry 
14:273:5, discussion 275-6 Ap '44. * (PA 18:3104, title only) _ 

30. Hunt, Howanp F. “A Note on the Problem of Brain 
Damage in Rehabilitation and Personnel Work.” J App! Psychol 
29:282-8 Ag '45. * (PA 20:154) 

à Mayman, Martin. "Review of the Literature on the 
Babcock Test,” pp. 558-61. In Diagnostic Psychological Testing: 
The Theory, Statistical Evaluation, and Diagnostic Application 
of a Battery of Tests, Vol. I. By David Rapaport with the 
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For related reviews, see 72. 


[72] 
*[Re Revision of the Babcock Examination for 
Measuring Efficiency of Mental Functioning.] 
Bascock, Harriet. Time and the Mind: Personal 
Tempo—The Key to Normal and Pathological 
Mental Conditions. Cambridge, Mass. : Sci-Art Put- 
lishers, 1941. Pp. 304. $3.75. * (PA 15:3651) 


J Abn & Social Psychol 37:284-5 Ap *42. 
G. K. Yacorzynski. * Babcock's level-efficiency 
theory of mental functions has more promise 
and is an indication of more mature thinking 
than the theory which she used to explain the 
results of her first deterioration scale. Formerly 
she held that vocabulary remains unimpaired 
in cases of deterioration since it is an old habit, 
whereas recently acquired material or new as- 
sociations (measured by the tests which are 
now purported to yield efficiency scores) de- 
teriorate. With the author's emphasis on the 
time of mental functioning, and in contrast to 
her former deterioration scale, all the tests 


which are used at present to measure efficiency 
are timed, The author still maintains, however, 
that the primary value of using a vocabulary 
test to measure level is that “it draws on old, 
well-fixed learning which was acquired spon- 
taneously because of natural interest and un- 
derstanding” (p. 75), whereas, “Other reasons 
for its use are the fact that the responses do 
not depend upon time . . .” (p. 75). As a con- 
sequence the author devotes a great deal of 
space to the discussion of the disintegration of 
old and recently acquired materials. The re- 
viewer feels that, as was true of the theoreti- 
cal interpretation published in the first deteri- 
oration scale, Babcock has been interested in 
using many subjects and test items in order to 
standardize the procedure rather than attempt- 
ing to analyze the mechanisms which are af- 
fected in deterioration. The author’s theory is 
an addendum to the standardization of the test 
rather than the central core of her problem. 
This, of course, reflects the author’s viewpoint 
embodied in the statement that, “Work in psy- 
chology . . . must be guided more than much 
of it has been by some utilitarian value . . .” 
(p. 241). The author's exposition suffers from 
the fact that there is little continuity in the 
presentation of the subject-matter and in the 
space of a few pages a number of unrelated 
topics may be discussed. Furthermore, the book 
is replete with analyses and statements which 
the reader will find very irritating. Some of 
these are: (1) Superficial categories such as, 
for example, dividing the "dimensions of the 
mind" into sensory discrimination, memory, 
level of intelligence, and mental tempo. Cate- 
gories of this nature are used throughout the 
book in spite of the author's statement in the 
introductory chapters that the organism should 
be studied as a whole. (2) Use of mentalistic 
concepts, such as "psychic," without proper 
definitions. (3) Describing the cerebral cortex 
as an instinctive organ. (4) Emphasizing self- 
evident statements such as suggesting that psy- 
chology should use “scientific psychological 
analysis” (original italicized, p. 26), as if this 
is not an accepted truism in psychology; or 
that, “The human mind does not function 
without a human body” (p. 213), should be 
accepted as an axiom in psychology. (5) In the 
light of the present knowledge of frustration’ 
exactly the opposite may be held to the state- 
ment that, “When there is no expectation of 
sex gratification until a definite age or untill 
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marriage, sex is not so dominatingly forceful 
except for chance experience, and neuroses and 
other untoward manifestations of behavior are 
not apt to develop” (p. 258). (6) The fact 
that, “Correlations of the total efficiency score 
with mental level were higher than those which 
measure different phases of mental function- 
ing” (p. 148), may mean that the reliability 
of parts of a test are lower than that of the 
whole test, rather than the interpretation given 
by the author that, "From this it is evident 
that the relation of the different phases to each 
other and the way in which different parts of 
mental process function together are of greater 
significance than either the total score or the 
different phases considered separately" (origi- 
nal italicized, p. 148). The above considerations 
vitiate the probability that an important theo- 
retical contribution is made to the understand- 
ing of mental functions. The book also is not 
a manual for the revised form of the Babcock 
deterioration scale. 

J Consult Psychol 6:270-2 S-O 42. Irving 
Lorge. * Operationally, the test, in its clumsy 
way, will yield a variety of patterns, and will 
yield a differential between level and efficiency 
which may be characterized as an efficiency 
index. But, such a function could be achieved 
by better tests better organized. It seems strange 
that with all the criticism of the mental age 
unit as applied to adults that Dr. Babcock has 
clung to it. Since Balinsky has shown that the 
factor pattern of the Bellevue- Wechsler at dif- 
ferent ages lacks consistency, and since Jones 
and Conrad have shown that different subtests 
of the Army Alpha make different contribu- 
tions to the total score at different ages, and 
since Lorge has shown the differential aspect 
of speed of performance in older ages, it seems 
strange that Dr. Babcock naively assumes that 
normal functioning should be appraised in 
terms of the performance of a group younger 
than 25 years. The contributions of speed, and 
of the various abilities measured will be dif- 
ferent, particularly for persons aged 30 to 70. 
Certainly, in terms of the explicit statement 
“The only meaning to be attached to each test 
is what actually is done and what can be logi- 
cally inferred from its relations to other re- 
sults,” Dr, Babcock’s test must be criticized 
explicitly for at least two defects. The first is 
the failure to appreciate the relations of the 
test results to age, and second, the failure to 
show the intercorrelations among each of the 


tests for the normal group at least. While Dr. 
Babcock looks askance on statistics, she does 
use them to make her point. She certainly had 
an obligation to show in how far the tests were 
measuring validly and reliably the perform- 
ances she thinks are diagnostic of mental effi- 
ciency. * All in all, as a test to estimate mental 
efficiency, the Babcock-Levy examination will 
yield an estimate of mental functioning. It does 
so, however, despite the standardization of the 
selected tests, and at the expense of forcing a 
notion of intelligence which by this time might 
be more critically appraised. Vocabulary is not 
necessarily a measure of level or of older learn- 
ing. * Dr. Babcock’s notion that level of men- 
tal ability can be appraised by vocabulary, and 
that level of mental efficiency can be appraised 
by a battery of short, acceptable tests does not 
materially advance knowledge of intelligence 
or of its measurement. It, as a notion, suffers 
from the single score approach, particularly in 
assaying intellectual level, but just as specifi- 
cally in estimating intellectual efficiency inde- 
pendent of age changes. In assaying intellectual 
or behavior deviates, the pattern approach 
might well be utilized, but it must be remem- 
bered that patterns need to be developed from 
very reliable tests, correlating rather low among 
each other. The task of reviewing Dr. Bab- 
cock’s contributions has been made somewhat 
more burdensome by the lack of editorial su- 
pervision given her text. Terms, concepts, and 
definitions tend to get mixed up; table head- 
ings are ambiguous, and some of the tables of 
data do not check out. Dr. Babcock’s contribu- 
tion to psychology may be appraised incorrectly 
because of a lack of clarity in its presentation. 

Psychol B 39:193 Mr’42. Arthur L. Benton. 
This book might well be considered as two 
books. “Book I” consists of the presentation... 
of a revision of her well-known test battery * 
The revised test battery and the author’s dis- 
cussion of its applications will be of consider- 
able interest not only to psychologists who are 
working with neurological and psychiatric ma- 
terial but also to those who are engaged in 
educational work and in vocational and personal 
counseling. “Book II" consists of the presenta- 
tion of the author’s opinions concerning a 
variety of psychological topics. Some of the 
section headings are: “Omissions in the Edu- 
cation of Academic Psychologists”; “Lack of 
Breadth in the Education of Leaders in Patho- 
logical Psychology”; “Sex and Education”; 
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“Gestalt Psychology"; “Obstacles the Suste- 
nance of Developing Character.” As can be 
seen from even this meager sampling, much 
ground is covered. 

Psychiatric Q 15:835-6 O "41. * [The author 
states,] "The praecox defect is usually consti- 
tutional,” i.e., the patient is one of the damned. 
In speaking of borderland conditions, on the 
same page, she states, “It is a general mental 
weakness undoubtedly due to a neurological or 
somatic condition . . .” The use of words like 
“undoubtedly” is hardly permissible in science, 
and is entirely out of place in discussing the 
relationships of the psyche and soma, in the 
present state of our knowledge. If the facts 
are thus and so, an “undoubtedly” will not 
make them any more true. If they are not thus 
and so, an “undoubtedly” is subversive propa- 
ganda. Aside from her gratuitous remarks 
about the basic nature of mental illness, the 
author seems to have written a sound report 
concerning the results of the application of 
various batteries of tests to various groups of 
people, and to have reached sufficiently amia- 
ble conclusions as to their parolability, employ- 
ability, etc., on the basis of these tests. Thus 
far, the report is useful, stimulating, and prac- 
tical, However, having found that mental mal- 
functioning is almost uniformly associated with 
disturbances of mental tempo, which can be 
measured, the author seizes on this attribute 
of maladjustment and makes it the cause of 
maladjustment. That kind of thinking has led 
to great mistakes in the past, one example of 
which was national prohibition. Dr. Babcock 
has probably started something. There is a 
market for the exposition of any method which 
will attempt to settle psychiatric questions nu- 
merically or graphically. It saves thinking and 
shifts responsibility. Furthermore, research in 
mental tempo should serve to keep large num- 
bers of students inoffensively occupied. They 
will not get any wrong ideas from “Time and 
the Mind.” In a book which purports to be 
“The Key to Normal and Pathological Mental 
Conditions,” there are two pages devoted, half- 
heartedly, to sex. On one of these (p. 258), 
there appear some remarkable statements. “The 
importance of sex has-been well recognized 
since the dawn of history as is shown by the 
numerous taboos and customs, which seem to 
have been an intelligent way of handling the 
problem. People adjust to different customs 
because of unconscious acceptance of the right- 


ness of the ideas inculcated by their society. 
These ideas preclude expectation and lessen 
mental stimulation to the sex impulse.” The 
reviewer rests. 


[73] 
Rorschach. Ages 3 and over; 1921 to date; also com- 
monly referred to as Rorschach Method, Rorschach 
Test, Rorschach Ink Blot Test, Rorschach Psychodiag- 
nostics, and Rorschach Method of Psychodiagnosis; 
many variants have appeared such as Harrower-Erick- 
son's Group Rorschach, Harrower-Erickson's Multiple- 
Choice Test, Monroe's Inspection Technique, Grassi's 
Graphic Rorschach, and Marseille Rorschach Mail In- 
terview; many modifications of the variants are in use 
such as the Amplified Multiple-Choice Rorschach and 
the Ranking Rorschach Test; in addition to the Ror- 
schach original Psychodiagnostic Plates, parallel plates 
have been prepared by Zulliger and Harrower and 
Steiner; Hermann Rorschach. 
4) PSYCHODIAGNOSTIC PLATES, FOURTH EDITION. 1021-45 ; 
$6 per set of 10 cards mounted on heavy cardboard ; $3 
per pad of 100 record blanks ; $5.75 per copy of the 1946 
English translation of Rorschach's Psychodiagnostics, 
Second Edition; Hermann Rorschach ; Grune & Strat- 
ton, Inc. 
b) BEHN-RORSCHACH TEST. 1942; a parallel set of ink 
blots ; $6 per set of 10 cards; $4 per manual ; Hans Zul- 
liger; Grune & Stratton, Inc. 
C) PSYCHODIAGNOSTIC INKBLOTS. 1945; a parallel set of 
ink blots; $4.50 per set of 10 cards; $2 per manual; M. 
R. Harrower and M. E. Steiner ; Grune & Stratton, Inc. 
(These ink blots on slides for use with standard pro- 
jectors are distributed by Psychological Corporation at 
$20 per set, postpaid.) 
d) HARROWER-ERICKSON'S GROUP RORSCHACH. Ages 12 
and over ; 1941-45 ; $8.50 per manual, Large Scale Ror- 
schach Technique (published by Charles C. Thomas), 
postpaid ; $20 per set of Rorschach ink blots on slides 
for standard projector, postpaid; $10.50 per set of ink 
blots on Kodaslides, postpaid; $3.75 per 25 record 
blanks, postpaid; (75-90) minutes; M. R. Harrower- 
Erickson and M. E. Steiner; distributed by Psycholog- 
ical Corporation. 
e) HARROWER-ERICKSON’S MULTIPLE CHOICE TEST. Ages 
12 and over; 1943-45; same manual and slides as for 
Harrower-Erickson's Group Rorschach ; $1.75 per 25, 
postpaid ; 30 per specimen set, postpaid; (45) minutes; 
M. R. Harrower-Erickson and M. E. Steiner; distrib- 
uted by Psychological Corporation. 
f) RORSCHACH METHOD OF PERSONALITY DIAGNOSIS. 1030- 
42; $1.50 per 25 record blanks; $3.50 per pad of 100 
picture sheets; Bruno Klopfer and Helen H. David- 
son; World Book Co. 
g) MARSEILLE RORSCHACH MAIL INTERVIEW. Applicants 
for employment ; 1945; $1 per booklet; Walter W. Mar- 
seille; the Author, 2145 California St., San Francisco 
15, California, 
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x07. Zunin, JosePH; Levy, Davin M.; AND Rust, Rarri. 
“Movement Responses in Normals. Schizophrenics and Neu- 
rotics on the Levy Movement Cards.” Abstract, Am Psychol 
2:269 Ag '47. * (PA 21:4401, title only) 

sog, SrEINMETZ, Harry C. "Obfuscating ;'ersonality." Let- 
ter. Scientific Mo 66:87 Ja '48. * 


Morris Krugman, Assistant Superintendent 
in Charge of Guidance, Public Schools, New 
York, New York. The Rorschach method, in use 
since 1921, continues to be the most important 


single psychological instrument for the measure- 
ment of personality. Other projective tech- 
niques, notably the Thematic Apperception Test 
(TAT), have received considerable attention 
in recent years; but, as yet, not one of them 
has seriously challenged the place of the Ror- 
schach in the clinical program. With further 
development, TAT should constitute an excel- 
lent supplement to the Rorschach, since the 
latter provides clues to the personality struc- 
ture, while the former supplies the mental con- 
tent and dynamics. E 

Although there is still considerable criticism 
of the Rorschach as a psychometric tool from 
some quarters, particularly from statisticians 
without clinical orientation, that criticism now 
amounts to a whisper when compared with the 
uproar against the method ten or twelve years 
ago. With the advance of more dynamic meth- 
ods of diagnosis and therapy in clinical psychol- 
ogy, psychologists have come to recognize that 
a complex instrument like the Rorschach cannot 
yield a simple score that may be validated by 
the application of a Pearson or other correla- 
tion, but that the entire configuration must be 
compared with the clinical picture obtained from 
such procedures as the full case study or the 
psychiatric examination. In other words, valida- 
tion must be by clinical rather than by mathe- 
matical processes. The Rorschach has stood the 
clinical test well throughout the years and has 
come out stronger for it; on the other hand, 
attempts at atomistic validation have been un- 
successful and will probzbly continue to be so. 
The test loses its value when broken down into 
its elements. It has meaning only when its 
elements are treated organismically in terms of 
their interrelationships. 

Several schools of thought have arisen among 
Rorschach workers. Among the more important 
is that of Oberholzer, who follows Rorschach 
almost literally. Beck is a disciple of Ober- 
holzer, but he deviates somewhat from the origi- 
nal. Klopfer has a very large following; his 
difference from the Psychodiagnostic lies in 
the designation of new scoring categories for 
some of the concepts which Rorschach originally 
discussed qualitatively. Rapaport claims that 
he follows Rorschach most closely, but actually, 
he deviates most of all. In the main, the differ- 
ences among the various groups are in semantics 
and in symbols for the same concepts rather than 
in basic concepts. 

The Rorschach method, as used in most psy- 
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chological clinics, is an individual method. In 
1941 Harrower-Erickson introduced the Group 
Rorschach and in 1945 the Multiple-Choice 
Rorschach. The Group Rorschach is adminis- 
tered by projecting the ten plates, one at a time, 
on a screen and having the subjects write their 
responses on given forms. In the Multiple- 
Choice Rorschach, the plates are projected in the 
same manner, but the responses are limited to 
choices among a given list for each plate. The 
group method seems to have some validity for 
screening purposes, although it cannot be used 
for individual clinical diagnosis in the way the 
individual test can. The multiple-choice test is 
of doubtful validity even as a screening method. 

An objective method for evaluating factors 
of adjustment or maladjustment on the Ror- 
schach was devised by Munroe and called the 
Inspection Technique. This consists essentially 
of a check list of Rorschach categories from 
which prediction of later adjustment can be 
made. This, too, is not equivalent to the complete 
Rorschach personality study, but it has merit 
when used for the purpose mentioned. 

Two parallel sets of plates have been devised 
for use as an alternate series to the original 
Rorschach. One of these was developed in Eu- 
rope—the Behn-Eschenberg series; the other 
was recently published in this country by Har- 
rower-Erickson and Steiner. Although there is 
need for a duplicate series of plates, they are 
still used only occasionally. 


J. R. Wittenborn, Assistant Professor of Psy- 
chology, Yale University, New Haven, Con- 
necticut. Since the publication of Dr. Hermann 
Rorschach's Psychodiagnostics in 1921, this 
method of personality evaluation has gained 
steadily in popularity. Today graduate instruc- 
tion in the Rorschach method is offered in de- 
partments of psychology where less than a 
decade ago more conservative phases of clini- 
cal psychology were shunned. At present, reser- 
vations concerning the ink blot test are rapidly 
becoming unfashionable. Since from the stand- 
point of relative progress little more is known 
now about the ink blots than was known by 
Rorschach, this rapid acceptance requires com- 
ment. 

The enthusiasm for the Rorschach method 
manifested by almost all who have used it is an 
obvious factor in its growing popularity. The ap- 
pearance of numerous publications suggesting 
new Rorschach practices and supporting old 


ones has also encouraged this growth. The cur- 
rent trend in psychology and psychiatry toward 
replacing static, categorical concepts of person- 
ality with flexible, highly descriptive concepts 
of personality based on a dynamic view of be- 
havior is also favorable to the acceptance of a 
flexible instrument such as the Rorschach. Un- 
derlying all of these developments, however, is 
a growing public consciousness of and tolerance 
for the emotional problems of the individual as 
a topic worthy of thorough study and sympa- 
thetic understanding. With this increase of pub- 
lic interest there arises a lively demand for new 
and powerful means of personality appraisal. 
For the gratification of these demands, no other 
single instrument now known appears to offer 
more promise than the Rorschach. 

Despite the fact that there are many who hope 
that the Rorschach method may have a brilliant 
future, this method has neither the advantage of 
a theory worthy of the name nor, with certain 
exceptions, adequately designed research. What 
passes for research in this field is usually naively 
conceived, inadequately controlled, and only 
rarely subjected to usual standards of experi- 
mental rigor with respect to statistical tests and 
freedom from ambiguity. Despite these limita- 
tions, the test flourishes, its admirers multiply, 
and its claims proliferate. 

This paradox may be partly understood in 
view of the tremendous difficulty of conceiving 
experimental designs appropriate to test the 
properties of an instrument of such remarkable 
power and flexibility. This difficulty is not less- 
ened by the fact that several most eminent 
Rorschach authorities differ from each other 
with respect to details of scoring and interpreta- 
tion (275, 364, 415, 535). Many Rorschach 
authorities have discouraged submitting the test 


` to critical investigation. In some cases this re- 


luctance may have been due to distrust of the 
capacity of usual methods of investigation to 
demonstrate the true quality of this complex 
instrument. In other cases, however, this sus- 
picion of research is born of a convention that 
the Rorschach analysis is an intuitive analysis, 
an art, and not subject to scientific scrutiny. 
Finally, it is true that numerous enthusiastic 
would-be investigators of the Rorschach have 
found their way most difficult because of the 
dilemma inherent in attempting to study an 
instrument of personality appraisal when per- 
sonality itself is as yet so little understood. 

The Rorschach test, it will be remembered, 


74 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 134 


was developed in the atmosphere of a mental 
hospital, and it has won much of its popularity 
in the study of individuals for whom serious 
disturbance of affect, impulse, or character is 
suspected. This background combined with its 
common use in the differential diagnosis of psy- 
chotic groups and patients suspected of suf- 
fering from cerebral lesions causes it to be 
associated in the thoughts of many with the 
neuropsychiatric clinic, the mental hospital, and 
formal psychiatric work. Many workers, how- 
ever, have found it to be extremely useful in 
vocational and educational guidance of college 
Students as well as in primarily therapeutic 
work. In helping the individual plan his future 
itis frequently desirable to know the answer to 
such questions as: How does he control his 
affect? Is he prone to anxiety ? Can he accept his 
needs and impulses? To what degree can he be 
assertive? Is he likely to seek dependent rela- 
tions? Can he make a good sexual adjustment? 
Such questions as these and many similar ones 
may be answered much more readily through 
the aid of the Rorschach than without it. For 
these reasons alone, its wide use seems to be 
amply indicated. 

The administration and interpretation of the 
Rorschach requires special training which many 
clinical psychologists have not had. Clinics or 
guidance bureaus contemplating the use of the 
Rorschach are confronted with several alterna- 
tives: securing the services of clinical psycholo- 
gists experienced with the use of the Rorschach : 
providing for the training of present members 
of the staff in the Rorschach method; or hiring 
a Rorschach technician (a person trained in the 
Rorschach technique but with few if any of the 
other skills and experiences which are an im- 
portant part of the psychologist’s qualifica- 
tions). The last alternative of depending on the 
services of the person trained only in Rorschach 
seems to be favored by two groups: one is the 
group which believes that the Rorschach method 
comprises a psychology in itself and that any 
person well trained in the Rorschach method is 
a competent psychologist; the other group is 
inclined to regard the mental tester as a techni- 
cian and to feel that the Rorschach may be 
handled by a psychometrist at much the same 
level as the American Council on Education 
Psychological Examination, for example, is 

handled by a psychometrist. It is probable that 
a Rorschach worker at a technician level (i.e., 
an individual trained only in Rorschach) may 


have an important place in mental hospital work 
and certain psychiatric clinics. In most guidance 
and minor therapy with adults, however, the 
writer has observed a difference between the 
quality of the Rorschach personality descrip- 
tions by workers who have a clinician's knowl- 
edge of people and those by workers who are 
regarded as good testers but who have had little 
or no responsible clinical experience. 

One deterrent to the wider use of Rorschach 
in educational and vocational guidance is the 
expense of preparing a protocol. An hour is 
ordinarily required for merely eliciting the Te- 
sponses of the subject and conducting an in- 
quiry. The actual scoring and interpretation of 
the responses may require several hours. For 
the administration, full scoring, and interpreta- 
tion of the Rorschach, three hours is a moderate 
time allowance. For most guidance situations 
this is prohibitive. It is fortunate, therefore, 
that many important questions concerning an 
individual may be answered from a mere inspec- 
tion of the responses. Factors usually of great 
importance in guidance and readily discerned 
from an inspection of the Rorschach responses 
are the level of psychosexual development, qual- 
ity of affective control, level of ambition, effi- 
ciency and practicality in work, and others. As 
a logical expression of the increased interest in 
Rorschach, group methods of administration 
have been developed. The group procedures are 
at present subject to conflicting reports and 
justify separate review. 

In general, it may be said that the Rorschach 
is one of the most promising of our methods for 
personality evaluation. When one considers, 
however, the vast cost of training and employing 
Rorschach workers as well as the guidance and 
care given individuals as a result of Rorschach 
evaluations, it is apparent that too little is known 
about this important instrument. 

For related reviews, see 74-91. 
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[Re Rorschach.] Beck, SAMUEL J. Personality 
Structure in Schizophrenia: A Rorschach Investi- 
gation in 81 Patients and 64 Controls. Preface by C. 
Macfie Campbell. Nervous and Mental Disease Mono- 
graphs, No. 63. New York: Nervous and Mental Dis- 
ease Py eats 1938. Pp. 88. Out of print. * (PA 
13:277. 


J Abn & Social Psychol 36:119-20 Ja ^41. 
D. Shakow. * The author continues in this 
volume his pioneer work in the elucidation and 
application of the Rorschach test. The style is 
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lively and conservational—a tendency perhaps 
pushed occasionally a trifle too far. This re- 
viewer finds it difficult to become accustomed 
to the use of "healthy" for the more generally 
accepted "normal"; somehow a picture of an 
Iowa 4-H contest winner persists in obtruding 
itself at each such mention! Too, the reference 
to the Rorschach "experiment" (more persist- 
ently used in others of the author's works) gives 
the impression of a degree of control which the 
test procedures do not ordinarily have. Although 
the distinction between test and experimental 
situations cannot clearly be drawn, the method 
of administration used in the present study obvi- 
ously falls into the former category. These are, 
however, minor points. More important is the 
fact that the author has not taken his own 
recognition of the danger of lumping schizo- 
phrenics together (p. 14) seriously. An essential 
methodological point is involved here. Paranoids 
and hebephrenics (to take the most clearly dis- 
guishable types and to accept a classification sys- 
tem which admittedly has its weaknesses) have 
in a number of studies been shown to fall on 
either side of the normal in certain characteris- 
tics, The combining of groups tends to wipe out 
important differences by pushing towards uni- 
modality tendencies which actually go in the 
direction of bimodality. It is likely that a num- 
ber of the negative results obtained in dealing 
with schizophrenia may be due to such disregard 
of fundamental syndromes. The personality 
structure of schizophrenia which the author de- 
velops is in considerable harmony with clinical, 
test, and experimental findings as well as with 
the work of other Rorschach studies such as 
those of Skalweit, Rickers-Ovsiankina, and 
Rorschach himself. The monograph will un- 
doubtedly take its place as one of the important 
items in the Rorschach bibliography of schizo- 
phrenia and is, of course, essential to the Ror- 
schacher. One must, however, admit that the 
reader lays down the volume with a feeling of 
incomplete satisfaction—due perhaps to the fact 
that acquaintance with the extent of the experi- 
ence and background of the author in psycho- 
pathology had led one to expect a much more 
detailed and extensive consideration of the sub- 
ject than is actually presented. Now that the 
statistical aspects have been considered, cannot 
we expect from him a supplementary essay re- 
lating clinical schizophrenic pictures in detail to 
Rorschach findings? 
For additional reviews, see 40:B841. 


[75] 
k[Re Rorschach.] Becr, SamueL J. Rorschach's 
Test: I, Basic Processes. Foreword by Willard L. 
Valentine. New York: Grune & Stratton, Inc., 1944. 
Pp. xiii, 223. $3.50. * (PA 18:2837) 


Am J Orthopsychiatry 15:178-9 Ja '45." 
L(awson) G. L(owrey). * Beck demonstrates 
thoroughly the processes used in evaluating re- 
sponses in the Rorschach. * An especially inter- 
esting chapter is on “scoring problems,” in 
which the solutions for the questions involved in 
scoring variable responses to Card I are pre- 
sented. There is no interpretation in terms of 
the whole personality structure. The effort 
rather is to establish a stable manual of pro- 
cedure. * For the Rorschach worker of some 
experience this is certainly the most important 
book yet to appear. While it presents a sys- 
tematic analysis of each symbol, some familiar- 
ity with the test is desirable for most effective 
use of the book. It is an entirely adequate suc- 
cessor to Beck’s Introduction, published by our 
own Association in 1937, which is widely used 
as a manual. 

Am J Psychiatry 101:709-10 Mr ’45. F. C. 
Redlich. * The book....does not contain any new 
fundamentals but includes much illustrative 
material, a discussion of the many problems of 
scoring, and consists chiefly of tables containing 
thousands of responses which have been scored 
by the author. This will be helpful to anyone 
endeavoring to score a Rorschach record, 
whether or not he accepts Beck’s own refine- 
ments of the method or belongs to another 
Rorschach “School.” One criticism of these 
tables is a certain lack of system which makes 
it difficult to find the particular response one is 
looking for; an alphabetic arrangement would 
have been very helpful. The discussion advances 
knowledge on some problems of scoring, e.g., 
the P response and the problem of "sequence" 
and “approach.” The problem of the scoring F 
plus and F minus responses, as well as the 
scoring of some M responses (e.g. animal 
movement), are still far from being settled. The 
reviewer agrees with Beck that the original (O) 
response cannot be analysed in a quantitative 
way ; however, original and individual responses 
as scored by Rorschach may still be of great 
clinical significance. The old question arises as 
to whether the Rorschach method can be made 
an exact quantitative test. Attempts in this direc- 
tion should be encouraged but the limitations of 
such a procedure at the present stage of de- 
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velopment of the test ought to be kept constantly 
in mind. For instance, the reviewer is not con- 
vinced that Beck's elaborate method of scoring 
the organization (Z) response is worth the 

_effort, although every Rorschach worker will 
agree as to the qualitative value of the (Z) 
response. The bibliography is short and does not 
do justice to other workers in the field. It is 
difficult to come to a definite opinion about this 
book without seeing its counterpart on interpre- 
tation or, as Beck optimistically says, "the cor- 
relation of clinical and Rorschach behavior." So 
far, our observation of both clinical [and Ror- 
schach] behavior seems far too vague and un- 
certain to warrant such optimism ; but it is stim- 
ulating to keep such endeavor in mind as a 
distant goal. Certainly agreements on scoring 
can be reached more easily than agreements on 
interpretation, just as agreements on grammar 
and syntax can be reached more easily than 
agreenients on meaning; but scoring and inter- 
pretation are interdependent and the various 
elements should only be differentiated in the 
light of clinical meaning. Beck has made a real 
contribution with’ his first book, and much may 
be expected of the second, after the completion 
of the chapters on interpretation and clinical 
application, Generally speaking the present vol- 
ume is a practical, helpful manual and in one 
respect constitutes a step ahead. The current 
books on the Rorschach method indicate the end 
of a period in which this valuable auxiliary tech- 
nique ceases to be a secret cult which could be 
transmitted only to members of the tribe. 

Am J Psychol 58:145-6 Ja’45. M. R. Har- 
rower—Erickson. The Rorschach method can 
never be learned theoretically. Thus no basic 
text concerning it, no matter how clear, sys- 
tematic or thorough it is, can provide a substi- 
tute for the collection of records which every 
experimenter must obtain before he is letter 
perfect in the technique. The chief criterion of 
a good text, therefore, is its ability to answer 
the maximal number of questions which the ex- 
perimenter will inevitably ask once he is faced 
with the actual problems of scoring and inter- 
preting specific records. * Beck’s most outstand- 
ing contribution to the inkblot technique is devel- 
oped in the chapter on Organizational Activity, 
where he makes explicit an idea merely hinted 
at by Rorschach; namely, that the manner and 
degree of the subject’s ability to combine and 
synthesize various areas of the blots is impor- 
tant in the estimation of mental activity. Nu- 


merical values have been given to various com- 
binations of details in each of the ten cards, so 
that quantative scores for this activity can be 
obtained. Chapter vii (Approach and Sequence) 
demonstrates a very simple, but telling, manner 
of recording the sequence of responses, so that 
this aspect of the record can be appraised at a 
glance. * Beck has given us a valuable addition 
to the rapidly growing Rorschach literature. 
For while the detailed examples make for dis- 
connected reading by the casual inquirer who is 
unfamiliar with the blots, the book undoubtedly 
answers many of the questions which arise in- 
evitably for the serious student, and this is the 
main function of such a text. 

Brit J Ed Psychol 14:168 N "44. W. D. 
W (all). For serious workers with the Rorschach 
Test this volume is indispensable. Rorschach's 
original scoring categories, conservatively de- 
veloped in the light of much recent research and 
Dr. Beck's own extensive experience, are here 
very clearly illustrated and in some cases statis- 
tically evaluated. Thus the book here considered 
will do much to eliminate the subjective varia- 
tions in scoring responses which have hindered 
validation experiments with Rorschach findings. 
In many categories the author parts company 
with other writers on the subject ; but, although 
the symbols used differ, in most essentials he 
agrees with Klopfer and Kelley, whose book 
was previously reviewed in this Journal. With 
admirable candour he clearly differentiates those 
categories and scorings which are statistically 
justified from those in which subjective judg- 
ment is still paramount in the absence of reliable 
evidence. His lists of common and uncommon 
details, of F+ and F— responses and of sigma 
scores for organisation of details are of great 
practical use, as too are the discussion of ap- 
proach and sequence and the list of popular 
responses. The complexity of scoring which 
necessitates so exhaustive a manual of illustra- 
tive material does, however, raise the question 
of whether the test itself is of more theoretic 
than practical value, whether the time consumed 
in administration and scoring produces a more 
accurate diagnosis than could be obtained by a 
well-directed clinical interview. The answer is, 
I think, that the test is a very useful clinical ad- 
junct, especially with a suspicious or defensive 
patient. 

Brit J Psychol 36:46 S 45. * This book would 
be of great use to anyone well acquainted with 
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the Rorschach*method, but useless to the begin- 
ner. No interpretations are given. 

J Am Med Assn 126:927 D 2°44. * a lucid, 
specific analysis of the way in which individual 
Rorschach responses are evaluated or “scored” 
* After employing scientific experimental pro- 
cedure Dr. Beck has written a technical manual 
replete with countless examples * One of the 
unique contributions..is the chapter on the 
organization activity, wherein the author places a 
relatively unexplored significant mental process 
on a statistical basis. This manual should en- 
able much wider use of the Rorschach test as it 
can be understood by beginners. It likewise can 
be utilized as a reference book for experienced 
testers and fulfils a concete need. 

J Med Soc N J 41:453-4 D "44. F. W. 
O'Brien. A good deal of the confusion attendant 
to the Rorschach test is due to the fact that 
most of the writings on the subject reflect the 
individual interpretation of the examiner in 
which certain personal ramifications are neces- 
sarily included in order to explain such inter- 
pretations. This book is an important step for- 
ward in the use of the Rorschach test. It merely 
attempts to set up a stable and standard method 
of applying the test and cites verbatim reactions 
subsequently recording the reactions under 
standard Rorschach symbols. It should go far 
in setting up standards that will lead to a general 
understanding and greater usage of the test. 
It will also assist in providing a basis of com- 
parison in evaluating the various techniques of 
the examiners. This should result in a higher 
standard generally in the application of the test. 
The book fills an important need in the literature 
relating to the Rorschach test. 

Psychiatric Q 18:518-20 Jl ^44. * seems to 
be the greatest collection of scoring samples 
available in print * To formalize the scoring 
of responses and make it as uniform as pos- 
sible, Beck placed great emphasis on the word- 
ing of the responses. His attempts at stand- 
ardization are in terms of the verbal form of 
the subject's responses rather than in terms of 
the visual images projected by the subjects into 
the blots. These projected visual images cannot 
be directly studied. The subjects have to de- 
scribe them in order to make them accessible 
to the examiner. In the opinion of this reviewer, 
the verbal responses are indispensable but only 
as a means to a goal, not as the goal itself. 
Rorschach himself made this plain, and it is for 
this reason that he called his test a test of per- 


ception, An exclusive reliance on the verbal 
form of the subject's response leads Beck, e.g., 
to score every "butterfly" response to Plate 1 
as an F+ because "butterfly" is a frequent 
response of healthy persons, even when the 
subject's image of the butterfly is quite different 
from that of the healthy person's and incom- 
patible with the real shape of a butterfly. * For- 
tunately for the beginner, for whom the book 
is intended, Beck is not consistent in his attitude 
of formalized standardization. Now and then, 
throughout the book, he departs from a rigid 
procedure and even overrules a subject. * Beck's 
feeling that interest in the psychological signifi- 
cance of the M is lacking is hardly justified in 
view of a number of articles by several authors 
on this very subject. Beck is occasionally polemi- 
cal and somewhat inadequately presents the 
views of those he criticizes. E.g., Klopfer's FM 
or animal movement response and m or inani- 
mate movement response are not “various forms 
of M" but are varieties of the F or form re- 
sponse. Beck justly warns against the "naive 
largesse with this label M." However, he him- 
self is rather generous with the symbol M and 
scores as human movement “a man sitting in an 
armchair" (projected in a tiny edge detail) or 
“the phallus, hanging down, not the erect posi- 
tion” (another tiny edge detail). There is no 
evidence of any motion or kinesthetic feeling 
in either of these responses which the reviewer 
sees as rather typical nongenuine, secondary 
movement responses, which are not scored as M. 
Tn these and similar examples, we are not deal- 
ing “with a movement sensed in the figure” for 
there is even no motion at all, and no kinesthetic 
feeling of tension or readiness of movement, 
The book will definitely aid the beginner in the 
acquisition of good scoring habits. Increased ex- 
perience will guard the student against its 
shortcomings of formalized uniformity. Beck’s 
“Basic Processes” is a welcome addition to the 
growing number of textbooks of the Rorschach 
method. 

Psychoanalytic Q 13:508-12 O '44. Frederic 
S. Weil. * It was the reviewer’s task to probe 
the clinical material and the statistical method, 
on which the scoring of the ‘literally thousands 
of detailed and individual responses’ is based. 
As sources of the responses the author used: 
patients in milder mental disturbance; adults in 
the community at large, comprising persons of 
average, high average and very superior intelli- 
gence, overtly in good mental health; adults in 
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psychoanalytic treatment ; General Hospital pa- 
tients; patients in acute schizophrenic disorgan- 
izations; a small group of patients with brain 
pathology. Thus, the subjects whose records 
were used represent a mixture of overtly sick 
and of doubtfully healthy people in an unknown 
proportion. Such clinical material used as a basis 
for standardizing values of single factors is def- 
initely objectionable, because standards which 
are used for comparison must be based on nor- 
mal clinical material. Out of Beck's six groups, 
only one group (the second) represents overtly 
mentally healthy and intelligent people. But how 
many even intelligent persons, who ‘overtly’ 
seem to be in good mental health, are neurotics ! 
A considerable number of our analytic patients 
belong in this group. On the other hand, there 
is not one factor in the Rorschach test which 
may not be affected by even an inconspicuous 
neurosis. A case in point is the differentiation 
between Form responses, F+ and F—, in Chap- 
ter III. Beck realizes the definite need for 
such a differentiation as well as the difficulty of 
carrying it through, Roughly his criteria boil 
down to the statement that intelligent persons 
perceive good forms, unintelligent persons and 
` schizophrenics perceive poor forms. Thus in 
Table 18 Beck classifies many poor responses 
as F+, e.g., a good number of anatomical and 
geographical interpretations. His criteria are de- 
batable. Many neurotics score an alarmingly low 
F+% in spite of a definite potentially good 
intelligence. Another case in point is the author's 
differentiation between Detail and Rare detail 
responses, dealt with in Chapter III. Here, as 
well as with the differentiation of Form re- 
sponses, it stands to reason that the standards 
should be derived from mentally healthy indi- 
viduals. Furthermore it is unfortunate that in 
this very important chapter Beck bases his con- 
clusions on the statistics of someone else, Dr. 
Ralph R. Brown; nor is it made clear on what 
material these statistics are based. And, above 
all, there is a lack of specifications as to the dif- 
ferentiation between D and Dd in each single 
card. The same objections hold against the au- 
thor’s mode of dealing with the Approach in 
Chapter VII. Beck cites Rorschach’s incidental 
statement in the latter’s posthumously published 
paper: ‘A normal average distribution would 
be 8 W, 23 D and 3 Dd.’ Beck opposes this with 
his own formula: 6 W, 20 D and 4 Dd, which he 
considers as ‘more consistently representative of 
the validating facts.’ However, he omits the 


statistical proof. Had he started out with un- 
questionably mentally healthy and intelligent 
subjects he would have discovered that the for- 
mula derived from such clinical material is 
nearer to the original Rorschach formula than 
to his own. In Chapter XIV the author doubles 
the Popular responses. The statistics are again 
Brown's. Unfortunately they are not appended 
for reasons of space. In Chapter VIII, dealing 
with Movement responses, Beck duly stresses 
the subjective and objective difficulties in verify- 
ing M responses. He says about movement in 
geometric figures and single lines: ‘I myself 
score such responses M as an act of faith in 
Rorschach and Oberholzer rather than from 
conviction based on validated findings’. He need 
not have relied on faith, had he considered such 
interpretations M responses only in persons who 
have perceived many other clearcut M’s. With 
another less kinesthetic individual the same in- 
terpretation may be a definite F. This compli- 
cation is apparently disliked by the author, nor 
would it fit into his neat rubrics. But here, too, 
Freud’s ‘Ça wempèche pas d'exister holds. 
Mutatis mutandis, the same is valid with regard 
to the Color responses, which are dealt with and 
listed in detail in Chapter IX. The author does 
not point out that the same interpretation of the 
same colored entity represents a ptire Form with 
one subject, a Form color response with another 
subject, and even a Color form response with a 
third subject. In Chapter X, devoted to the light 
determined responses, it appears comforting and 
convincing that the author differentiates only 
between the Vista, i.e., the three-dimensional, 
and the flat Grey interpretations. Grey, however, 
is a color, more specifically a color visibly com- 
posed of black and white. Why, then, substitute 
it for the Black white interpretations, which 
Oberholzer has already described and qualified 
as to their clinical significance in his paper on 
post traumatic conditions (1931), and recently 
in Dubois's book about the Aloris. Thus far the 
author has adhered to Rorschach's original con- 
cepts. However, he has introduced changes con- 
cerning the factors as well as the testing pro- 
cedure. The most striking one is that he has 
omitted Rorschach's Original answer, and 
wrongly so. In the reviewer's opinion no other 
factor reflects the education and background of 
a subject or hints at a neurosis as does the pres- 
ence or absence of Original answers. In Chap- 
ter VI Beck introduces a new factor Z, the 
Organization activity. Its justification is hard 
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to challenge as the discussion of its clinical sig- 
nificance is apparently reserved for a second 
volume. However, two objections can already be 
made. First, although Beck gives a list he omits 
the mode of derivation of the stated Organiza- 
tion values for the ro figures. Secondly, Beck 
has already stated in his previous book, the In- 
troduction, that ‘Assoziationsbetrieb is essen- 
tially organization activity’. In the present vol- 
ume he also translates Assoziationsbetrieb as 
‘combining or organizing activity’ as ‘organiza- 
tion drive’, or as ‘organizing activity’. However, 
to associate and to organize are not the same; 
and the question arises whether this is not more 
than just a linguistic misunderstanding. The 
English translators of Rorschach’s Psycho- 
diagnostics consistently translate Assogiations- 
betrieb as ‘associative activity’ which is accept- 
able. As a third change Beck has modified the 
signature: Rorschach’s Do becomes Hdx or 
Adx. The argument that anxiety rather than 
oligophrenia lies behind this reaction does not 
hold water. Did not Rorschach himself state that 
‘Do occur not only in oligophrenics but also in 
depressives and anxious subjects, as well as al- 
ways in compulsives.’ He could have added that 
they occur in subjects who do not see the forest 
for the trees. In other words they occur when an 
obviously specific form of inhibition of thought 
Processes (Denkhemmung) is at work for 
whatever cause: stupidity, depression, anxiety 
or other factors. With regard to the same Chap- 
ter, XV, the reviewer wishes to add that not 
only should the sex interpretations be separated 
from the human detail and listed separately, but 
likewise the anal interpretations. So much for 
the author's modifications of test factors, An- 
other of his modifications applies to the experi- 
mental procedure, namely the so-called inquiry, 
the repetition of the test in reverse in order to 
verify the entities and qualities of the single 
answers. Rorschach obviously did not consider 
such an inquiry necessary. Why should not the 
examiner, if any doubt arises, make the subject 
show the entity immediately after the response 
was given and induce him through neutral ques- 
tions to verify or exclude the doubtful determin- 
ants? If, in a difficult response, the examiner’s 
question should become too outspoken and thus 
risk a betrayal of possible determinants, one may 
Postpone the clarification of this specific re- 
Sponse until the end. But why, for this one in- 
Stance, repeat the entire test? The immediate 
questioning, directly after a response is given, 


offers the decisive advantage of reliability, a 
reliability which is lessened after a patient has 
gone through the entire test when he is likely 
to change his interpretations or even not to rec- 
ognize them again or to see them in a different 
way. The last possibility, namely additional new 
interpretations, are explicitly disregarded even 
by Beck. One more remark about the author's 
technique: In Chapter IV, on Scoring, he says: 
*.. . for permanent recording of these very rare 
as well as of the poorly outlined Dd the best 
method is to use a facsimile of the Rorschach 
test card.’ The reviewer still considers tracing 
the more exact method for this purpose. The 
facsimiles leaves out the most subtle details, and 
the fact that a Dd was seen may be much less 
decisive for the diagnosis than its specific type. 
A technical review of a technical book neces- 
sarily sounds more critical than it actually is. 
It is to be hoped that the second volume will 
appear soon. 

Psychol B 43:379-80 J1'46. Robert M. Lind- 
ner. It is unquestionably true that Dr. Beck has 
once more demonstrated his competence and 
scholarship by producing this book. The ques- 
tion, however, is just how much is contributed 
by it—and by other like material—to American 
psychology and psychiatry. From where this re- 
viewer sits, the answer to the question is dis- 
appointing. Volume I of Beck's Rorschach's 
Test proposes to bring the presumptive reader 
up to date in Dr. Beck's cherished ambition to 
standardize and "demonstrate the processes 
used in evaluating Rorschach test responses." Tt 
fulfills its announced intention by painstakingly 
and deliberately detailing responses, rationaliz- 
ing the usé of the various symbols, explaining 
and cataloguing, arguing and justifying, through 
sixteen laborious chapters chaperoned by a brief 
bibliography, two appended tables, and ten dis- 
jointed schema of the blots. With commentary 
added to commentary, as one reads it, it takes 
on more and more the character of the Baby- 
lonian Talmud. To read such a book is obviously 
out of the question: to study it requires a fixity 
of purpose amounting almost to a dedication. Its 
real service will be as another standard refer- 
ence work in a young but already needlessly 
over-complicated field. The practicing psychia- 
trist or psychologist is coming to rely upon the 
Rorschach as he does upon almost no other tool. 
It is a good technique, perhaps the best of the 
projective methods, and he appreciates what it 
can do for him by way of evaluating the per- 
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sonality and, indeed, for certain services no- 
where else available. He regards it practically, 
as an invaluable instrument for diagnosis, as an 
index and guide to therapeusis, as a clue to 
treatment progress. Hence the cultism of Ror- 
schachers, their allegiances, their bickerings and 
jealousies, bore and distract him; so also do 
their endless disputes over—let us say—the 
criteria of distinction between one F and an- 
other. What the clinician wants from the Ror- 
schach worker and researcher are ways of in- 
creasing the sensitivity of the test, enlarging its 
scope, and making it more informative. The 
present book, on the other hand, leaves the field 
where it was before it was written. Perhaps the 
implied criticism of this review is unjustified in 
view of the fact that the volume under con- 
sideration really purports to do nothing more 
than reveal "basic processes." If this is so, how- 
ever, then it must be taken as but another symp- 
tom of that compulsive over-zealousness to sub- 
‘stitute cold hieroglyphs for warm humans that 

“so diverts the stream of clinical endeavor in the 
behavior sciences. As for the volume itself, from 
the point of view of sheer book-making it is a 
satisfactory job of work. While the material 
could have been arranged in a manner more con- 
siderate of the reader and more sparing on his 
patience, it must be considered that merely set- 
ting up (much less proof reading) the text was 
a staggering accomplishment. For those who are 
preoccupied with scoring problems in Rorschach 
work and who adhere to the Beck method as 
exposited already in his Manual and other pub- 
lished works, this volume is an amplification and 
extension of what they already know. But for 
those who are seeking help in their efforts to 
help others, this is merely another book bound 
in blue. 

Psychosom Med 6:342-3 O’44. Saul Rosen- 
zweig. Any book concerned with the Rorschach 
test is inevitably of interest to the proponents 
of psychosomatic medicine. Few trustworthy 
and penetrating psychological tools are available 
for the study of the total personality and among 
these the Rorschach procedure is probably the 
most distinguished. In the present volume Beck 
continues his pioneer work with this test, which 
he first expertly represented in this country 
after an apprenticeship in Zurich with Ober- 
holzer, himself a direct disciple of Rorschach. 
Against this historical background the title of 
the book is significantly laconic. Many “Ror- 
schachers” have departed radically and, accord- 


ing to Beck’s implication, dangerously from the 
original test. Some are self-styled experts and 
others have little adequate training or experi- 
ence with the technique in its various clinical 
applications. Since the Rorschach of all tests is 
most technical and least adapted for popular use, 
Beck is concerned over its abuse. The title 
Rorschach’s Test—sets forth in a word what 
he hopes to achieve by the publication. * The 
scoring categories here included do not differ 
very much from those originally used by Ror- 
schach himself, Beck is, in a sense, a defender of 
the faith though he admits readiness to accept 
new scoring categories and symbols once they 
have proved empirically to be an advance over 
the original test procedure. * Shading responses 
are given a somewhat novel treatment * The 
organization response—Z—represents Beck's 
most original contribution though based upon 
suggestions contained in Rorschach’s own state- 
ments. When two or more parts of the blot 
figure are seen in relation to one another with 
a resultant emergence of new meaning the Z 
score is applied. This category aligns itself with 
the W and M factors as indicating creative or 
generalizing capacity but appears to reveal as- 
pects of the personality which the other two 
categories less specifically indicate. On the basis 
of considerable research and careful statistical 
evaluation, examples for the scoring of Z in the 
various figures are given in the text and a table 
in the Appendices makes available the Z values 
for various types of organization encountered in 
the ten figures. An evaluation of the book must 
inevitably emphasize the positive features of 
which it has so many. The conservatism of the 
author will strike many users of the method as 
salutary. It is easy to be carried away by intui- 
tive fancy in almost any projective procedure. 
By emphasizing the objective and, where pos- 
sible, the statistical validation of the scoring 
categories Beck has done much to make the 
already-mentioned conservatism progressive. 
He has not, however, sacrificed psychological 
meaning by making clinical judgment secondary 
to statistical objectivity. On the whole the book 
will probably be easier for most students to use 
than the already available texts because, for one 
thing, it employs a minimum of scoring cate- 
gories. It is true that the examples of responses 
given by Beck are not always easy to follow: 
that some of the discussions of the scoring 
categories, e.g:, F+, could with profit be ex- 
panded; and that an occasional inconsistency 
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mars the going. But such minor weaknesses are 
inevitable in a first edition. In future editions 
improvements may undoubtedly be expected. 
One such emendation that cannot be too strongly 
recommended is a longer and more complete 
discussion of how much constitutes one re- 
sponse for scoring purposes. The second volume 
of the present work, which will complete the 
discussion of the method by adding the all im- 
portant principles of interpretation and provide 
representative examples of total records, will 
be awaited by many with eagerness. 

O R Biol 20:191 Je *45. Wendell Muncie. * 
will be found very useful to Rorschach testers. 

For additional reviews, see 77. 
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X*[Re Rorschach.] Brck, Samuer J. Rorschach's 
Test: II, A Variety of Personality Pictures. Fore- 
word by Roy R. Grinker. New York: Grune & Strat- 
ton, Inc., 1945. Pp. xii, 402. $5.00. * (PA 20:1150) 


Am J Psychol 59:704-9 O 746. Donald E. 
Super. * in no scientific sense an experimental 
work * The work is refreshing, for the author 
appreciates that the last word has not been said 
on the subject and he makes some attempt to 
relate theory of personality as developed from 
Rorschach material to more general theories. 
One oriented to psychological theory and 
method is inclined, however, to wonder, upon 
reading the forword by Grinker, Beck’s chief 
at Michael Reese Hospital, whether Beck was 
ina position to do scientific work. In this brief 
exposition of his views, Grinker demonstrates 
a disregard of scientific method and of psycho- 
logical fact which psychologists associated with 
him in the Air Forces came to know and to 
distrust. Describing the Rorschach test as a 

research tool” of great importance, he pro- 
ceeds to attribute its improvement, not to the 
use of accepted scientific procedures but to clini- 
cal study by investigators who have used “every 
assistance from clinical psychiatry, psychoanaly- 
sis, and sociology . . .’—not, apparently, from 
psychology, with its emphasis upon avoiding 
the contamination of data, upon the use of ex- 
perimental and control groups, upon the accu- 
mulation of facts in quantifiable form, the de- 
velopment of norms, and the use of tests of 
Significance as illustrated in Rorschachean work 
by Munroe’s fruitful research. Within the limits 
set by this unscientific dictatorship, Beck has 
produced a volume of considerable interest to 
students of personality and of Rorschach’s tech- 


nique. In the opening chapter he outlines a 
theory of personality as background for his in- 
terpretation of data. This has been too rarely 
done by followers of Rorschach, despite their 
developilent of a certain type of theory from the 
inkblots. This chapter is very brief (9 pages) 
and sketchy, undocumented by evidence, and 
somewhat dogmatic. It is followed by a longer 
chapter on the psychologic (sic) significance of 
Rorschach test-factors, a discussion which helps 
Beck to achieve his translation of Rorschach 
into the language of the clinic. Here again are 
evidences of the conflict between unscientific 
orientation and a desire to conform to sound 
standards of investigation. * At times the variety 
of qualifying data to be kept in mind in inter- 
pretation is frightening. * it is clear that the 
interpretation of a single protocol is at least as 
complex a matter as the interpretation of a clini- 
cal case-history, making the Rorschach, as used 
by Beck, rather a diagnostic method than a test. 
* There is no hesitancy, it seems, in accepting 
...[an] opinion of Rorschach’s as evidence, de- 
spite a quarter-century of subsequent work with 
the inkblots, without citing evidence on the 
point. * Beck shows commendable caution when 
he refrains (pp. 32, 33) from attaching signifi- 
cance to the little understood phenomenon of 
colornaming * forty Rorschach records, each 
discussed at considerable length (331 pages in 
all) * It would have been helpful had each rec- 
ord been preceded by a systematic discussion of 
the condition, as seen, e.g., in Bochner and Hal- 
pern. There are no records of organic, psycho- 
pathic and various other categories. As detailed 
records and interpretations of cases in the cate- 
gories covered, this part of the book will be use- 
ful to the student of Rorschach. It gives a clear 
picture of the interpretation of the Rorschach 
data and of the derivation of a description there- 
from of personality. As a means of becoming 
familiar with the Rorschach technique this book, 
with its companion volume, is unexcelled in 
clarity and detail. As a source of information 
about, or demonstration of, the validity of the 
test, neither of which it purports.to be, it is 
disappointing, A Rorschach ‘manual’ may in 
time be written with validities included. Or 
possibly—as in the case of the Revised Stanford 
Binet—a separate volume on validity will fol- 
low the manual. But the data must first be sys- 
tematically gathered, tabulated, and analyzed— 
some brands of ‘clinical psychiatry’ to the con- 
trary notwithstanding.. 
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Brit J Psychol 36:177 My 46. * This volume 
continues with the interpretation of responses. 
A detailed discussion of the interpretation and 
its psychological significance, suitable for the 
Rorschach expert, is given first ; and is followed 
by case histories to illustrate differences of in- 
telligence, and studies of maladjusted adoles- 
cents, neurotic adults and schizophrenics. There 
are, however, no cases of normal children. A 
comprehensive bibliography is given. 

J Abn & Social Psychol 41:233-6 Ap '46. 
Albert I. Rabin. The long-awaited volume on 
the interpretation of the Rorschach has finally 
appeared. Beck’s first volume, in which the me- 
chanics and techniques of scoring Rorschach 
records were given detailed exposition, ap- 
peared more than a year ago. It showed a con- 
siderable degree of crystallization of method 
and reflected the author’s rich experience with 
the test, to some extent modulated by his ex- 
perimental attitude. In the two volumes of the 
present series, the Rorschach has come of age; 
it is called a test rather than method or experi- 
ment, the terms with which Beck introduced it 
in his first manual, back in 1937. The feeling is 
implicit that the author now considers the 
Rorschach a more solidified and less tentative 
instrument in the evaluation of the human per- 
sonality than he did eight years ago. The tre- 
mendous amount of work done with this 
method during recent years and the accumu- 
lated experience with it justify to a consider- 
able degree such a viewpoint. The present 
volume is truly a "second" book in the study 
of the Rorschach test. Anyone attempting to 
read this book must be thoroughly familiar 
with the Rorschach "language," the scoring 
symbols, and the method of making a record 
summary. It is the author's assumption that the 
basic material of Volume I or its equivalent 
has been mastered and can be applied in fol- 
lowing the personality analyses comprising the 
bulk of this book. * Part I is devoted to a 
general statement of principles and a descrip- 
tion of personality components as related to the 
Rorschach, while the second part, which covers 
more than 80 per cent of the entire volume, is 
devoted to a detailed and masterful analysis of 
forty-seven Rorschach records representing a 
variety of diagnostic, normal and abnormal, 
groups. Though Beck disclaims any "effort at 
outlining any theory of personality," this is ex- 
actly what he is attempting to do in the first 
brief chapter of the book. It seems to be an 


attempt to bridge the gap between scientific, 
academic, and experimental psychology, on the 
one hand, and clinical psychology and psychia- 
try, on the other. The chapter is all too sketchy 
to accomplish such a task and leaves some 
dichotomies like "abstract" vs. "practical" in- 
telligence, or "excitement" vs. "oppression," 
insufficiently clarified in terms of their rela- 
tionship to the mass of scientific data compiled 
during many years of psychological research. 
Yet, it is difficult to find a better presentation 
in which the "island" of Rorschach personality 
theory approaches any closer to the "mainland" 
of the general psychology of personality. The 
admission that the Rorschach Test does not at- 
tempt to give a “complete” personality picture 
is quite evident; it selects personality factors 
and traits and shows their interrelationships. * 
A notable refinement in his new scoring as 
compared with his previous publications is the 
more precise differentiation between surface 
color (Y) and vista and perspective (V) in the 
black-grey cards. Different determination of 
rare and common details is also obvious, Even 
the “shading shock” (Klopfer), or, as Beck 
calls it, “grey-black shock,” is given a place of 
prominence in the array of Rorschach factors. 
Throughout this part of the book, which draws 
considerably upon psychoanalytic theory, Beck 
remains the cautious and eclectic scientific ex- 
perimentalist dealing, basically, with an em- 
pirical method of personality evaluation. 
Furthermore, it is quite apparent that Beck 
does not consider the Rorschach as a “closed 
and circumscribed system,” but as an instru- 
ment which possesses a number of tentative and 
provisional aspects, and which is in need of 
further refinement, research, and validation. 
The alert reader will find, in this part of the 
book, a number of direct as well as indirect 
suggestions for further inquiry and investiga- 
tion. Part II of this volume is devoted entirely 
to the detailed protocols and interpretations of 
the records of some forty individuals. The first 
ten records cover, in the author’s words, the 
“intelligence curve.” * It is understandable that 
the author was not at liberty to give full and 
adequate clinical personality characterizations 
of these individuals. Had he not been governed 
by the need for the anonymity of these sub- 
jects, a better understanding of the relationship 
between the Rorschach factors and the person- 
ality patterns would have been gained. The nine 
additional records of adolescents,«within the 
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normal range, are a particularly valuable con- 
tribution to the Rorschach literature, Their 
value is to be found especially in the aid they 
may provide in differential diagnosis. Many 
"normal" adolescent records resemble those of 
psychotic patients, Interpretative caution, there- 
fore, is in order. The remaining records, with 
the exception of the last section, deal with 
"schizophrenic solutions" and "neurotic strug- 
gles.” There, a wealth of material, including 
clinical notes, is presented. Beck brings into 
play abundant clinical experience as well as 
intuitive clinical sense and sheer artistic 
Rorschach empathy. The interpretations are 
penetrating and the constant reference to the 
factors and the records is very instructive. In 
the last section, four “before and after” 
Rorschach studies are analyzed. The clinician 
will find the analysis of the first case of special 
value, The occasionally difficult problem of 
differential diagnosis between schizophrenia 
and manic depressive psychosis is brought out 
and made salient. Unfortunately, the circular 
psychoses as such receive little attention in this 
book, The only other case dealing with this 
type of disorder may also be found in this sec- 
tion. The groupings of organic and convulsive 
disorders, psychopathic states, and behavior 
disorders and studies of the Rorschach in child- 
hood are omitted. The reason for this omission 
is not clear. It may well become the subject of 
a third volume in the present series. In his 1937 
monograph, Beck prefaced the records of each 
diagnostic group with a descriptive summary of 
Rorschach relationships and a quantitative 
table of Rorschach signs and their frequency of 
occurrence in the particular group under con- 
sideration. The student found these summaries 
very helpful, though occasionally misleading. 
It is to be questioned, however, whether the 
abandonment of this method of presentation is 
advisable, Lack of a reference point, especially 
for the beginner, makes the task of Rorschach 
mastery even more formidable than it would 
be otherwise. The inclusion of some “inade- 
quate” records would also have been worth- 
While, Every Rorschach worker has the 
misfortune of obtaining such records. How far 
the interpretation should or may go in these 
cases is a point worthy of consideration. The 
present volume represents the culmination of 
the author’s work with the Rorschach method 
over a period of nearly twenty years. It repre- 
sents the cumulative efforts of many investi- 


gators, among whom the author is a pioneer. 
Dr. Beck’s continuous efforts to integrate 
Rorschach’s insights and empiricism with mod- 
ern psychological and psychiatric views of per- 
sonality have had a salutary effect upon many 
clinicians and are amply reflected in this book. 
No psychologist who is interested in the 
Rorschach, whether from the clinician’s view- 
point or from the point of view of the in- 
vestigator or research worker, can afford to 
overlook this volume. It is one of the most 
basic works in the entire Rorschach literature. 
Psychoanalytic Q 15:383-6 Jl '46. Frederic 
S. Weil. * Beck convincingly offers a develop- - 
mental exposition of the meaning of the Color 
responses and also discusses the Vista and 
Gray answers, the color shock (‘neurosis’), 
and the gray-black shock (‘anxiety’) * Con- 
sidering the great importance that Beck at- 
tributes to the associational contents, its 
richness and breadth, his insistence on discard- 
ing Rorschach’s ‘Original Response’ seems 
even more surprising. His main argument, lack 
of frame of reference, can easily be refuted by 
quoting Rorschach’s definition of Original Re- 
sponse: ‘interpretations that are given in about 
one hundred experiments with normal people 
about once . . . according to their quality they are 
either good or poor. Original or Original— 
... Beck begs the question in his polemic by 
substituting for Rorschach’s clearly defined 
technical term Original Response the word 
‘original . . . as it is commonly understood.’ How- 
ever, in one respect he is right: apart from the 
example in his Psychodiagnostics, Rorschach 
did not leave a list of Original Responses, Yet 
lack of sufficient statistics is no reason for dis- 
carding a factor, and one wonders why Beck, 
who checked so many Rorschach factors statis- 
tically did not do so with the Original Re- 
sponse. The answer is that Rorschach’s 
Original Response is derived from a healthy 
intelligent average, whereas Beck’s statistical 
raw material, as criticized in the review of his 
previous book, represents a mixture of overtly 
sick and doubtfully healthy people in an un- 
known proportion, which could neither furnish 
nor check Rorschach’s Original Response. This 
general disregard for the normality of his raw 
material in his first book is also conspicuously 
present in the records of this second volume. 
Thus the Intelligence Curve, that runs with 
ten records from most superior via middle 
range to feebleminded, includes one single case 
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which is not complicated by a more or less 
severe neurosis, a possible psychosis, a psy- 
chopathy or immaturity, although all these ad- 
ditional clinical factors change the Intelligence 
factors in the Rorschach test. Of the eight 
cases of the Adolescent Years some are suspi- 
ciously schizophrenic and all are at least so 
neurotic that their pathology overshadows 
"whatever may be characteristic of the adoles- 
cent years in the Rorschach test. Among the 
eleven records of the Schizophrenic Solutions, 
on the other hand, there are two from non- 
schizophrenic mothers of these patients. One 
mother, according to Beck's Rorschach inter- 
pretations, ‘plays the róle opposite that of her 
daughter’ whereas ‘passivity seen in the (other) 
mother progresses to its ultimate stage (in her 
daughter). "Neither ‘playing of the opposite 
róle' nor ‘extreme passivity’ make for a schizo- 
phrenia as far as we know, nor can the Ror- 
schach prove it, even if Beck’s title Schizo- 
phrenic Solutions suggests a psychogenic etiol- 
ogy of this still enigmatical psychosis. The short 
and insufficient Clinical Notes present another 
defect. The reader, presumably a Rorschach 
student, looks to them for crucial proof of the 
validity of the test, ie, whether Rorschach's 
assumptions, deductions and connections are 
borne out or not. What he gets are interpreta- 
tions of several pages which are verified not with 
a clinical description but with a few lines such 
as: “The fact that this man is the president of a 
university is in itself a validation of the Ror- 
schach findings,’ or, ‘This woman ... is a pro- 
fessor . . .’ Where the Clinical Notes are less 
laconic they show, with the exception of the 
records of schizophrenics and feebleminded, in- 
sufficient relation to the Rorschach interpreta- 
tions—they are neither proof for nor against the 
interpretations, neither modifications nor quali- 
fications of them. Beck seems to be aware of that 
deficiency ; however, excuses such as: ‘Condi- 
tions precluded using biographical information 
or inquiry,’ are hardly admissible, since clinical 
pictures can be given without biographical data, 
or more suitable material could have been 
chosen instead. In this connection it should be 
said that the titles of the cases are misleading, 
as they epitomize the Clinical Notes through- 
out, regardless of whether or not the Rorschach 
interpretation bears out the Clinical Note, For 
instance, the case with the caption Compulsion 
in a Young Woman is according to Beck’s in- 
terpretation a neurosis, although not a compul- 


sive-obsessive one, Nor in A Male Homosexual, 
can Beck or anyone deduce homosexuality 
from the Rorschach record at all (as is often 
the case). More instances could be easily 
quoted. Incidentally, the presentation of each 
case in the following order: Response Sum- 
mary—Interpretation of the Record—Clinical 
Note—Response Record, does not seem logical, 
as the interpretation refers again and again to 
the response record, Any attentive reader nat- 
urally will start reading the response record, 
then the summary, afterwards the interpreta- 
tion which he will finally compare with the 
clinical note. It would go beyond the scope of 
a review to take up each of the interpretations. 
One symptom, however, which is contained in 
almost all of them should be singled out. With 
the exception of the few feebleminded and 
Some schizophrenics there is hardly a record 
from which Beck does not infer anxiety. This 
ubiquity deprives the finding of a more specific 
significance. Even conceding that there is anx- 
iety in the last analysis behind most psycho- 
pathological phenomena, clinically we are more 
interested in knowing whether anxiety is bound, 
for instance, by an obsessive-compulsive neu- 
rosis or is overt, and how it manifests itself— 
as free anxiety, or as part of a basic mood and 
so forth. Here adequate clinical notes could 
have demonstrated that the Rorschach can 
Show more than unqualified anxiety. Beck's 
two volumes on the Rorschach test are com 
ceived as a comprehensive textbook, Such an 
enterprise would serve its didactic purpose even 
better could some amendments be made in a 
future edition. 

Psychosom Med 8:365-6 S-O ’46. Saul Ros- 
enzweig. * Beck has made a distinguished con- 
tribution in this second part of his work. While 
Chapter I is perhaps unduly impressionistic 
and will raise questions in the minds of more 
systematic thinkers than he, the second chapter 
provides a distinct compensation. Tn the latter, 
one finds within the compass of 50 pages the 
best existing exposition of the significance of 
the various Rorschach categories in their mul- 
tiple relationships. The account is by no means 
uniform as to fullness of detail but this treat- 
ment is in agreement with the author's avail- 
able information. He studiously admits, when 
necessary, the incompleteness of knowledge 
regarding one or another test factor, One 
misses, however, an exposition of some of the 
general principles in terms of which the Ror- 
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schach method—in common, to some degree, 
with other projective techniques—must be in- 
terpreted. The student could profit considerably 
from a better appreciation of the postulates 
upon which Rorschach interpretation proceeds: 
the nature of norms, the general significance of 
patterns in Rorschach results and the use of 
“signs,” as well as the more general principles 
connected with the “analogizing” upon which 
Rorschach interpretations are founded. Obvi- 
ously this omission can be supplied by an in- 
structor using the text, but the inclusion of 
such a discussion would have helped the aver- 
age student a good deal in his orientation to the 
less general discussion in Chapter II and to the 
concrete case presentations that follow. The sec- 
ond main portion of the work—the case illus- 
trations—represents a unique contribution to 
Rorschach literature. Nowhere else is to be 
found a similar array of detailed Rorschach 
records with a full and, at times, bold interpre- 
tation of the entailed personality pictures. Re- 
gardless of whether every Rorschach examiner 
—let alone every psychologist—would agree 
with the interpretations made by Beck, the 
mere publication of these records, together 
with the full statement of conclusions from 
each, makes it possible for the method to 
undergo a critical examination at the hands of 
both the initiated and the uninitiated—an op- 
portunity unparalleled in the literature to date. 
The selection of the cases is, on the whole, 
broad, there being included the various levels 
of intelligence, records from adolescent sub- 
jects, the varieties of schizophrenia, and the 
neuroses, as well as four paired records—be- 
fore and after various critical experiences: 
psychosis and remission, murder and imprison- 
ment, before and during psychoanalysis. Many 
readers will miss illustrative records from sub- 
jects younger than the adolescent group and 
some will regret the absence of cases of the 
organic and affective psychoses. But Beck was 
obviously not attempting to be exhaustive; he 
preferred apparently to present cases repre- 
senting the stress of his own experience. Those 
acquainted with Beck’s first book—Introduc- 
tion to the Rorschach Method—will note a 
methodological change in the present volume 
which is at the same time both more and less 
conservative. While in the earlier work the 
various chapters, each devoted to a particular 
diagnostic category, were introduced by a page 
or two of general discussion, preceded by a 


paradigm for the correlated Rorschach pattern, 
the present volume notably omits such schema- 
tizations. Each chapter is baldly empirical. 
Such conclusions as are to be drawn about the . 
feebleminded, the neurotic or the schizophrenic 
individual, for example, must be drawn by in- 
duction from the individual case presentations 
as supplemented by the general comments in 
Chapter II. What one finds in these diagnostic 
chapters, instead of the omitted general charac- 
terizations, is an exhaustive employment of 
detail interpreted on the postulate of psychic 
determinism. The close relationship between 
the psychoanalytic approach and the Rorschach 
method—which was, of course, inherent in 
Rorschach’s own original orientation—is evi- 
dent in the present work, The style in which the 
case interpretations are written will at times 
pique the reader. An occasional flash of dogma- 
tism—a conclusion drawn from evidence seem- 
ingly insufficient or, at least, insufficiently 
presented—is fortified by a too carefully chosen 
phrase. One is sometimes overconscious of the 
care with which the author has selected his 
words or phrases—as if he were attempting to 
avoid somewhat too conscientiously the repeti- 
tion of some essential common expression. The 
defect is, however, not serious since, on the 
whole, the exposition is lucid and one almost 
always knows to which details of the record a 
particular interpretive statement owes its exist- 
ence. In general, this second volume is superior 
to the first. It more nearly represents a unique 
contribution. The importance of the earlier vol- 
ume is nonetheless very real—as was brought 
out in an earlier review. No reader of the total 
work can fail to appreciate the sincere effort of 
the author to establish the Rorschach method 
on a firm research basis, his willingness to pre- 
sent his interpretations—sound or unsound— 
for what they may be worth, and a certain evi- 
dence of liveliness in the understanding of the 
individual human being, which is all too in- 
frequently encountered in textbooks of per- 
sonality. Taken together Beck's two volumes 
constitute the best existing single work on the 
Rorschach method of personality diagnosis. 
Rorschach Res Exch 10:37-9 Mr '46. Philip 
Lawrence Harriman. Forty-seven complete 
Rorschach records, with the subjects’ own re- 
sponses and with Beck's interpretations of the 
structures of personalities revealed therein, are 
a contribution of major importance not only to 
the specialist in Rorschach methodology but 
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also to the general student of human psychol- 
ogy. Here, in fact, are the data which reveal 
the inadequacy of contemporary textbooks pur- 
porting to deal with personality from the stand- 
point of modern psychology. Though the title 
of this volume may deter many psychologists 
from reading it, actually the case materials are 
of such importance that they will justify thor- 
ough study by those who have never been con- 
vinced of the validity of the Rorschach 
technique, The responses, though occasionally 
a bit fragmentary and sometimes edited, are 
adequate for the experimental psychologist; 
and the stimuli are easily identified through 
Beck’s exposition (Volume I, pp. 13-57). 
Apart, therefore, from the technicalities which 
are of particular interest to the student of the 
Rorschach method, here is a contribution which 
should be called to the attention of experimen- 
tal psychologists who are concerned with per- 
ception or with individual differences as well 
as with explorations into personality. Conse- 
quently, the reviewer hopes that, after deleting 
Chapter I, his colleagues in the Rorschach 
group will urge all psychologists to put this 
book on their own list for required reading. 
As a matter of fact, it is somewhat perplexing 
to understand why Beck has developed no 
more tenable theory of personality than that 
which he expounds in Chapter I. The case ma- 
terials are all here (pp. 63-392) from which 
one of the most substantial contributions to the 
theory of personality might be written. Unfor- 
tunately, Beck has expounded his theory of 
personality at the beginning of his book. In 
none of the seven footnotes to this chapter does 
he reveal an acquaintance with the extensive 
literature, particularly the major experimental 
findings and the newer points of view ex- 
pounded in books and journals, that have been 
published in recent years. His own theory 
seems, at least to this reviewer, to be a hodge- 
podge of speculation and dogmatism, Does he 
lack a frame of reference for his appraisals of 
Rorschach records, or is he hesitant about ex- 
pounding his views? Why does he first present 
a superficial theory, and then (p. 9) deny mak- 
ing “an effort at outlining any theory of 
personality”? The records and their interpre- 
tations (pp. 63-392) are ample illustrations of 
the fact that he has disappointed his readers 
with withholding the insights which he has de- 
veloped during his years of research. Chapter 
II deals with the meanings of the various Ror- 


schach factors as scored by the Beck method. 
Here in fifty-two pages, are compressed the 
discussions which must some day be expanded 
into a large volume, each major point being 
supported by reference to experimental or clin- 
ical evidence. Frankly, the reviewer would pre- 
fer to keep this material from the eyes of 
“academic” psychologists until more nearly 
conclusive proofs of Rorschach validity are 
available whereby to combat their scepticism. 
Nineteen separate references given in foot- 
notes, one being a novel by Sigrid Undset and 
another being an article by Alfred Adler, are 
decidedly inadequate to represent the well- 
established principles now available for the in- 
terpretation of Rorschach factors. Though the 
experienced Rorschach investigator will read 
some of this material with benefit, the chapter 
disappoints the inquirer who can no longer be 
satisfied with mysticism, glib generalizations, 
and armchair theories. Were the chapter writ- 
ten twenty years ago, it might be regarded as a 
challenge to investigators. Now, however, many 
dependable principles have been established by 
Beck and others, and the literature abounds in 
reports which validate many procedures in in- 
terpretative work. Of course, much remains to 
be done in as thorough a manner, for example, 
as that which Beck has worked out for the 
feeble-minded and the schizophrenic. Curi- 
ously, he appears to be indifferent towards the 
contributions of many other investigators. This 
chapter, therefore, is by no means the best dis- 
cussion of the psychological significance of 
Rorschach factors that could now be written. 
On the other hand, if the reader be motivated 
to embark upon the fascinating exploration of 
other literature on this subject, the chapter will 
serve a desirable purpose. That which really 
makes this volume a substantial contribution 
to the field of psychology is found in Part II. 
Here for the first time are the raw materials 
for a study of forty-three diverse personalities 
explored by means of the Rorschach technique. 
The reviewer hopes that Beck will publish 
more records in the years to come, particularly 
those of young children and of the aged, both 
of which types are not represented here. The 
Tesponse summaries are given in terms of 
Beck’s own methods of scoring, interpretations 
of varying lengths follow, and then come the 
original responses and the scorings. The clini- 
cal notes on each case are too brief to satisfy 
the reader, and Beck would do a great service 
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to the science of psychology if he would prepare 
cases for subsequent publication in which all 
pertinent data are given to help in evaluating the 
Rorschach findings. The third chapter presents 
records illustrating various grades of intelligence 
from that of a brilliant university president to 
the feeble-minded. Since many books of this type 
present only the records of deviates, it is a de- 
light to find here four records of very superior 
persons and three of people in the middle or 
average range of intelligence, Readers who deal 
with normal adolescents, either as parents or as 
teachers, are likely to raise an eyebrow when 
they come to Chapter IV, “The Adolescent 
Years.” Each of the eight records presents the 
Rorschach test results for a maladjusted adoles- 
cent. Years ago psychologists exploded the myth 
that this period in the life span is marked by 
stress and strain, feelings of insecurity, conflict 
with parents, and stubbornness. Of course, Beck 
may have made an unwise selection from his 
files to represent the Rorschach findings of the 
years of adolescence. On the other hand, his own 
clinical experiences may have given him the 
“jaundiced eye.” More than a quarter century 
of intimate association with large numbers of 
adolescents in school, on the athletic field, and 
at social gatherings make this reviewer question 
the wisdom of offering these records as "typical" 
of what Rorschach investigators find. Since Beck 
himself advocates the need for extensive ac- 
quaintance with various types of people on the 
part of the Rorschach expert, it would not be 
amiss to caution the reader to study modern 
youth before accepting these records as repre- 
sentative. Chapter V, which presents eleven rec- 
ords of schizophrenics, is excellent. It would be 
most enlightening to have Boisen review these 
cases in the light of his own experience and eval- 
uate the interpretations which Beck has given. 
The clinical notes in this section are satisfyingly 
adequate. In short, this is the best chapter in 
the book. Chapter VI, “Neurotic Struggles,” is 
rather disappointing because the writer has a 
limited concept of the neuroses. It is doubtful 
whether they can be interpreted, as Beck insists 
upon doing, in vacuo, with cavalier disregard of 
the milieu in which they occur. At any rate, here 
are the materials to enrich a course in mental 
hygiene as well as to interest the experienced 
student of the Rorschach method. One could 
wish that Beck had written at length on his con- 
cept of the neuroses, especially as he may dif- 
ferentiate the creative person from the individual 


who is simply neurotic. This reviewer is left 
with an opinion that if Beck had a broader 
experience with “normal” people he might have 
given better interpretations of these records. 
The final chapter presents pairs of records for 
single individuals, the first pair being those 
before and after remission from a psychosis in 
which schizophrenia and hypomania are present. 
These records repay careful study. Excitement 
and depression are revealed in the next pair. 
Tests of a murderer, tested on entering prison 
andagain after twenty-nine months of imprison- 
ment, are given in full, together with interesting 
observations and interpretations by Beck. Fi- 
nally, there are the records taken before and 
during the psychoanalysis of a nationally known 
psychiatrist. These records are of especial inter- 
est to all psychologists who followed the sym- 
posium published in the Journal of Abnormal 
and Social Psychology. In fact, they are among 
the best reports that have ever come from the 
author of this important book. After studying 


this volume carefully, one must agree that the . 


Rorschach technique is largely an art, far less 
objective than the Terman-Merrill even in the 
hands of a neophyte. To be sure, the experiments 
in “blind diagnosis” have brought convincing 
proof of agreement among experts in the Ror- 
schach test; but the interpretations of a begin- 
ner would inevitably be far removed from those 
of an experiented Rorschach worker. Beck has 
added little of value for the expert, and his pro- 
cedures would be disastrous in the hands of one 
who would learn the method by self-study. The 
readable, fluent style in which he writes his in- 
terpretations is deceptively simple to the earnest 
but unqualified young student who would “prac- 
tice” the method. On the other hand, the patient 
student can learn a great deal by taking a set 
of cards and working out each of the scorings 
which Beck has given. Only after a full year 
spent in this laborious research can he hope to 
be ready to piece the records together. Then, he 
must gain in knowledge of “reaction types” and 
“clinical pictures” before starting on the more 
stimulating task of interpretation. Beck’s own 
weakness in the area of adolescent behavior 
should be construed as a warning against hasty 
ventures into meretricious interpretations, and 
his assurance in dealing with the Rorschach 
records of schizophrenics ought to inspire the 
student to acquire knowledge which makes in- 
terpretations valid and useful in clinical investi- 
gations. 
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War Med 8:436 N-D '45. This excellent book, 
essentially on personality, is written primarily, 
in the opinion of the reviewer, for the psychol- 
ogist. Only a person well trained in psychiatric 
or psychologic behavior who has studied in detail 
the normal and intact personality in all its phases 
of development can understand the implications 
and meanings of the various processes found in 
the case reports of this book. Obviously a book 
of this kind lends itself to controversial inter- 
pretation because the text is that of "norms" for 
the normal and the abnormal personalities. This 
book is highly recommended to persons inter- 
ested in personality and all its ramifications. 

For additional reviews, see 77. 


77 
*[Re Rorschach.] ip ds J. Rorschach's 
Test, Volumes I and II. New York: Grune & Strat- 
ton, Inc., 1944-1945. Pp. xiii, 223; xii, 402. $3.50 ; $5.00. * 
(PA 18:2837, 20:1150) 

J Clin Psychol 2: 302-3 Jl’ 46. This two-volume 
work by the acknowledged master of Rorschach 
testing in the United States marks another mile- 
stone in the objectification of projective meth- 
ods of personality study. Volume I demonstrates 
in minute detail the processes used in evaluating 
individual associations with little change in the 
theoretical orientation contained in Beck's In- 
troduction to the Rorschach Method published 
in 1937. * The scoring precepts for each basic 
problem are clearly presented in separate chap- 
ters with numerous verbatim responses scored 
for each card to permit the reader to study the 
possible responses with maximum ease. A new 
scoring category, organization or Z responses, is 
introduced on the basis of previously reported 
studies indicating that not all organization re- 
sponses are properly classified under Rorschach's 
W activity. * Volume II is devoted to an inter- 
pretation of the psychological significance of 
Rorschach test factors evaluated in light of the 
author's dynamic theory of personality. In con- 
trast with other texts in this field, Beck has suc- 
ceeded in presenting the theoretical implications 
of different types of activity in simple, easily- 
read style which makes the work useful to be- 
ginner as well as expert. The method is presented 
conservatively as an objective scientific instru- 
ment with no attempt to found a “school” or 
cult. The most important contribution of Volume 
TI is the detailed presentation of 43 well selected 
cases with verbatim response records, response 
summaries and clinical interpretations. Included 


are representative examples at all intelligence 
levels, in adolescence, in schizophrenia, in neu- 
rotic struggles, and, finally, records on four per- 
sons retested at significant periods of develop- 
ment. This work should be on the required book 
shelf of every clinical psychologist and psychi- 
atrist since it is probably the most authoritative 
presentation in the field. It can be thoroughly 
recommended to all who are interested in the 
objective development of projective methods in 
psychological diagnosis. 

J Consult Psychol 10:375-6 N-D '46. Zyg- 
munt A. Piotrowski. These two volumes....con- 
stitute a very good manual for the use of begin- 
ners. The title, Rorschach’s Test, apparently has 
been chosen advisedly. Beck decided to follow 
Rorschach faithfully, turning a deaf ear to the 
innovations and additions of the last quarter 
century on the ground that “examination of the 
published evidence does not result in convic- 
tion." Be that as it may, it is not necessary to 
include new developments in a text for beginners 
since none of the innovations are incompatible 
with any of Rorschach's important methodolog- 
ical principles. Many of the innovations enable 
us to infer more, and without a loss of validity, 
from a Rorschach record than Rorschach him- 
self could infer (for example, from an analysis 
of the human and animal movement responses ; 
cf. The Nervous Child, 1945, 4:344). However, 
this new type of inference about personality 
traits hitherto unmentioned in the Rorschach 
method does nof clash with, and does not in- 
validate, the inferences from Rorschach's orig- 
inal method. It should be added that while Beck 
is uncompromising and orthodox in his preface 
and in a number of paragraphs, elsewhere he 
considers and admits the views of others besides 
Rorschach. Since Beck insists on following Ror- 
schach closely, a comparison of Beck's work 
with Rorschach's Psychodiagnostics is natural. 
All text books on the Rorschach method are of 
course chiefly a paraphrase of the Psychodiag- 
nostics. The latter (available in English since 
1942) is written so concisely that it reads like a 
summary. It is difficult to use it as a textbook 
without the aid of an instructor. Thus the 
Psychodiagnostics requires amplification; its 
ideas must be made explicit and its principles 
illustrated more fully on a larger number of 
individual cases, Beck gives a particularly full 
exposition of the scoring of the subject’s re- 
sponses. * Beck publishes long lists of responses 
he personally classified as'either good or poor 
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forms. This seems to be the weakest portion 
of the Basic Processes; its weakness lies not 
so much in that it is based on the judgment 
of one man, no matter how experienced, but 
in its inconsistencies. Beck's lists seem to be based 
essentially on the statistical frequencies of the 
wording of a response and not on an analysis 
of the relation between the shape of the per- 
cept and the shape of the area of which the 
percept is an interpretation. Thus, for exam- 
ple, in Plate V "squashed animal," a percept 
of indeterminate shape, and for this reason 
alone not a good form, is scored by Beck as 
good form while "beetle" is scored as poor 
form; “insect” however is considered satisfac- 
tory. Certainly "beetle" is not a less plausible 
interpretation of Plate V than "insect" and 
therefore not a worse form than "insect." The 
emphasis should be placed on the percept, the 
visual image, and not on the words which con- 
vey the image. This emphasis upon the percept 
is implicit in Rorschach's own concept of the 
form response. Rorschach himself called his 
method a "diagnostic test based on perception" ; 
his method obviously is not intended to be a 
word association test; words are necessary but 
merely as a means to a goal, not as the goal it- 
self of the test procedure. In case of doubt, 
Oberholzer compares a patient's questionable re- 
sponse with the patient's unquestionably good 
and poor form responses and thus decides 
whether to score the questionable response as 
good or poor. This procedure is fraught with 
less danger than scoring rigidly according to a 
table and considering only the verbal form of 
the subject’s response. In this connection it may 
be recalled that even such a common response 
as “butterfly” to Plate I can bea poor form ; some 
patients with a psychosis or with brain damage 
“see” the “butterfly” in I in a very inexact and 
uncommon manner that does not resemble the 
shape of any real butterfly. The percentage of 
sharply conceived forms is a measure of the sense 
of reality and not of a verbal agreement with 
others. Rorschach’s Test contains special chap- 
ters describing the psychological meaning of the 
scoring categories, These chapters are very re- 
strained and in them Beck does not follow even 
Rorschach all the way. This is most noticeable 
in Beck’s presentation of the shading responses. 
However in the long case presentations (the sec- 
ond volume contains forty-seven complete rec- 
ords and their interpretations) Beck is far less 
restrained and much freer, Tn the individual case 
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studies a great deal is made of the symbolic 
significance of the content, such as anatomy, of 
references to depressing happenings, etc. The 
case studies are the most instructive part of the 
textbook. They prove that the author not only 
has vast personal experience in the interpreta- 
tion of Rorschach records but also knows and 
uses the literature well. There are sample rec- 
ords of healthy, feebleminded, neurotic and 
schizophrenic patients, including incidental re- 
marks concerning specific diagnostic criteria. As 
far as the use of this method as a diagnostic 
aid is concerned Beck's second volume covers 
much less ground than Rorschach's book or other 
textbooks. Beck might have corrected Ror- 
schach's error that color shock did not occur in 
psychosis. Color shock does occur in the func- 
tional psychoses and frequently is associated 
with brain damage. The repetition of Rorschach's 
error may lead to many clinically wrong diag- 
nostic suggestions. The individual records are 
interpreted more fully than they were in the 
1937 edition. In these interpretations Beck him- 
self deviates from Rorschach in many points. 
He utilizes more than Rorschach ever did the 
content of the subject's responses. The case stud- 
ies show that a great deal of personal skill is re- 
quired in addition to intellectual knowledge inthe 
practical application of the Rorschach method 
in its present phase of development. The “se- 
lected pertinent publications” which close the 
second volume are somewhat surprising. They 
include Plato’s Apologia and several novels but 
do not contain any reference to Guirdham nor 
to the second of Rorschach’s two long and im- 
portant individual case studies (see Zeitschrift 
Neurologie, 1937). Ross’ brief report on the uses 
of the Rorschach in the Canadian Army is men- 
tioned but his more significant “Contribution of 
the Rorschach Method to Clinical Diagnosis” 
(Journal of Mental Science, 1941) is omitted. 
In spite of his demonstrative emphasis on or- 
thodoxy Beck actually deviates from Rorschach 
in many instances with benefit to the reader. 
Many of these deviations need a fuller proof 
before they will be generally accepted. Beck in- 
cluded them because he had found them helpful 
in making individual case analyses. In conclu- 
sion, Beck’s volumes cover less ground than 
Rorschach’s Psychodiagnostics. However, they 
cover it in greater detail and in a manner that 
is easy to follow for a beginner, In the reviewer’s 
opinion a careful study of the individual records 
and of their interpretations will be particularly 
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profitable. These conscientiously written vol- , records of compulsive neurotics which should be 


umes are a definite contribution to psychologi- 
cal literature. 


For additional reviews, see 75-6. 
[ 78] 


X*[Re Rorschach.] Bocuner, RUTH, AND HALPERN, 
Frorence, The Clinical Application of the Ror- 
schach Test. Introduction by Karl M. Bowman. New 
York: Grune & Stratton, Inc., 1942. Pp. x, 217. * For 
latest edition, see 79. (PA 16:2315) 


Am J Orthopsychiatry 14:186 Ja "44. L(aw- 
son) G. L(owrey). * The publishers say the 
book proves that the Rorschach is not “a cult in 
which only the initiated may serve.” The latter 
point particularly seems to have caused a con- 
siderable emotional reaction among Rorschach 
specialists. Some reviews have dismissed the 
book as invalid and, in effect, useless. Some of 
the reaction seems dependent on an expressed 
belief that the book was hastily thrown together. 
Actually, the volume is a simple and unpreten- 
tious set of instructions for giving, scoring and 
interpreting the test. It will not make of any one 
a finished Rorschacher; there are errors and 
misinterpretations and it has not the value of 
Beck’s or Klopfer’s books; but it will be useful 
to the beginner and should be judged as such. 

Am J Orthopsychiatry 16:731-2 O ’46. Edna 
Mann. A newcomer to the Rorschach picks up 
many an article on the subject, reads it through, 
puts it down and sighs, “This is wonderful, I'Il 
have to read it over again some time to see what 
it means.” This book aims at being understand- 
able, clear, simple, and practical. In its first edi- 
tion, many army psychologists, with G. I. orders 
to become Rorschachers, found a road their 
everyday feet could follow. Several have men- 
tioned that they found this book the best intro- 
duction to the Rorschach. The second edition 
«Contains twenty new case studies (making a 
total of 40), a new chapter on “Alcoholics” and 
one on "Behavior Problems." One of the most 
helpful aspects of the book, particularly for the 
relatively new Rorschacher ( though it has much 
to interest the most experienced), is the mean- 
ingfulness and down-to-earth quality given to 
Rorschach symbols and concepts. * There is an 
attempt throughout to point up the first section 
of the book (on administration, scoring, analy- 
sis of determinants, etc.) by references to the 
actual cases in the second half. Incidentally, the 
cases do not always prove the point, showing 
perhaps that the Rorschach is too much like life 
itself. The poorest choice seems to be the three 


coarcted but turned out to be dilated in M:C 
relationship. The scoring follows Klopfer’s, as 
does much of the book, but foregoes some valu- 
able refinement. For example, where Klopfer 
and Davidson have elaborated the F score to 5 
levels, Bochner and Halpern reduce it to the 
original two of Rorschach (F+ and F—). This 
reviewer feels that lumping true F + along with 
crude F's beclouds the F+ rating. Of course, 
the psychologist can and will make a final qual- 
itative evaluation of the record but why have 
scores if they do not “translate responses into 
most accurate symbols” —at least to the extent 
to 3 F levels (F+ carefully seen form; p 
poorly perceived form; and F for noncommital 
or crude forms) ? Other scoring simplifications, 
for better or worse, have apparently been a work 
aday response to the rush of the busy psychiatric 
clinics in which the authors have worked. Other 
clinic workers will understand the need to cut 
corners and streamline reports, reluctant as they 
may be to do so. They should also understand 
and hail with unmitigated enthusiasm the au- 
thor’s practical admonition on writing reports, 
When Rorschach workers recommend writing 
reports in English instead of Rorschachese, that 
should be welcome news to the social workers, 
and school and community people, who in the 
past have been heard to leave conferences mut- 
tering “What is that: labile affectivity”? In 
short, this is a good practical readable first book 
on the Rorschach which should not be a substi- 
tute for a course of careful training, but which 
might well follow or accompany it. ? 
Am J Psychiatry 99:471 N’42. R.M. Finney. 
* a simple and understandable presentation of a 
very difficult psychological procedure, the Ror- 
schach experiment. * Their explanation and in- 
structions regarding scoring of the record are 
perhaps a little too simple, especially for Te- 
search purposes. However, for practical, clinical 
use there is sufficient information given. The 
discussions of the meanings of the various sym- 
bols derived by scoring are perfectly adequate 
for orientation of the beginner in Rorschach 
work. The authors present a clear discussion of 
the test cards, although one cannot help feeling 
that reproductions of the cards in color would 
have enhanced the value of the text. One of the 
most commendable features of the book is the 
presentation of twenty actual Rorschach records 
with the interpretations appended. To research 
workers these interpretations might conceivably 
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be thought too superficial, but for the practising 
psychiatrist they might be adequate and serve as 
guides in learning the method. 

Am J Psychol 56:313-5 Ap 43. M. R. Har- 
rower-Erickson. * One is not certain whether 
the authors have decided to put the clock back 
to where Rorschach left it, a perfectly legitimate 
procedure if adhered to conscientiously, or 
whether, in attempting to do this, they are actu- 
ally unaware that their presentation of the Ror- 
schach method incorporates, and is permeated 
with, certain systematic additions, and that even 
their terminology bears witness to the particular 
frame of reference on which they draw, * An- 
other disconcerting feature is the manner in 
which reference to the articles in the Rorschach 
Research. Exchange is handled, namely, by a 
blanket reference to the Journal in the index. 
Although work derived from this source is dis- 
cussed, no citations are given. * Some features 
of the presentation, which, in the main, is clear 
and very readable, seem to show a fundamental 
lack of appreciation of the method, or else are 
careless slips. For example, the "average" and 
"above average" adult records (p. 80-81) both 
have equal FM, m, and K scores. This is in ac- 
tual fact an unusual finding and far from “aver- 
age." Or, again, there is the statement (p. 54) 
that when the number of responses in the last 
three cards exceeds 33 per cent there is an over- 
dependence on external stimuli, but that fewer 
than 33 per cent of such responses indicates a 
withdrawal from the environment. Must we as- 
sume, therefore, that there is no middle ground, 
no way for the well-adjusted individual to show 
his adjustment along these lines? The book falls 
naturally into two parts, the discussion of the 
method per se and the clinical material derived 
from the authors’ own experience. We have felt 
that, for various reasons, this theoretical discus- 
sion was disappointing. With the ideas which 
have prompted the authors to make available 
their case material, we are in complete sympa- 
thy. * The records, case histories, and discus- 
sions which are presented, although by no means 
all-inclusive, are an interesting cross-section of 
the clinical population which may be referred 
for Rorschach examination. Thus, while the 
book cannot provide the beginner with anything 
like the systematic treatment of the subject that 
can be found in Klopfer’s The Rorschach Tech- 
nique, and while the expert will not find in it 
much that is new to him, it may prove of inter- 
est and be somewhat of an eye-opener to psy- 


chologists and physicians in various fields who 
have not as yet realized the applicability of the 
method to their particular sphere of interest. 
Arch Neurol & Psychiatry 48:161-2 Jl’ 42. 
* Although....[the authors] state that their ma- 
terial is essentially derived from the literature, 
they neglect to include in their bibliography the 
two papers of Klopfer, “The Technique of the 
Rorschach Administration” and “Theory and 
Technique of the Rorschach Interpretation,” 
from which most of their material might have 
been derived. This omission is surprising since 
some of the Klopfer theories have never been 
published elsewhere. Their handling of this ma- 
terial tends, unfortunately, toward oversimplifi- 
cation, and they are apparently unaware of cer- 
tain changes which have been made since these 
articles were first published, some years ago. In 
addition, the authors themselves do not seem at 
all clear in their discussion of the more difficult 
problems. Their discussion of the M:Sum C 
ratio tends to be confused, and they contradict 
themselves concerning the meaning of the basic 
ratios on three consecutive pages. It requires 
more skill and experience than the average psy- 
chiatrist possesses to interpret these pages, 52, 
53 and 54. The major difficulties with the work 
appear, however, when the authors approach the 
clinical field. Although the book purports to be 
a study of clinical application, it deals only with 
mental deficiency, neuroses, schizophrenia and 
“organic” disorders. Apparently the authors have 
had no experience with the affective psychosis 
or else consider this broad aspect of psycho- 
pathology as negligible. The clinical data given 
in the last half of the book are scant and filled 
with errors. An example of some of the mis- 
statements is the interpretation of the value of 
popular responses of the mentally defective sub- 
ject. Apparently the authors are unfamiliar with 
the work of Beck (the major investigator in 
this field), who stressed the significance of pop- 
ular responses in indicating the ability of the 
defective person to get along in institutions. It 
is amusing to note in this connection that the 
sample record given for a mentally defective 
person has a high percentage of popular re- 
sponses, in contradistinction to the authors' pre- 
vious assertion, and that their records substan- 
tiate Beck's findings even though their theoretic 
statements do not. Concerning the rest of the 
theoretic discussion dealing with clinical entities, 
there is little to say except for enumeration of 
the frequent errors. The authors do not seem 
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to be aware that there are no true pathognomic 
signs of schizophrenia or that their “neurotic 
indicators” are not only nonspecific but unsub- 
stantiated by any type of research. Their failure 
to recognize variation in the quality of the per- 
formance in cases of damage to the brain is a 
serious oversight. The greatest drawback of the 
book, however, is its misleading oversimplifica- 
tion, The authors would have one believe that 
the technic is an easy one and that diagnoses 
can be made by simply scoring a record and 
then looking up the listed interpretation. Work- 
ers in the field are in complete agreement on the 
fact that there is no specific pattern or response 
for any psychopathic deviation. The Rorschach 
method gives only a personality pattern which 
clinicians must evaluate and interpret. A num- 
ber of case records have been included, but the 
paucity of clinical material and the superficial- 
ity of the histories destroy any usefulness they 
might possess. In conclusion, one can only say 
that the large number of errata, the poor print- 
ing, the incomplete bibliography and the lack of 
an index make the physical structure of the book 
seem as unfinished as the content. 

Brit J Ed Psychol 13:165 N°43. W. D. W (all). 
(Also a review of The Rorschach Technique by 
Bruno Klopfer and Douglas M. Kelley.) Con- 
tinental and American psychologists in the last 
twenty years have widely used the Rorschach 
test as an instrument to investigate the psycho- 
ses, in vocational, delinquency, child and neurotic 
studies and more recently as a group technique. 
In England, Kerr has used it in studies of twins 
and of London school children; and Vernon in 
investigations with university students. Beyond 
the reports of these two investigators there are 
few references to English work with the test 
reported in the journals ; and many English psy- 
chologists regard it rather as an esoteric cult 
than a practicable approach to personality diag- 
nosis. The books at present under review, which 
contain the first full discussions of the method 
and summaries of research in book form, should 
do much to modify this view and lead to an 
urgently needed standardisation on English 
groups, by age and occupation, and statistical 
validation in fields other than the abnormal. At 
present, as the authors admit, the method is pri- 
marily clinical and individual; but it permits of 
wide group application in the study of normal 
and neurotic personalities and is superior to 
most in treating personality as a gestalt. * The 
scoring doubtless will be further simplified and 


standardised, and the diagnostic criteria more 
thoroughly validated and made wider in scope: 
but the basis of the test appears to be sound. * 
Unfortunately, although each book is provided 
with extensive references to the literature, 
neither cites precise figures of either the relia- 
bility or the validity of the method. The latter 
seems likely to be higher in skilful hands than 
that of personality inventories or even of ex- 
perimental situations like those of May and 
Hartshorne. * How far poor visualisers are 
handicapped, the determination of what consti- 
tutes a popular response, what allowances must 
be made for intelligence level, and whether the 
artificiality of the situation is not such as to af- 
fect the responses of adult neurotics, are prob- 
lems peculiar to the test which can be settled 
only by further research with large groups. Such 
research is made immeasurably more easy by 
these two books. The case records cited in Hal- 
pern and Bochner are extremely suggestive, par- 
ticularly for those to whom the test is more than 
a curiosity. Klopfer and Kelley in their third 
and fourth parts deal, somewhat cursorily as is 
inevitable, with thé whole field of application of 
the test. Best of all, both books provide an ex- 
haustively clear account of the procedure and 
niceties of scoring. * At times, however, both 
sets of authors appear to forget that the test is 
in the nature of a series of miniature situations 
and that, without the most stringent statistical 
validation, rigidly detailed conclusions as to per- 
sonality structure cannot safely be drawn from 
reactions to such things as nuances of shading 
and colour. A priori considerations should not 
blind us to the potentialities of the test, even if 
we regard it at present with caution. Its use as 
an adjunct and check upon subjective observa- 
tion is obvious; Dr. Vernon goes so far as to 
say that he is “unable to call to mind any other 
test of personality or temperamental traits which 
will tell me so much about my subjects in so 
short a time,” while Professor Burt has found 
it successful with neurotics : and its use in differ- 
ential diagnosis of the organic and non-organic 
psychoses seems justified. It is to be hoped that 
these two books, ably written and soundly prac- 
tical, will lead to an extensive use of and ex- 
periment with the test in this country. 

Brit J Psychol 33:141-2 O *42. * may be rec- 
ommended without hesitation to any student of 
psychology or medical practitioner who wishes 
to learn how to apply the test, or to any trained 
psychologist who desires to have data on which 
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to form a reasonable opinion of the value and 
limitations of the Rorschach procedure. So far 
as we know there is no other book which covers 
the field as adequately and as fairly in so short 
a space. While the authors do not in any way 
disguise their belief in the test as a means of 
clinical examination, they exaggerate neither the 
strong nor the weak points of its use. They 
give very clear instructions in the matter of how 
to administer the test cards, they list and explain 
all the main categories of interpretation and the 
symbols by which these are expressed, they state 
the interrelations of the cards and why they have 
been placed in the order in which they are 
given, and they give a number of illustrative 
interpretations drawn from actual experiment. 
These interpretations cover records of normal 
people, of children, of mental defectives, of neu- 
rotics, of schizophrenic patients, and of people 
suffering from certain organic disorders. There 
is a very short and admittedly inadequate chap- 
ter dealing with recent developments which, how- 
ever, ought not to be held up against the authors 
as a defect in their presentation, since they are 
perfectly clearly concerned with the procedure 
only as a basis for individual character diagno- 
sis. What impression is the whole likely to make 
upon the interested, but undecided reader? It 
seems likely that he will be attracted and inter- 
ested, and that however cautious he may wish 
to be he will have to admit that the test may 
have very great and expanding value for many 
purposes. He will also see that the technique may 
be very time-consuming, and that the possible 
interpretations are so various and the boundaries 
between one category and another so soft that 
it may become very difficult to strike a balance 
between individual judgment and conventional, 
perhaps esoteric, categorization. But whether he 
will regard this as an advantage or a disadvan- 
tage is likely to depend very much on his gen- 
eral psychological point of view and his personal 
predilections. However, any fair-minded reader 
is likely to agree that this admirable book does 
make out an impressive case. 

J Am Med Assn 120:1347 D 19 "42. The 
avowed purpose of this book, as stated in the 
preface, is to make the Rorschach test "avail- 
able to all psychologists and psychiatrists." It 
is therefore warrantable to evaluate it in terms 
of the degree to which it accomplishes this 
rather ambitious aim. Perhaps, however, we 
Should question the possibility or, indeed, the 
desirability of attempting to put this tool in the 


hands of any total group, whether composed of 
physicians or psychologists. This is not at all 
to concede, as surprisingly enough the authors 
seem to, that the Rorschach has been a cult 
which only the initiated could serve but merely 
to recognize it as a technic fully as complicated 
as the reading of X-ray plates by the roentgen- 
ologist and therefore properly the function of 
the specialist. But, laying this question aside, 
how well does the book succeed in giving to 
any hitherto unenlightened person a working 
knowledge of the Rorschach test? Unfortu- 
nately it is not comprehensive enough or suffi- 
ciently precise for the beginning student and it 
is much too cluttered up with detail, too lack- 
ing in its presentation of underlying Rorschach 
philosophy, for the general reader. Not only 
for historical perspective but because it remains 
the single great classic in a rapidly growing 
literature, the student should begin with the 
Psychodiagnostik itself, now happily translated 
into English. This, followed with Oberholzer's 
article of twenty years ago in the Journal of 
Nervous and Mental Disease, will establish his 
firm foundation. Then if he keeps abreast of 
current material in the psychologic journals 
and the Rorschach exchange he will see both 
the many extensions in the use of the test and 
the new concepts which are being incorporated. 
Undoubtedly the best exposition of the sub- 
tleties and refinements of scoring which are 
used by members of the Rorschach Institute is 
embodied in another new book, The Rorschach 
Technique, by Klopfer and Kelley. The pres- 
ent book, then, like Beck’s, which came out in 
1937, is mainly useful in rounding out the pic- 
ture with supplementary case material for those 
already familiar with Rorschach practice. 
What is still lacking and what the psychologist 
called on to present and defend Rorschach per- 
sonality studies and diagnostic impressions 
longs for is a new Rorschach classic, a small 
book like Hart's on insanity. Such a book, 
stripped of the details that obscure the forest, 
should present the philosophy of personality 
organization on which Rorschach theory is 
based and which constitutes its unique con- 
tribution. This is what should be "available" 
to all psychologists and psychiatrists but, un- 
happily, it is yet unwritten. 

J Consult Psychol 6:274-5 S-O '42. Morris 
Krugman. * a carelessly written work, replete 
with loose statements, contradictions, and mis- 
leading conclusions. Its generalizations are 
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oversimplified to such an extent that a beginner 
in the Rorschach method, attempting to apply 
them, would become hopelessly confused. One 
can picture the confusion of the Rorschach 
learner who reads in the same paragraph (pp. 
55-6) that "Ambiequal individuals are rela- 
tively rare,” and then, “Thus we see why many 
neurotics fall in the ambiequal group.” Then, 
to cite one more of many possible examples, 
let us follow that beginner to page 139, where 
he learns that, in the case of the compulsion 
neurotic, ““The Rorschach protocol will show 
marked coarctation,” but, when the reader ex- 
amines the three cases illustrating compulsion 
neurosis (cases II, 12, 13) all of them are 
markedly expanded. Some of the other illus- 
trative cases likewise contradict the text, or are 
atypical. In the preface, the authors indicate 
that they have “organized and reworked” much 
of the Rorschach literature and have “given in 
brief and simplified form an exposition of the 
Rorschach Test.” In reading the book, one is 
impressed by the small number of sources ac- 
tually employed, and by the extent to which the 
authors leaned upon Klopfer, sometimes giving 
credit, and frequently not. The book was ob- 
viously compiled in great haste. The reviewer 
does not remember another book in which so 
many grammatical errors occur. The English 
is frequently so poor that it would not be ac- 
ceptable in a freshman composition. The naivete 
of the presentation is shown by some of the 
chapter headings. Chapter III is headed, “What 
the Symbols in Column I Mean,” and Chapter 
IV, “What the Symbols in Column II Mean.” 
On the other hand, the last chapter (XV) has 
a rather ambitious title: “Recent Developments 
of the Rorschach Test.” This chapter consists 
of little more than three pages. No more apt 
commentary both on this chapter and on the 
entire book, can be made than that of the au- 
thors in the first sentence of the closing para- 
graph of the chapter and book: "This chapter 
in no way covered all the recent avenues of re- 
search, either as they relate to better under- 
standing of the instrument or as they pertain 
to personality problems.” 

J Nerv & Mental Dis 101:88 Ja’45. A(rthur) 
N. F(oxe). * a very readable volume, some- 
thing of a manual in the use of the Rorschach 
test. It is practical and excellent for the be- 
ginner, * 

Mil Surg 93:226 Ag "43. Webb Haymaker. 
* The style of writing is clear, the explanations 


easy to follow, its directions explicit. This book 
should serve well its purpose as a practical 
manual for psychologists. It should also be of 
interest to psychiatrists, but its value would 
be enhanced by illustrations of the test cards. 
Occupational Psychol 16:149-50 Jl '42. C. J. 
C. Earl. In the field of personality testing the 
Rorschach method is in many ways the ana- 
logue of the Binet scale in the intellectual field. 
Both methods mark a sharp departure from 
anything which preceded them. Both methods 
abandon the precise and accurate measurement 
of strictly isolable or definable elements in 
favour of an approach to the problem as a 
whole. Both methods were devised primarily as 
practical clinical techniques, and both are cer- 
tainly open to objections from the purely scien- 
tific point of view on this account. On the other 
hand both methods have such enormous clinical 
values that their place is assured. A grave diffi- 
culty in the case of the Rorschach which was 
absent in the case of the Binet scale is the diffi- 
culty in the technique of its scoring and inter- 
pretation. Rorschach’s original book Psycho- 
diagnostik was a brilliantly intuitive essay in 
the psychology of personality, but in no sense 
a manual for use in the actual administration 
and scoring of the test, and none of the very 
few books which have appeared since then has, 
in any considerable measure, filled the gap. Al- 
most the whole of the extensive literature has 
appeared in the form of papers in journals of 
widely diverse character. Different authors 
have used different standards in scoring, and 
the very considerable improvements made by 
the American school are accessible only in the 
privately circulated journal of the Rorschach 
Institute. Most Rorschachists learnt their tech- 
nique by personal training rather than by read- 
ing, and indeed the impression was fairly gen- 
eral that only by such training could the method 
be learnt. Even were this the case—as the 
present reviewer is convinced that it is not— 
the need for an elementary manual for prac- 
tical use was very great, and this little book 
makes a brave attempt to meet this demand. 
The method of administration and scoring is 
fully and very simply set forth, the system of 
scoring being, in general, the orthodox Ameri- 
can or ‘Exchange method. One unfortunate 
departure from the method is that in the tabu- 
lation of scores the chiaroscuro responses are 
listed with the achromatic colour responses, 
although the former undoubtedly represent in- 
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troversive trends, while the latter have as cer- 
tainly extratensive significance. The advantage 
of the orthodox listing of introversive and ex- 
tratensive elements to the left and right respec- 
tively of the purely logical F scores is obvious, 
and this departure will tend to confuse be- 
ginners. Apart from this defect, the authors 
have been very successful in the difficult task 
of explaining the scoring method clearly and 
briefly. Numerous examples are given, but a 
discussion of complex responses and their scor- 
ing would have been helpful. The full discus- 
sion of the meaning of the scores, as also the 
final interpretation of a record, involves a very 
considerable discussion of the psychology of 
personality, etc, Rorschach's own essay on the 
Erlebnistypus, for example, is an important 
contribution to personology. Any such discus- 
sion is bound to be lengthy and perhaps tends 
to undue complication in a book intended for 
beginners, The authors have avoided the diffi- 
culty by omitting theoretical discussion and 
confining themselves to a few brief didactic 
Statements and explanations, and the book is 
so much the poorer for this omission. The 
reader whose own knowledge of psychology is 
extensive will, however, be able to read much 
between the lines in this section; indeed, the 
authors in their preface, refer to the importance 
of the psychological background of the ex- 
aminer for obtaining the fullest advantage from 
the test. In the last chapter of the book a num- 
ber of records are given, together with their 
scoring and interpretation, and these should be 
helpful in clinical work with the test. * There 
is a brief mention of possible fields, other than 
the clinical, for the use of the test. The bibliog- 
raphy is fairly full but by no means complete, 
and the table of reciprocals will prove a boon 
in calculating percentages. The general lay-out 
of the book is so clear that the lack of an index 
will scarcely be felt. Though this book is guilty 
of over-simplification, there is no doubt that it 
fulfils a long felt need for the beginner with 
the method. A word of caution is, however, 
necessary on each of two points. The first of 
these is that any Rorschachist should be ex- 
tremely cautious in his interpretation until his 
experience is fairly considerable. Secondly, and 
most importantly, efficiency with the method 
cannot be attained without a knowledge of the 
dynamic psychology of personality. The Ror- 
Schach method is an exquisitely sensitive clini- 


cal technique; it cannot be expected to function 
as a penny in the slot machine. : ; 
Psychiatric Q 16:598-602 Jl "42. * this 
summer appears to mark an important turning 
point in development of the Rorschach method 
for psychiatric use in the United States. Two 
admirable introductory volumes on the tech- 
nique have just been published; and the first 
authorized English translation of Rorschach $ 
“Psychodiagnostik” has been brought out 
For the first time, authoritative and readily 
comprehensible material in this field has be- 
come available for the beginner * While Dr. 
Bowman’s introduction notes that. “the be- 
ginner . . . will find the book well suited to his 
needs,” the authors’ treatment of their subject 
is from a somewhat different viewpoint than 
that of a training manual. Mrs. Bochner and 
Mrs. Halpern have devoted as much attention 
to what might be called the presentation ofa 
birds-eye view of the Rorschach and its possi- 
bilities to the psychiatric profession as they 
have to the problem of teaching administration 
to beginners. * Mrs, Bochner and Mrs, Halpern 
„illustrate the clinical application of the 
method by carefully chosen protocols, selected 
in a fashion to make plain to psychiatrist or 
student beginner the differences in the records 
of the commonly encountered mental states. 
Illustration and discussion of children’s proto- 
cols, with emphasis on caution in accepting 
children's interpretations, seem of particular 
value. * [That the authors] do not consider 
their book complete equipment for administra- 
tion and interpretation of the Rorschach ex- 
amination is made plain in Dr. Bowman s in- 
troductory warning, “All tests... in fhe hands 
of the non-expert may do harm." * Both 
[Klopfer's The Rorschach Technique and this 
volume]..should be important CANARD 
to greater knowledge and use of the Rorschac 
technique. The Bochner and Halpern work— 
based on much experience with the Rorschach 
at Bellevue Psychiatric Hospital—is unques- 
tionably the easier to comprehend for a person 
wishing merely a general knowledge and un- 
derstanding of the method ; as the authors ee 
they have endeavored to present their materia 
‘in brief and simplified form.” They follow, 
in general, the symbols and scoring methods 
developed by Dr. Klopfer at the Rorschach In- 
stitute, but with modifications, notably in their 
chiaroscuro scoring, in construction of per- 
sonality graphs and in summation. lt is, of 
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course, testimony to the objectiveness and scien- 
tific validity of a method—administration of 
which is not only an examination but applica- 
tion of an art, and which, as Mrs. Bochner and 
Mrs. Halpern observe "cannot be scored against 
a stencil"—that such differences in technique 
do not prevent essential agreement on findings 
by all schools of Rorschach workers. The 
Bellevue workers' book is adequate for instruc- 
tion of administrators, given at least a little 
personal supervision; and it gives an extraor- 
dinarily clear view of how the records of the 
various mental states compare. * The Bochner 
and Halpern volume also has an excellent bib- 
liography, though a far less complete one; and 
it contains a useful table of reciprocals for cal- 
culating percentages of replies in any record of 
ordinary range. Regrettably, there is no index; 
and a specimen scoring chart and at least out- 
line representations of the cards would have 
been desirable additions, Keeping in mind the 
somewhat different orientations of these two 
introductory books, either of them can be rec- 
ommended without reservation as a better in- 
troduction to the study, practice and general 
evaluation of the Rorschach technique than has 
heretofore been available. The fact that a set 
of the cards is essential for serious study or 
comprehension of the Rorschach by use of either 
textbook is a matter which may not be entirely 
obvious to persons lacking any acquaintance 
whatever with the method. * 

Psychiatry 5:606 N *42. Ernest G. Schachtel. 
* not quite free from errors in the rendition of 
Rorschach’s as well as other Rorschach work- 
eis’ teachings and it shows traces of being 
hastily written. It avoids, as the introduction 
declares, “lengthy discussions of the theoretical 
points.” In fact, it avoids any discussion of 
such points. Consequently, it does not convey 
an adequate idea of Rorschach’s work. This re- 
viewer does not share the viewpoint, which the 
authors apparently hold, that it is possible to 
understand and use the Rorschach method 
without reference to theoretical concepts of 
personality and of the processes involved in the 
Rorschach test. The book contains a simple de- 
scription of the technical categories used by 
Rorschach or by some of his followers, and 
some interesting case records. 

Psychoanalytic Q 11:587-9 O '42. S. J. Beck. 
Having decided that ‘a certain aura has gath- 
ered about the test, making of it a cult in which 
only the initiated may serve' (p. ix), these au- 


thors are undertaking to make it 'available to 
all psychologists and psychiatrists.’ They do 
so in chapters on How to Score a Record, 
What the Symbols in Column I Mean, and in 
a style reminiscent of the 'easy lessons' style 
of promotion. * To cite in full the numerous 
naivetes committed in this small book would 
require far more space than is alloted this re- 
view. ‘When the white space is used to indicate 
something missing, as a gap . . . in our experi- 
ence this type of response is frequent in the 
records of children and adolescents from broken 
homes' (p. 30). Here the authors seem them- 
selves to be doing some free associating of 
the Klang variety : ‘broken blot’, ‘broken home’. 
Experience of other workers does not confirm 
this superficially simple working of the test. 
‘We feel that individuals of superior intelli- 
gence should not give an F— unless they are 
emotionally disturbed’ (p. 38, n). Scientific 
method happens to be not a matter of feeling, 
but observation. The fact is that the healthy 
superiors always have some F—. They dare to 
make errors. That goes with superiority. With 
their highly endowed personalities they more 
than compensate for their errors as does also 
the rich Rorschach response pattern for the 
F—. Referring to the D (major detail) response: 
"The shape of these details approximates real- 
ity more closely than does the shape of the 
whole blot' (p. 24). Here is a complete lack of 
understanding of the psychological processes 
in the Rorschach test factors. Reality in the 
test is totally independent of whole, detail, or 
rare detail. The ‘butterfly’ to Figure V (whole) 
is as real as ‘muskrat’ to the lateral red of 
Figure VIII. It is in F+, not D, that the in- 
dividual reveals his loyalty to reality. Percent- 
ages and formulas are offered as though they 
represent well-established, quantitative meas- 
ures of psychological processes (e.g., pp. 36-37; 
41, 60-61). There is a too frequent use of quo- 
tations without quotation marks. The descrip- 
tion on page 20 of the whole response as 
related to intelligence will be found in the 
Psychodiagnostik, p. 63 ff., 1932 edition; and 
the ‘good form’ discussion (p. 36) is a fairly 
faithful paraphrasing of Rorschach, pp. 60-61 
ibid. Numerous other instances can be cited. 
Klopfer appears to be the chief sufferer in this 
respect, as in regard to ‘testing the limits’ 
(p. 4). The possible rationalization of such a 
practice is that the hitherto published material 
in a scientific field is the public property of all 
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workers in it. This may be so in respect to the 
foundations of such an ancient science as, say, 
chemistry. But in a field so new as the psychol- 
ogy of personality and so specialized as the 
Rorschach test, it is small compensation for 
each worker to ask that his contribution be 
credited. The one merit of the book lies in the 
twenty complete Rorschach response records 
distributed in the several personality groups. 
It is the reviewer's teaching experience that the 
best way to get to know the test is to make 
comparative studies of the records of different 
clinical pictures. A valuable suggestion will be 
found on pages 34-35, regarding the Suksez- 
sionstypus as spatially determined (not only 
according to whether W, D, Dr follow in that 
order). The bibliography looks good but the 
omission of Skalweit's important monograph 
on schizophrenia looks like evidence of haste 
in preparing the manuscript. The book has no 
index—an inconvenience in a discipline so de- 
pendent on symbols peculiar to it. This work 
reflects no more than a shallow grasp by the 
authors of the psychological dynamics involved 
in the Rorschach experiment. It would have 
helped in judging their preparedness for writ- 
ing such a book if they had informed us what 
is their background of direct experience with 
the test. A single, brief paper by one of the 
authors is their only other publication. Ror- 
schach labored ten years before he published 


anything. Most conscientious workers today , 


consider about three years’ work a necessary 
minimum to enable them to grasp it solidly, let 
alone write a book. The introduction by Dr. 
Karl M. Bowman has a special interest in view 
of his own many years of association with the 
test. He not only has had the opportunity to 
observe it in use at Bellevue, but also in the 
Boston Psychopathic Hospital where he was 
chief medical officer in the years when it was 
receiving some of its early trials in the United 
States, 

Psychol B 41:64 Ja’44. Ralph R. Brown. * 
Bochner and Halpern will at least convince 
PSychologists that the test is not a cult or a 
religion, but this book will not serve to over- 
come the general objection to the test on the 
grounds of undemonstrated validity. * the au- 
thors.... were principally concerned with present- 
ing in a simple, condensed form the principles 
of administration, scoring and interpretation of 
the Rorschach. It is unfortunate that the au- 
thors did not include a chapter on validity. 


Sufficient material is now available to convince 
most of us that the test is fundamentally 
sound, even though subject to misuse by over- 
enthusiastic proponents. For the beginner, the 
book offers a lucid and concise description of 
Rorschach testing. It is not sufficiently com- 
plete, however, to permit the beginner to score 
and interpret the test without consulting other 
references. There is no clear method of differ- 
entiating between major and minor details. The 
scoring of good and poor form quality is also 
left to the imagination, As an independent 
presentation, therefore, this book does not 
compare with Beck's "Introduction to the Ror- 
Schach Method," or “The Rorschach Tech- 
nique" by Klopfer & Kelley. The advanced 
student of Rorschach will find much of inter- 
est in this book. The chapters dealing with in- 
terpretation of the symbols are well written 
and bring together under one heading the opin- 
ions of various Rorschach workers. Many of 
these interpretations are not from Rorschach 
and care must be exercised in the differentia- 
tion of well-established Rorschach principles 
from the more recent and less well-founded in- 
terpretations. The illustrative records are inter- 
esting and expertly interpreted. There are not 
sufficient records presented, however, to serve 
as a satisfactory background for “getting the 
feel” of records in various clinical categories. 
This book is recommended as supplementary 
reading for beginners. For the more experi- 
enced Rorschach worker, it offers a number of 
interesting records and interpretations. 
Psychosom Med 6:277 Jl’44. Martin Grot- 
jahn. * simple, clear and readable * Twenty 
records given in detail should be of special help 
for people using or learning to use this test. 
War Med 3:111-2 Ja ’43. * a handy little 
volume written by two psychologists who have 
had extensive experience in the use of the Ror- 
schach test * so complex is the Rorschach tech- 
nic that it is doubtful whether many, on the 
basis of this exposition alone and without some 
actual tutoring, would be able to start using it 
forthwith * In general the authors appear to 
have followed Klopfer..in the weighting and 
the interpretative significance which they assign 
to some of the test factors. While Rorschach’s 
formulations are not necessarily final—certainly 
not to be accorded the uncritical acceptance of a 
“revealed religion” —neither can all of the more 
recent elaborations of the technic and the new 


meanings derived be accepted until a good deal 


79 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 158 


more work has been done to validate them. 
They are extremely provocative for research 
but the authors make no effort to distinguish 
between the well-tested Rorschach principles of 
evaluation and the newer contributions to the 
theory. Despite these reservations the book 
should have a double usefulness. It provides a 
suggestive compendium for the neophyte to keep 
at hand. Also, though perhaps in somewhat too 
detailed a fashion, it gives any psychiatrist or 
psychologist hitherto unfamiliar with the Ror- 
schach test an opportunity to learn the factors 
and the relations on which the often amazingly 
accurate character evaluations and clinical diag- 
noses of the experienced user are based. It 
should, therefore, do much to eradicate the feel- 
ing that the test is some sort of hocus-pocus and 
to increase the recognition of it as a valid and 
extremely delicate scientific tool. 


[79] 
*[Re Rorschach.] Bocuner, RUTH, AND HALPERN, 
Frorence. The Clinical Application of the Ror- 
schach Test, Second Edition. Introduction by Karl 
M. Bowman. New York: Grune & Stratton, Inc., 1945. 
Pp. xi, 331. $4.00. * (PA 20:472) 


Am J Psychiatry 103:429-30 N '46. Leslie 
Phillips. * Chapters have been added with Ror- 
schach records of alcoholics and individuals 
who fit into the broad category of “behavior 
problems.” The other chapters also have been 
filled out with many new records. * a brief, 
easily understandable introduction * The in- 
troductory chapters are little changed * A table 
of samples of good and poor form perception 
is not included, nor is there a list of common 
and rare details, although reference is made to 
where these may be obtained. These are unfor- 
tunate omissions, as they are needed, particu- 
larly by beginners, for practical work with the 
test. The variety of records chosen to illustrate 
those obtained in health and mental disorder is 
quite rich, although some records, particularly 
among the normals and “organic” cases, seem 
poorly chosen. However, they should be help- 
ful in demonstrating how the test factors are 
pooled to obtain a picture of the personality. 
The interpretations are usually short and tend 
to be superficial, and the accompanying case 
histories seem particularly inadequate. No im- 
provements have been made in this regard in 
the second edition, and consequently the book 
cannot lead to any deep appreciation of the 
Rorschach technique, nor is it adequate for ad- 
vanced work. The addition of new material, 


however, helps to increase its value as an ele- 
mentary introduction to Rorschach testing. 
Am J Psychol 59:704-9 O °46. Donald E. 
Super. * attempts to do in some 330 pages 
what Beck’s two volumes do in twice that com- 
pass. In fact, more is here attempted * Cover- 
ing so much in such limited space, this manual 
is of necessity too sketchy at many points. * 
The chapters on location, determinants, and 
content lack the helpful diagrams which Klopfer, 
Harrower-Erickson, and others, have popular- 
ized. Missing (as also in Beck’s volume) are 
the frequency-tables which serve as useful ref- 
erence points and give one confidence in Beck’s 
earlier work and in that of a number of cur- 
rent publications. Psychoanalytical interpreta- 
tions are frequent and are glibly incorporated 
into other statements with no citation of evi- 
dence to justify them. * There is occasional 
loose thinking of other types. * This book is 
characterized not only by an emphasis on psy- 
choanalytical interpretation which seems to this 
sympathetic reviewer to go beyond the data, 
and by an allegiance to the old school of Ror- 
schachers * It is refreshing and reassuring to 
find a note (p. 228) to the effect that "the 
necessity for confining the use of an instru- 
ment like the Rorschach test to professional 
workers who have had adequate training and 
experience in clinical psychology and psychi- 
atry is clearly evident." It is to be hoped that 
the former is intended to include scientific pro- 
cedure and research as well as practice. In view 
of this word of caution, one is inclined to 
question the place of this book, and others like 
it, in the professional literature. As a manual, 
it is too brief (Beck is much more detailed, 
supplemented by quantitative data from Har- 
rower-Erickson and Steiner, Hertz and others). 
As an introduction, it is too sketchy for the 
critically or scientifically minded (Beck justi- 
fies his views somewhat more adequately, 
Munroe gives evidence) and oversimplified for 
the uncritical novice. (Beck points out more 
complexities and recognizes some unsolved 
problems.) One who knows something about 
the Rorschach but is not an expert may glean 
new insights from it and here and there, or a 
beginner may read it or Klopfer (if already 
grounded in the psychology of adjustment) as 
a first introduction before reading Beck's two 
volumes. He should then, with a critical eye, 
read Harrower-Erickson and Steiner, Munroe, 
and others, who have attempted more or less 
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successfully to approach the Rorschach with 
scientific restraint. Such a reader will, it is 
hoped, then recognize that the Rorschach is 
still just one way among several of getting 
some tentative insights into the structure of a 
personality. 

Brit J Psychol 36:177 My '46. This is an ex- 
cellent manual for the use of the Rorschach 
ink-blot test. It is clear and intelligible to the 
beginner as well as to the expert, and the for- 
mer could not do better than use it in learning 
how to administer the test. Good records and 
interpretations are given for normal adults and 
children, as well as for abnormals—mental de- 
fectives, neurotics, behaviour problems, schizo- 
phrenics, alcoholics and organic cases. There is 
an extensive bibliography of Rorschach litera- 
ture, 

J Abn & Social Psychol 41:233-6 Ap '46. 
Albert I. Rabin. * A little over 25 per cent of 
[the book] is devoted to the procedure, scor- 
ing, and meaning of the various test factors. In 
this section of the book, the authors endeavor 
to accomplish what Beck did in his entire first 
volume and in part of the second. It is a tall 
order ; yet the authors have succeeded in filling 
it with some degree of success. The manner of 
presentation is simple and elementary, though 
it is taken too much for granted that the reader 
has a full understanding of the psychological 
correlates of the Rorschach factors. The per- 
sonalities and traits of individuals in whom the 
given “signs” occur with some frequency are 
described in a rather naive and common-sense 
manner without resorting to too much psycho- 
logical sophistication. There is no attempt to 
“justify” the method or give it a theoretical 
foundation, or, as Beck attempts, to relate it 
more tangibly to the general psychological field. 
Neither is there the distinct “experimental at- 
titude” with its implicit research interest and 
spirit of inquiry. It is a “manual” in the full 
sense of the word. It is a manual, of a "test," 
which as such also suffers from a number of 
weaknesses. In the first place, the section on 
form (F) in general and good form (F+) 
in particular leaves much to be desired. It is 
rather brief and leaves too much to the subjec- 
tive judgment of the Rorschach worker. The 
theory of frequency as the determinant of good 
form is explained, but the beginner or even the 
advanced Rorschach worker who has not com- 
piled his own frequency tables is left out on a 
limb, This manual does not give a set of ex- 


pected responses, with or without their fre- 
quencies, which may be used as a standard of 
judgment of good or poor form. In the second 
place, the striking psychoanalytical and other- 
wise symbolic explanations of various answers 
and bits of independent content are sometimes 
over-dogmatic and not readily digestible by the 
experimentally minded clinician. The explana- 
tion of the isolated symbolism of bays and in- 
lets “as a shelter or as an encircling menace,” 
the suggestion that when details frequently 
seen as female genitalia are seen as vases and 
urns there is an indication of “a glorification of 
the female,” or that the soft light quality of 
card VII gives “this card a feminine quality, 
frequently with maternal implications” (!), 
and that the denial of a detail in card IV is a 
situation in which “the whole problem of cas- 
tration is involved,” appear to this reviewer 
rather far-fetched and in need of better valida- 
tion and substantiation. The remaining three- 
fourths of the volume are devoted to forty 
records of normal as well as abnormal in- 
dividuals and their corresponding personality 
sketches. In addition to the usual schizophrenic, 
neurotic, and normal records, protocols and 
interpretations of Rorschach findings in some 
organic cases, alcoholics, and adolescent be- 
havior problems, as well as those of young 
children, are included. Each group is prefaced 
by a general statement of the diagnostic fea- 
tures and how they are revealed in the test. 
These preliminary discussions are very helpful, 
especially to the beginning student, The scoring 
follows the standard procedures, with slight 
variations. One may question the usefulness of 
the multitude of percentages, for such factors 
as P, F, W, D, etc. The interpretations are 
comparatively brief, but bring out the essential 
features. In some cases one may wish that the 
authors were a bit more penetrating in their 
analysis and more generous with case-history 
notes and reference to them. Bochner and Hal- 
pern have written a readable, though somewhat 
oversimplified, Rorschach manual and have 
eliminated many of the errors included in its 
first edition. Along with Beck's somewhat eru- 
dite, basic two volumes, it may serve the stu- 
dent as a worthwhile reference book. F 
Psychiatry 9:411-2 N°46. Isabelle V. Kendig. 
The fact that a second edition of this book has 
been brought out undoubtedly attests its pop- 
ularity as well as the widely extended use of 
the Rorschach test during the war years. This 
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is both a more comprehensive presentation, es- 
pecially of actual case material, and a some- 
what wiser and more modest one. In the preface 
to the former edition, it was stated the purpose 
of the book was to make the Rorschach test 
“available to all psychologists and psychiatrists.” 
This rather magniloquent goal seems to have 
been abandoned, and the present preface sim- 
ply stresses the importance—and limitations— 
of the technique, concluding that it “must re- 
main an instrument in the service of psychol- 
ogy and psychiatry.” This is a commendable 
change, since to purport to teach the interpre- 
tation of Rorschach data in ten easy chapters 
is about as preposterous as to teach roentgenol- 
ogy by mail. * the authors are somewhat less 
fixed and dogmatic than formerly, A few new 
topics are discussed * the chief enrichment 
comes through the introduction of additional 
case material * Actual case material is always 
of interest to Rorschach workers but inevitably 
raises the question as to the comparability of 
the psychiatric nosology on which diagnosis is 
based. In the records presented here one won- 
ders at the absence of samples of Rorschach 
production in the affective psychoses. The pro- 
tocols taken from schizophrenics, while illu- 
minating certain general characteristics, are 
unsatisfactory because no attempt is made to 
bring out the wholly different patterns shown 
in the various subtypes of the disorder, To the 
last record, that of an alcoholic, is appended 
the opprobrious term "psychopathic personal- 
ity," and this term is also used for one of the 
cases presented as a "behavior problem." In 
both instances it is used quite uncritically al- 
though the concept of psychopathy is generally 
recognized as a most unsatisfactory one. 
Through the entire series, there is a lack of 
diagnostic specificity in the cases presented 
which diminishes their value and certainly im- 
plies that the Rorschach is a much less precise 
tool than it has proved itself to be in the hands 
of many workers. 

Psychosom Med 8:290 Jl-Ag '46. Vernon 
Clark. * The value of the book will vary ac- 
cording to the use the reader means to make of 
it. The section devoted to technique and inter- 
pretation is clear, economically written and well 
organized, although the authors encounter the 
difficulties inherent in discussing piecemeal 
material which must be seen as a dynamic 
whole. The records selected for demonstration 
are rich and challenging, so rich that one is all 
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the more conscious of the fact.that the analyses 
offered are often meagre and somewhat static. 
Moreover, the quest. for typical patterns asso- 
ciated with the clinical entities leads to an over- 
simplification of the problem of differential 
diagnosis with the Rorschach. Thus, to the 
newcomer in the field, the book may seem to 
promise too much too soon. If its limitations 
are kept in mind, it will prove valuable to stu- 
dents interested in a general grounding in Ror- 
schach testing and who wish to postpone 
complex theoretical questions until they have 
mastered the fundamentals of the technique. 

U S Naval Med B 46:1309 Ag '46. * One 
gets the impression that the authors of this 
book, in their attempt to be clinically helpful, 
resort to rather rigid formulae in support of 
empty and rather arbitrary statements. 


[80] 
X[Re Rorschach.] Forn, Mary. The Application of 
the Rorschach Test to Young Children. University 
of Minnesota, The Institute of Child Welfare Mono- 
graph Series, No. 23. Minneapolis, Minn.: University 
of Minnesota Press, 1946. Pp. xii, 114. $2.00. (London : 
Oxford University Press. 12s.) * (PA 21:807) 


Am J Orthopsychiatry 17:733 O "47. (Adolf 
G.) Woltmann. * This book fills a gap that has 
long been painfully noted by many Rorschach 
workers. The author’s scholarly approach to 
the subject matter deserves high commendation. 
Her suggestions for future study should encour- 
age research so that the application of the Ror- 
schach test to young children becomes less a 
means of individual intuition and more a valid 
clinical tool. * 

El Sch J 48:167-9 N ’47. Lee J. Cronbach. 
* Ford gave the test to 123 children, ranging 
from three years to eight years in age. For many 
of the children, data were also available on 
other tests and rating scales. From this group, 
although it is in no sense considered an ade- 
quate sample of children, Ford derives norms 
which, for the first time, give a frame of refer- 
ence for dealing with individual preschool test 
records. She explores thoroughly such ques- 
tions as the changes in responses with age and 
sex. In an interesting section on stability, Ford 
finds that, for most scores, retests after one 
month correlate .65 or better with original 
scores. Ford also uses a split-half technique but 
contributes more by a sharp and highly accurate 
statement of the reasons why the split-half 
method is inappropriate, inadequate, and mis- 
leading with the Rorschach test. Two chapters 
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on validity will have especial interest for all 
readers. One chapter studies the ability of the 
Rorschach test to estimate intelligence. It is 
assumed, without qualification, that the Stan- 
ford—Binet test is an adequate measure of intel- 
ligence. On these terms, Ford finds substantial 
relationships between mental age and the fol- 
lowing scores: R—The total number of re- 
sponses to the series of ten ink blots. Obj.—The 
total number of persons or objects mentioned in 
response to the series. F--—Response in which 
"good" form is the sole determinant of the in- 
terpretation. M— Response indicating an ex- 
perience of felt movement. M-+—Since M re- 
sponses are ipso facto F responses, movement 
always being movement of forms, all M re- 
sponses are either M+ or M— according to 
whether the form element is F+ or F—. These 
correlations are often around .70. There is 
little relation to the intelligence quotient, of 
course, because of the uncontrolled influence of 
chronological age, but it appears that reason- 
able estimates of the intelligence quotient could 
be made from Rorschach results. A more basic 
question is whether, when the two estimates dis- 
agree, the Rorschach scores may not be the 
more valid. The Maurer study [see 288] casts 
doubt on the meaning of certain items used in 
the Binet tests for early ages. Rorschach theory 
sets up a dynamic concept of intelligence in 
which attention is drawn to the interaction of 
drive, caution leading to self-criticism, imagi- 
nation, experience background, and keenness 
of observation. If there is any validity to the 
assumptions commonly made from the Ror- 
schach in clinical descriptions of mental func- 
tioning, it is unlikely that any single composite 
or quotient can adequately represent the infor- 
mation which it provides about these dimen- 
sions. If the Rorschach, treated mechanically, 
correlates with Binet performance as well as 
Ford finds, it should yield remarkable results 
when used in the objective but insightful man- 
ner for which it was designed. The chapter on 
validity of indicators of personality is inevitably 
disappointing, since Ford chooses in this study 
to consider single scores, whereas the meaning- 
ful contributions of Rorschach results to per- 
sonality diagnosis lie in interpreting each indi- 
vidual's Pattern of scores. Furthermore, criteria 
known to be valid are almost unobtainable. 
Ford has a few striking results. Teachers' rat- 
ings of emotional adjustment correlate .50 with 
the Rorschach ratings of adjustment—and again 


one is faced with deciding which of the two 
is invalid! Ratings on the Marston Introver- 
sion-Extroversion Scale correspond with the 
Rorschach index of introversive-extroversive 
tendencies. Ford's characterization of the cor- 
respondence as "relatively low" is not surpris- 
ing, because the traits bearing such deceptively 
similar names are differently defined by Ror- 
Schach and Marston. In general, one must con- 
clude that Ford has made a superior contribution. 
She has gathered excellent data which can be 
reconsidered by persons who are not willing to 
accept her interpretations always. She has es- 
tablished beyond question that the ink-blot tests 
will work well with young children and has 
provided hints on administering the test at these 
ages, which will be widely appreciated. Where 
her data are inadequate or her methods conceal 
essential characteristics of Rorschach data, she 
can rightfully leave the development of fuller 
knowledge to subsequent studies, * 

J Consult Psychol 11:154 My-Je 47. 

J Ed Psychol 38:190-1 Mr ’47, C. M. 
Louttit. In this monograph....Dr. Ford reports 
her investigations on the responses of children 
from three to eight years of age to the Rorschach 
test. The reader cannot help but be impressed 
at the evident objectivity and care with which 
this research has been planned and carried out. 
There were four major aims of the investiga- 
tion: (1) the refinement and simplification of 
procedures adapting them to young children, 
(2) the study of variations in determinants as 
related to age, intelligence and sex, (3) the 
determination of reliability of determinants at 
young age levels, and (4) a check on the valid- 
ity of claims as to the meaning of certain test 
determinants. The one hundred twenty-three 
children examined were attending nursery or 
elementary schools in Minneapolis and Ithaca, 
New York. Except for a restriction on rotating 
the cards, the administrative procedures were 
standard. There was no evidence that this one 
restriction influenced the responses. The results 
indicate that the Rorschach is definitely useful 
at the young ages studied ; the children utilized 
all the ways of responding which are utilized 
by adults, but not to the same extent, The per- 
centage rejecting at least one card decreased 
rapidly from thirty-two per cent at three years 
of age to none at six years of age. The mono- 
graph reviews the literature, especially of chil- 
dren, involved with work with inkblots. The 
procedures and results of the investigation are 


in detail, and illustrated with both 
graphs. It is hardly necessary in this 
attempt to summarize the details. The 
conclusions are that the test proves 
with children, although there is a great 
for norms at young ages. The reliabilities 
with some determinants showing high 
tions between test and retest, and with 
being low. The results with these young 
do not always support certain validity 
based on work with adults, This study 
an important contribution to the Rorschach 
terature. A particularly significant contribu- 
tion for the Rorschach layman is a glossary of 
terms and symbols used in Rorschach inter- 
Psychiatrie Q 21:156 Ja '47. Dr. Ford....has 
made a very valuable contribution to the scien- 
tific literature on the Rorschach Test. Her con- 
monograph summarizes the statis- 

tical obtained by other investigators 
and the author's own carefully gath- 
ered data from 123 boys and girls ranging in 
age from three to eight. On the whole, the 
author found that the data obtained by various 
agree rather well. The differences 
appear to be due to intrinsic psychological 
among the various of chil- 
ied. The study throws light upon the 
of the influence of age, intelligence and 
© Rorschach components de- 
more upon the age of the child than 
intelligence. A clarification of this 
great importance both practically and 
ly. The fact that during childhood the 
growth and maturation is intense 
the interpretation of children’s 
records. There are essential differ- 
between the records of children and of 
but there are also striking similarities, 
Dr. Ford's constructive study will enable psy- 
chologists to draw more valid and more com- 
plete conclusions from children’s Rorschach 


Prychol B 44:570-81 N "47. L. Joseph Stone. 
Dr. Ford's study was initiated in 1936; many 
of its goals and methods were set at that time. 
In the face of the enormous proliferation of 
Rorschach interest, Rorschach studies and Ror- 
schach users (and misusers) in the past decade, 
what would once have been of pioneering sig- 
nificance is now more in the nature of con- 
firmation, elaboration (or, occasionally, refuta- 
tion) of studies published in the interim. The 
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first problem Dr, Ford set herself, to refine and 
simplify Rorschach administration, was not re- 
solved by research, but rather by preliminary 
experience. Most notable modifications are the 
use of a trial blot 4 la Hertz, a prohibition 
against turning the cards, and a certain amount 
of “prodding” for further responses. Undoubt- 
edly there will be disagreement regarding the 
need for such changes. Although she appears 
to have been unusually successful in carrying 
through her inquiries, Ford, like others who 
have worked with young children, suggests the 
desirability of a brief inquiry at the time each 
card is shown. * Her second problem was the 
study of “variations in the test determinants” 
(scoring categories) and their relation to sex 
and age. No normative value is claimed be- 
cause the 123 subjects "represent a highly se- 
lected group both in intelligence and in socio- 
economic status" (p. 39). Existing adult and 
adolescent norms were used. This section can- 
not but remind the reader of the regrettable 
lack of final agreement on scoring nomencla- 
ture, In the few instances where the present 
study is at variance with the Klopfer-Margulies’ 
findings this appears to have been a determin- 
ing factor. (For example, including among 
whole responses the "cut-off whole," by Klop- 
fer and Margulies but apparently not by Ford ; 
or again, "One possible explanation for the 
low incidence of C responses in....the present 
study is the fact that Cn responses were not 
included in C responses" (p. 47) as is custom- 
ary and was the case in the other report.) Cer- 
tain other discrepancies raise the more serious 
question of disagreement on scoring. For ex- 
ample, the author points to the higher inci- 
dence of M in her study in spite of the fact 
that “all the investigators including the writer, 
use Rorschach's instructions in scoring M re- 
sponses" (p. $4). Yet in the three sample 
protocols 


of bugs crawling around.") The reviewer would 
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relationship between intelligence and certain 
specific determinants suggested by Rorschach. 
Satisfactory correlations were found for most 
of these but not for W and 96A. These excep- 
tions "make sense" in view of the fact that 
children's whole responses are often crude and 
undifferentiated, unlike the more structured W's 
of adults, and in view of the fact that animal 
concepts in children's thinking do not have the 
banality that they do for adults. A third con- 
tribution of the study was in establishing test- 
retest reliabilities for the specific determinants 
of the Rorschach with young children. * Her 
fourth major aim, the study of the "validity of 
some of the determinants of ," Dr. 
Ford undertakes with great modesty, but it is 
difficult to evaluate because data so meticu- 
lously supplied in other sections of the mono- 
graph are inadequate here. As one example, in 
evaluating emotional adjustment on the Olson 
scale we are told that the subjects were "di- 
vided into three groups on the basis of emo- 
tional adjustment as indicated by the Rorschach 
Test" (p. 82), but we do not know how this 
was done. A new scoring concept of organisa- 
tional links (OL) is tantalizing because it is 
always referred to allusively. Defined in the 
glossary as "The number of verbalized, logical 
connections within a response or within a rec- 
ord," this would appear to be a valuable indi- 
cation of complexity of response. However, the 
sample protocols make it difficult to determine 
whether one counts links or items. "A man 
walking a fence" receives a score of 2 OL; "a 
partly burned tree trunk" is also scored 2 OL; 
while “a balanced snake" is scored 1 OL. For 
the clinician there is some disappointment in 
the mass treatment of data and the paucity of 
individual records or even scoring lists. How- 
ever, he will find much of interest in the dis- 
cussion of administration of the Rorschach and 
on the frequencies of locations, determinants 


the use of the Rorschach to study 
in general, and the development of the process 
of logical thinking in particular, 
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come to those who wish to objectivate changes 
of personality in subjects under prolonged 
clinical observation, because the new set allows 
for repeat performances which would not be 
affected by the influences of memory and past 
experience, The manual gives detailed statistical 
information on the responses obtained from 
225 normal subjects and from 160 institutional- 
ized patients. This makes it easy to use the new 
cards successfully without acquiring personal 
experience. The only difference between the 
old and the new series is found in the fact that 
the new series makes it easier to give form- 
color responses. This may be of practical value. 
Furthermore it raises again the theoretical 
question whether the original set is not un- 
necessarily limited in its usefulness by the per- 
sonal bias of Rorschach himself. (See G. A. 
Roemer: Vom Rorschachtest zum Symbol- 
test. Zentralblatt fuer Psychotherapie 10:310, 
1937/38.) Aside from its main purpose the 
manual is valuable because it includes the re- 
sults of a study of clerical workers and of en- 
gineers. It appears that different personality 
traits are required for success in each of the 
two groups. This reviewer feels that the relia- 
bility of technical aptitude tests could be greatly 
improved if used jointly with the projective 
ink blot technique. 


[82] 
*[Re Rorschach.] Harrower-Erickson, M. R., AND 
STEINER, M. E. Large Scale Rorschach Techniques: 
A Manual for the Group Rorschach and Multiple 
Choice Test. S ringfield, Ill.: Charles C Thomas, 
Publisher, 1945. Pp. xi, 419. $8.50. * (PA 19:1720) 


Am J Orthopsychiatry 15:737 O '45. Louis 
A. Lurie, * If the conclusions of the authors 
are borne out by other workers, their tech- 
nique will take its rightful place with the other 
objective psychological personality tests, For 
the present it must be remembered that the 
records of but 340 subjects provided the mate- 
rial for the analysis, * Not satisfied with the 
results of group Rorschach technique insofar 
as shortening and simplifying the procedure 
was concerned, the authors devised another 
modification of the Rorschach test which they 
call the multiple choice test. * The underlying 
assumption of the authors has been “that those 
individuals most likely to give certain types of 
responses when responding freely in the Ror- 
Schach method will pick such responses when 
confronted with them in a multiple choice sit- 
uation." The weakness of this line of reasoning 


[ 164 


is obvious. Among other things, no allowance 
is made for the effect of suggestion or the 
effect of confusion due to ability to choose. * 
Despite the need for further evaluation, this 
monograph is well worth reading, not only by 
those interested in the Rorschach technique but 
by everyone interested in objective psychologi- 
cal personality tests. 

Am J Psychiatry 102:141-2 Jl '45. Fred V. 
Rockwell. * excellent organization * Specific 
directions for the administration and scoring 
of the group Rorschach are presented simply 
and clearly. * Eight thousand responses have 
been evaluated in a painstaking manner from 
the viewpoint of location, determinants and 
content. There is nothing in the literature to 
compare with these carefully worked-out and 
beautifully charted standards. The subjects in- 
clude normal adults, psychotics, college age 
individuals and a group of prisoners. By com- 
parison of these diverse groups, it has been 
possible to determine accurately the stimulus- 
producing properties of each individual card. 
Such standards are of fundamental and lasting 
importance both in the evaluation of group 
Rorschach tests and in connection with the origi- 
nal individual technique. * The multiple choice 
test was developed in an effort to permit tests 
to be given and scored by untrained persons. 
Here, in the opinion of this reviewer, the au- 
thors violate the fundamental principles in- 
volved in Rorschach interpretation. by at- 
tempting to reduce the results to a numerical 
score. * Any person who has worked with the 
Rorschach technique is bound to experience a 
certain feeling of reverence and awe at the 
enormous amount of painstaking work which 
the authors have put into this most authoritative 
volume. The book is highly recommended as an 
essential part of any Rorschach reference 
library. 

Am J Psychol 58:410-5 JI "45. Donald E. 
Super. The sub-title of this volume and the de- 
scriptive material on the jacket will lead many 
clinical psychologists to look upon the appear- 
ance of this book as a hopeful indication that 
the Rorschachean techniques have now been 
adapted for their general use. The scientifically- 
minded psychologist may look upon it as a help 
in setting an estimate upon the test in its sim- 
pler forms. The table of contents and the in- 
troduction should confirm the reader's hopes. 
* Some readers may be more impressed by the 
possibilities of the method than by the incon- 
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clusive nature of much of the evidence, and 
some may note certain favorable, but subjec- 
tive, valuations of the authors and of their 
authorities. The authors have at times—to be 
sure—successfully attempted to be scrupulous 
in their approach, with important results, For 
example, two careful and convincing studies of 
the value of the Group Rorschach in academic 
prediction, made by Harrower-Erickson at 
McGill and by Munroe at Sarah Lawrence, 
are described in some detail (pp. 18-27). In 
another section (p. 70 ff.), the stimulus values 
of each card were studied with the finding that 
the distribution of responses by location is rela- 
tively constant among normal and abnormal 
groups, but with slight group-differences. The 
incidence of anatomical responses in medical 
students, who might be expected to see such 
content more frequently than other groups as 
a result of training, was studied (p. 137), and 
the authors depart from traditional Rorschach 
theory in concluding, quite properly, that ex- 
perience does affect the content of responses. 
But no consideration is given to other possible 
training effects, e.g. vista responses in persons 
trained in drawing. * [These] studies, despite 
lack here and there of statistical refinement, 
should interest the scientifically-minded reader 
in further research with large-scale Rorschach 
techniques. Unfortunately, however, these stud- 
ies are in the minority. Despite the book's prom- 
ise, its methods are frequently weak, its data 
sometimes inadequate, its statistical analyses 
often incomplete and unsophisticated, its conclu- 
sions generally colored by bias in favor of the 
techniques investigated, and cautious conclusions 
are too frequently discarded for clinical impres- 
sions and observations which smack of wishful 
thinking. The experimenters will be divided 
from the clinicians early in the book, where the 
statement is made (p. 8) “. . . it is clear that 
minor deviations in technique, in materials, in 
administration and scoring, will matter as little 
here as they do when the test is given individ- 
ually." On the one hand, this attitude seems to 
involve a lack of scientific care and accuracy ; 
on the other hand, it is clinical in its approach, 
attempting to maintain uniformity by adapting 
the situation to each patient rather than by 
standardizing the situation itself. Herein lies an 
old quarrel between clinicians and psychometri- 
cians. On pages 13 and 14, impressionistic con- 
clusions reached by various users of the Group 
Rorschach are cited without quoting evidence 
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to justify the conclusions. For example, Krug- 
man is quoted: "Although far from complete, 
preliminary results indicate that . . . the data 
yielded possess far greater usefulness for per- 
sonality evaluation than any other pencil-paper 
‘personality’ test now known to this writer and 
his colleagues." This is not evidence. In an- 
other place (pp. 85-86) trends "considered" 
typical by Klopfer are referred to and used as 
“well-recognized norms," thereby converting 
opinion into fact. It is stated (p. 36) that, in 
the group-test, the location of responses on 
cards by subjects is sometimes confused, but is 
sometimes spontaneously clearly delineated 
with varicolored pencils. No mention is made 
of the desirability of standardizing the method 
of locating the response to make possible more 
accurate scoring of locations; instead, in line 
with Rorschachean tradition, it is suggested 
that the method of locating responses be used 
as additional diagnostic material. It may be 
useful as such, but this unscorable material is 
gained at the sacrifice of the accuracy of the 
scoring of the more basic data on location. The 
authors decided to run a check (p. 190) on the 
identity of interpretations of group and indi- 
vidual responses in Rorschach. Forty subjects 
were therefore given an individual inquiry. 
Some general conclusions concerning the ac- 
curacy of scoring of data secured by group- 
methods and a number of examples of deter- 
minants scored for data gathered first by group 
and then by individual inquiries are presented. 
But there is no tabulation of data to justify 
the conclusions—nothing more than impres- 
sionistic judgment and selection of examples. 
Section viii (two pages) is entitled “Discussion 
of Content Categories,” and is followed by 
eight pages of graphs and tables showing the 
distribution of responses according to content 
and clinical group. But there are no measures 
of the significance of the differences, and no 
discussion of the results, This leads the re- 
viewer to wonder, with embarrassment, why it 
is that clinicians, and Rorschacheans particu- 
larly, have such strong leanings toward words 
without figures, on the one hand, and incom- 
pletely analyzed figures, without words, on the 
other? No critical ratios, chi-squares, or similar 
statistics are reported for any of the authors’ 
distributions of responses, whether of clinical 
or occupational groups. This consistent failure 
to use measures of significance leads to over- 
emphasis of differences of unknown signifi- 
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cance. * The total percentages of failures seem, 
on the surface, to vary considerably from group 
to group: College Students 12 per cent, Adults 
6 per cent, Prisoners 27 per cent, and Psychot- 
ics 54 per cent. But when reference is made 
to earlier sections of the book and it is ascer- 
tained that these groups consisted of 224, 34, 
41, and 41 subjects respectively, one is inclined 
to wonder how significant these differences in 
total negative responses really are. Again, these 
figures tell only the percentage of negative 
responses in each group of subjects, instead of 
indicating the mean and standard deviation of 
the negative scores of each group. If the data 
were analyzed by the accepted method for com- 
parisons of test-performances, and if critical 
ratios were computed, the differences might, or 
might not, seem "very striking." We do not 
know. Due, Wright and Wright also are guilty 
of unscientific methods and reasoning in their 
sections of the book, although they use occa- 
sional measures of significance. Their 731 psy- 
chiatric patients from the Navy are not com- 
parable to their 369 "normal" recruits in age or 
education, and the authors state that officers 
“with frequent contact” with the normal group 
estimated, we are not told how, that from 20 to 
30 per cent were moderately or severely mal- 
adjusted. No attempt was made, apparently, to 
weed out the maladjusted members to secure a 
truly normal group, or to match the two groups 
for age and education by dropping cases, The 
Naval authors report a significant difference be- 
tween the mean introversiveness-extratension 
scores of their two groups (p. 198), but with so 
much overlapping as to make the scores “of little 
value.” They then recommend a cut-off score of 
54 which would single out 31 per cent of the pa- 
tients and 19 per cent of the normals; whether 
these latter are the maladjusted members of the 
so-called normal group is not known, but “subse- 
quent experience with the test at an embarkation 
center has strongly validated this inference,” If 
the experience yielded evidence, the evidence 
could be statistically treated and analyzed. These 
high-scoring subjects are the extratensives ; no 
mention is made of the low-scoring patients 
and normals who should, by definition, be in- 
troversive, obsessive, and repressed, and who 
are not differentiated at the lower ends of the 
distributions (Table I, p. 198) nor by the sug- 
gested critical score. These same authors re- 
port (pp. 199-200) that the distributions of 
negative scores of their psychiatric and normal 
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subjects overlap so much as to be nondifferen- 
tiating. This conflicts with Harrower-Erickson 
and Steiner's conclusion (pp. 157-161) that 
negative scores of 4 or more are indicative of 
maladjustment, but agrees with a study by 
Wittson, Hunt, and Older, referred to in the 
text. The only attempt to reconcile these con- 
flicting findings is the suggestion that, in view 
of the possibility of maladjustment in some 20 
per cent of the normal group, a critical score 
of 5 be used, thus selecting 37 per cent of the 
patients and 23 per cent of the normals. Statis- 
tical evidence failing, resort is again made to 
"clinical experience," which has "repeatedly 
demonstrated to the authors that an individual 
with 5 negative responses shows marked be- 
havior disturbances." Again, if experience dem- 
onstrated this to the authors, the data would 
demonstrate it to the reader. That a recent 
study (M. B. Jensen and J. B. Rotter, The 
validity of the multiple choice Rorschach Test 
in officer candidate selection, Psychol. Bull. 42, 
1945, 182-185) reports that 22 per cent of a 
group of Armored Force officers rated excel- 
lent by immediate supervisors made negative 
scores of 5 or more on this test, is perhaps an 
indication of the dangers of discarding objec- 
tive evidence in favor of the impressions of 
clinical experience. In the light of these facts, 
even the positive results reported earlier in this 
review hardly justify the conclusion—or per- 
haps it is a hypothesis—of Due, Wright and 
Wright on page 212: "Experience with the test 
has shown that it may be useful in estimating 
the relative amenability or resistance of the 
patient to psychotherapeutic treatment. It is 
quite probable that the test can also be used as 
an objective evaluation of therapeutic prog- 
ress." It seems to this reviewer that, as the 
result of too glib summaries of the results of 
research, leading by easy transitions to state- 
ments based on observations and impressions 
and from there to wishfully phrased hypothe- 
ses, the authors permit themselves to reach 
conclusions which have not actually been dem- 
onstrated for these two forms of the test. As 
a final illustration of the book's clinically ori- 
ented weaknesses, one might cite Section vii, 
an interesting and helpful discussion of inter- 
pretative principles for differential diagnosis. 
‘Helpful,’ that is, if true. The material is all 
subjective, based on clinical impressions and on 
single-case illustrations. True, these interpre- 
tations are in line with Rorschach practice, and 
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clinical experience suggests that they are sound. 
But unorganized experience, unanalyzed data 
and tradition are often misleading. Validation 
should be accomplished by evidence gathered 
in controlled investigations and analyzed objec- 
tively, not by the opinions of authorities and 
the impressions of observers. A few defects 
and errors such as are found in most books 
might be mentioned. Tables are numbered seri- 
ally in each section, thus making it necessary 
to identify tables by number and page; graphs 
are not numbered at all. The graphs on pages 
188-189 -have some unexplained hollow and 
broken-line extensions to otherwise solid bars. 
On page 31 the name “Snellen” is misspelled. 
Pages 101-103 are unnumbered, H. J. Older 
is referred to as “T. J.” on page 239. Lt. Due 
is referred to as “(MC), USNR” on pages 
195 and 205, but as “MC V-(S), USNR” on 
page 214, which, if true, should presumably be 
written “MC-V (S), USNR.” But these are 
minor defects. Despite its major shortcom- 
ings, the book contains much valuable material, 
including norms for the distribution of re- 
sponses by location, the frequency of determi- 
nants, content, popular responses, and rejec- 
tions or negative responses. The beginning of 
the process of standardization, incomplete 
though it is, is important. Future studies will 
no doubt be made, building on the foundation 
laid by these investigations and leading to the 
development of a practical and well-validated 
instrument, 

Arch Neurol & Psychiatry 54:72-3 Jl "45. 
In the first part of the book the authors deal 
extensively with the development of the group 
Rorschach method * Adequate directions for 
administration and for scoring are given, and 
numerous illustrations appear. The second part 
of the book, in which an analysis of the find- 
ings of an experimental investigation is pre- 
sented, is much less convincing, from both the 
statistical and the clinical viewpoint. The statis- 
tical procedure open to most serious challenge 
is the small and poorly selected sample of sub- 
jects used for standardization purposes, involv- 
ing, as it does, a college group in which more 
than one half are medical students and nurses. 
With the individual Rorschach method, such 
a professional group is generally conceded to 
report more than the usual number of anatomic 
responses. The authors recognize the need for 
contrasting control groups and include them, 
although a considerable amount of overlap- 


ping appears to exist in their choices, Subjects 
consist of (1) unselected males, (2) ‘patients 
(psychotic and psychopathic) and (3) two 
groups of prison inmates, composed of “sex 
offenders” or persons “serving terms for mur- 
der, burglary, grand larceny and forgery.” The 
data do not suggest, however, that in the third 
group psychopathic personalities have been 
eliminated. Little attention is given to the basic 
problem of reliability of the group method it- 
self, although it appears to be fairly sensitive 
to variations of administrative technic, Evi- 
dence is also lacking that qualitative aspects of 
the individual Rorschach test principles may not 
have been altered radically under group con- 
ditions, Nevertheless, interpretation is based 
primarily on principles of the individual Ror- 
schach method, seemingly on the assumption 
that the two technics reveal personality patterns 
which are qualitatively similar. If, as the au- 
thors suggest, their book is to serve as a “man- 
ual” for the group Rorschach method, these 
theoretic considerations must be met before an 
adequate evaluation can be made of the method 
itself. Their statistical tabulation of responses 
is of value, inasmuch as it constitutes a new 
and genuine contribution to the literature, To 
date, no such analysis of responses obtained to 
the Rorschach cards presented by the group 
method has appeared. * Three sections of the 
book concern themselves with validation of the 
multiple choice test in terms of its effectiveness 
in the psychiatric differentiation of military 
personnel. The contributors....present evidence 
which, although favoring the use of the mul- 
tiple choice test among trained workers, ac- 
knowledges its limitations when employed by 
persons unskilled in Rorschach interpretation. 
They qualify their endorsement of the test by 
stating that their “blind analyses were made 
from the interpretative principles of the indi- 
vidual Rorschach combined with subjective 
clinical hunches” (page 214). For a test which 
purports to objectify and quantify the Ror- 
schach method to a degree which makes it 
usable by workers untrained in Rorschach in- 
terpretation, such evidence is not convincing. 
Some of the material in the section on the ap- 
plication of interpretative principles definitely 
suggests a need for further research Clinicians 
experienced in the use of the individual Ror- 
schach method will find it difficult to avoid 
feeling that much of qualitative significance 
has been sacrificed, from such statements as the 
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following : “Organic and convulsive states are 
pooled in this discussion inasmuch as they are 
indistinguishable, interpretively, in the Multiple 
Choice Test" (page 231), or, "Three case rec- 
ords will be cited to illustrate this group, one 
with grand mal epilepsy and two cases demon- 
strating i evidences of organic intel- 
lectual impairment, one with post-traumatic 
y and the other having nervous 

oye syphilis with paresis” (page 232). 
rom their grouping and interpretation, the 
contributors seem to suggest that organic dam- 
age to the brain is to be expected with convul- 
sive disorders, a fact which has yet to be estab- 
lished, * The fourth, and last, part of the book 
contains classified lists of content obtained from 
group record responses based on the amplified 
version of the multiple choice test. This mate- 
rial is definitely of interest, although no con- 
clusions are drawn from group records and no 
comparison made with individual records taken 
as a group. Throughout the entire book, ad- 
ministrative details and appraisal of them are 
handled by trained Rorschach workers, not by 
untrained laymen. There is evidence of consid- 
erable success with the use of the group Ror- 
schach methods under these conditions. Much, 
however, remains to be done. In commenting 
on the book in a recent publication (Rorschach 
Research Exchange 9:46-53, 1945), Dr. Mar- 
guerite Hertz summarizes its contribution rather 
well when the says that "'Large Scale Ror- 
Techniques’ is valuable as a pioneer 
effort,” but that “it is still unsuited for wide- 
spread application.” Just how far this falls 
short of the authors’ intentión is suggested by 
the publisher's statement on the jacket flap of 
the book, that the authors “have devised an 


cator, or counselor may now avail himself of 
the fruits of research of many years in the 
Rorschach method." Many Rorschach workers 
skilled in the use of the individual method will 
agree with Dr. Hertz when, in speaking of the 
group Rorschach and the multiple choice test, 
she further concludes : “Both tests may be used 
to advantage only by workers who are steeped 
in Rorschach methodology, who are willing to 


subject the tests to thorough study, and to 
treat their results with suspended judgment.” 
Many will question the wisdom of attempting 
such oversimplification for the purpose of mass 
consumption and will regret that such a poten- 
tially dangerous weapon has already been 
placed at the disposal of the novice without 
proper validation, 

Brit J Psychol 36:45-6 S '45. This book is 
intended mainly for those who already have a 
good knowledge of the Rorschach technique, 
though it will also be of considerable interest 
to psychologists in general. * a very interest- 
ing section by F. O. Due, M. E. Wright and 
B. A. Wright shows why a simple pass or fail 
score often gives inaccurate results. The par- 
ticular responses of various maladjusted types, 
eg. of different neurotic types, vary greatly ; 
and it is the pattern of particular ‘good’ and 
‘bad’ responses which is most likely to be really 
diagnostic. Although it requires the highly 
skilled Rorschach technician to work out these 
patterns, it seems possible that a trained psy- 
chologist might learn to use them in selection 
testing with less labour than is required to 
learn the whole Rorschach technique. The sim- 
ple ‘pass or fail’ form of the Multiple Choice 
test can of course be given and scored by psy- 
chologically untrained personnel. It has the ad- 
ditional advantage over the older Rorschach 
techniques that it is less dependent on the ver- 
bal facility of the subject; but a possible dis- 
advantage in that the choice of responses might 
be found unduly constricting by a highly imagi- 
native subject. Moreover, the validity of both 
group techniques depends to a high degree on 
the subjects' co-operativeness. For one of the 
principal signs of emotional disorder is said to 
be the failure to respond (or the response 
"Nothing at all"). English subjects, often less 
amenable to testing than are Americans, might 
frequently fail to respond for reasons other 
than emotional disorder, e.g. because they 
"couldn't be bothered." However, it would of 
course be necessary to try out these new tech- 
niques on large representative groups of Eng- 
lish subjects before deciding the best manner 
of using them in this country. They might be 
found exceedingly valuable if they could be 
used with a reasonable degree of accuracy by 
psychologists who had not the time for long 
periods of specialized training. 

J Am Med Assn 127:955 Ap 7 45. * a well 
written, comprehensive manual of large scale 
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Rorschach technics by writers who are well 
known and respected for their work in this 
field. Methods used for group testing and mul- 
tiple choice tests,...are clearly delineated, * 
The book is well indexed * The work repre- 
sents an extremely valuable contribution to 
both military and civilian psychologic investi- 
gation, 

J Consult Psychol 9:310-1 N-D '45. Bruno 
Klopfer. For the last four years the authors 
have been pioneering under the pressure of 
war demands for modifications of the Rorschach 
Technique which would make it feasible for 
use on a large scale, * The present volume is 
the first comprehensive account of these efforts 
previously only reported in articles and oral 
reports. The bulk of the volume....consists 
of tables containing the quantitative output of 
the experimentation with the modified tech- 
niques * The book is furthermore enriched by 
three original articles by F, O. Due, M. E. 
Wright, and B. A. Wright which contain some 
of the most stimulating experiments and 


ground effects for the pictorial presentations 
of specific blot areas in the stimulus cards (the 
areas under discussion appear as black blots 
on a lighter gray background) ; the large and 
easy-to-read print; the spacious tables; the 


deals with the modifications of the adminis- 
tration of the Rorschach Technique * The dis- 
cussion covers quite adequately the important 
modifications of the recording of Rorschach 
responses: namely: (a) the substitution of 
slides for the ink-blot cards; (5) - — 
tion of self-recording techniques for re- 
cording of the responses by the examiner; and 
(c) the substitution of a modified self-inquiry 
for the inquiry by the examiner as to how the 
subject arrived at his responses. The function 
of the volume as a "manual" is limited by the 
authors to this phase of the administration. 
For the scoring and interpretation of the re- 
sponses thus collected, the reader is referred 
almost exclusively to The Rorschach Tech- 
nique. * An ingenious technical device should 
be mentioned....which is used by the authors 
throughout Part IL. The ten stimulus cards 
share the statistical spotlight with the group of 


and extremely valuable tool for the experi- 
enced rei v rie lend n h 
Wright, and Wright give an amazing con- 
vincing account (especially in the detailed 
analysis of multiple choice records published 
on pages 215-238) of the use an experienced 
clinician and Rorschach worker can make of 
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Choice. Test is too extreme an abbreviation of 
the Rorschach method to be of practical value. 
However, the authors' experience also demon- 
strates that the Rorschach, applied to groups, 
can be helpful provided these group records 
are interpreted in a way in which the individ- 
ual Rorschach records are usually interpreted. 

Nerv Child 5:276 Jl '46. * the subtitle....in- 
dicates that the attempt is made to use Ror- 
schach plates to develop a means of mass test- 
ing, a task which must be considered most 
peripheric for a testing method with a great 
individual sensitivity. Since the Rorschach ad- 
herents are divided, the applicability of the test 
will cause much controversy among those who 
are most interested in it. It seems to us a mis- 
take to use the word "group" as it is here 
applied; we should prefer the use of collective 
application of the Rorschach test as title. 

Prison World 8:23 Mr-Ap ' 46. 

Psychiatry 8:516 N ’45. M. F. Ashley Mon- 
tagu. * [This method for the group Rorschach] 
has been well tested by the authors and several 
other investigators whose independent reports 
are interestingly presented in this volume. The 
test works, and it works remarkably well. * 
The Group Rorschach is destined to be widely 
used; and its possibilities are great, particu- 
larly now that peace is restored to the world, 
for use in educational and industrial institu- 
tions. An entirely new group test, the multiple 
choice test, is also described in detail. * There 
are some remarkable reports on the results of 
the application of this test to naval personnel 
in which a 9o per cent accuracy in diagnosis 
was obtained, even down to organic brain 
lesions. Obviously this is a test full of the 
promise of immediate usefulness. The reader 
should be warned that some knowledge of the 
Rorschach technique is assumed for both tests. 

Psychol B 43:285-7 My '46. Robert C. 
Challman. * 'This book purports to be a manual 
for the Group Rorschach (a technique of ad- 
ministering the Rorschach ink-blots to groups) 
and for the Multiple Choice Test (a technique 
in which the subject selects his responses to the 
Rorschach ink-blots from a list). * The litera- 
ture on the Group Rorschach is well summa- 
rized, and a convincing case for the usefulness 
and merit of this approach is made. The au- 
thors contend on the basis of one study and 
a passing reference to two others that the group 
method yields essentially the same material as 
that yielded by the individual Rorschach. Ac- 
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cording to Dr. Marguerite Hertz, however, 
other studies not cited by the authors do not 
bear out this contention. [n the section serving 
as a manual for the Group Rorschach, excel- 
lent directions for administration are given. 
The recommended methods for conducting the 
inquiry for determinants involve, however, 
what is tantamount to the use of leading ques- 
tions—a practice which, for the individual 
Rorschach, both Beck and Klopfer strongly 
condemn. The chapter on scoring is not in- 
tended to aid the prospective scorer of the 
Group Rorschach since the authors believe that 
the scoring principles of the individual Ror- 
schach apply. It seems likely, however; that all 
“F—” scores might not be the same with the 
two techniques and one might infer even from 
the authors’ own data that “popular” responses 
and probably “usual detail" responses do not 
correspond entirely with their counterparts in 
the individual Rorschach. Over 200 pages are 
devoted to data concerning locations, determi- 
nants, and content—mostly in graphic and tab- 
ular form. Unfortunately the value of this 
material is seriously weakened by the follow- 
ing major and minor flaws. (1) No account of 
the procedure of administration of scoring is 
given. (2) The sampling is inadequate. Of the 
340 subjects tested, 224 were of college age, 
and the rest consisted of relatively small groups 
of adult males, prisoners, and a mixed group 
of psychotics and psychopathic personalities. It 
is very doubtful whether any meaningful dif- 
ferences can be found between groups so het- 
erogeneous within themselves. (3) No meas- 
ures of dispersion or measures of reliability of 
differences are presented. (4) The authors 
state that their “main interest and concern 
[was in] providing norms for the college age 
group".., but since their group is composed 
mainly of medical students and nurses, even 
this aim is not fulfilled. (5) Instead of at- 
tempting to ascertain what are the “usual de- 
tails" from their own data, the authors use 
Klopfer's series and add other details which 
were seen by their subjects at least as often as 
the most infrequent one he lists. This results 
in their identifying as usual a large number of 
details whose actual frequencies are less than 
I per cent, i.e., unusual “usual details.” (6) 
Although average figures for the different de- 
terminants would have been helpful in inter- 
preting a Rorschach record, only percentages 
are given. * Although the original validating 
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study of the authors [for the Multiple Choice 
Test] showed a very good discrimination be- 
tween normals and abnormals, subsequent stud- 
ies (one of which is presented in full in the 
book) have found such imperfect discrimina- 
tion as to make its value as a screening instru- 
ment very dubious. The authors attempt to ex- 
plain away these findings, but the more recent 
study of Jensen and Rotter showing that the 
test places 45 per cent of “actually excellent 
[Army] officers” in the abnormal group makes 
such attempts rather futile. In an effort to 
overcome some of the shortcomings of the test, 
the authors developed an amplified form. In 
this version, the number of items was tripled, 
objectivity of scoring was increased, differen- 
tial weighting was introduced and a new criti- 
cal score adopted. These changes were made 
on the basis of an analysis of "more than 
50,000 test blanks," but only fragmentary data 
are presented to justify most of the innova- 
tions. The reliability and validity of the new 
instrument are not mentioned. Fortunately a 
recent study allows a tentative evaluation of 
this form, R. F. and D. I. Malamud, compar- 
ing the responses of roo normals with those 
of 100 abnormals, found an almost complete 
overlapping of the two distributions. Item 
analysis, with the standard of C.R. of 2 or 
over, revealed only 1o per cent of the items as 
discriminative while 6 per cent discriminated 
in the wrong direction. Some material of value 
to those interested in experimental work can 
be gleaned from the book, but these hardly 
compensate for the many minor inaccuracies, 
the unsupported speculations, the overly op- 
timistic attitude of the authors, the lack of or- 
ganization of the material and the unduly low 
scientific standards employed. Since it is clear 
that both the Group Rorschach and the Multi- 
ple Choice Test are still in the experimental 
stage, the book offers little to the clinician. 

U S Naval Med B 45:181-2 Jl "45. * The 
general reader who is not familiar with the 
Rorschach Test will be hopelessly lost among 
its professional intricacies, but the Rorschach 
specialist will find it a thorough and stimulat- 
ing treatise which he will want on his reference 
shelves. * 

War Med 7:274 Ap '45. * the group Ror- 
schach test is a distinctly different test, and its 
usefulness is entirely different from that of the 
individual Rorschach test * The group pro- 
cedure may be likened.to a group mental test 


procedure. The group Rorschach test is useful 
as a screening device and as a means of esti- 
mating in a rough sort of fashion personality 
adjustment, just as a group mental test meas- 
ures mental ability. For a more accurate, precise 
measurement of mental ability an individual 
mental test administered by a trained examiner 
is preferred. For a more comprehensive, more 
accurate and more precise measurement of per- 
sonality, the Rorschach test individually ad- 
ministered and interpreted is preferred. The 
book presents what is undoubtedly a forward 
step in personality testing. It is the feeling of 
this reviewer, however, that the book is for 
persons who have had training in the Ror- 
schach technic, despite the fact that the authors 
state that they have written it for the psychol- 
ogist, psychiatrist, educator, social worker or 
probation officer untrained in Rorschach inter- 
pretive principles. 
[ 82a ] 

X*IRe Rorschach.] Kasack, Gorpre Ruts. Voca- 
tional Personalities: An Application of the 
Rorschach Group Method. Columbia University, 
Teachers College, Contributions to Education, No. 
924. Harry D. Kitson, faculty sponsor. New York: 


Bureau of Publications, the College, 1946. Pp. xi, 
116. $2.10. * (PA 21:1929) 


Occupations 26:135-6 N '47. Joseph Zubin. 
This study attempted to determine whether the 
responses of two professional groups (account- 
ants and pharmacists) differ significantly on 
the Group Rorschach Test and whether the 
students of these two professions also differ 
from each other in their Group Rorschach re- 
sponses. This study marks a new departure for 
the Rorschach Test, since up to the present it 
has been largely limited to the clinical field. Tts 
extension into this new field should be watched 
with interest both by Rorschach technicians as 
well as by students of personality and vocational 
guidance. Once before in the history of the 
Rorschach Test an attempt was made to use it 
technologically outside of the clinical field. 
After the first World War about 1921 when 
German psychologists began to reconvene and 
reorganize, the fame of the newly born test was 
spread by A Rómer and many of the German 
psychologists lent a willing ear. It was Stern, 
however, who prevented a wholesale. accept- 
ance of the new test on the ground that it was 
not yet fully validated and depended upon ar- 
tistic intuitive evaluation. The present study 
comes at a crucial time. We are now facing a 
somewhat similar situation in this country that 
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German psychology faced in 1921. Many of our 
applied psychologists, newly returned from their 
wartime experience, may not have had the ex- 
perimental and academic training required for 
evaluating the validity and reliability of tests. 
For this reason it is high time that the Ror- 
schach Test were subjected to a validation of its 
various claims, with special reference to its 
positive contributions as well as its limitations. 
The study itself takes an objective, neutral tone 
regarding the interpretive values of its findings. 
It generally maintains this dispassionate point 
of view despite its difficulty, except for one or 
two lapses. For example, on page 33 the author 
states that “data obtained from case histories, 
clinical records and psychoanalytical material 
confirm Rorschach’s claim that human move- 
ment responses are representative of creative 
thinking, inventiveness, originality and imagi- 
nation and that artists, imaginative subjects and 
abstract thinkers offer the largest number of 
M responses.” This claim is not documented 
and furthermore some of our leading Rorschach 
experts freely admit that the meaning of the M 
response is still not settled, Only recently a study 
of outstanding artists failed to confirm Ror- 
schach’s claim. Several of the interpretations 
made on the basis of the Rorschach findings do 
not seem to fit into our general knowledge 
about the groups under investigation. On page 
28 the finding that “Liberal Arts students are 
less inclined toward philosophic, theoretical or 
abstract thinking than the two other student 
groups” and that they also possess less aesthetic 
and ethical understanding seems somewhat 
strange. One wonders whether the Allport- 
Vernon Test of Values would have corroborated 
that finding. The statistical treatment of the 
data leaves little to be desired. In fact, in her 
attempt to squeeze out all the possible relation- 
ships, the author presents the same material in 
several guises. For example, not only are both 
the critical ratios as well as the chi squares for 
the same data computed, but a third measure of 
the same differential is provided by the point- 
bi-serial. This threefold presentation of the 
material leads to confusion rather than clarity, 
especially since the author does not seem to be 
aware of the fact that the three different meth- 
ods should theoretically lead to identical results, 
and if they do not lead to the same results, some 
explanation on the basis of the sample structure 
should be made, Another difficulty is the au- 
thor’s interpretation of the correlation between 


Original responses (O) and large details (D). 
A high correlation between these two needs not 
necessarily lead to the conclusion that “when 
an original concept (O) is perceived by the 
members of a group, it is quite likely to be 
based upon the interpretation of a large detail 
(D), etc.” High correlation does not neces- 
sarily mean a high incidence of concurrence. 
Such a fact can be derived only from a pattern 
analysis, not from a correlation coefficient. * 
Finally, the use of the multiple r between the 
two professions regarded as a dichotomous vari- 
able and the twenty-four Rorschach Factors is 
somewhat of a puzzle. The exact method used 
is not indicated, but it sounds so much like the 
method of the discriminant function invented 
by Fisher that it is somewhat surprising that 
no mention is made of it. Despite the above 
critical comments, the study is well worth the 
reading. Out of the host of variables analyzed, 
no special pattern of personality characteristic 
of the two professions emerges. This holds true 
of both the men engaged in the profession as 
well as of the students. Apparently the interest 
and personality trait patterns for occupations 
discovered by Strong and his followers still re- 
main the method of choice in vocational guid- 
ance. The personality trait differences observed 
by previous workers as characteristic of the two 
occupations need not be regarded as disproved 
by this study, for this study suffers from sev- 
eral basic difficulties inherent in the Rorschach 
Test which no experimental design can over- 
come. The first difficulty is the rather nebu- 
lously defined scoring categories with which the 
Rorschach worker contents himself. These are 
good enough for clinical usage but they can 
never serve to advantage in an experiment. We 
must provide some more rigorous psychometric 
frame of reference which will yield more sys- 
tematic scores that can be operationally defined 
independently of their alleged personality cor- 
relates, if we are to use these scores experimen- 
tally. The second difficulty inheres in the un- 
defined personality correlates of the Rorschach 
factors. Such personality traits as anxiety, cre- 
ativity, etc., need some type of objective opera- 
tional definition. How these two ends are to be 
achieved lies beyond the scope of the present 
study, but they are the foundations on which 
further attempts at studying personality via the 
Rorschach Test must be based. 

Psychol B 44:581-2 N "47. Max Hertzman. 
* The bulk of the monograph is devoted to a 


173] TESTS & REVIEWS: CHARACTER & PERSONALITY 


84 


meticulous statistical analysis of inter-group 
differences based on most oí the large variety 
of scorable Rorschach factors that have been 
recognized to date. The statistical analysis is 
conscientiously performed and well presented 
but the scope of the study does not go much 
beyond statistically defined problems. A limita- 
tion is also placed on the data by dealing with 
the Rorschach factors in a purely quantitative 
fashion. The interpretations of the data are thus 
based only on the sheer number or the propor- 
tions of various significant items. The omission 
of qualitative considerations can at times be 
disastrous to an intensive character analysis. * 
A fuller understanding of the relationship be- 
tween personality and vocational choice re- 
quires a broader investigation than the one re- 
ported here. Vocational choice is related to the 
prestige, economic reward, economic security 
afforded by the vocation as well as to the skills 
demanded by it. The prestige may vary with 
the social stratum from which an individual 
comes, as may the standards which determine 
adequate reward and adequate security. An 
analysis of what the appeals of given occupa- 
tions are could be the basis of more sharpened 
hypotheses as to which personality factors to 
investigate. An attempt to obtain such informa- 
tion, in the present study, through a question- 
naire was not very successful. When asked the 
reason for their choice of profession, over 60 
percent of the 300 subjects either did not an- 
swer this question, or said they selected the 
profession because they liked it or were inter- 
ested in it. Intensive interviews of a few sub- 
jects might have been more fruitful, perhaps 
yielding a picture of the relationship between 
self evaluation and occupational choice. Such 
findings as the greater anxiety among pharma- 
cists could be clarified if there were greater 
understanding of the meaning of the profession 
as a way of life to the people in it. The discov- 
ery of a high degree of overlap between the 
pharmacists and the accountants, despite the 
wide differences in the nature of these profes- 
sions, focusses attention on the limitations of a 
personality approach to vocational choice in 
present day society. As the author indicates, the 
present study cannot be taken as a final state- 
ment. She points to the need of a more diverse 
sample than was used and to the need for fur- 
ther experimentation with the Group Rorschach 
Method to clarify interpretations which were 
carried over bodily from the individual test. We 


would stress the value in future research of an 
intensive approach which would not limit per- 
sonality description to the special characteristics 
of a single test and which would investigate 
motivational and social determinants as well as 
personality structure. 


83 
*[Re Rorschach.] an. Bruno, AN» DAVIDSON, 
Heen H. The Rorschach Technique: 1946 Supple- 
ment. Yonkers, N. Y.: World Book Co., 1946. Pp. 431- 
75. Paper. $0.48. * 

J Consult Psychol 11:154 My-Je '47. * The 
first part of the supplement is a brief review of 
new trends in Rorschach development since 
1942. A second part is a chapter on a method 
of form-level rating. This is of particular in- 
terest because of Klopfer's earlier rejection of 
Rorschach's F-plus per cent as a score. His new 
proposal rates each response for form on a new 
scale of from minus 2 to plus 2. A number of 
examples of scoring are given. * 

[84] 

X[Re Rorschach.] Krorren, Bruxo. The Rorschach 
Technique: A Manual for a Projective Method of 
Personality Diagnosis. With clinical contributions 
by Douglas McGlashan Kelley. Yonkers, N. Y.: World 
Book Co., 1942. Pp. xi, 436. (London: Geo. G. Harrap. 
& Co., Ltd.) * For supplement, see 83. For latest edi- 
tion, see 85. (PA 16:3170) 

Am J Orthopsychiatry 12:739 O '42. (Ira 
S.) Wile. * As an interested and enthusiastic 
protagonist for this mode of analysis. [Klop- 
fer) presents a well-organized discussion of its 
nature, technique, and problems of scoring. 
His enthusiasm is reflected in the section de- 
voted to the general problems of interpretation 
and the nature of personality structure. When 
discussing the intellectual aspects of person- 
ality one notices an undercurrent of defensive- 
ness. It is heartening to read: “It would natu- 
rally be absurd ever to expect to establish 
standardized tables based on statistical research 
which would enable one to determine whether 
a subject is schizophrenic, neurotic, or any 
other definite personality type—normal or ab- 
normal.” As personality diagnosis can only be 
made by inference, there is special interest at- 
tached to Part IV, dealing with clinical diag- 
noses, by Dr. Kelley. This section is not always 
clear nor is it very convincing, This may be 
exemplified by his reference to “the so-called 
pathognomonic signs” and then, “none of them 
is limited too exclusively to one single clinical 
entity.” After referring to clinical entities that 
produce general constellations of Rorschach 
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response, typically reflecting the personality 
structure, he states: "A quantitative validity is 
not to be expected from studies of these general 
signs, since, clinically, all that one may hope for 
is that a majority of patients showing the signs 
will be found to possess the disease entity in 
question and, conversely, that the majority of 
patients not suffering from the disease entity 
will not show these Rorschach responses." He 
is quite right in stating: "There is a grave dan- 
ger of misusing such general Rorschach patterns 
by blindly applying them as diagnostic criteria 
without considering other qualitative factors 
and extenuating interrelationships which may 
be present." A careful analysis of the data de- 
rived from the Rorschach technique has mean- 
ing only to those thoroughly familiar with the 
scoring methods and their interpretation. Hence 
this volume has special value for Rorschach 
testers and clinicians. It will be useful also to 
those who are uncertain concerning the validity 
of the test as an instrument for diagnosis, prog- 
nosis or possible therapy. 

Am J Orthopsychiatry 13:179-80 Ja °43. 
Edna Brand Mann. For anyone interested in 
Rorschach, this book is by so much the best in 
the field that it earns the rank of the authorita- 
tive text on the Rorschach. It does not sacrifice 
scholarship to simplicity. It does not minimize 
the living complexity of human nature to achieve 
a pseudo-simplicity. It is the outcome of rich 
clinical experience with the Rorschach in Eu- 
rope, followed by seven years of “learning by 
teaching” in this country. When Klopfer started 
his first seminars, at a time when his English 
was as mystifying as the Rorschach still seems 
to a beginner, he attracted excited groups of 
select students who wished to follow him into 
the then artistic realms of his creative analyses. 
In those early days students thought, “This is 
wonderful, but is it Klopfer or Rorschach? 
Who else but Klopfer could get at an interpre- 
tation like that?” We longed for something less 
inspired and more usable. And now it is here, 
a beautiful job of logic, a neatly organized struc- 
ture of thought; a practical, expertly written 
handbook which is safe for the beginner and 
illuminating for the Rorschach expert. Many of 
the questions we have been asking in our daily 
clinic work are here raised and answered. A 
few of the technical points this reviewer is par- 
ticularly pleased to see emphasized in the scor- 
ing section are the differentiation in the Form 
(F) realm, e.g., the difference between a simple 


popular or noncommittal F, the superior elab- 
orated F (F plus), and the poor F (F — ). Many 
clinical workers have found only confusion in 
lumping the two first F's into one F plus cate- 
gory or 46, thus obscuring the significant diag- 
nostic feature implicit in a genuinely high F 
plus, based in superior F. A similarly helpful 
discussion of Whole (W) responses, breaking 
the category down into the kind of helpless, 
unorganized Wholes characteristic of brain 
diseases, the vague noncommittal Wholes com- 
mon among those who would put on a big front, 
and the carefully elaborated Whole showing 
genuine substance behind the Form. Only in the 
FC-— refinement does there seem question as to 
whether the elaboration may not be too deli- 
cate for comfort. Would not such answers fit 
better under Cf or Cf—, where they can later 
be sumiated more appropriately under the CF 
than the FC category? An important section of 
the book is devoted to “Problems of Interpre- 
tation of Personality," based on the Major Scor- 
ing Categories. Included is a full case study to 
illustrate in neat detail just how the Rorschach 
is given, how the "Inquiry and Testing the 
Limits" are done, how each answer is located 
by the examiner on a location sheet and scored, 
and then how (and why) the finale is inter- 
preted. The last section of the book, prepared 
by Dr. Douglas Kelley, will be of special inter- 
est to clinical psychologists and psychiatrists. 
It presents a well-documented account of Ror- 
schach studies in the field of organic, psychotic 
and neurotic disorders and mental defective- 
ness. It contains a great deal of material which 
might be organized more tellingly. The final 
chapters introduce the reader to latest events in 
the Rorschach field, Dr. Harrower-Erickson's 
recently devised group method of administering 
the Rorschach, and its possibilities as an adjunct 
in military medicine and in selection of per- 
sonnel. The book lives up to the hopes of early 
Rorschach and Klopfer enthusiasts for a text 
on the Rorschach that should be comprehensive 
and comprehensible. 

Am J Psychiatry 90:901 My ’43. R. Milton 
Finney. * the most comprehensive exposition 
of the Rorschach experiment published in Eng- 
lish. * clearly and simply written, but such a 
wealth of detail and information is provided 
that repeated reading may be necessary by the 
student. On the other hand, the book will serve 
as an excellent reference for those familiar 
with the Rorschach techmique. One would wish 
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that the authors had felt free to use colored 
reproductions of the .cards employed in the 
method, and the inclusion of more Rorschach 
records would probably have made the tech- 
nique more intelligible to the beginner. 

Am J Psychol 55:609-10 O "42. Z. A. 
Piotrowski, Of the several available books on 
the Rorschach method this volume best meets 
the needs of the beginner. * readable * pre- 
sented in a clear and systematic manner * no 
exaggerated claims for the method are made * 
good index and a comprehensive bibliography 
* Klopfer gives a careful analysis of scoring. 
The differentiation between sharply and vaguely 
perceived forms is very difficult and yet impor- 
tant. Klopfer’s treatment of this vexing prob- 
lem is a definite contribution. His differentiation 
between F-plus and plain F-responses leads to 
a more valid estimate of the subjects’ intelli- 
gence. He suggests that the F — % rather than 
the F+% be computed since the former pos- 
sesses the same meaning for all investigators. It 
is this reviewer's considered opinion that Klop- 
fer's tendency to super-refined scoring does not 
always achieve results commensurate with the 
effort expended on the scoring. Thus the chia- 
roscuro responses subdivided into as many as 
I2 categories are treated with disproportion- 
ately greater detail than other—perhaps even 
more important—types of response, e.g. the 
human movement-response. Furthermore, the 
numerous chiaroscuro categories create compli- 
cated problems of reliability and validity. Kelley 
»-Blves a good survey of results obtained in 
various psychiatric categories of patients. * 
There are several....misstatements, but on the 
Whole Kelley's review gives the reader a good 
idea of the type of problem encountered in the 
application of the Rorschach method to psychi- 
atric diagnosis as well as of the results achieved 
to date. * Only those who will use the book as 
a guide in their work with the method will 
appreciate fully the many fine points which the 
authors raise. It must be borne in mind, how- 
ever, that neither this nor any other book will 
enable the reader to use the method success- 
fully. At least at the present time, the help of a 
qualified instructor seems indispensable, as 
Klopfer himself emphasizes, and as is attested 
by a number of articles in which half-baked 
views are offered in the naive conviction that 
the method was used as Rorschach intended it 
to be used. 

Am Sociol R 8:369-70 Je ’43. M. R. Har- 


Az 
rower-Erickson. * a reliable and authentic text- 
book to which the enthusiastic would-be learner 
can turn, after his first contacts with the method 
through popular or pseudoscientific channels. 

The Rorschach Technique...is described by the 
senior author as follows "This book attempts 

for the first time to present the accumulated 

experience of twenty years of application (of 

the method) in terms of a technical description 

of the problems the beginner will encounter in 

his attempt to administer, score, and interpret 

a Rorschach record. While this book is hardly 

sufficient as a basis for self-training it is hoped 

that it will be of considerable help when used 

in conjunction with other training facilities." 

This is a modest statement, and the book more 

than measures up to the authors' hopes. * To 

[the] scoring system of Rorschach's, Klopfer, 

both as author and teacher, has done much not 
only to sharpen and refine it as a tool, but to - 
make it understandable and significant to the 

bewildered beginner. Part III deals with the 

interpretation of the scored record * Klopfer's 

wide experience has enabled him to give to this 

phase much that is essential to the beginner, and 

of value to the expert. Part IV (Douglas Kel- 

ley) is an able presentation of the use made of 

the Rorschach method in clinical situations. * 

This section is obviously not intended to bring 

the completely uninitiated reader to the point 

where he can make a differential diagnosis be- 

tween the patient with a brain tumor and the 

hysteric! It will be of great interest, however, 

to the clinical psychologist, and to the physician 

who has perhaps referred such patients for a 

Rorschach test, but whosé knowledge of the 

actual characteristics of the personality, as re- 

vealed in the test, is scant. The psychiatric social 

worker, who may have felt the need for some 

objective method of estimating personality, will 

be amply rewarded by a study of the type of 

clinical problems which come within the scope 

of the test. All in all, the book is an important 

landmark in the development of a method of 

personality analysis which is showing that it has 
a vitally important contribution to make at the 
present time, 

Arch Neurol & Psychiatry 49:030 Je '43. 
The Rorschach method of personality analysis 
is still a mystery to most neurologists and psy- 
chiatrists. It cannot be said that the present 
volume makes the matter simple, but the book 
does provide a great store of information about 
the test, which is far easier of access than that 
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in Rorschach's original book or any others at 
present available. This history of the test is 
interestingly described; then precise directions 
are given for administering it. * an exhaustive 
bibliography * The test is useful in the differen- 
tial diagnosis of various psychoses and neuroses. 
In addition, definite patterns of response are 
seen with certain types of structural disease of 
the brain. The specificity of the pattern in epi- 
lepsy is open to question. It would be difficult, 
but not impossible, to learn to give and to inter- 
pret the Rorschach test from this book alone. 
The manual will, however, be of great value 
to all those who are working with the procedure, 
or who ate attempting to utilize its results. 

` Brit J Ed Psychol 13:165 N ’43. For review, 
see 78. 

J Abn & Social Psychol 37:572-3 O ’42. 
Claire Myers. * the most thorough and complete 
summary of Rorschach material available * Al- 
though statisticians may disagree with many of 
the arguments advanced against standardiza- 
tion and validating statistics, nevertheless the 
authors make some sound objections to the rou- 
tine application of statistical formulae and the 
development of a mechanical interpretative pro- 
cedure. * In general the system of interpreta- 
tion given is based on the use of formal scores, 
and the content of the responses is neglected. 
Many of the criticisms which might be made of 
this section are of the authors' theories of per- 
sonality structure rather than of the system of 
interpretation per se. * One of the best features 
of the book is a complete bibliography of Ror- 
schach publications. One important fault is an 
absence of statistical evidence for many of the 
statements made. Rorschach workers and stu- 
dents of the method will probably find the whole 
book of value. Psychologists and psychiatrists 
may find the two parts covering administration 
and interpretation too thorough and detailed, 
but the sections on history and clinical diagnosis 
should be of interest to them. 

J Appl Psychol 27:209-11 Ap '43. Robert E. 
Harris. * the criteria for scoring are carefully 
enough defined so that the agreement between 
trained scorers should approach that on, say, the 
vocabulary items of the Revised Stanford- 
Binet. * A scoring category does not have a 
single interpretational significance; it may have 
different meanings in different records, depend- 
ing on the values in each of the other categories. 
That every record has unique configurational 
aspects is continually emphasized, but in spite 


of these recurring holistic strictures, Klopfer 
often seems to assume a one-to-one correspond- 
ence between a scoring category and a person- 
ality variable. For example, he says, “....the 
proportion of usual details may indicate the 
awareness of the obvious and the immediate 
problems of everyday life..." (p. 203) * More 
commonly, though, the multiple interpretational 
possibilities of the variables and their ratios 
are discussed against the background of differ- 
ent kinds of records. It is unfortunate that this 
section [Part 3] contains no validating infor- 
mation of the kind usually found in the descrip- 
tions of more objective tests. Instead the reader 
must be satisfied with such statements as, “A 
predominance of human profiles, based on parts 
of the contours of all the blots, has been found 
in cases of anxiety neuroses, combined with a 
strongly introversial personality constellation" 
(p. 264). * Most of the interpretations assume 
a great deal more stimulus-equivalence and re- 
sponse-generalization than most experimental 
psychologists would be willing to grant without 
extensive empirical evidence. It is true that vali- 
dating the interpretative significance of fifty to 
a hundred scoring categories, the meaning of 
each of which may vary with a change in the 
value of each of the others, presents a for- 
midable task, logically and practically, and, in- 
deed, some Rorschachists would claim, a work 
of supererogation. There are, however, many 
indications of the validity of the Rorschach in 
the clinical and experimental literature, and this 
section could have been made more convincing 
to the general reader and more useful to clinical 
and research workers if citations from these 
studies had been included. Such facts are an 
integral part of a technical manual. * Klopfer 
and Kelley have made a significant contribution 
to the study of personality. In offering a scheme 
for analyzing the projective material of the 
Rorschach which may well become standard, 
and summarizing what is known about inter- 
pretation and diagnosis, they have provided 
clinicians with an immediately useful tool. Re- 
search workers will be intrigued by the perplex- 
ing problems of validation, and they will be 
forced to use (or invent) more elaborate experi- 
mental designs than the simple measures of asso- 
ciation used in validating tests of nomothetic 
abilities and traits. The field of personality 
measurement has been too long dominated by 
over-simplified conceptualizations of personal- 
ity structure derived from methods successful 
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in other areas of measurement; we have thought 
of personality as being made up of those things 
which we had the tools to measure and validate. 
It is good that Klopfer and Kelley have stated 
their conceptualization so clearly. It makes ob- 
vious the need for new methods. 

J Consult Psychol 6:274-5 S-O '42. Morris 
Krugman. * a careful, scholarly presentation of 
a very difficult subject, and is extremely well 
written. The authors have presented a sys- 
tematic, comprehensive exposition of the method 
and have added their own original contributions 
to the procedures. They have refined and elabo- 
rated many of the Rorschach concepts. Some 
Rorschach workers, including the reviewer, do 
not agree with all of these refinements and elab- 
orations. However, the authors usually present 
opposing views, and are free to admit when their 
changes in scoring technique or interpretation 
are inconclusive or tentative. They have de- 
parted radically, for example, from Rorschach’s 
original concept of “good form," or F+, and 
have eliminated the F+ per cent. The latter is 
so generally used that Klopfer and Kelley were 
forced, in discussing the Rorschach in connec- 
tion with pathological conditions, to reintroduce 
this concept. The experienced Rorschach worker 
will find some of Klopfer’s scoring refinements 
elaborated out of proportion to their importance. 
The beginner will probably find the very sin:ple 
explanation of scoring symbols, included in the 
Record Blank attached to the end of the book, 
a more useful and concise treatment of scoring 
categories than the three chapters on that sub- 
ject. Chapters V, VI, and VII could have been 
much clearer if some of the more technical and 
rarer aspects of scoring had been removed and 
placed in the appendix. The examination and 
scoring procedures also seem unnecessarily com- 
plicated, and very few experienced Rorschach 
workers would employ the full process as out- 
lined in the book. The steps involved in per- 
formance, inquiry, testing the limits, location 
chart, scoring list, tabulation sheet, determinant 
graph, table of relationships among factors, and 
interpretation, have value for students. Most 
Rorschach workers adhere to the much simpler 
process of Rorschach's Psychodiagnostik, which 
places the responses, inquiry, and scoring ad- 
jacent to each other, followed by one sur mary 
tabulation of scoring factors. This simpler type 
of recording presents a single gestalt of the 
performance rather than a bewildering array of 
tabulations. Only one case is presented in the 


entire book. The reviewer feels that poor judg- 
ment was shown in selecting this case. It is 
atypical in many ways. Numerous pathological 
factors, admirably discussed elsewhere in the 
book, make their appearance in the sample case, 
but their significance is not pointed out. With 
the exception of the case, and some minor items, 
the treatment of the Rorschach method in this 
book is excellent. To say that it is the most com- 
plete manual on that method available in the 
English language would be saying very little 
indeed, since there are so few. It is difficult to 
see how a clearer manual can be compiled for 
years to come. One of the major merits of the 
book is the fact that the authors have not at- 
tempted to condense the complex, dynamic 
method into over-simplified, stagnant, magic 
formulae, so that he who runs may make snap 
judgments on personality. They present the sub- 
stance of the method so that serious students of 
clinical psychology or psychiatry who are willing 
to put forth the necessary effort may learn to 
apply the method. f 

J General Psychol 30:105-11 Ja '44. Philip 
Lawrence Harriman, * ever since D. M. Levy 
acquainted the American psychiatrists and psy- 
chologists with the technique, and particularly ~ 
after the publication of S. J. Beck's disserta- 
tion, interest in the Rorschach has increased 
with each passing year. Now the corpus of liter- 
ature on Rorschach methodology rivals in bulk, 
if not in quality as well, the published research 
on the Binet-type of test. In part, no doubt, the 
increased attention is due to the novelty of the 
procedure; but a main contributory factor has 
been the rise of new theories of personality, 
especially those identified with Murphy and 
Jensen, G. W. Allport, Lewin, and others. These 
new viéws, emanating from the Gestalt protest 
against “atomistic psychology," have done much 
to raise questions about the serviceability of per- 
sonality inventories and to pave the way for 
acceptance of a test which purports to measure 
the “total personality.” Consequently, the pub- 
lication of a book by the Director of the Ror- 
schach Institute and by a psychiatrist who has 
contributed at least 13 research articles is an 
event of no little importance. * As all students 
of Rorschach technique know, there are three 
principle “schools” or methodologies in this 
country: the Beck, the Hertz, and the Klopfer. 
Unfortunately, there is little published research 
on the relative merits of these three groups ; but 
some evidence supports the conclusion that 
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“blind diagnoses” by each method bring unanim- 
ity of agreement in interpretations. Disagree- 
ments among the experts may pertain only to 
the superficial and mechanical phases of the 
procedure. At any rate, this book does not dis- 
cuss the matter. It presents a direct and un- 
ambiguous exposition of the Klopfer technique. 
* In view of the current interest in the Har- 
rower-Erickson and the Linder-Chapman ex- 
periments in group Rorschach testing, it is à 
little surprising to find only five short para- 
graphs devoted to this topic. Presumably, Klop- 
fer, like many other exponents of the technique, 
believes that it should continue to be, what 
Rorschach intended, an individual examination. 
* Klopfer attaches a great deal of importance 
to the inquiry, which is the phase of the admin- 
istration wherein the examiner learns how the 
subject arrived at the responses and wherein the 
subject has an opportunity to make elabora- 
tions. * Klopfer....agrees with Beck in postpon- 


- ing the inquiry until after all 10 cards have been 


B 


presented, By the use of a location chart, the 
examiner records the response area. Some read- 
ers will have difficulty in appreciating the need 
for making precise localizations for each re- 


^ sponse discussed during the inquiry, since Klop- 


fer seems to have little faith in tables of norms. 
Zubin has pointed out the need for statistical 
refinement in scoring Rorschach responses ; 
Hertz has published elaborate scoring tables; 
and Beck appears to follow a somewhat rigorous 
method. On the other hand, Klopfer says that 
there is no likelihood of schematizing the 
method or of developing “foolproof, mechani- 
cal" tables. In fact, such a procedure obscures 
the "individual nuances and facets of any given 
record" and thus vitiates the basic purpose of 
the Rorschach. Of great importance is the phase 
of administration which is called testing-the- 
limits. At this time the examiner tries to elicit 
responses which were avoided during the initial 


- administration of the cards and to clear up 


vague or evasive material. Especially is this 
phase of the administration to be conducted 
thoroughly if the record displays signs of block- 
ing, “misunderstanding” the directions, absence 
of response to color, non-committal material, or 
failure to make the popular responses. In the 
sample record presented as an appendix to 
Part 3, the examiner had asked 12 brief ques- 
tions in the testing-the-limits phase. No doubt, 
had the subject given a smaller number of re- 
sponses, these notes (p. 308) would have been 
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a less disappointing example of a technique 
which Klopfer considers to be an essential part 
of a Rorschach record. The scoring problems of 
the Rorschach investigator are infinitely com- 
plex. This book gives a great deal of help in 
the form of apposite comments, but it does not 
outmode Rickers—Ovsiankina’s Rorschach Scor- 
ing Samples for a neophyte who wants to de- 
velop assurance in scoring responses. * One of 
the outstanding contributions which Klopfer 
has made....is to develop a detailed scoring plan 
which, presumably, reveals more of the subtle- 
ties in the responses than would appear in any 
other transmutation of associations into letter 
symbols. Each one of these symbols is, appar- 
ently, a logical development of the unfinished 
researches of Rorschach. One original addition 
is to discriminate the “cut-off whole” response, 
as to Cards III and VI, from a D (Beck) or 
a W. Responses to the entire figure must be 
carefully studied in order to estimate, qualita- 
tively, their construction. Small details (d) are, 
of course, to be differentiated from usual de- 
tails (D) ; and the only table in the book lists 
them in the approximate order of their fre- 
quency. Of particular interest is the account ' 
of tiny details (dd), edge details (de), and in- 
side details (di). Kelley, unlike Beck, seems to 
regard the oligophrenic detail (do) as occur- 
ring in the records of feebleminded (p. 329), 
whereas Beck and Loosli-Usteri regard it as 
symptomatic of anxieties. Klopfer does not 
mention the do in his discussion of detail re- 
sponses (pp. 91-105). Although it seems im- 
portant to differentiate the exact type of detail 
response, neither the adequacy of the directions 
for scoring nor the exposition of interpretative 
significance is wholly satisfying to the careful 
reader. Probably there are advantages in going 
beyond the Beck procedure of scoring normal 
details, rare details, and the white space details 
(D's and Dr's) ; but as yet it seems a bit like 
multa in parvo. One possible danger is that 
revelatory material may become obscured when 
converted into symbols which have only a spe- 
cious exactitude. Yet, the advantages in manipu- 
lating symbols rather than cumbersome and 
lengthy responses are obvious, and the diagnos- 
tic significance of various types of details seems 
to justify a tentative adherence to the Klopfer 
procedure rather than to the more general cate- 
gories of Beck, In his discussion of the deter- 
minants, Klopfer makes a number of unique 
contributions to Rorschach methodology. First, 


179 ] 


é . 


TESTS & REVIEWS: CHARACTER & PERSONALITY 84 


he differentiates three types of “kinesthetic” 
responses: human or human-like movement 
(M), animal or animal-like action (FM), and 
minor movement (m). * In the section dealing 
with interpretations of Rorschach records, 
Klopfer does not include much information 
about the diagnostic value of minor movement, 
except to state that these responses are “an 
almost literal reflection of tensions within the 
personality structure....where the subject expe- 
riences his promptings from within as hostile 
and uncontrollable forces working upon him 
rather than as sources of energy at his dis- 
posal.” In a sense, this quotation is representa- 
tive of much of the presentation in the book. 
Broad generalizations and intuitive comments 
abound; but, except in the final part of the 
book, there is practically nothing in the way of 
experimental or statistical evidence to support 
them. No doubt, all these broad statements are 
the outgrowth of an extensive and varied expe- 
rience with the Rorschach technique. Certainly, 
they are presented in a plausible manner, and 
they appear to have pragmatic justification. If 
the reader be an “academic psychologist” of the 
“old school,” he may lift the quizzical eye-brow 
now and then during his perusal of the book. 
If, on the other hand, he has attempted to inter- 
pret a number of Rorschach records, he will 
be constantly delighted by Klopfer’s argute, 
shrewd generalizations, even though they be 
clinical insights rather than conclusions based 
upon laboratory investigations. The scoring of 
the chiaroscuro responses (Beck’s FY and Y) 
is the second important contribution in this text- 
book. * Unquestionably, this book presents the 
most elaborate plan for scoring these responses 
that has yet appeared in print. * The account 
of how to score the color determinants includes 
only a few novel contributions, * It is disappoint- 
ing to find a very brief and inadequate chapter 
devoted to the Record Blank....which is one of 
the best devices for setting up a Rorschach 
report. Since the gute Formen responses are 
"really signs of the integrity of the Ego," some 
readers may wonder why Klopfer denies the 
value of tables of norms (such as Hertz, Beck, 
and Gardner have supplied) or why he does not 
furnish more examples of his own scoring plan. 
The outstanding merit of the exposition on how 
to score a Rorschach is that it provides symbols 
for almost every conceivable type of response. 
Whether or not each of these symbols stands 
for a definite referent is a troublesome ques- 


tion. Since great diagnostic significance attaches 

to the sum of various symbols in the total con- 
figuration, inaccuracies in converting the re- 
sponses into symbols could be disastrous. This 
reviewer feels a need for a book which will 
include every possible type of response, classi- 
fied by experts, and which will provide prac- 
tice material for the student. Klopfer gives an 
adequate exposition of the system of scoring 
now used by the Rorschach Institute, but he 
does not at all points furnish enough examples 

to illuminate his account. Neither does he pre- 
sent evidence to justify the extension of the 
conventional symbols which have been in use 
for the past 10 or 12 years. His plan for re- 
cording "secondary responses," trends, and re- 
fined differentiation is a distinctive contribu- 
tion. The section of the book which is devoted 

to interpretations advances many hypotheses 
which will stimulate investigators for years to 
come. It may not be too extreme to say that this. 
part is one of the finest examples of clinical 
insights to appear in recent years. Of course, 
the laboratory psychologist and the statistician 
will deplore the absence of experimental data 
and controlled investigatory evidence. * the seri- ... 
ous inquirer about the Rorschach technique may 

be disappointed by one feature of this section— 

its comparative brevity. Whereas the intrica- 
cies of scoring methods take up about one-third 

of the text, only a quarter of the space is given 
over to interpretations, The sample interpreta- 
tion presented as an appendix to Part Three 
makes the reader wish that the senior author 
had been less cautious and had included more 
examples of clinical insights. * the Rorschach 
method is not suitable to be introduced-as a 
new course of study on the undergraduate level ; 
hence the book is directed particularly to those 
psychologists and psychiatrists who want to 
learn more about the procedure. It might find 

a place as a parallel reading for advanced stu- — 
dents in clinical psychology who wish to ascer- 
tain whether they have aptitude for specialized 
study in the Rorschach. For the instructor who 

is using the test in research, the publication of 
this book is an event fully as important as the 
issuance of a new revision of the Binet to a 
psychometrist. Unquestionably, this book, to- 
gether with Beck's Introduction, will establish 
the Rorschach technique as one of the standard 
topics in psychology. Those who are interested 

in bringing the potentialities of this test to the 
attention of other psychologists find genuine 


* 


| 


84 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


satisfaction in being able to recommend these 
two excellent books. Except for a rather poor 
apportionment of space and for the use of “due 
to" in an adverbial sense, the Klopfer-Kelley 
book might also serve as a model for college 
classes in expository writing. 

Psychiatric Q 16:598-602 J1'42. * this sum- 
mer appears to mark an important turning point 
in development of the Rorschach method for 
psychiatric use in the United States. Two ad- 
mirable introductory volumes on the technique 
have just been published; and the first author- 
ized English translation of Rorschach's “Psy- 
chodiagnostik" has been brought out * For the 
first time, authoritative and readily comprehen- 
sible material in this field has become available 
for the beginner * Klopfer notes modestly that 
“this book is hardly a basis for self-training,” 
but expresses the entirely justified hope "that it 
will be of considerable help when used in con- 


» junction with other training facilities." * Drs. 


Klopfer and Kelley, after presenting a thor- 
ough exposition of the problems of administra- 
tion, have devoted their section on clinical diag- 


* nosis to material “of value primarily to the medi- 


cally trained professional worker or to those 
readers who have had extensive experience in 
clinical psychopathological fields." Dr. Kelley's 
discussion of clinical findings, prepared during 
two years of research as Rockefeller Fellow at 
the New York State Psychiatric Institute and 
Hospital, is highly technical. It is intended for 
the expert and will be of use chiefly to the 
expert, an end deliberately sought by the authors 
to avert the possibility that inadequately trained 
workers would attempt to put the Rorschach to 
clinical use. The criteria of differential diag- 
nosis are discussed thoroughly, but their appli- 
cation is not illustrated. * Dr. Klopfer feels that 
"Proficiency as a Rorschach administrator can 
be gained within a few months." To interpret, 
however, what has been administered and 
scored, he warns, requires “a wealth of general 
psychological experience" and the remaining in 
“a earning stage’ for two or three years." * 
Both [Bochner and Halpern’s The Clinical Ap- 
plication of the Rorschach Test and this volume] 
...Should be important contributions to greater 
knowledge and use of the Rorschach technique. 
* The Klopfer and Kelley volume is a far more 
complete training manual for any person who 
expects to do serious and continued work with 
the Rorschach. It covers the ground so compre- 
sively and so minutely that, despite Dr. Klop- 
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fer’s expressed doubts, it might almost seem 
possible for a psychologist, by earnest study of 
the book and the cards to learn administration 
and scoring without personal instruction. The 
refined scoring system developed at the Ror- 
schach Institute and through publication of the 
“Rorschach Research Exchange” is carefully 
and clearly explained, a protocol of a normal 
subject is presented in full, the standard "Indi- 
vidual Record Blank" printed for the Rorschach 
Institute is shown in facsimile, and the “loca- 
tion chart" with its gray, lithographed repro- 
ductions of the cards is reproduced and its use 
demonstrated, A bibliography of the Rorschach 
method through January, 1942, is included in 
the volume, and there is an excellent index. * 
Keeping in mind the somewhat different orienta- 
tions of these two introductory books, either of 
them can be recommended without reservation 
as a better introduction to the study, practice 
and general evaluation of the Rorschach tech- 
nique than has heretofore been available. The 
fact that a set of the cards is essential for serious 
study or comprehension of the Rorschach by 
use of either textbook is a matter which may not 
be entirely obvious to persons lacking any ac- 
quaintance whatever with the method. * 
Psychiatry 5:604-6 N '42. Ernest G. Schach- 
tel. Since the publication of Hermann Ror- 
schach's Psychodiagnostik in 1921 quite a few 
manuals, textbooks, and introductions to his 
method have been published in various coun- 
tries. Although varying in scope and thorough- 
ness they all refrain from examining the psy- 
chological foundations of the Rorschach method. 
Klopfer’s book is no exception to this rule 
and it is for this reason chiefly that—despite 
many valuablé contributions and a painstaking 
thoroughness in technical details—it nowhere 
approaches the level of Rorschach’s own pre- 
sentation. Without an understanding of the psy- 
chological foundations of the Rorschach test and 
without clear concepts concerning the dyna- 
misms of personality and interpersonal relations 
no amount of technical labor with regard to the 
descriptive scoring of a subject’s interpretations 
of the Rorschach ink blots will add to the under- 
standing of the human psyche. To contribute to 
such an understanding is the aim of all psycho- 
diagnostics and certainly was the aim of Ror- 
schach’s Psychodiagnostics. That in his book the 
reader never loses sight of this aim, while in 
Klopfer’s book it hardly ever comes into the field 
of the reader’s vision, is a consequence of his 
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partly vague, partly self-contradictory concepts 
regarding the nature of the Rorschach test and 
the nature of personality. Rorschach has given 
a prominent place to his inquiry into the psychic 
processes which constitute a subject’s “Ror- 
schach responses,” and he has reached the con- 
clusion that the interpretation of the ink blots 
is a perception in which the effort to integrate 
available memory pictures—engrams—with re- 
cent complexes of sensation—produced by the 
ink blots—is for most persons so great that it 
is consciously felt as an effort and that hence 
the interpretation of these chance forms “falls in 
the field of perception and apperception rather 
than imagination.” This insight—although in- 
complete as an explanation and theoretical foun- 
dation of the test—is basic for an understanding 
of Rorschach’s ideas and of the nature of his 
method. Klopfer does not even mention it. In- 
stead he speaks on one occasion of an “interplay 
between the structural characteristics of the 
stimulus material and the personality structure 
of the subject” as reflected in concept forma- 
tions, and a few pages further on—page 12— 
he follows Frank's and others' definition of the 
method as a "projective" procedure, adding, 
rather obscurely, that Rorschach's interest was 
in the person's personality structure and that 
"individual differences, as revealed by the ink 
blot method, were only incidental to this primary 
concern." It is never stated whether these two 
explanations of the test are offered as an alter- 
native or what their relation to each other and 
to Rorschach's own explanation is. Both ex- 
planations, that of concept formation based on 
an interplay between stimulus material and per- 
sonality and that of "projection" of the person 
into his interpretations of the test places, are 
less specific than Rorschach's, and the widely 
accepted projection theory is, in addition to 
that, misleading. Either the term "projection" 
is to be taken merely as another way of saying 
that every personal activity and feature is char- 
acteristic of the person engaging in this activity 
or having this feature and, thus, is a “projec- 
tion" of his personality; in this case the use of 
the term "projection" would explain nothing 
which is specific to the Rorschach method—or to 
any other of the so-called projective techniques 
of personality testing—as distinguished from 
any other method of evaluation or interpreta- 
tion of personal human behavior. Or else “pro- 
jection" has the accepted psychiatric meaning 
that a person projects conscious or unconscious 


attitudes, fears, desires on other persons or 
objects and assumes that the other person or 
object has this attitude, fear, desire. This fre- 
quent psychic dynamism plays an important róle 
in the movement responses to the Rorschach ink 
blots and in some dynamic form responses, but 
it is quite insufficient as an explanation of the 
entire range of reactions to the Rorschach test, 
since the great majority of all responses do not 
show any such projection at all. An adequate 
explanation of the Rorschach test would have to 
analyze the complexity of the Rorschach situa- 
tion, which changes considerably even during a 
single test, it would have to show the share that 
memory, perception, comprehension have in it, 
it would have to consider the test material, the 
tester, and the subject under the aspect of an 
Aufgabe and authority situation, it would have 
to describe the aspects of freedom and compul- 
sion inherent in this situation or transferred to 
it by the subject, and it would have to examine 
the Rorschach categories and their significance 
which undergoes changes with the changing 
meaning of the test situation. Unless the Ror- 
schach situation can be understood and analyzed 
in such a way, as a particular kind of human 
experience related to all other human experi- - 
ence, the test will become a sterile technique 
rather than the ingenious instrument for the ex- 
ploration of man as which it was conceived and 
presented by Rorschach. It is obvious that the 
concept of personality in the mind of any stu- 
dent of human nature and personality will be 
of considerable importance for his insight into 
people, regardless of whether such a concept be 
articulate and explicit or vague and even uncon- 
scious. When dealing with a diagnostic tool for 
the exploration of the individual mind and per- 
sonality it is impossible to make even a single 
statement without explicit or implicit reference 
to personality concepts, of which the person 
making such a statement may be aware or un- 
aware. Klopfer thinks it unnecessary, for pur- 
poses of the Rorschach test, to have any theory 
of personality except the assumption that people 
are prompted "from without and from within" 
—page 221—and that the “susceptibility to be 
stimulated from within, or to be stimulated from 
without is distributed in mankind according to a 
normal curve"—fage 222. But he constantly 
uses such concepts as spontaneity, inhibition, 
control, pseudo-control, repression, sublima- 
tion, some of which seem to stem from the 
psychoanalytic theory of personality which has 


84 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 182 


influenced Rorschach's personality concept. Es- 
pecially the term control is used freely through- 
out the book, a term which makes little sense 
unless at least two agencies are assumed within 
the human personality one of which controls 
the other, as they are assumed in Freud's theory 
of the ego and the id, Klopfer, then, makes many 
more assumptions concerning the human per- 
sonality than he seems to be aware of—as most 
people do. But unfortunately in addition to that, 
he abrogates in one place even those concepts 
and assumptions which he uses quite explicitly 
in many other places. Also, he never clearly 
explains the two factors of his own avowed per- 
sonality concept, the promptings from within 
and those from without and why they should 
be distributed according to a normal curve. 
While the basic concepts concerning the nature 
of the Rorschach method and of the "subjects" 
to whose benefit it is to be applied are thus 
vague, shaky, or absent, the merits of Klopfer's 
book lie chiefly in the accuracy of some of his 
descriptions of the properties of Rorschach's ink 
blots and in some of his elaborations of Ror- 
schach’s scoring categories, for instance the vari- 
ous stages of acuteness or dullness of form per- 
ception, and various ways to arrive at a “whole” 
interpretation of the ink blots. However, where 
the symptomatic and interpretative significance 
of Rorschach’s scoring categories and of Klop- 
fer’s elaborations on them is concerned—and 
it is on this significance that the diagnostic pic- 
ture of the personality rests—the lack or the 
opaqueness of the psychological concepts ob- 
structs insight adequate to the labor spent on 
Scoring or approaching the insights gained by 
Rorschach. Thus, for instance, Rorschach's de- 
scription of the factors involved in acuteness 
of form perception, or his derivation of the 
significance of movement responses, is far richer 
in psychological content and insight than Klop- 
fer's remarks on the same subjects, although he 
goes to much greater lengths than Rorschach in 
subdividing both movement and form responses. 
Altogether too often the reader has to be content 
with the mere assertion that certain scores have, 
or seem to have, certain symptomatic values, 
without any attempt by the author to prove his 
assertions. Klopfer succumbs to the same dan- 
ger to which so many Rorschach workers be- 
fore him have succumbed, namely of construct- 
ing a kind of immanent “Rorschach psychology” 
which uses the categories of scoring implicitly 
as categories of human behavior and human 


psychology, without obtaining at first the neces- 
sary insight into their relation to the basic con- 
cepts of psychiatric and psychological theory 
and to the totality of human experience. It was 
the merit of Rorschach never to lose sight of 
these relations. A comparison of the one sample 
analysis of a Rorschach record in Klopfer’s 
book with the three extensive analyses which 
have been published from Rorschach’s own 
practical work shows that the elaboration of 
scoring categories in Klopfer’s manner does not 
seem to yield as plastic and accurate diagnostic 
pictures as Rorschach achieved with less such 
elaboration, but with greater awareness of the 
relation between his—fewer—scoring categories 
and the general psychiatric and psychological 
categories on which his concept of the human 
personality was based. The fourth part of the 
book, written by Kelley, contributes a concise 
summary of the clinical diagnostic findings of 
Rorschach and others who have used his method. 
The best chapter of this part is that on intra- 
cranial organic pathology, while the chapter on 
psychoneuroses is confined to a symptomatology 
which does not penetrate to the present state of 
insight into the structure and core of neuroses. 
The difficult and-interesting problem of the psy- 
chopathic personality is not discussed. A bibli- 
ography concludes the book. 

Psychoanalytic Q 11:583-7 O 42. S. J. Beck. 
* Klopfer’s contribution....offers the latest and 
most complete description of the Rorschach 
test technique as he has been teaching it. The 
reader wishing to acquaint himself with the 
form that the test is taking under his molding 
can, therefore, do no better than read this book. 
From this point of view it is quite as authorita- 
tive as its advertisements announce it, How- 
ever, this review is being written by a student 
trained in the Rorschach-Oberholzer discipline. 
Judged from that sphere of reference, Klopfer 
has superimposed on the Rorschach test so 
much that is strange to it as to make it, except 
for the broad outlines, essentially unrecogniza- 
ble. With certain new emphases in interpreta- 
tion, he has made of the test another instru- 
ment. Whether for better or for worse is 
another matter ; the point is that it is something 
different. The first and most obvious transfor- 
mation is in respect to the multiplicity of sym- 
bols which Dr. Klopfer has introduced. In the 
‘detail’ category (responses to parts of the ink- 
blot), Rorschach’s two classifications, D, Dd, 
are expanded to six: D, d, dd, de, di, dr. This 
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is not to mention such inventions as DS; or 
DS (apparently not identical with Rorschach's 
Dzw, the usual white space response). The 
variation becomes even more astonding in re- 
spect to movement, color and shading responses. 
In these categories, Rorschach was able to han- 
dle the material of his test with the simple M, 
C, CF, FC, and F(C) (translating his German 
letters), five symbols. The Klopfer repertory 
with M, FM, m, mF, Fm, k, kF, Fk, K, KF, 
FK, Fe, c, cF, FC’, C'F, C’, F/C, C/F, C, Cn, 
Cdes, Csym, is bewildering. As one studies the 
responses for which these Klopfer symbols are 
shorthand, there is no disputing their occur- 
rence. They have been observed by various 
workers and commented upon. As precisely 
identifiable refinements, they are, however, too 
rare to warrant separate classificatory rubrics. 
They are those qualitative events which may 
overlay any Rorschach response record and 
help complete the diagnostic picture, the finer 
strokes that paint in the high-lights and the 
shadows once the structure itself has been 
established. The objection to designating them 
by special signs is that these symbols become 
part of the response summary which, in turn, 
becomes part of a formula; eg., (p. 254) 
(Fm + m) : (Fc 4- c C^). Or, (p. 229), ‘there 
are several ways of measuring the degree of 
outer control with the aid of the quantitative 
Rorschach results' and several ratios are offered 
as: FC to (CF + C); (FC + CF + C) to 
(Fc + c + C^), by which to gauge ‘outer con- 
trol,’ ‘inner control,’ ‘crude control,’ ‘refined 
control,’ ‘extreme constriction,’ ‘modified con- 
striction,’ and others. This gives an impression 
of a quantitative foundation which simply does 
not obtain for the psychologic events tested. 
The few formulas which Rorschach and his 
followers use are not intended as a representa- 
tion of exactly measured events. They are sym- 
bols of directions of psychic trends, not meas- 
urements. They do not presume to go further 
in interpreting psychologic events than is possi- 
ble by inference from regularly occurring Ror- 
Schach response categories in clinical pictures 
of known dynamics. Rorschach's own ten years 
of careful labor, his psychologic insight backed 
by an experimenter's temperament, and Ober- 
holzer's long experience with the test, on the 
background of his thoroughgoing clinical knowl- 
edge, are the bases for the personality interpre- 
tations that have been made from the Ror- 
Schach test. To offer formulas is to go much 


further than is known experimentally concern- 
ing the symbols as referring to mental life. It 
is striking the pose of a quantitative method 
not warranted by the present state of knowl- 
edge. Deriving from this, is a second more seri- 
ous objection. The creation of many symbols 
and formulas steers the examiner into a search 
for them. It encourages that superficial ama- 
teur psychologism which has been so much the 
bane of psychometrics, happily less so today 
than twenty years ago. It takes the student 
away from psychological dynamics, and leads 
him to hunt for signs and the magic of num- 
bers. To make symbols—used by Rorschach 
only as a convenience in recording—so large 
a portion of the test is to lay the emphasis on 
sign at the expense of substance. This mis- 
placed emphasis in turn leads to a third, and by 
far more fundamental, criticism, Klopfer in- 
forms us in his first sentence that the ‘book 
grew out of seven years of "learning by teach- 
ing"'. He makes no statement of having first 
set his own foundation in study with any of the 
men who developed the test from its inception. 
Oberholzer is of course the most important of 
these and Zurich is not far from Bavaria where 
Dr. Klopfer lived. Now that both are in New 
York the distance is much less. Klopfer started 
his Rorschach work without profiting from the 
experience of those who devised and developed 
the test, and the results show it. This accounts 
for his fundamental error in regard to the all- 
important M, or movement (Rorschach's B or 
Bewegung) response, and particularly respect- 
ing movement in animal forms (pp. 114, ff). 
A single session with Oberholzer would have 
disabused Klopfer of the notion that ‘the open 
mouth of a crocodile . . . devouring its victim’ 
(pp. 115-116) has any M in the Rorschach 
sense; similarly with ‘movement’ in inanimate 
objects : ‘something falling apart’ (p. 117). The 
extreme to which this distortion of M may go 
is finding it in ‘animal skin, nailed on a board 
... itis beginning to contract’ (p. 119). The M 
response in the Rorschach test is something the 
individual does; it is of deep significance to 
him; it is something other than what he states 
in the manifest content of the association, in 
other words, it is the latent dream in the literal 
freudian sense. To designate any Rorschach 
association that has a verb in it as M is to apply 
this symbol to words, and miss its value for 
identifying events in the unconscious living of 
the individual. Some M responses obtained 
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with the test have been shown to communicate 
the innermost personal needs of the individual. 
This value of M is what makes it so important 
a discovery and Rorschach’s great contribution 
to the problem of objective ingress into person- 
ality. Accuracy in identifying it, therefore, is 
of the greatest importance. It makes the differ- 
ence between the psychologist trained to search 
for subsurface mental activity, and the ‘sign 
psychologist.’ The merits of Klopfer’s portion 
of the book are in the valuable suggestions for 
further exploration. One of the best is what he 
calls ‘Testing the Limits,’ In this he is making 
a regular rule of procedure of a practice that 
had formerly been employed only in those pa- 
tients who had been extremely uncommunica- 
tive. Without doubt, this additional effort to 
elicit associations can also be used profitably in 
many instances where we now stop after ob- 
taining that minimum set of responses consid- 
ered essential for personality delineation. In 
extremely productive patients, Klopfer ac- 
knowledges the probability of adding little by 
‘testing the limits’ (p. 52). The author devotes 
more space than have former publications— 
excepting, naturally, Binder’s monograph—to 
the shading response. He makes some valuable 
suggestions as to possible significances of the 
different shading nuances. Especially important 
may be the lead that the ‘surface’ shading re- 
action (not the three dimensional, or vista per- 
cept) expresses a need for contact sensation. 
Beginners will be grateful for the considerable 
more space devoted to administration and re- 
cording than have earlier publications. The 
scoring blank (Klopfer and Davidson), which 
is a formal element in Klopfer’s method, is 
pasted into the book, This author closes (Ap- 
pendix to Part Three) with one sample Ror- 
schach record, scored and interpreted. In this 
reviewer's opinion, teaching is best done by an 
abundance of illustrative material. The chapter 
on history includes some hitherto unpublished 
and valuable details. This reviewer regrets the 
omission of Campbell and Wells, in Boston, 
The American Orthopsychiatric Association, 
and Lawson Lowrey in New York. The first 
two were largely responsible for the early re- 
search with the test in this country. The Asso- 
ciation was the forum before which the first 
reports were made, and Lowrey’s persistent 
interest made possible many of the earlier 
American publications. * The best of [Kel- 
ley’s] chapters are the ones on brain pathology 


and schizophrenia, which is to be expected, 
these being the fields which have been most 
studied. The least satisfying is the one on psy- 
choneuroses. The fault is due to the paucity of 
published reports on the Rorschach test in neu- 
rosis. It is indeed strange that this clinical 
group, which comprises so large a percentage 
of the population, should yield so few reports, 
Here is a big field, still to develop. The chap- 
ters by Kelley are much the most satisfying in 
the book. They reflect a caution born of clini- 
cal seasoning, and a balanced approach to Ror- 
schach problems, unhappily missing in most of 
the rest. The book has a good bibliography, 
370 titles, covering the literature through Jan- 
uary 1942. It is well indexed and paginated, an 
important feature in any instruction manual. 
The introduction is appropriately by Doctor 
Nolan D. C. Lewis, one of the men whose sup- 
port, material and moral, has done much 
toward developing this test in America. 

Psychol B 40:144-6 F '43. S. J. Beck. 
[Practically the same as his review in Psycho- 
analytic Quarterly, excerpted above.] 


[85] 
*[ReRorschach.] Krorrer, Bruno. The Rorschach 
Technique: A Manual for a Projective Method of 
Personality Diagnosis. With clinical contributions by 
Douglas McGlashan Kelley. With 1946 supplement by 
Bruno Klopfer and Helen H. Davidson. Yonkers, N. Y.: 
World Book Co., 1946. Pp. xi, 475. $4.80. * (London: 
George G. Harrap & Co. Ltd. 215.) 

[ 86 ] 
*[Re Rorschach.] Masiow, Paut. The Analysis 
and Control of Human Experiences: The Individ- 
ual Seen Through Rorschach, Volumes 1 and 2. 
Brooklyn 2, N. Y.: the Author, 16 Court St., 1947. Pp. 
iii, 233; iv, 195. Paper, mimcographed, $3.50 per vol- 
ume. * (PA 21 :2326) 


Am J Psychother 1:529-30 O '47. Rose 
Palm. * Maslow has touched on most of the 
principal topics covering the field of human ex- 
perience. Included in his discussion are psy- 
chology, psychoanalysis, philosophy, mythology, 
science, art, psychiatry, politics, sociology, 
ethics, culture, religion, anthropology and physi- 
ology. In addition, Maslow has ambitiously 
attempted to set up a new Rorschach system 
which would throw light on the topics under 
discussion and establish new uses for the Ror- 
schach test as a future tool of social, cultural 
and political significance. Although it would 
appear that Maslow is in many respects an origi- 
nal and independent thinker, it is doubtful if 
this book will make its way as a guide for stu- 
dents of human nature in general or for Ror- 
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schach students in particular. To be certain, 
Maslow possesses a vast stock of information, 
a fact which has apparently misled him into 
covering too much ground. His statements tend 
to be too generalized and he is inclined to “over- 
interpret” his Rorschach material. * What is 
most lacking in Maslow’s work is an empirical 
backlog of material to substantiate the many 
ideas which are proposed and developed. Mas- 
low’s presentation is subjective and speculative 
and is in many places contrary to established 
psychological or psychoanalytical concepts. * 
Another way in which Maslow stretches his 
point beyond the accepted limits is in the way 
he attempts to set up a Rorschach typology by 
picking out certain elements of a Rorschach 
record as representative of a type or of a group. 
* In interpreting the single features of a Ror- 
schach record, it is necessary to remain within 
the framework of the total structure. Although 
the author himself stresses this point, he has 
often ignored his own teaching. The author's 
unorthodox use of the Rorschach symbols de- 
tracts greatly from the general merit and appli- 
cation of the book. It is true that....different 
schools are....using varying systems. While Mas- 
low is thus perhaps justified in introducing his 
own symbols, he fails to prove in what way his 
system is preferable to the old ones. We feel 
that if the imaginative and creative faculties of 
this author were applied to certain more spe- 
cialized areas of research and were backed by 
careful clinical experimentation, the results 
might afford valuable contributions to our 
knowledge of human nature and the use of the 
Rorschach test. We see the value of the book 
mainly in the way it may stimulate the reader 
to select some of the author’s ideas as starting 
points for further investigation. 

J Consult Psychol 11:155 My-Je '47. These 
volumes represent an attempt to explain all 
human characteristics and activities in terms 
of the Rorschach pattern. Problems of eco- 
nomics, war, fascism, politics and many other 
fields are subjected to a confused and usually 
inaccurate Rorschach appraisal. 

[87] 
*[Re Rorschach.] Mastow, PAUL. Rorschach Psy- 
chology. Brooklyn 2, N. Y.: the Author, 16 Court St., 
1945. Pp. ii, 149. Paper, mimeographed. For latest edi- 
tion, see 86, (P.4 19:3409) 


Am J Psychol 59:330 Ap "46. D. R. Miller. 
* covers four topics. Present methods of in- 
terpreting the Rorschach test are summarized ; 


new additions are made to the interpretation of 
scoring symbols; a number of new scoring 
symbols are proposed, and an attempt is made 
to relate interpretation to social psychology. 
The summary of present methods of interpre- 
tation contains numerous misconceptions and 
careless generalizations. The attempt to relate 
interpretation to social psychology consists in 
the discussion of some strongly environmental- 
istic theories with occasionally interspersed 
‘Rorschach’ terms; there is almost no attempt 
to relate these theories to the mechanics of in- 
terpreting the test. The proposed additions to 
interpretation are often so vaguely and incau- 
tiously phrased (the extravert “lives for and 
with the body, has the physical courage of 
might and the proud”) that it would be almost 
impossible to apply them clinically. The pro- 
posed interpretations and scoring symbols have 
not been empirically obtained nor have they 
been objectively validated. The primary evi- 
dence offered in their support is a group of 
miscellaneous references in the fields of philos- 
ophy, art, literature, cinema, and theoretical 
and clinical psychology. Typical of the care 
with which many of these references have been 
chosen is the citation of Jung in the develop- 
ment of the author's conception of introversion- 
extraversion despite Rorschach's statement that 
the clinical picture of introversion obtained by 
the test has nothing in common with Jung’s 
conception except the name. The carelessness 
with which the book has been organized, edited 
and published places many obstacles in the path 
of the reader. The book does not contain a 
statement of purposes and organization, nor 
does it contain an index. The grammar is often 
clumsy and occasionally incorrect. Finally, 
every two-pages average about one error in 
spelling or typing. 

Nerv Child 5:276 Jl '46. This is the most 
thorough attempt so far at a philosophical and 
psychological evaluation of the Rorschach test. 
However, the author has failed to finally clarify 
his thoughts and reduce them to definite con- 
cepts. The manuscript therefore is more like 
an essay than a scientific treatise. 

Psychiatry 8:517 N ’45. Milton H. Erickson. 
In commenting on the author's book, Ror- 
schach Theory, I suggested that the author 
secure an elementary textbook on the Ror- 
schach Test and a competent instructor so that 
he might acquire some knowledge of the sub- 
ject. Now he comes forth with a volume en- 
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titled, Rorschach Psychology, which is an even 
more pitiful demonstration of his tremendous 
desire to communicate something and his utter 
incompetency, at least in this field. The book is 
essentially a conglomeration of words, some 
new, some old, some especially adapted, ar- 
ranged in phrases and sentences without regard 
for spelling, grammar, coherence, or relevancy, 
and expressing most often vague, unformulated 
ideas, sometimes related to the Rorschach 
Test. * 

Psychoanalytic Q 15:387-90 Jl '46. William 
F. Murphy. * This manual is a perplexing and 
callow attempt to present the Rorschach Test 
as an all-revealing expression of the individ- 
ual’s personality to be used not only as the 
basis for a complete character analysis, but for 
psychotherapy, as a philosophy and even for 
social reform! It is exceedingly difficult to as- 
certain how much of the content of this book 
is the thought of the author and how much has 
been torn from the context of the one hundred 
and four items listed in the bibliography rang- 
ing from Freud's Collected Works to Ernie 
Pyle's Brave Men. The author begins by stat- 
ing that the Rorschach test makes possible a 
quick and accurate insight into the total person- 
ality structure of the individual, and then goes 
on to say, "When we are quite sure of the 
Subject's personality then we can make the 
proper recommendations with the hope that 
correct comprehension will be followed by 
therapeutic success". If only it were so! In the 
discussion of personality types, which he di- 
vides into intraverts and extraverts according 
to the amount, type and ratio of Movement to 
Color responses, Maslow confuses the Inner 
Living of Rorschach with the Introversion of 
Jung and írequently becomes involved in a 
vague, discursive and naive discussion of char- 
acter traits. * Such confused thinking has not 
been helped by a liberal use of Rorschach sym- 
bols not ordinarily utilized. * At times the 
characteristics attached to the different types of 
responses are reminiscent of the fortunetelling 
cards that drop from the pennyweight ma- 
chines. * Rationality, Anxiety, Emotionality, 
the Conscious, Unconscious and Foreconscious, 
Anal and Oral Complexes are all discussed 
with vague wordiness and superficial under- 
standing. The most interesting chapters deal 
with the author's Social Psychology. After dis- 
cussing shading and perspective responses in- 
dicative of sympathy and empathy he opines 
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that these qualities may be based on mental 
telepathy. * In another chapter we are intro- 
duced to "The Rorschach" who is apparently a 
combination of priest, analyst and social re- 
former. * As a therapist the "Rorschach," 
unlike the psychoanalyst who gives the patient 
"only the opportunity of seeing how bad he is," 
will “coolly reconnoiter the field before giving 
advice and then act quickly to restore a meas- 
ure of objectivity by reducing mounting fright- 
ening ideas to their proper size.” Similar 
absurdities can be found on practically any 
page. In conclusion, this book is useless for the 
student of Rorschach and embarrassing to 
those interested in the development of this 
excellent projection test. It is poorly printed in 
mimeograph form. Errors in spelling are nu- 
merous and pages are occasionally duplicated. 


[88] 
*[Re Rorschach.] Mastow, PauL. Rorschach The- 
ory. Brooklyn 2, N. Y.: the Author, 16 Court St., 1944. 
Podi; 89. Paper, mimeographed. For latest edition, 
see 86. 


Psychiatry 8:257 My '45. Milton H. Erick- 
son. This book purports to deal with the 
Rorschach Theory. Actually, however, it is 
essentially a disorderly, ungrammatical, unin- 
formative, often irrelevant collection of poly- 
syllabic phrases that demonstrates the author's 
need to study, under competent supervision, 
some elementary exposition of the Rorschach 
Test. 

[89] 

*[Re Rorschach.] Muxnoz, Ruru LEARNED. Predic- 
tion of the Adjustment and Academic Perform- 
ance of College Students by a Modification of the 
Rorschach Method. Foreword by Gardner Murphy. 
Applied Psychology Monographs of the American As- 
sociation for Applied Psychology, No.7. Stanford Uni- 
versity, Calif.: Stanford University Press, 1945. Pp. 
104. Paper. $1.25. * (PA 20:918) 


Am J Orthopsychiatry 16:734 O '46. Marion 
Cowin. To find a carefully organized report of 
a research project maintaining a genuinely 
human and dynamic emphasis is a source of 
satisfaction to students of human behavior. * 
For years Rorschach workers have been de- 
bating the question of the extent to which the 
Rorschach should be standardized. Some have 
urged intensive efforts in this direction in the 
interests of objectivity, others have resisted too 
much statistical treatment of the data lest the 
essentially dynamic quality of the individual 
record be lost. Dr. Munroe’s method to some 
extent reconciles these points of view. She does 
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use a standardized check-list and obtains a nu- 
merical rating, but her appreciation of the 
compensatory processes in individual behavior, 
her recognition of their important Rorschach 
equivalents, and her ingenious provision for 
them in the check-list reduce the danger of a 
mechanical or stereotyped concept of adjust- 
ment. * Appendices give detailed explanations 
of the check-list and instructions for use. The 
author believes that it may be used satisfac- 
torily by persons with less background and 
experience than is usually required for Ror- 
schach workers, though a good knowledge of 
scoring is essential. Whether it would prove 
true on protocols obtained from a less intelli- 
gent and cooperative group than at Sarah Law- 
rence is not certain, nor is it certain whether 
in this case the method itself could be applied 
with any such degree of success. Further ex- 
perimentation in this direction would be of 
interest. 

Am J Psychol 59:315-8 Ap '46. Donald E. 
Super. The history of the Rorschach 'psycho- 
diagnostic technique' seems to parallel that of 
a successfully treated aggressive child, in that 
it first refused to conform to accepted stand- 
ards (by becoming scientific and objective) 
while rejected by its family, but, having finally 
won a degree of acceptance and security, it is 
now proceeding to behave in a socially ap- 
proved manner (by using quantitative tech- 
niques and scientific controls), apparently of 
its own volition * a slim but satisfying mono- 
graph which bears the earmarks of scientific 
endeavor. * As Munroe points out..she has 
'demonstrated' that the Rorschach can be used 
as an ordinary psychological test, with a stand- 
ard scoring technique requiring no more train- 
ing than the Binet. That this was an unexpected 
outcome of the experiments makes it none the 
less significant, and perhaps somewhat more 
palatable to orthodox Rorschachers. * This re- 
view proposes to summarize some of the ex- 
perimental procedures, results and conclusions, 
pointing out certain shortcomings and certain 
excellencies, Thirty-five freshmen were first 
tested with the standard technique, ‘blind’ per- 
sonality descriptions were written and were 
then submitted, in groups of five and with a 
random list of the students’ names to selected 
teachers for matching. The complete absence 
of matching errors demonstrates the reliability 
and validity of the technique when used by 
expert examiners and teachers who know their 


students well. When the experiment was twice 
repeated, using the Inspection (briefer) Tech- 
nique and more teachers, the results the first 
time yielded a p of less than .or, and, on the 
second repetition, “were not quite so good, 
though still clearly significant” (p. 15). Mun- 
roe has given her conclusion but has withheld 
the evidence upon which it is based, a type of 
error too frequently committed by clinicians. 
The reader, not knowing the probability of the 
last experiment, cannot judge for himself how 
much less good it was. Lack of sufficient care 
in presenting results is manifest again in the 
next paragraph, in which the author refers to 
the “1941” experiment, but does not tell the 
reader whether this is the first or second ex- 
periment with the Inspection Technique. * The 
occasional looseness of her experimental meth- 
ods typified by....variations of procedure is 
subject to criticism, but they may have been 
the unavoidable concomitants of research in a 
practical situation (we are not told) ; that these 
variations in procedure are at other times 
glossed over or forgotten (e.g. Summary, p. 
37.) typifies occasional careless reporting, al- 
though this study is on the whole a good exam- 
ple of experimenting and reporting, especially 
when compared to most Rorschach work. Un- 
scientific methods and errors of the types too 
frequently committed by clinicians without 
experimental orientation occur in a few other 
places. * 

J Appl Psychol 30:660-2 D '46. Paul E. 
Meehl. One of the facts about the Rorschach 
literature which is disturbing to those Amer- 
ican psychologists who are interested but 
skeptical is the pitiably small number of studies 
which can really be called “validation” studies 
in any respectable sense, buried among a great 
mass of investigations whose titles would sug- 
gest that they were systematic studies of valid- 
ity, but which turn out not to be so at all upon 
reading. It is, therefore, gratifying to come 
across this work of Dr. Munroe’s, which is a 
contribution both to college guidance and to the 
case for the Rorschach. She presents data from 
three successive freshman classes, totalling 348 
girls, at Sarah Lawrence College. Her aim was 
to determine the efficiency of the Rorschach in 
predicting “adjustment” and academic achieve- 
ment. * Because of the nature of the criteria 
available, data are not presented in correla- 
tional form, but are expressed in terms of con- 
tingency tables. In one respect this turns out to 
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be a decided advantage, since it brings out 
certain relationships which a Pearson r as 
ordinarily employed would possibly obscure. * 
The study is unfortunately marred by a few 
minor defects and deviations from.perfect con- 
trol which leave a way out for anyone who is 
adamantly skeptical about the Rorschach. For 
example, it would have been better had all of 
the tests been scored completely “blind,” with- 
out any opportunity for clinical impressions to 
be formed in face-to-face contact. The author 
states that the results on those cases she tested 
personally were not “significantly” better than 
the others. The third freshman class might well 
have been excluded to increase the purity of 
design, since their records were filed and avail- 
able to teachers before the year’s close. Here, 
however, Dr. Munroe states that the results for 
the third year were not significantly better than 
those of the first two. One would like to have 
the data separately analysed for the middle 
year, when the most rigid control was exer- 
cised. On the whole, however, the study could 
be a pretty good model for other Rorschachers, 
and it certainly is an important contribution to 
Rorschach literature. 


89a 
*[Re Rorschach.] haat gen NONE MARIA. 
Rorschach Scorigg Samples: Compiled From 
Various Sources for Private Circulation. [Revised 
Edition]. Norton, Mass.: the Author, Wheaton Col- 
lege, 1942. Pp. v, 197. Paper, mimeographed. $4.00, 
postpaid. * 

Rorschach Res Exch 7:184 O '43. Zygmunt 
A. Piotrowski. * contains...over two thousand 
Scored responses to the original Rorschach 
plates. The sample responses and their scores 
were taken from published case studies in Eng- 
lish and other languages. The responses in for- 
eign languages have been translated into Eng- 
lish, and the English scoring symbols are used 
in the book. On each page there is an outline 
reproduction of the particular Rorschach card 
with the part to which the interpretations listed 
below refer, conspicuously marked. Sometimes 
there is doubt concerning scoring ; the differen- 
tiation between F+ and F— responses is par- 
ticularly troublesome in some cases. If the sub- 
ject did not cooperate sufficiently to enable the 
examiner to decide about the quality of some 
. of his form responses, the scoring samples by 
Dr. Rickers will assist in scoring and they have 
been collected for this purpose. A valid as well 
as an entirely objective method of distinguish- 


ing F+ from F— responses in the Rorschach 
cards still remains a problem to be solved. 
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*[Re Rorschach.] Rorscuacu, Hermann. Psycho- 
diagnostics: A Diagnostic Test Based on Percep- 
tion: Including Rorschach’s paper, “The Applica- 
tion of the Form Perception Test” (Published 
thumously by Dr. Emil Oberholzer), Second 
dition. Translation and English edition by Paul 
Lemkau and Bernard Kronenberg, Edited by W. Mor- 
genthaler. Berne, Switzerland: Verlag Hans Huber, 
1942. Pp. 238. (New York: Grune & Stratton, Inc. 

$5.75.) * (PA 16:4443, title only) 


J Am Med Assn 120:1076 N 28 '42. * The 
translators found the task of adequate transla- 
tion exceedingly difficult, but it would appear 
that they have done an excellent piece of work. 
The Rorschach test was devised and organized 
by Hermann Rorschach, who was born in Zu- 
rich, Switzerland, in 1884 and died in 1922. 
His early death was a great loss to psychiatry 
in general and to Swiss psychiatry in particu- 
lar. He was an extremely versatile man with a 
sound character and a practical grasp of scien- 
tific knowledge together with an inexhaustible 
interest in research. The Rorschach test, often 
referred to as the “ink blot test,” is being used 
more and more for the differential diagnosis of 
personality problems. When given and evalu- 
ated by one who possesses a thorough knowl- 
edge of Rorschach's method and technic it 
offers an excellent guide relative to the prog- 
nosis and choice of treatment to be employed 
when some form of psychotherapy is indicated. 
The translation is intended primarily for stu- 
dents of the Rorschach Technic, but the book 
should prove of interest to all students of 
human behavior. * Anyone who is at all inter- 
ested in attempts to measure or evaluate prob- 
lems of human personality structure by means 
of academic laboratory methods is advised to 
study Rorschach's basic work very carefully. 
They seem to offer greater possibilities for ac- 
curate evaluation than any other method or 
methods that have been presented up to the 
present time. Improvements in the technic itself 
and evaluation of the material collected are 
constantly being worked out by Rorschach's 
followers. 

J Consult Psychol 7:132-3 Mr-Ap "45. 
Bruno Klopfer. 'The English translation of this 
book, appearing twenty-one years after Her- 
mann Rorschach first published his Psycho- 
diagnostik in Switzerland, is the first author- 
ized English translation. * Lemkau, one of 
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Adolph Meyer's co-workers in Baltimore... 
should be congratulated at his admirable at- 
tempt "to present his (Rorschach) ideas ade- 
quately and at the same time preserve as much 
as possible of the personality of Rorschach as 
it is revealed in his choice of words, his sen- 
tence structure and his delicate shades of 
meanings." * the sometimes confusing double 
meaning of the word "interpretation" referring 
to the Rorschach responses of the subject as 
well as to the interpretation of these responses 
by the examiner might have been avoided. * 
This translation is much more faithful and 
adequate than previous attempts. * Rorschach's 
...Psychodiagnostik....remains, even when eval- 
uated in the light of more than four hundred 
books and papers published since that time, the 
most outstanding theoretical and practical con- 
tribution to the "Rorschach method" of per- 
sonality diagnosis. Readers not thoroughly 
familiar with Rorschach work will be unable 
to comprehend the full importance and value 
of this book for the technical reason that it is 
anything but a manual. Even as an experimen- 
tal report it might appear strange to many 
American trained psychologists and psychia- 
trists. Instead of statistically reliable data, ar- 
ranged in the usual manner, they find a whole 
book full of rough quantitative estimates, hy- 
pothetical suggestions and theoretical hunches, 
interwoven with a most original and dynamic 
psychology of personality. On the other hand, 
twenty-one years of routine research have 
proven more than ninety per cent of his as- 
sumptions correct and even many of his 
hunches which seemed wrong at first glance 
proved to be pointing in the right direction on 
further scrutiny. * Taking for granted that the 
Psychodiagnostics are not meant to serve as a 
manual, one need not be concerned about such 
lack of the usual equipment for such books as 
an index. The third and fourth German edi- 
tions contain valuable survey tables by Ewald 
Bohm, which substitute to some extent, for an 
index. They have not been included in the 
translation. The only new addition is a “Bibli- 
ography of the Important Contributions on the 
Rorschach Method." One-fourth of the 243 
titles listed refer to non-Rorschach papers, 
dealing with related medical or psychological 
subjects or with similar methods like the “cloud 
picture test.” Among the Rorschach listings 
are only 26 titles not contained in recent Amer- 
ican Rorschach bibliographies. * On the other 
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hand, all the Rorschach titles listed comprise 
just about half the titles to be found in other 
Rorschach bibliographies....covering about the 
same period and there doesn't seem [to be] 
any systematic selection. Rorschach's first re- 
port about a new kind of personality diagnosis 
is naturally most valuable as a historic docu- 
ment but most of his intuitive assumptions 
have proven so adequate that the Psychodiag- 
nostics will point the way for research in the 
field of personality diagnosis for many years to 
come. 

Psychiatric Q 16:598-602 J1'42. * Hermann 
Rorschach published his original work “Psy- 
chodiagnostik” ("Psychodiagnostics") in 1921. 
The following year, his further important con- 
tribution on the subject was published by his 
friend and colleague, Emil Oberholzer, the 
psychoanalyst, shortly after Rorschach’s own 
untimely death, This extraordinary “blind” 
Rorschach diagnosis of one of Oberholzer’s 
analysands is included in the present English 
translation of the “Psychodiagnostik” under 
the title of “The Application of the Form 
Interpretation Test.” (“To Psychoanalysis” 
might well have been added.) Despite a vol- 
uminous and still rapidly growing literature 
compiled by a large number of able and en- 
thusiastic scientific workers in the past two 
decades, the original research of Rorschach 
remains by far the most important single con- 
tribution ever made to the understanding and 
application of the diagnostic method he devised 
—as fundamental to its theory and practice as 
the “Interpretation of Dreams” and the “In- 
troductory Lectures” are to psychoanalysis. 
The translation by Drs. Lemkau and Kronen- 
berg is the first authorized one and the first to 
be made generally available in English to stu- 
dents and Rorschach workers. For understand- 
ing of the development of the method, of the 
dynamic psychology involved in its operation, 
of the underlying concepts which account for 
its empiric validity, this work of the originator 
would still appear to be essential, Rorschach, 
in the “Psychodiagnostics,” sets forth explicitly 
the conditions for conducting and interpreting 
his test; but his book is less a manual of in- 
structions than a careful report of the aston- 
ishing results of an experiment; and the 
volume could not well be used for modern 
training, in view of the advances in application 
of the method which have taken place since 
Rorschach’s death. What Rorschach reported 
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and what the work here translated concerns is 
the application and theory of a procedure 
which must have seemed to many of his col- 
leagues—as it does to many present-day psy- 
chiatrists—too absurdly simple for serious 
attention, let alone scientific usage. One may 
observe in Rorschach’s text his extensive 
knowledge of psychoanalysis and his applica- 
tion of psychoanalytic principles in developing 
the theory of his inkblot method. Before he 
published *Psychodiagnostik," he had studied 
results of the test as applied to patients before, 
during and after analysis. Here is a dynamic 
background for the method which is nowhere 
made clearer than in Rorschach's own writings 
and which is, perhaps, not sufficiently appre- 
ciated. It may be interesting to speculate 
whether it was not his psychoanalytic under- 
standing which gave to Rorschach sufficient 
confidence to make public the results of his 
seemingly simple test. The publication is evi- 
dence of possession of insight which might 
justly be termed genius. It should not be over- 
looked that Rorschach, despite his youth—he 
was 37 when he died—was a psychiatrist of 
commanding reputation in Switzerland at the 
time he published his researches; and he must 
have known that he staked his repute on their 
scientific validity. The present long-overdue 
translation seems admirable. The volume in- 
cludes a brief biographical sketch of the author 
and the introduction to the second German edi- 
tion, both written by the editor * Typographical 
errors are comparatively few, considering that 
the volume was set by German-speaking com- 
positors; and those undetected in this will 
doubtless be remedied in future editions. The 
original symbols of Rorschach are translated 
into generally used English forms. Both stu- 
dents and clinical workers will be interested to 
note that Erlebnistyp, which has given trouble 
to most translators, is rendered here as "expe- 
rience type." The volume is eminently satisfac- 
tory and well calculated to take its place as a 
necessary part of every Rorschach worker's 
professional library. The translation will also 
perhaps bring home for the first time, to stu- 
dents and workers who are not familiar with 
German, Rorschach's repeated emphasis on the 
need for parallel series of figures to the plates 
used in “Psychodiagnostics” and all succeeding 
Rorschach work. Rorschach himself doubtless 
would have published such a series had he 
lived. It may or may not be coincidence that 


publication of the translation of Rorschach's 
original work comes shortly after publication, 
also by Hans Huber, of “Einfuerung in den 
Behn-Rorschach-Test" by Hans Zulliger. This 
is the long-awaited test of 10 parallel plates 
which differ from the original and may be used 
for control purposes. The German text, 232 
pages, is priced at $3.80, the new Zulliger 
plates at $3.75; the American distributors are 
Grune & Stratton. There is no announcement 
of an English translation: * 

U S Naval Med B 46:771-2 My '46. * This 
translation is a fitting tribute to a brilliant pio- 
neer. 
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*[Re Rorschach.] Hak dl Hans. Einfiihrung in 
den Behn-Rorschach Test. Berne, Switzerland: 
Verlag Hans Huber, 1941. Pp. 232. (New York: Grune 
& Stratton, Inc, 1942. $4.00. Accompanying plates, 
$6.00.) (PA 15:5232) 

Psychiatric Q 17:715-6 O '43. When Ror- 
schach published “Psychodiagnostik,” he al- 
ready felt the need of another series of inkblots, 
which would parallel his own and would serve 
as a control series for reexamination, Behn- 
Eschenburg developed, with the aid of Ror- 
schach himself, such a’ series in 1920. He used 
these blots in an experiment with school chil- 
dren and published his results in 1921 ; but the 
new series was not made available to the public 
until 1941. (H. Huber, Bern). In his "Ein- 
fiihrung,” Zulliger offers an excellent introduc- 
tion to the Rorschach method, and specifically 
to the use of the new series of blots. Twenty- 
two pages of his book are given over to a 
presentation of some 500 sample interpretations 
of the new inkblots and include a description 
of the “normal details,” without which a record 
could not be scored. More than half of the 
book consists of scored and interpreted Ror- 
schach records obtained from 25 subjects. Both 
the original Rorschach plates and the new 
Behn-Rorschach plates were shown to most of 
these subjects. Zulliger suggests the abbreviated 
form, Bero, be used to designate the latter 
series. Zulliger proceeds on the assumption that 
there is an extremely close correspondence be- 
tween the Rorschach and the Bero series and 
that consequently the same number of whole, 
detail, movement, color, form, etc., interpreta- 
tions can be expected from both series, His 
own examples, however, frequently show a 
marked difference between the Rorschach and 
the Bero records, Invariably, Zulliger ascribes 
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these differences to changes in the psychologi- 
cal condition of the subjects. This procedure 
does not appear justified, because Zulliger does 
not provide experimental evidence in support 
of his assumption that the two series of blots 
correspond perfectly. This observation, how- 
ever, is not a very serious objection since the 
present norms appear to be satisfactory for 
most cases and since the correction of norms 
can be made relatively easily by anyone work- 
ing regularly with the Rorschach and the Bero 
plates. Everything Zulliger writes about the 
administration and interpretation of records is 
equally applicable to the Rorschach and the 
Bero plates. His book is modelled after Ror- 
schach’s "Psychodiagnostics." Rorschach was 
very concise. Zulliger has succeeded in express- 
ing explicitly many important ideas which 
Rorschach presented in a condensed and im- 
plicit form. Another significant contribution is 
the author's application of the Rorschach 
method to character analysis for educational 
and vocational purposes. Zulliger is known to 
have been very successful in using the Ror- 
schach method as an aid in the solution of 
problems such as: “Why does John steal?" 
"Should Edward be placed in a reformatory or 
should he be given another chance?” “For what 
vocation is Mary best fitted?” “Is the testimony 
of Miss Smith trustworthy?” Zulliger has been 
able to answer these difficult questions well, 
not only because he has mastered the Ror- 
schach technique but also because he has very 
wide and deep psychological and pedagogical 
experience. Never were the Rorschach findings 
the only basis for his recommendations. It is 
Zulliger’s ability to link the Rorschach findings 
with sound psychological principles that ac- 
counts to a large degree for his success with 
the Rorschach method. His Rorschach record 
interpretations should be required reading for 
all Rorschach analysts. Zulliger is a Swiss edu- 
cator, well known for his application of psy- 
choanalysis to classroom situations. His articles 
on this subject appeared in the "American 
Journal of Orthopsychiatry." His previous 
contribution to Rorschach literature consists of 
a book on juvenile thieves and articles illus- 
trating the use of the Rorschach method as an 
aid in education and vocational guidance. The 
author refrains in the present, as in his previ- 
ous publications, from discussing the Ror- 
schach as an aid in psychiatric diagnosis and 
prognosis, but concentrates on character or 


personality analysis. The present book belongs 
to the best works on the Rorschach method. 
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*Schrammel-Gorbutt Personality Adjustment 
Scale. Grades 7-16 and adults; 1943; Form A; 90¢ per 
25, postpaid ; 15¢ per specimen set, postpaid; nontimed 
(20) minutes; H. E. Schrammel and Dorothy Gale 
Gorbutt; Bureau of Educational Measurements, Kan- 
sas State Teachers College of Emporia. 

Raleigh M. Drake, Professor of Psychology 
and Head of the Department, Kent State Uni- 
versity, Kent, Ohio. This is a self inventory 
consisting of 117 items of the yes-no type 
constructed to discover and measure the neu- 
rotic personality. It was standardized on high 
school and college freshmen populations, the 
norms being based on 602 high school students, 
grades not stated, and 349 college freshmen, It 
is presumed that the population included both 
boys and girls in about equal proportions. 

The authors report a correlation of .88 with 
Thurstone’s Personality Schedule, Since it con- 
tains only 117 items as against 223 for the 
Thurstone test, its chief merit appears to be 
that it measures the same thing as this test but 
with an economy of almost half the items. The 
reviewer correlated the Schrammel-Gorbutt 
scale with the C-factor (cycloid disposition) in 
Guilford's An Inventory of Factors STDCR 
and obtained a correlation of .46 for thirty 
sophomore women in college. Author-reported 
reliability by the split-half method (corrected ?) 
is given as .9o which seems satisfactory for so 
short an inventory. 

It suffers, along with all other yes-no type 
inventories, in objectivity of representing life- 
like situations by forcing the testee to choose 
an all-or-none answer to a question. Degrees of 
response are provided in some cases by includ- 
ing the words “often” or “usually” in such 
questions as “Do you often feel you are some- 
one else rather than yourself ?" but many ques- 
tions are categorically stated, e.g., “Can you 
listen to a lecture without fidgeting?” 

Students taking the inventory complain that 
many of the questions are not realistic to them, 
that they cannot answer out of actual past ex- 
perience and therefore must guess at what they 
would do under the circumstances. Such ques- 
tions as “Do you think that you would be 
afraid to dive from a springboard?” is hard to 
answer if one has never tried diving. Others 
are ambiguous, such as, “Have you ever bitten 
your finger nails continually over a period of 


93 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 192 


time?" or “When you are eating your meals, 
do you usually like to have someone eat with 
you?" One can hardly expect an objective an- 
swer from "Do you believe that you are self- 
ish?" This inventory may not sin any more 
than similar inventories in this regard, but it 
is no better in this respect. 


Nelson G. Hanawalt, Associate Professor of 
Psychology, New Jersey College for Women, 
Rutgers University, New Brunswick, New Jer- 
sey. This is just another test for neurotic 
tendencies. The reviewer can think of no rea- 
son for its publication or use. Although the 
reported reliability is as good as other tests of 
its kind, no attempt has been made to establish 
its validity beyond correlating it with Thur- 
stone's Personality Schedule and testing the 
items for internal consistency. No test was 
made to determine whether or not it would 
differentiate between known adjusted and mal- 
adjusted groups of students. Yet the authors 
state that “because the interpretations are thus 
based on the distribution of over nine hundred 
representative cases, they possess a reasonable 
degree of validity and significance." This is 
quite misleading. The number of cases upon 
which the distribution of scores is based has 
nothing to do with its validity and significance 
as far as neurotic tendencies are concerned, If 
the test does not have these qualities to begin 
with, no amount of giving it will supply them. 

The authors give the impression that a score 
made by the individual student on this test is 
as valid as a score made on an achievement test 
and that it can be of the same use to the stu- 
dent if he knows where he stands. They suggest 
that the mere knowledge that a student has a 
neurotic score is a valuable means of therapy 
and that he can improve his adjustment by this 
knowledge alone. Certainly no student should 
be given his rating on this test, nor on any 
other of its kind, unless it is supplemented by 
an adequate explanation of its meaning, limita- 
tions, etc.; and this is especially true for those 
in the lower brackets where the authors say 
that information of their standing will do the 
most good. The authors make a statement con- 
cerning the reliability of the individual score 
on the test. The assumption is made that this is 
a statement of the reliability of the adjustment 
of the individual, which is a different thing al- 
together from the reliability of the test score. 
No evidence of any kind is given of the relia- 


bility of the adjustment score. Adjustment 
Scores are given in terms of percentiles. The 
statements concerning degree of adjustment 
are made upon a statistical basis. No empirical 
test of the adjustment of students with the 
various scores is made. High school students 
and college freshmen are shown to have quite 
different distributions of scores, and yet they 
are combined in one distribution for the pur- 
pose of a verbal statement of adjustment. 

The test is marked in terms of plus and 
minus. This is a great disadvantage on many of 
the items because it requires that natural yes- 
no questions be translated to plus and minus. 
It also allows for no question-mark answers. 
Just what effect forcing a true-false answer to 
all of the questions has on the results is un- 
known. 

The only excuse for publishing another test 
of neurotic tendency in this day and age is in- 
creased validity over other tests in the field. 
This test is grossly lacking in this respect. Tt is 
unfortunate that the authors recommend its 
widespread use with or without the guidance of 
experienced and competent workers in the 
field. Such testing programs can only bring dis- 
credit upon the testing movement. If one has 
to test neurotic tendencies, it would be wise to 
stick to tests like Bernreuter's Personality In- 
ventory and Bell's Adjustment Inventory 
which have a long list of research results avail- 
able to assist in their interpretation. 


[93] 
*Selective Vocabulary Test. Ages 13 and over ; 1944; 
a disguised test of masculinity-femininity; 1 form; 
35. 2d. per 25; 4d. per single copy; 3s. per manual; non- 
timed (15-30) minutes ; Patrick Slater; George G. Har- 
rap & Co. Ltd. 
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Jack W. Dunlap, Dunlap, Morris, and Associ- 
ates, New York, New York. This test is designed 
to analyze the interests of individuals in such a 
manner as to give a measure of the masculinity- 
femininity of the subject. The test consists of 
80 words, 40 of which are primarily familiar 
to boys and 40 of which are basically more 
familiar to girls than to boys. The subjects are 
required to write out a definition of each word. 
In scoring the test the examiner must consider 
whether the definition is sufficient and whether 
it is true. Correct answers are often falsified 
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by unnecessary details, and sometimes these 
details reveal genuine misunderstandings of the 
meaning of a word, Therefore, answers must 
be both sufficient and true. 

When the subject's answer sheet has been 
corrected, two scores are obtained, one for mas- 
culinity and one for femininity. For a boy it is 
necessary to determine whether he knows more 
or less feminine words than is usual for a boy 
of his age. In interpreting the results it is nec- 
essary to use both the masculine score and the 
feminine score in order to account for differ- 
ences of vocabulary levels. The author has 
worked out detailed charts for evaluating the 
masculinity-femininity of individuals in vari- 
ous age groups. 

This test is an ingenious method for measur- 
ing the masculinity-femininity and it seems to 
have been based on sound experimental work. 
However, like all such new instruments, fur- 
ther research is needed in order to establish 
more completely the reliability and the validity 
of the instrument. For anyone interested in 
measurements in this field it would be wise to 
a carefully the Selective Vocabulary 
Test. 


Starke R. Hathaway, Professor and Clinical 
Psychologist, Department of Psychiatry and 
Neurology, University of Minnesota, Minne- 
apolis, Minnesota. Although this little test has 
been made available without adequate experi- 
mental background or clinical study, it is prob- 
ably a good measure of male-female differenti- 
ation among school-age children in the cultural 
environment of that portion of England where 
it was developed. The subject is asked to write 
a short definition of each of 80 words and his 
definitions are scored correct or incorrect ac- 
cording to examples provided by the author. 
The vocabulary words that are used include 
such definitely English words as "scrum" and 
“googly” which illustrate the cultural factors 
involved. 

The 80 words were chosen from among an 
original list of 365. Forty are characteristically 
defined correctly more frequently by males and 
40 by females. The interpretation is based upon 
the relative number of feminine words cor- 
rectly defined as contrasted to the number of 
masculine words. 

Tt seems unfortunate to this reviewer that 
among tlie many ways in which we may differ- 
entiate male and female the author chose the 


subjective definition of words that in many 
cases are so obviously derived from a local 
culture. The departure from a simpler, more 
objective item would seem to be justified only 
on the basis that a particular variable could 
not be identified otherwise. No evidence is pre- 
sented that this particular test variable is pecu- 
liar to this method of measurement. 

It should be emphasized in all tests of this 
type in which the items have been derived from 
male-female differentiating items that, without 
experimental extension, the variable involved 
is no more than the extension of those differ- 
ences into the interest field. The author pro- 
vides no data on validity beyond the fact that 
the items correlate with sex. 

The author is suitably cautious in warning 
users of the test against labeling subjects as 
excessively masculine or feminine. He empha- 
sizes that extreme scores should merely indi- 
cate that “, . . a problem exists, for which some 
explanation should be sought. Abnormal sexual 
development is only one possible explanation." 
In another place he is not so clear on the point. 
He says, "We do not want to differentiate 
members of different sexes, but members of 
the same sex. This the test will do. It reveals 
abnormalities in the bias of an individual's 
interest towards subjects characteristic of one 
sex or the other: excessive masculinity and ex- 
cessive femininity can be effectively differen- 
tiated in either sex.” In the context it is clear 
that the author means only to say that the two 
sexes can each be sorted among themselves by 
scores on the test, but the use of the words 
“abnormalities” and “excessive” in the above 
sentences is clearly suggestive of something 
more. 


[94] 
XSense of Humor Test, Second Edition. Ages 16 
and over; 1939-43; 1 form; $4 per 10; per single 
test; $1.50 per set of scoring sheets; 60) minutes; 
A. A. Roback; Sci-Art Publishers. 


[ 95] 
XShipley-Institute of Living Scale for Measuring 
Intellectual Impairment, Adults ; 1940-46 ; formerly 
entitled Shipley-Hartford Retreat Scale for M easuring 
Intellectual Impairment; 1046 copyright changed in 
title only; 1 form; limited distribution gratis; 20(25) 
minutes; Walter C. Shipley ; Institute of Living, Hart- 
ford, Connecticut. 
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E. J. G. Bradford, Senior Lecturer in Edu- 
cation, The University of Sheffield, Sheffield, 
England. This pair of tests, devised originally 
to measure the deterioration of intellectual 
power resulting from mental illness, can be 
usefully employed in estimating the ability of 
persons in normal health. The American stand- 
ardisation fits the British population fairly 
closely if the total score of the Abstraction 


Test, without the multiplication by two, is 
paired with the shortened Vocabulary Test 
recommended (by the reviewer) for use in 
Britain (14). 

The Vocabulary Test differentiates well 
down to the mental age of eleven years, and 
the same can be said but with less confidence 
about the Abstraction Test. Since the ceiling 
for both tests would be 20 points, it is doubtful 
if scores above 18 have much differential sig- 
nificance. In a battery of seven tests, including 
reading, repetition of sentences, and perform- 
ance tests, the Shipley tests work very well, 
but by themselves the difference in score be- 
tween them would be very shaky evidence on 
which to base a verdict of mental illness. A 
difference between the reading test and the rep- 
etition test is as reliable and usually confirma- 
tory, while in cases of neurosis the difference 
between reading or vocabulary and perform- 
ance scores may be more suggestive of mal- 
adjustment. Thus, a random sample of five 
diabetic cases gave a mean Vocabulary— 
Abstraction difference of 1 year; five gastric 
cases, 4 years; 8 attempted suicides, 4 years. 
The corresponding Reading-Repetition differ- 
ences were 1⁄4 year, 2 years, and 2% years. For 
the Reading-Performance differences, 114 
years, 214 years, and 434 years were obtained 
respectively. There is no doubt that age and 
sex play a part in producing these differences, 
for proneness to these illnesses varies with the 
age and sex of the person, just as the intelli- 
gence of the patient seems to be related to the 
type of syndrome which his maladjustment ex- 
hibits, whether it be obsessional, psychopath, 
anxiety, or hysteria. 

Experience indicates that a reading (me- 
chanical) test should, at a clinical interview, 
precede the vocabulary test, because a reading 
age of less than 11 years invalidates the vocab- 
ulary test as a test of intelligence. Indifferent 
or scanty schooling could through a weakness 
in reading result in vocabulary scores similar 
to those of a moron. 

Responses to the Abstraction Test (Series 
Test would be a better name) when given 
orally often show that a dependence on audi- 
tory clues, meaning clues, or visual clues re- 
sults in failures when clues of a different type 
are required. Full success at the test requires 
a flexibility of approach, the tríal of alterna- 
tive clues. This type of rigid or one-track mind 
is not disclosed in the self-administered use of 
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the test. A suggestion such as “Read it aloud" 
will sometimes enable a patient who has been 
vainly struggling for a visual clue to give a 
correct solution from an auditory clue. Would 
a failure of this type be due to the decay of 
intelligence or to a bias which might be de- 
scribed as an unwillingness or inability to ex- 
periment, a lack of flexibility in other words? 


William A, Hunt, Professor of Psychology, 
Northwestern University, Evanston, Illinois. 
This scale for measuring intellectual impairment 
or deficit is based upon the established tendency 
in such cases for a discrepancy to appear be- 
tween vocabulary level and the ability to handle 
abstract thinking. Vocabulary level is measured 
by a 40-word, multiple-choice vocabulary test 
with a 10-minute time limit. Abstract reasoning 
is measured by a 20-item abstraction test in 
which the subject is asked to grasp the relation- 
ship involved in a progressive series of items 
and complete the final missing member of the 
series. This also has a 10-minute time limit. 
Mental age norms for vocabulary, abstraction, 
and total score are available based on a norma- 
tive group of 1046 subjects on whom group in- 
telligence test scores were available. Reliabilities 
of 87, 89, and .92, respectively, were obtained 
for vocabulary, abstraction, and total score. In- 
tellectual impairment is expressed by a "Con- 
ceptual Quotient” or C.Q. based on the differ- 
ence between vocabulary level and the level of 
abstract reasoning. The clinical value of the 
scale was established on a group of 374 mental 
patients. 

Subsequent studies by other investigators here 
and abroad have confirmed the clinical value of 
the test as a measure of intellectual impairment 
within the usual limits of such tests. Correla- 
tions of .77 and .65 also have been reported be- 
tween the Shipley total score and the Wechsler- 
Bellevue scale. This indicates that the test may 
serve satisfactorily as a quick measure of intel- 
ligence. 

The reviewer has used the Shipley-Institute 
of Living Scale widely in both functional and 
organic mental disorders. In his experience it is 
as satisfactory as, if not more so than, the other 
available measures. Brevity and ease of admin- 
istration are definitely in its favor. The answers 
on the Abstraction Test also offer relatively rich 
material for clinical interpretation despite the 
pencil-and-paper nature of the test. Some weak- 
ness develops at the lower ranges of intelligence, 


suggesting the need for further extension at 
this level. 


Margaret Ives, Psychologist, St. Elizabeths 
Hospital, Washington, D. C. This scale is in- 
tended to provide a sensitive but quick self- 
administering test which may be used by psy- 
chiatrists or even by general practitioners to aid 
in detecting early inefficient intellectual func- 
tioning. It is based upon the finding, first pub- 
lished by Babcock (1930) and now widely used, 
that, where there is intellectual decrement, 
yocabulary is usually least affected but that the 
capacity for conceptual thinking is among those 
abilities which decline rapidly. This concept, 
although it has proved highly useful, must be 
used with caution since frequently the vocabu- 
laries of chronic psychotics do vary in succes- 
sive retests even as much as 5 years of mental 
age. It is doubtful, therefore, whether those who 
are not clinical psychologists should be encour- 
aged to interpret results. 

The Shipley vocabulary test is arranged in 
multiple-choice form with one explanatory sam- 
ple. Additional explanation is allowed when 
necessary and although the 10-minute time limit 
is ample, the penalty for those whose thinking 
is unusually slow is minimized by allowing one 
point of credit for each four items not at- 
tempted. 

Each item of the Abstraction Test requires the 
subject to “induce a general principle and from 
it to deduce a specific answer.” There is no 
sample, but additional help is permitted includ- 
ing, at the examiner’s discretion, explanation of 
the first, easiest item which cannot then be cred- 
ited. Such a procedure seems to penalize those 
who are not well educated and test-wise, a dis- 
crimination which could be obviated by routine 
additional explanation including a sample. Also, 
although the time allowance is adequate, it is not 
nearly so generous as the vocabulary limit and 
there is no alleviation of the penalty for slow- 
ness. This tends to lower the conceptual quotient 
of those whose thinking has been slowed by their 
illness and introduces another factor in addition 
to impairment of ability to abstract. 

The test provides four ratings—first, a vocab- 
ulary age to measure the patient’s prepsychotic 
or original level; second, an abstraction age to 
measure the level of his abstract thinking; third, 
a total mental age which gives a rough estimate 
of the present functional level; and fourth, the 
“C.Q.” or “conceptual quotient,” which is the 
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ratio of the patient's abstraction age to that of 
the “normal” person receiving his vocabulary 
score. A “C.Q.” below 9o is suggestive of im- 
pairment; below 70 it indicates probable pa- 
thology. 

Standardization is on 1046 individuals, all stu- 
dents from the fourth grade up through college 
who had had standardized group intelligence 
tests. One would expect such a group to set the 
norms somewhat higher than an unselected sam- 
ple and at this hospital we find that the Binet 
vocabulary levels tend to run higher than the 
Shipley vocabulary ages; e.g., in a group of 12 
paretics the median Binet vocabulary level was 
at 14 years as opposed to 11.7 on the Shipley. 
Since the test is not considered valid for those 
with a Shipley vocabulary age below 14 years, 
this excludes not only feebleminded and border- 
line individuals but the entire lower half of the 
general population, as measured by the Binet or 
Bellevue. 

Also the standardization group includes no 
older people and therefore no allowance is made 
for normal retrogression. This is important in 
any patient population, but perhaps not so sig- 
nificant for verbal abstract thinking in a better 
than average group as it might be for a differ- 
ent kind of test (e.g., nonverbal new learning) 
or for a less intelligent population. 

The results reported by Shipley (1941) show 
consistent differences in C.Q. between normals 
and mental hospital patients as well as among 
various diagnostic groups. The lowest C.Q.’s are 
reported for organics; next come the functional 
psychoses ; while psychopathic personalities, psy- 
choneurotics, and alcoholics fall within normal 
limits. The test does not claim to differentiate 
between functional and organic causes of decre- 
ment but unfortunately the interchangeable use 
of the words deterioration and impairment im- 
plies “irreversible diminution in intellectual ca- 
pacity” in both types of cases (7). 

This reviewer believes that wherever possible 
a complete psychological examination using a 
standard general intelligence test supplemented 
in doubtful cases by a Rorschach or other projec- 
tive technique is preferable to any brief exami- 
nation for special defect. However, for screening 
purposes or to supplement information from 
other tests, the Shipley scale provides valuable 
information regarding impairment in abstract 
thinking when restricted to the select group for 
which it is suited—above average in intelligence, 
reasonably well educated with no language hand- 


icap, test-sophisticated, not too disturbed to be 
cooperative, and preferably young. 


[ 96 ] 
Social Intelligence Test: George Washington Uni- 
versity Series. Grades 9-16 and adults; 1930; 1 form; 
$12 per 100; 20¢ per specimen set; 49(55) minutes; 
F. A. Moss, T. Hunt, and K. T. Omwake; Center for 
Psychological Service, George Washington University. 
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Glen U. Cleeton, Director, Division of Hu- 
manistic and Social Studies, Carnegie Institute 
of Technology, Pittsburgh, Pennsylvania, Test 
administration is divided into two parts. Four 
minutes is allowed for preliminary study of a 
sheet containing twelve photographs. The photo- 
graph sheets are then collected, and the subjects 
are instructed that they will have 45 minutes to 
complete the examination. Except for prelimi- 
nary study of photographs, the test is self-ad- 
ministering. 

Percentile norms for college freshmen and 
upperclassmen are reported along with quartile 
scores for high school students, college fresh- 
men, upper-class students, and college graduates. 
Quartile scores are also reported for industrial 
and commercial groups including administrative, 
secretarial, sales, engineering, clerical, and other 
office- and store-workers. A bibliography of 
twenty publications covering a period from 1920 
to 1940is includedin the manual. No reliability or 
validity coefficients are reported in the manual. 

Tests of social intelligence are open to two 
main criticisms, both of which apply to this test : 
(a) The results of such tests usually correlate 
highly with measures of abstract intelligence, in- 
dicating that evidence of existence of an inde- 
pendent factor of social intelligence has not been 
established, and (b) any measure or work sam- 


ple in which the person is permitted to record 
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the results of "thinking over" the thing to do in 
hypothetical situations rarely indicates how the 
person would respond in actual social situations. 


Howard R. Taylor, Professor of Psychology 
and Head of the Department, University of Ore- 
gon, Eugene, Oregon. This test is an attempt to 
make the distinction suggested by E. L. Thorn- 
dike in 1920 between abstract, mechanical, and 
social intelligence. The revised form (1931) does 
not differ materially in organization or assump- 
tions from the original test. The reliability of 
individual scores even in the limited range of col- 
lege sophomores is high enough to identify with 
reasonable accuracy those who are superior, aver- 
age and inferior in whatever the test measures. 
A retest with the same form gave a correlation 
of .89, and the estimated reliability coefficient 
based on self-correlation is .88. Unfortunately, 
however, as a means of differentiating between 
social and abstract intelligence the test has little 
or no validity. The results of two studies are 
fairly typical and conclusive. 

Stein (26) reported that the intercorrelations 
of the five subtests of the Social Intelligence Test 
were of the order of .34; those of the Mental 
Alertness Test constructed and sold, by the same 
publisher, as a measure of abstract (verbal) in- 
telligence were of the order of .39. But the in- 
tercorrelations of the mental alertness subtests 
with the social intelligence subtests were also of 
the order of .34. Thus these measures of two 
supposedly distinct kinds of intelligence have as 
much in common as the presumably similar sam- 
ples of each ability comprising its subtests. 

A factor analysis of the same set of intercor- 
relations (R. L. Thorndike, 1936) indicates that 
“the comprehension and use of words” accounts 
for most of whatever either test measures. The 
covariance of this general factor is nine times 
as great as that of a second factor which has 
small predominantly positive loadings with the 
subtests of the Mental Aleriness Test and equally 
slight negative loadings with those of the Social 
Intelligence Test. 

Clearly then, the test can only function as a 
somewhat disguised and to that extent imper- 
fect measure of verbal intelligence. The differ- 
entiations between occupational and educational 
groups provided by the norms should be attrib- 
uted primarily to selection in terms of such ca- 
pacity instead of to some unique aptitude for 
dealing with people or social situations in gen- 
eral. If the ability to understand others and meet 


social problems skillfully is not so specific to 
the individuals and situations involved as to be 
largely unmeasurable, some very different ap- 
proach to the isolation of the generalities un- 
derlying it would seem to be called for. 


For a review by Robert L. Thorndike, see 
40:1253. 


[971 
XSocial Personality Inventory for College Women. 
1942; 1 form; $1.50 per 25; 10¢ per specimen set; non- 
timed (20-30) minutes ; À. H. Maslow; Stanford Uni- 
versity Press. 
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Nelson G. Hanawalt, Associate Professor of - 
Psychology, New Jersey College for Women, 
Rutgers University, New Brunswick, New Jer- 
sey. This inventory is based upon sound research 
and is recommended, provided it is used with 
due caution and only as directed by the author 
in the manual. Its chief use at the present stage 
of development should be as a research tool, In 
the experienced hands of a clinical worker it 
should be of great assistance in gaining insight 
into and understanding of the individual per- 
sonality. The test measures one broad phase of 
personality, self-esteem or similar phases of per- 
sonality, and is not intended as a measure of the 
personality as a whole. The name is a disguise 
to hide the real purpose of the test since the 
author has found that knowledge of purpose can 
change the results to a considerable extent. The 
test is not a measure of adjustment. The same 
score made by two individuals might mean ad- 
justment for the one and maladjustment for the 
other. Adjustment depends upon the personality 
as a whole and, in the case of the amount of 
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self-esteem, especially upon the feeling of secu- 
rity. With these facts in mind it is quite evident 
that for use with the individual an expert knowl- 
edge of the dynamics of personality is required. 

The test is satisfactory from the point of re- 
liability. Its validity is better established than 
that of most other tests in the field of personal- 
ity. It has the advantage of being clinically de- 
rived and is validated against estimates made on 
the basis of careful interview by an expert in 
this phase of personality. It has stood the test 
of subsequent research by other investigators 
working in this field of personality. Neverthe- 
less, it should be used with caution when applied 
to the individual case, for, as the author sug- 
gests, serious mistakes can be made in one out 
of twenty cases unless the test results are veri- 
fied by personal interview. Another factor to bear 
in mind is the effect of culture upon the results. 
The test should be used only for college women 
unless research has established norms for other 
groups. Even among college women the average 
score varies from one school to another, from 
one locality to another, etc. Before it is used in 
a college clinic, it would probably prove a valu- 
able asset to have it standardized on the group 
of women selected by that particular college. In 
any case it should be interpreted in the light of 
the background of the individual student. 

Like other personality tests, the value of the 
test is dependent upon the research back of it. 
This test appears to measure something differ- 
ent from other personality tests. It is correlated 
to some extent with the Allports' A-S Reaction 
Study (r = .55) and with the self-sufficiency 
and dominance scales of Bernreuter’s Personal- 
ity Inventory (r = .43 and .37, respectively). 
These correlations are low enough to indicate 
that it is measuring something different from 
the other tests but high enough to substantiate 
the fact that it is not independent of them. The 
test is in its infancy. Self-esteem is an important 
aspect of personality. Future research will have 
to determine its ultimate usefulness and value. 


[98] 
Student Questionnaire. Grades 7-14; 1932-38; 1 
form; $9.90 per 100; 35¢ per specimen set; nontimed 
(40) minutes; Percival M. Symonds and Virginia Lee 
Block; Bureau of Publications, Teachers College, 
Columbia University. 


Simon H. Tulchin, Consulting Psychologist, 
30 East óoth St., New York 22, New York. 
This questionnaire consists of 100 items which 
attempt to obtain data on the feelings and atti- 


tudes of students toward the curriculum, social 
life of the school, the administration, the teach- 
ers, other pupils, home and family, and a mis- 
cellaneous group of personal evaluations. For 
each item, the student is offered a selection of 
five statements and is to select the one which best 
expresses his own feelings. For the most part 
the choice can be made in terms of always, often, 
sometimes, seldom, or never. Depending on the 
item, however, the statements may be worded in 
more qualitative terms. 

Two groups of high school seniors were used 
to establish the reliability of the test. The ques- 
tionnaire was given to one group twice at an in- 
terval of two weeks and resulted in a coefficient 
of correlation of .83. With another group of 
seniors a correlation of .72 was obtained when 
the questionnaire was repeated after one week, 
and a correlation of .71 when it was repeated 
after three months. The correlation between the 
second questionnaire and the one given three 
months later was .80. 

In order to establish validity the results of the 
questionnaire were checked against recorded 
statements of individual difficulties as well as 
check lists submitted by parents and teachers. 
Correlations of .72 and .70 were secured. 

On the question of norms the authors take the 
position that general norms are not helpful and 
that “judgment must be made in the light of 
the standards of adjustment that the home, the 
school, and the community are setting.” They 
feel that the adjustment of any particular group 
cannot be taken as “a valid basis for making an 
appraisal of the adjustment of any particular 
group of students in any other city.” They feel 
that an item-by-item analysis is more important 
than total scores and that serious maladjustment 
in one or two areas may be covered up if only 
general scores are considered. However, the au- 
thors do state that in a study of 5,000 pupils in 
various parts of the country those who showed 
good general adjustment had scores between 76 
and 85. 

While it can be agreed that numerical scores 
on this type of questionnaire may be mislead- 
ing, norms are essential in that they can serve 
as a background or standard for interpretation. 
Otherwise it is difficult to see the reason for 
calculating scores at all. Further questions are 
raised in relation to the wide range of ages cov- 
ered by the test. A given statement must neces- 
sarily carry different connotations to different 
age groups. Some statements suitable for a 
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seventh grade pupil are obviously out of place 
when presented to a college sophomore. 

This test is also subject to the general criti- 
cism of all paper-and-pencil questionnaires. We 
do not know what motivates the choice of any 
given statement by any individual taking the test. 


[99] 
Study of Values: A Scale for Measuring the Dom- 
inant Interests in Personality. College and adults; 
1931 ; $2.25 per 25; 15€ per specimen set; nontimed (20) 
minutes; Gordon W. Allport and Philip E. Vernon; 
Houghton Mifflin Co. 
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Paul E. Meehl, Assistant Professor of Psy- 
chology, University of Minnesota, Minneapolis, 
Minnesota. Considering its a priori method of 
construction, the problematic validity of its the- 
oretical foundation (Spranger's types), and the 
relatively small number (45) of items which are 
used to measure six value dimensions, this seems 
to be a remarkably good test. It is administered 
either singly or in groups in around 20 minutes 
and is fairly easy to score. The reliabilities in 
the sense of internal consistency are not as high 
as are usually required, if this is taken to be of 
any great consequence; test-retest reliabilities 
range from around .70 to .9o with the exception 
of one score, the “social,” which is reported 
by the authors and by numerous investigators as 
being of doubtful meaning and stability. Valid- 
ity,as indicated by ratings, correlation with other 
tests, and differentiation of various academic 
and occupational groups, is fairly good when 
the many attenuating factors in such a situation 
are taken into account. Other investigators have 
reported normative data in fairly good agree- 
ment with those of the manual. The test scores 
have been shown to be related to field of college 
work, nature of the college itself, stated occupa- 
tional preference, actual occupation, sex, relig- 
ious affiliation, and various patterns of vocational 
interest as measured by the Strong and other 
vocational interest tests. Almost all of the trends 
which have been found are what the definitions 
of the five reliable value scores would lead one 
to expect theoretically. There is evidence of a 
slight relation to academic achievement. 

The criticism has been made that it is impos- 
sible to value many things at once, since the items 
of the test force a choice between situations in- 
volving the “opposed” values of the subject. The 
manual has been quite explicit in pointing out 
that it is the relative strength of values which 
the test is supposed to get at, so this is hardly 
a real criticsm. There are reasons for believing 
that this pairing of valued alternatives is actu- 


ally preferable to another method which, while 
allowing a subject to value two things equally 
strongly rather than being forced to choose, also 
introduces the new variables of differences 
among persons with regard to rating in general. 
While the norms compare one person with 
another, what they really compare is not the 
strength of a value in A versus its strength in 
B, but the extent to which the value competes 
with other values in A as contrasted with the 
corresponding extent in B. 

As was mentioned above, there is consider- 
able evidence for not paying much attention to 
the social value score, which is highly unreliable 
and in any case rather ambiguous in meaning. 
Factor analytic studies and other correlational 
work tends to indicate that the clearest case for 
primary component values is for the theoretical 
component. There seems to be a considerable 
overlap between the economic and political val- 
ues, which some other investigators, using both 
this and other item sets, have referred to by 
various names stich as “philistine,” “practical,” 
"business," and the like. 

At Minnesota this test has proved itself useful 
and stimulating when administered in a class in 
personality. In some unpublished work with 
which the reviewer was connected, all of the in- 
vestigators were very favorably impressed with 
the rather neat descriptive power of the values 
profile in some blind matching studies. In elim- 
inating certain persons as possibilities for a 
matching, or in the final choice of one rather 
than the other, this instrument was often of cru- 
cial importance, even though in competition with 
several other fairly well-validated personality in- 
ventories. With suitable caution as to its use 
with the less educated and as to the untrust- 
worthiness of the social value score, this test 
can be recommended as one of the few struc- 
tured personality devices having considerable 
value. 


[ 100 ] 
XSzondi Test. Ages 4 and over ; 1937 ; $7.50 per set of 
48 picture cards; $6 per copy of manual, German edi- 
tion ; an English translation by Susan K. Deri is in prep- 
aration; L. Szondi; Grune & Stratton, Inc. 
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Susan K. Deri, Visiting Lecturer in Psychol- 
ogy, The City College of New York, New York, 
New York. The Szondi Test is a projective 
technique originated by the Hungarian psychi- 
atrist Lipot Szondi. The test has been in use 
in Europe since 1936. The test material consists 
of 48 photographs, representing various types 
of mental patients, grouped into six sets, each 
set consisting of eight photographs: a homo- 
sexual, a sadist, an epileptic, an hysteric, a 
catatonic schizophrenic, a paranoid schizophre- 
nic, a manic-depressive depressive, and a 
manic-depressive manic. 

The administration of the test requires about 
ten minutes. The subject is presented one set 
of pictures and asked to pick out the two he 
likes and the two he dislikes the most. This 
procedure is repeated with all six sets consecu- 
tively, finally resulting in the choice of twelve 
liked and twelve disliked pictures. These choices 
are recorded graphically on a scoring sheet. 
The test must be administered several times 
with at least one day of interval between ad- 
ministration in order to be able to give a valid 
clinical interpretation of the personality. For 
various research purposes, however, the use of 
one or two profiles for each subject is permis- 
sible. 

The minimum age for which the Szondi Test 
can be given is approximately four years. It is 
administered individually although there are 
experiments under way trying it out as a group 
test by projecting the series on a screen. The 
same limitations which hold for the use of single 
test profiles hold for the group administration 
to an even greater extent. There are a limited 
number of research problems not entailing qual- 
itative analysis of the individual profiles, in 
which cases the group administration is per- 
missible. 

Something has to be said about the psycho- 
logical theory underlying this test although an 
adequate presentation of it is almost impossible 
within the framework of this short review. 
Szondi in his own theory contends that choices 
in the test are based on some kind of similarity 
between the hereditary factors (genes) of the 
choosing subject and the particular mental pa- 
tient represented in the chosen picture. Actually 
one can very well work with the test without 
accepting this "gene-theory." 

The eight different types of mental disorders 
have to be looked upon as extreme manifesta- 
tions of psychological tendencies present to 


greater or smaller extent in all of us. Thus we 
have an eight-dimensional concept of the per- 
sonality. Depending on the state of tension in 
each of the eight "basic" psychological needs, 
the pictures representing the corresponding 
needs will be chosen in various proportions. To 
make it more concrete: more pictures of sadists 
will be chosen by a person whose aggression is 
pent up than by somebody who is able to find 
ways to discharge his aggression. 

Whether the particular types of pictures are 
chosen as liked or as disliked depends upon the 
person's conscious or unconscious attitude 
towards the particular need in question. To con- 
tinue our above example: the person whose 
aggression is dammed up and ready to be dis- 
charged as open aggression will choose a great 
number of pictures of sadists as liked, while 
somebody whose aggression is dammed up but 
strongly repressed, will choose the same amount 
of pictures of sadists as disliked. The psycho- 
logical meaning of the choices in the remaining 
seven categories has to be interpreted according 
to the same dynamic principle but varying in 
the specific content of the needs represented by 
the various categories. Since all eight of these 
basic psychological tendencies have a wide 
range of potential manifestations ranging from 
the most normal psychological phenomena to 
psychotic or antisocial symptoms, the fields in 
which the test can be used are extremely wide. 

Accordingly, the fields of application of the 
test include: clinical practice, normal person- 
ality testing, vocational guidance, experimental 
social psychology, and a variety of fields of re- 
search, for example, following up the effects of 
various psychiatric treatments such as psycho- 
therapy, drug treatments, shock treatments, etc. 
It is particularly suited to follow up certain psy- 
chopathological processes such as the psycho- 
logical changes during the paroxismal cycles of 
epileptics or the personality changes during the 
process of a developing psychosis. 

From the above-mentioned points it follows 
that the test is not an instrument which is sup- 
posed to measure static traits of the personality 
but one which regards personality as a dynamic 
process. Therefore the usual criteria of relia- 
bility do not hold for this test since retesting is 
not supposed to show identical test profiles. 

However, the degree and type of changes are 
the main diagnostic signs which indicate 
whether the person is psychologically adjusted, 
neurotic, or psychotic. The validity of the test 
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was established against case histories and clini- 
cal data. Like in other projective techniques, 
the line of development followed detailed quali- 
tative analysis of individual cases rather than 
collecting large-scale statistical data. 

Another reason which makes statistical treat- 
ment of the data difficult lies again in a common 
characteristic of all projective techniques, 
namely in the fact that configurational patterns 
of the global profile have greater diagnostic 
value than the single factors. However, many 
tables with quantitative data are given in Szon- 
di’s new book Experimentelle Trieblehre. The 
data given in the tables are based on the test 
profiles of 2,237 subjects without any clinical 
symptoms ranging from the age of 3 to go, and 
1,880 subjects representing the various types of 
neuroses and psychoses. The tables indicate the 
frequency of the various configurations in dif- 
ferent age groups, vocational groups, and patho- 
logical groups. 

A critical appraisal was practically inherent 
in the above description. The very same aspects 
which make this test a unique instrument for 
the “measurement” of dynamic processes unap- 
proachable for any of the other personality 
tests, account for the shortcomings of the 
method. To sum it up, the strength of the test 
lies in the following points: (a) Unlike most 
well-known projective techniques, the Szondi 
Test does not involve verbalization, thus ex- 
cluding the possibility of certain types of verbal 
covering up or evasion well known to every 
Rorschachist. (b) Through the choice-reactions 
the subject reveals unconscious aspects of his 
personality which cannot be detected by any 
other projective test. (c) It is the only instru- 
ment which records the constantly fluctuating 
dynamisms of the personality. 

Besides Szondi’s theory about the role of the 
genes—which remains an hypothesis—the fol- 
lowing points of criticism could be made: (a) 
More experimental work is needed to clarify 
the basic psychological processes involved in 
our reactions to the pictures, Not having a par- 
allel series we do not exactly know the extent 
to which we react to these specific pictures or 
to the clinical entities as such. (b) In spite of 
all the aforementioned inherent difficulties, more 
statistical data are required for the purpose of 
standardization and validation. (c) A limitation 
or strong point of the test—depending from 
which angle we look at it—is the fact that a 
great deal of psychiatric and psychoanalytic 


knowledge is required of the person who uses 
the test. Because of dealing with concepts orig- 
inally derived from psychopathology, the dan- 
ger of misinterpretation due to superficial 
knowledge is greater than in any other person- 
ality test. 


[ 101 ] 
XTest of Etiquette: George Washington Univer- 
sity Series. Grades 10-16; 1941-42; 1 form; $8 per 
100, postpaid; 15¢ per specimen set, postpaid; 50(55) 
minutes; Creelman Rowland and Mitchell Dreese; 
Center for Psychological Service, George Washington 
University. 
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Test of Knowledge of Social Usage, 1942 Edition. 
Grades 7-12; 1933-42; 1 form; $5.50 per 100; 25¢ per 
specimen set, postpaid; nontimed (30-60) minutes; 
Ruth Strang, Marion A, Brown, and Dorothy C. Strat- 
ton; Bureau of Publications, Teachers College, Colum- 
bia University. 
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*Thematic Apperception Test. Ages 7 and over; 
1943; commonly known as TAT; individual; $5 per 
testing outfit; nontimed (120) minutes ; Henry A. Mur- 
ray; Harvard University Press, (28s. Oxford Univer- 
sity Press, London, 1944.) 
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Arthur L. Benton, Associate Professor of 
Psychology, University of Louisville School of 
Medicine, Louisville, Kentucky. The Thematic 
Apperception Test is a projective technique con- 
sisting of 20 pictures, the nature of which dif- 
fers in certain instances according to the age 
and sex of the subject. The subject is requested 
to make up a story about each picture—to con- 
struct a plot about the characters indicating what 
they are doing, what events have led up to the 
pictured situation, how the situation is going to 
turn out, etc. The purpose of the test is to elicit 
the dominant "thema" in the subject's life, i.e., 
the drives, tensions, relationships, conflicts, and 
environmental pressures that have played a de- 
termining role in his behavior. These thema are 
inferred from analyses of the subject’s stories, 
the fundamental assumption underlying the test 
being that the tensions, relationships, etc., pro- 
jected in his stories represent actual past or pres- 
ent tensions, relationships, etc., of the subject. 

This review will be for the most part a brief 
examination of some of the empirical data bear- 
ing upon this fundamental assumption. A sizable 
body of literature has accumulated on this topic, 
most of which provides favorable suggestive, 
rather than critical, evidence. Among the more 
critical assessments may be mentioned the re- 
ports of Harrison (8, 9), Murray and Stein 
(32), and Combs (61, 62, 63). 

Harrison gave the test to 40 psychiatric pa- 
tients whose backgrounds were unknown to 
him. Utilizing a personally developed method 
of analysis which “was eclectic and emphasized 
common-sense psychology,” he found that a 
good deal of biographical and personality data 
could be inferred from the patients’ perform- 
ances (83 per cent agreement with case history 
data). In 39 of the 40 cases the test perform- 
ances yielded sufficient information for the 
investigator to venture to offer a psychiatric 
diagnosis. The inferred diagnosis was correct 
in 30 cases (77 per cent). The validity of a 
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"blind diagnosis" of the TAT records of 15 
other patients was investigated. A slightly 
smaller amount of biographical and personality 
data was inferred (75 per cent agreement with 
case history data). In 11 of the 15 cases the 
inferred psychiatric diagnosis was correct (73 
per cent). 

Murray and Stein, using only 5 pictures which 
were adapted for group use by projection on a 
screen and having the subjects write their sto- 
ries, report that it was possible to make judg- 
ments concerning the leadership qualifications 
of a group of officer candidates which correlated 
fairly well (rho — .65) with the judgments of 
the candidates’ superior officers. 

Combs, employinga method of analysis “which 
would be consistent with accepted psychological 
theory and in agreement with accepted clinical 
practice," compared TAT stories written by col- 
lege students with their written autobiographies. 
He found that 31 per cent of the written stories 
included significant thema drawn from their 
own life experience, as indicated by autobiogra- 
phy. The individual pictures varied greatly in 
respect to their effectiveness in eliciting such 
thema, ranging from 58 per cent for Picture 2 
to 5 per cent for Picture I1 (1943 edition). 

The clinical significance of the various types 
of behavior elicited by the test and of the con- 
textual and formal characteristics which may 
appear in the stories has been noted by many 
workers. Sound interpretation of a subject’s 
performance requires considerable experience 
and skill and, above all, a sound, workable 
knowledge of psychology and psychopathology 
on the part of the examiner. Murray’s (4) dic- 
tum that “one must know a lot to comprehend a 
little” applies with particular force to the prob- 
lem of interpreting the segments of behavior 
elicited by a TAT or a Rorschach. 

One can say that the general validity of the 
test is fairly well established if one means that 
in the hands of a competent examiner the test 
data permit true, meaningful inferences. At the 
present time the greatest hindrance to its use- 
fulness is the lack of normative data on the test 
productions, and the collection of such data 
would seem to be a necessary step in the devel- 
opment of the test as a clinical tool. 


Julian B. Rotter, Associate Professor of Psy- 
chology, The Ohio State University, Columbus, 
Ohio. The Thematic Apperception Test (TAT) 
is a projective device which purports to reveal 


the basic personality characteristics of individuals 
by interpreting the stories they tell or “imagine” 
about a series of pictures. It is expected to re- 
veal to the trained interpreter the subject’s more 
central conflicts, attitudes, and goals as well as 
underlying inhibited motivations which the sub- 
ject cannot or will not admit as he is unaware 
of them. Unfortunately, these personality char- 
acteristics are not revealed in some simple or di- 
rect manner, but they appear mixed with cultural 
stereotypes, relatively unimportant recent events, 
and non-ego involved attitudes, and may reflect 
immediate emotional experiences. 

The clinician or experimenter seeking to find 
basic personality characteristics has the problem 
of separating the basic or central characteristics 
from the more superficial, the immediate and spe- 
cific from the more permanent and generalized, 
the cultural stereotypes from the idiosyncratic 
responses, and the directly reflected attitudes 
and conflicts from the distorted ones. 

Like other projective devices dealing with 
content material, TAT stories may be analyzed 
in a variety of ways, and a number of articles 
have been published attesting to the value of the 
TAT by clinicians interested in divergent prob- 
lems and with marked differences in personality 
theory. These differences include interest in 
“disease entity” diagnoses, in symbolic interpre- 
tations along freudian dream lines, and in “com- 
mon sense” behavioral description. The method 
of reducing stories to a system of “needs and 
presses” advocated by Murray in the manual 
for the test is not followed by many clinicians. To 
the reviewer it appears to tend to mask impor- 
tant individual differences and to be too closely 
tied to a theory of personality which has not 
been fully validated. In addition to the test au- 
thor’s brief manual, somewhat different methods 
of analysis and procedure have been published 
by Rapaport (73), Henry (85), and Rotter (78) 
and still other manuals are in preparation. 

Many investigators using projective tech- 
niques for clinical purposes are generally agreed 
that test-retest reliability is of little significance 
as the characteristics measured may change, the 
attitude towards the test and the stimulus-func- 
tion of the material may change, or superficial 
content may change between testings. Split-half 
reliabilities are not appropriate as the tests are 
not made up of equivalent items which can be 
conveniently divided into comparable halves. 
Inter-scorer reliability is of greater concern as 
it tests the stability of interpretations from one 
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investigator to another. Harrison and Rotter’ 


(48) found a reliability of .77 on a five-point 
rating scale for emotional stability of officer 
candidates for interpretations made from five 
written stories by two psychologists. Combs (61 ) 
using agreement between judges in determining 
the "needs" expressed in TAT stories from a 
list similar to Murray's found 50 to 60 per cent 
agreement. 

Harrison (8, 9) has presented a statistical, 
controlled study of validity. He showed high 
validity for "blind guesses" regarding descrip- 
tive characteristics which could be checked 
against an adequate case history. Although this 
study established the validity of the hypothesis 
that people "projected" or reflected their per- 
sonal experience in their stories, it did not wholly 
validate it for its more common use for “depth” 
interpretations. In this same study, guesses of 
IQ for 37 cases correlated .78 with obtained 
Stanford-Binet IQ's. 

"There are several different sets of pictures in 
use but the one in most common use is that of 
Murray's published in 1943. A number of inves- 
tigators have felt that the more bizarre and ab- 
stract pictures in this set are less productive with 
children and adults of average and low ability 
than ones which contain real people with whom 
the subject can easily identify. An experimental 
study by Gerver (68) ‘recently demonstrated this 
for children ages five to ten. 

Although some crude attempts at collection 
of normative data which can serve as a back- 
ground for interpretations have been made, the 
lack of standard methods of analysis and proce- 
dure for different investigators has prohibited 
combination of the results. Individuals have 
tended to use their own experience for implicit 
norms, and the beginning user is generally at a 
loss to understand in what way the subject's 
productions are unusual and significant. 

In spite of the limitations of nonstandard pro- 
cedure and lack of experimental evidence of re- 
liability and validity for the different methods 
of analysis, this kind of technique reveals to 
many examiners the kind of information they 
find most helpful for making judgments regard- 
ing diagnosis and treatment, and its value as a 
clinical tool is considered unquestioned by those 
who have had sufficient practical experience with 
it. 

"However, in its present state of development 
the TAT cannot be considered as a clinical in- 
strument apart from the person who uses it. The 


value, significance, nature, and validity of the 
tests are dependent upon the interpreter, his ex- 
perience, and his approach to the field of per- 
sonality. Different examiners may use the TAT 
as a source of material for finding characteris- 
tics of the subject which each thinks is impor- 
tant and significant. In one case the clinician 
may use the TAT solely for help in determin- 
ing diagnostic characterizations of a subject's 
behavior ; another may use it only to find what 
he considers to be the important emotional re- 
lationships of the subject to the world in which 
he lives. In all cases interpretations from the 
TAT should be considered as hypotheses or 
leads for further investigation. 


J. R. Wittenborn, Assistant Professor of Psy- 
chology, Yale University, New Haven, Con- 
necticut, The Thematic Apperception Test, a 
story-telling projective device, has been de- 
veloped largely through the efforts of Henry 
A. Murray (4). The formal testing proce- 
dure recommended by him is set forth in a 
manual of directions which accompanies the set 
of twenty stimulus pictures. This procedure for 
the administration of the test requires both care- 
ful observation of the subject as he produces 
his stories and expert inquiry into the origins, 
implications, and extenuations of the responses. 
This is a time-consuming procedure, and the 
tester, if he is to profit both from his observa- 
tions and the inquiry, should not only be an ex- 
pert clinician but also be acquainted with the 
history of the subject. 

Another procedure for administration is de- 
scribed by Rapaport (73), and in the litera- 
ture there is indication that a variety of devices 
have been employed to secure responses to 
pictures in the form of stories or dramas. The 
story-telling technique is an extremely flexi- 
ble device and the Thematic Apperception 
Test pictures may be used in a variety of ways. 
For example, the writer in his work with 
college students has found that the following 
procedure is not only economical of adminis- 
tration time but also highly rewarding in the 
quality of the stories produced : ten of the Mur- 
ray pictures known to elicit clinically useful ma- 
terial are presented with instructions that they 
constitute a test of creative imagination and that 
the student is to write a highly dramatic inci- 
dent about each. Invariably college students ac- 
cept this as a challenge to their imagination and 
produce stories which usually exceed in rich- 
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ness those elicited through individual, formal 
administration. 

Because of the all too prevalent limitations of 
time and clinically trained personnel, the use of 
the Thematic Apperception Test as Murray rec- 
ommends is almost prohibitive, Nevertheless, be- 
cause of the richness of the material which the 
pictures elicit and the useful insight which they 
afford, their general use in both vocational and 
educational guidance is to be recommended. For 
these reasons, the flexibility of the story-telling 
device should always be recognized, and local 
procedures devised which make its frequent ap- 
plication feasible. 

The exhaustive interpretation of the produc- 
tions described by Murray in the manual is not 
always employed by those who use his pictures. 
The sheer labor of his analysis or interpretation 
may in some instances exceed not only the time 
available for analysis but perhaps the demands 
of the case as well. His concepts of personality, 
moreover, are somewhat different from those 
held by many clinical workers. Accordingly, the 
procedures actually employed in interpretations 
are probably as varied as the procedures em- 
ployed in administration. 

The quality of the interpretations seems to this 
writer to be based on at least three characteris- 
tics of the interpreter: general knowledge of 
behavioral dynamics, including symbolism and 
other expressions of a dynamic unconscious; 
experience in guidance or therapy with the type 
of problems under investigation ; and experience 
in observing the relationships between the con- 
tent of the Thematic Apperception Test produc- 
tions and the content of subjects’ problems and 
their neurotic expressions. 

The use made of TAT material naturally va- 
ries with the needs of the case, the habits and 
skill of the counselor, and the policy of the clinic 
or guidance bureau. The Thematic Appercep- 
tion Test gives an expression of the content of 
the individual's problem, whereas the Rorschach 
expresses the structure of his personality. In 
this statement the word "content" should not be 
taken too literally, for the thematic productions 
not only give evidence of the nature of the in- 
dividual's problems but also indicate his charac- 
teristic modes of reaction and the intensity of 
his conflicts. Such data have great value in guid- 
ance, short-term therapy, and psychiatric diag- 
nosis (73). In almost any situation in which a 
serious effort is made to understand the individ- 
ual, the Thematic Apperception Test in some of 


its possible modifications will be found to be a 
genuinely good and useful technique. 

Aside from their obvious value in the appraisal 
of an individual, the TAT stories may have ex- 
ceptional value when employed in the interview 
as a means of guiding or facilitating the course 
of guidance or therapy. The writer has found in 
work with college students and adults numer- 
ous individuals who in the interview can read 
over, elaborate upon, and interpret their own 
Thematic Apperception Test productions, This 
procedure may have a supportive value for many 
cases and may be very useful for insight therapy. 

There is little to be said about the standard- 
ization and objectification of a projective instru- 
ment such as the Thematic Apperception Test. 
Although it would be of academic interest and 
of probable clinical value to know how the thema 
vary with the society in which the individual 
lives, his age, his sex, and his status in society, 
true standardization of this device must wait for 
a theory of personality which is worthy of the 
name. In the meantime, norms for interpreta- 
tion accrue from the experience of the clinician, 
and the chief validation of the test lies in the 
clinician's satisfaction with its use. 

A recent review of the literature on the The- 
matic Apperception Test has been prepared by 
Martin Mayman and includes a useful bibliog- 
raphy (71). 

Brit J Psychol 36:107 Ja '46. This revised 
form....consists of 30 pictures, and a manual of 
instructions and procedure. A brief description 
of methods of analysing the results is also given. 
The test is now suitable for use with men, women 
or children. 


For related reviews, see 104 and 104a. 


[ 104 ] 
X*[Re Thematic Apperception Test.] Crarx, RUTH 
Mireur. A Method of Administering and Evalu- 
ating the Thematic PE eeepuon Test in Group 
Situations. Genetic Psychology Monographs, Vol. 30, 
No. 1. Provincetown, Mass.: Journal Press, August 
1944. Pp. 55. Paper. $4.00. * (PA 18:3777) 


J Consult Psychol 9:309 N-D ’45. Percival 
M. Symonds. * There will undoubtedly be a 
number of attempts to find methods of scoring 
projective techniques. This reviewer, however, - 
is of the opinion that not only will such methods 
prove of little value but that attempts to use the 
results of projective techniques along psychomet- 
ric lines will bypass the more valuable methods 
of a direct analysis of the stories and the themes 
they contain. This study proves little because 
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there is no reference to independent judgments 
of the individuals who are subjects in the study. 
It may be of interest to know that two different 
methods of evaluating the Thematic Appercep- 
tion test lead to similar results. Of considerable 
more importance, however, would be to know 
that the Thematic Apperception test agrees 
with other and independent evidence concerning 
the personality characteristics of these same sub- 
jects. The student of projective techniques will 
be interested in the schedule for the analysis of 
the material which contains five headings. * This 
study is an honest attempt to find an objective 
method of evaluating the results of the The- 
matic Apperception test but its failure to meas- 
ure the results against an independent criterion 
limits its value. 


[ 104a ] 


X*[Re Thematic Apperception Test.] Tomxrns, 
SiLvAN S.; WITH THE COLLABORATION OF ELIZABETH 
n Tomxins, The Thematic Apperception Test: 

he Theory and Technique of Interpretation. 
New York: Grune & Stratton, Inc., 1947. Pp. xi, 297. 
$5.50. * (PA 22:1658) 


J Consult Psychol 12:61 Ja-F ’48. Tomkins’ 
book is a substantial contribution to the litera- 
ture of the TAT, perhaps the first sizable book 
intended primarily as a manual for this test. 
Chapters review the history and development 
of the TAT, administration, scoring (which de- 
parts somewhat from usual methods), and level 
analysis. The major part of the volume is then 
devoted to diagnosis in the regions of family, 
of love, sex and marriage, of social relation- 
ships, and of vocation. The final chapter ex- 
plores the use of the TAT in therapy. Through- 
out, the book is exceptionally rich in clinical 
illustrations that will be of value both to the 
student and to the experienced worker. The 
bibliography has rro entries, a tribute to the 
vitality of a technique scarcely twelve years old. 


[ 105 ] 
*[Torgerson’s Inventories and Record Forms.] 
Grades kgn-12; 1947; these inventories and record 
forms appear in Theodore L. Torgerson's Studying 
Children (Dryden Press, 1947) ; until such time as they 
are published as separates, permission to mimeograph 
the inventories and record forms may be obtained b 
writing the author at the University of Wisconsin ; 
Theodore L, Torgerson. : 
a) CLASS SUMMARY OF BEHAVIOR SYMPTOMS AND DIS- 
ABILITIES, 
b) BEHAVIOR INVENTORIES. 
4 HOME ENVIRONMENT INVENTORY. 
d) SPEECH INVENTORY. 
€) DEVELOPMENTAL INVENTORY OF BACKGROUND FACTORS. 
f) CASE STUDY FORM. 


REFERENCES —— DE 
a KA C eL iR Teaching Kee works Dryden 
Press, Inc., 1947. Pp. x, 230. $2.75. * (PA 21:4656) 

Harold H. Abelson, Associate Professor of 
Education and Director, Educational Clinic, The 
City College of New York, New York, New 
York. In conjunction with his book, Studying 
Children, Dr. Torgerson has prepared a series 
of pupil-study forms which he claims “will en- 
able teachers to make effective and objective 
observations of children with a view to discov- 
ering problems and identifying disabling fac- 
tors, (provide) an objective rating scale for 
evaluating standardized tests, (result in) a 
cumulative record of the child’s mental, educa- 
tional, physical, personal, and social develop- 
ment, (and help) teachers to systematize and 
facilitate the writing of case histories.” The 
inventories and record forms appear in Chap- 
ters 3, 6, 8, and 9 of the text. The forms are 
published separately as well. 

a) The Class Summary of Behavior Symp- 
toms and Disabilities is a form for indicating 
the number of specific symptoms or disabilities 
noted for each child under various subheadings 
arranged under the larger heading of scholar- 
ship, reading, spelling, arithmetic, vision, hear- 
ing, health, social behavior, and speech. 

b) Used in connection with the foregoing 
Class Summary is a series of Behavior Inven- 
tories, corresponding to each of the major head- 
ings previously enumerated. In all, over two 
hundred characteristics are presented, No dis- 
tinction is made in the tabulation as to the 
importance of the several items. 

c) The Home Environment Inventory consists 
of a list of forty-three negatively expressed char- 
acteristics of such home factors as child training, 
parent-child relationships, child-to-child relation- 
ships, and socio-economic status. The record 
form itself makes provision for checking the 
statements that apply to each pupil. 

d) The Test Rating Scale assigns numerical 
weight to “two major criteria: (1) validity and 
(2) reliability, and three minor criteria: (1) 
scoring, (2) norms, and (3) administration." A 
maximum of 50 points is given for validity, 25 
for reliability, and so on down to 5 for admin- 
istration. Detailed tables show just how many 
points to assign for such supposed indicators of 
the worth of the test as validity coefficients and 
reliability coefficients. 

e) In the study of individual cases there is 
added a Developmental Inventory of Back- 
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ground Factors (case history) consisting of 
thirty-five items. This is to be used in addition to 
the Behavior Inventories previously described. 
In addition a Case Study Form is provided for 
summarizing various features of the case. 

It should be added that the entire book explains 
how these various forms, along with such other 
approaches as anecdotal records, interview, and 
standard test procedures, can be used in a com- 
prehensive program of child study in the school 
situation. The book concludes with an enumera- 
tion of 120 "practices and conditions essential 
in an effective child study program." 

The forms developed by Dr. Torgerson will 
undoubtedly be appraised differently by persons 
with different backgrounds. Few clinically ori- 
ented child guidance workers whose concern is 
with dynamic relationships within the child will 
find palatable the atomic, enumerative approach 
implicit in the use of the inventories. The Test 
Rating Scale, with its almost indiscriminate ap- 
plication of numerical ratings to qualities that re- 
quire finely differentiated interpretations, should 
have very little appeal to anyone. On the other 
hand, the beginning teacher may find the con- 
crete aids developed by Dr. Torgerson helpful 
in directing attention to aspects of the child that 
might otherwise be overlooked and in encour- 
aging active experimentation with child study 
procedures. 


[ 106 ] 

V.G.C. Personality Adjustment Indicator or In- 
ventory. Grades9-16; 1944; an adaptation of Hugh M. 
Bell's Adjustment Inventory, Student Form (40:1200) ; 
Form S; separate answer sheets must be used; $2.25 
per 25; 40 per 25 answer sheets ; 25¢ per specimen set ; 
specimen sets must be purchased to obtain the manual ; 
nontimed (25) minutes; adaptation by M. D. Parmen- 
ter; Vocational Guidance Centre. 


For reviews of Bell's Adjustment Inventory 
by S. J. Beck, Raymond B. Cattell, John G. 
Darley, J. P. Guilford, Doncaster G. Humm, 
C. M. Louttit, Percival M. Symonds and 
Austin H. Turney, see 38:912 and 40:1200. 


[ 107 ] 
Vineland Social Maturity Scale. Birth to maturity ; 
1935-47 ; individual; 1 form ; $1.25 per 25 record blanks ; 
72€ per manual (1947) ; $1 per specimen set; 20(30) 
minutes; Edgar A. Doll; formerly published by Vine- 
land Training School; Educational Test Bureau. 
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Capacities of a Group of Young Delinquents.” Brit J Psycho 
39:02- 7. 

88. LANDISBERG, SELMA. "A Personality Study of Institution- 
alized Epileptics." Am J Mental Def 52:16-22 Jl '47. * 

C. M. Louttit, Professor of Psychology and 
Dean of the Undergraduate Division, Univer- 
sity of Illinois, Galesburg, Illinois. 'The Vine- 
land Social Maturity Scale, first published in 
1935, has been so widely used that it hardly 
needs extended comment. As a means of evalu- 
ating social competency, this scale is a unique 
instrument, being neither test, nor rating scale, 
nor questionnaire. The present reviewer has 
used the scale both clinically and for research 
since its first publication and is particularly 
struck by the former function where four prin- 
ciple values may be enumerated: (a) In the 
differential diagnosis between mental deficiency 


and physical or personality conditions which 
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result in subnormal achievement in verbal 


learning» which is commonly the criterion of 
psychological inadequacy. (b) The scale pro- 
vides a systematic description of actual behavior 
as observed day by day in a variety of sitüa- 
tions, Without some type of guide, the ques- 
tioning of parents or other informants con- 
cerning a child’s behavior is very apt to produce 
a picture colored by the informant's prejudices 
and by the interviewer’s lapses or preconceived 
ideas. Further, in this respect, the use of the 
scale frequently helps to establish rapport with 
parents and to make it easier for them to de- 
scribe behavior. (c) Both of the preceding 
values are pertinent in the study of children 
with personality and conduct problems. Com- 
parison of responses given by mother and 
father or by other persons significant in the 
child's environment sometimes reveals .differ- 
ences in attitudes important in the etiology of the 
complained-of behavior. (d) Finally, the meth- 
odology of the scale provides a means of secur- 
ing an evaluation of behavior level without 
subjecting the child to possibly disturbing con- 
ditions such as may be encountered in the 
usual type of test. Also, the data for evaluation 
may be secured by observation of the subject, 
or from reports of several observers who are 
in prolonged contact with the child. 

There is a real need for a systematic analysis 
of the abundant literature on this scale. The 
present manual is primarily a guide only and 
needs extensive supplementation, especially in 
respect to the data on standardization and the 
empirically determined information for clinical 
interpretation. Such an enlarged manual has 
been in preparation for some time and it is 
hoped will appear shortly. 


John W. M. Rothney, Associate Professor 
of Education, The University of Wisconsin, 
Madison, Wisconsin, This scale does not pur- 
port to be a psychometric test system although 
both point-scale and age-scale techniques are 
used. It is not intended to be a rating scale but 
rather a guided interview with someone who 
knows something about the behavior of some 
person between the ages of birth and twenty- 
five plus. The behavior is described in terms of 
self-help and self-direction generally, and par- 
ticularly with respect to eating, dressing, loco- 
motion, communication, social relations and 
occupation. The evidence is “hearsay” rather 
than the result of direct observation. Cautions 
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addressed.to the interviewer about the possi- 


bilities of bias, or distortion by attitude and 
questioning methods, suggest that only the 
most skillful and thoroughly trained interview-" 
ers. would obtain similar evidence about the 
same subjects. Scoring is highly subjective and 
there would be no objection to this if the stand- 
ards were not as specific as those given in the 
table of social-age values, These values are 
based upon the appraisal of 620 subjects com- 
posed of ten normal persons of each sex from 
ages birth to thirty. 

There is need for a scale of this kind and 
this reviewer is in sympathy with all the stated 
objectives of the author. He also approves 
highly of the attempt to get away from rating 
which involves the making of observations and 
judgments simultaneously. He does, however, 
question the value of using standards which 
suggest rather refined measurements when the 
user of the scale, even if he is trained as fully 
as the author suggests, is allowed so much 
freedom in gathering the data. 

The author promises a book with more data 
and more information about his scale. Until it 
appears it would seem desirable to restrict the 
use of the scale to the purpose of getting sup- 
plementary materials for the preparation of 
case histories. At present it appears that it 
could be used very effectively in the discussion 
of a person’s behavior with parents, teachers, 
and employers and in teaching them about gen- 
eral stages of development which can be ex- 
pected at various age levels, The manual con- 
tains 59 references to articles about the scale 
in which such uses are proposed. Study of 
these references will convince the clinician that 
he should have some forms of the scale avail- 
able for use when the occasion arises. 


J Consult Psychol 11:158 My-Je '47. * The 
scale is neither a test nor a questionnaire, but 
a series of behavioral observations, ranging 
through “does not drool,” “eats with fork,” 
“bathes self assisted,” “makes telephone calls,” 
“buys all own clothing,” to “advances general 
welfare.” The user of the scale is enjoined to 
obtain in detail the facts on which to base the 
scoring of each item. Herein lies a potential 
weakness, not of the scale itself, but of those 
who may use it to evaluate inadequate or un- 
reliable observations and reports. The items 
are subdivided into categories: self-help (gen- 
eral, eating, dressing), locomotion, occupation, 


communication, self-direction, and socialization, 
which permit some diagnostic use. The final 
rating is a point score of the number of items 
passed, which is converted to a.Social Age by 
mearis of a table. The scale is a valuable instru- 
ment for psychologists who recognize the wide 
value of developmental studies, including as- 
pects of personality in addition to intellect. 


For reviews by Paul H. Furfey, Elaine F. 
Kinder, and Anna S. Starr of Experimental 
Form B, see 38:1143. 
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* Visual Motor Gestalt Test. Ages 4 and over; 1938- 
46; individual ; 1 form; $1 per sct of cards and booklet 
of instructions ; $3.50 per copy of the complete manual 
A Visual Motor Gestalt Test and Its Clinical Use; 
(10) minutes; Lauretta Bender; American Ortho- 
psychiatric Association, Inc. 


REFERENCES 

1. Benner, Laurerra, “Disturbances in Visuomotor Gestalt 
Function in Organic Brain Disease Associated With Sensory 
pies Arch Neurol & Psychiatry 30:514-37 S '33. * (PA 

33581 

2, Benper, LAunETTA. "Gestalt Function in Mental Defect," 
pu clea MICE 38:88-104, discussion 104-6, 9 plates '33. * 

11103, 

3. ScuirpEm, PAUL. “Space, Time and Perception.” Psyche 
14:124:38 '34. * (PA 9:400 MT 

4. BENDER, LAURETTA, "Gestalt Function in Visual Motor 
Patterns in Organic Disease of the Brain: Including Dementia 
Paralytica, Alcoholic Psychoses, Traumatic Psychoses an 
Acute Confusional States.” Arch Neurol & Psychiatry 33: 
300-28, discussion .328-9 F $5. * (PA 9:5705) 

5$. BENDER, LAURETTA. A Visual Motor Gestalt Test and Its 
Clinical Use. Preface by Paul Schilder. American Orthopsy- 
chiatric Association, Research Monographs, No. 3. New York: 
the Association, 1938. Pp. xi, 176. $3.50. * (PA 13:1720) 

6. Scuriper, PAUL. "Notes on the Psychology of Metrazol 
Treatment of Schizophrenia.” J Nerv & Mental Dis 89113344 
F ’39. * (PA 14:356) E xb S s 

7. Fastrax, A, A. "Vertical Rotation in Visual-Motor Per- 
tormata: Its Relationship to Kaun Reversals.” J Ed Psy- 
chol 36:129-54 Mr '45. 19:2363 

8. Horr, Max L. "The Use of Projective Methods of Per- 
sonality Measurement in Army Medical Installations.” J Clin 
Psychol 1:134:40 Ap '4s. * (PA 19:2250) 


For related reviews, see 40:B843 and 109. 
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[Re Visual Motor Gestalt Test.] BENDER, LAURETTA. 
A Visual Motor Gestalt Test and Its Clinical Use. 
Preface by Paul Schilder, American Orthopsychiatric 
Association, Research Monographs, No. 3. New York: 
the Association, Inc., 1938. Pp. xi, 176. $3.50. * (PA 
13:1720) 

Am J Psychiatry 97:498-9 S '40. W. Line. 
This monograph brings together some results 
of the author’s investigations previously pub- 
lished, and some new material. * The theoreti- 
cal statements are hard to summarize, and at 
this stage, to evaluate. The main facts are that 
as children grow older their copies of presented 
figures approach more and more closely those 
of adults; that defective and abnormal subjects 
often give primitive and bizarre performances ; 
that the variations from the original undoubt- 
edly reflect phenomena that, if fully interpreted, 
would be of great assistance to our psychologi- 
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cal understanding. But the procedure as adopted 
by the author as diagnostic, seeking differences 
in performance which characterize subjects of 
known and significant difference (e.g., age, de- 
gree of intelligence, pathology, etc.), and thence 
interpreting those differences by recourse to a 
somewhat elaborate story of perceptual unfold- 
ing. The validity and practical value of that 
story, as well as its clarification, must await 
further research. Undoubtedly, as Schilder 
affirms in his preface, it is clinically stimulat- 
ing. It is too early, also, to comment very fully 
on the "Standardization of the Gestalt Func- 
tion in a performance Test for Children" 
(chapter XI). The method is not reported in 
detail, and relatively few cases have as yet 
been used as subjects. The results to date are 
both interesting and promising. A final com- 
ment on the monograph as a whole. It is obvi- 
ously ambitious in its scope, embracing as it 
does problems of development, deterioration 
and recovery. The general thesis is phenome- 
nological, with frequent recourse to complex, 
broad and unclear concepts. The method is not 
rigidly experimental, and being diagnostic in 
its direction it leans heavily (for purposes of 
maintaining a "dynamic" rather than "static" 
outlook) on a great deal of somewhat specu- 
lative interpretation. The practical application 
or experimental verification of the theoretical 
formulations will accordingly be difficult. (The 
reviewer believes that the type of concept used 
does not lend itself to clear-cut investigation in 
these directions.) Nonetheless, the monograph 
represents a very stimulating contribution to a 
challenging and complex field. 

J Abn & Social Psychol 36:128-9 Ja '41. M. 
A. Rickers-Ovsiankina. * Numerous reproduc- 
tions of drawings offer very interesting mate- 
rial to those readers who are not familiar with 
these phenomena. Essentially they are not new, 
however, the novelty lying chiefly in very sweep- 
ing statements and theoretical generalizations 
made by the author. The theory expressed is 
certainly not a theory in the sense that it leads 
to predictions and conclusions or even factual 
statements of a specific nature. What is impor- 
tant to know is not that these influences exist 
but exactly in what way they are responsible 
for a particular perceptive phenomenon. The 
author forgets that psychologists themselves 
are very conscious of the fact that they have 
not yet mastered certain dynamic features of 
perception as well as of other fields of psychol- 


ogy. More is needed, however, than just point- 
ing this out. The clinical part of the book 
presents a contribution of greater value than 
the theoretical. One regrets only that so little 
use is made of statistics. Except for the chapter 
dealing with the standardization of the test on 
children, one never knows how frequent and 
consistent the described patterns of behavior 
are in the various clinical groups. The test 
gives an estimate of mental age in children and 
defectives in a way similar to the Goodenough 
test. The present test, however, since it em- 
ploys meaningless figures, has the advantage 
that the picture of intelligence is not obscured 
by language and training. For the same reason 
it is valuable in following up the progress of 
transient states of mental disturbances, e.g. 
aphasia, With respect to malingering the study 
demonstrates the use of such drawing tests, 
when well standardized. In schizophrenia, 
manic-depressive psychosis, and psychoneurosis 
the subjects apparently behaved as one would 
expect a patient of that type to behave. On the 
whole, despite the criticisms which one might 
make especially of the theoretical section, the 
study is a stimulating one and the test has 
considerable promise for clinical purposes. 
For additional reviews, see 40:B843. 
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Washburne Social-Adjustment Inventory, Thas- 
pic Edition. Ages 12 and over; 1936-40; IBM; 1 
form; separate answer sheets need not be used; $1.70 
per 25; 20¢ per manual ; 35¢ per specimen set, postpaid; 
$1.20 per 25 machine-scorable answer sheets; $2.40 per 
set of stencils for scoring answer sheets; nontimed 
(30-50) minutes; John N. Washburne; published in 
1940 by the World Book Co. 


REFERENCES 

1-4. See 40:1262. 

. 5. WasHBURNZ, Joun N. “Factors Related to the Social Ad- 
DX ET of Suege Girls.” J Social Psychol 13:281-9 My '41. * 
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J Excep Child 7:212 F "41. Harold Postel. 
* will prove a valuable instrument for the sec- 
ondary school or college counselor, dean, or 
psychologist that desires to secure a better ad- 
justment for an individual, or that endeavors 
to select those students in need of psychiatric 
advice. The test is rather complicated in its ad- 
ministration and interpretation ; the teacher not 
trained in testing and without some knowledge 
of mental hygiene would have difficulty in 
making proper use of the test. It is this re- 
viewer's opinion that Dr. Washburne's inven- 
tory is best adapted for use by a specialist in 
the administration of an individual test, espe- 
cially when it is used on the high school level. 
This psychometric scale is well constructed, the 
result of ten years of trial, revision, and im- 
provement. It should find general favor in the 
secondary and college field, and will be a valu- 
able aid in solving those problems of youth 
that are often responsible for serious social and 
academic maladjustments. The instrument will 
be increasingly used as teachers become more 
enlightened regarding the important role played 
by the use of mental hygiene in the develop- 
ment of a well-integrated personality. 

J Nerv & Mental Dis 93:537-8 Ap ’41. * an 
inventory designed to reveal in secondary school 
children divergences from the “norm” in so- 
cial or affective maladjustment; thus calling 
the need for individual counsel and help to 
those charged with their guidance * If such 
necessarily rough classifications lead to the ful- 
fillment of needed individual attention along 
mental hygiene lines, their use in group educa- 
tion is of value. But the whole approach of 
adjustment scales, tests, evaluations, etc., is so 
essentially superficial from the standpoint of 
the protean dynamic determinants of adjust- 
ment and maladjustment—to say nothing of 
the ever-present external social and economic 
determinants—that it is difficult to believe that 
any newcomer to the tribe is going to prove an 
exception to the rule. 

For review by Daniel A. Prescott of the 
original edition, see 38:928. 
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*Weitzman's Inventory of Social Behavior. Ages 
16-25; 1941; 1 form; $1.75 per 25; 40¢ per specimen 
set; nontimed (20) minutes; Ellis Weitzman; Sheridan 
Supply Co. 
REFERENCES 


1. Weitzman, Erris. Test of Social Maturity and Its Use in 
Comparing People Sixteen Through Twenty-Four Years of Age. 
Unpublished doctor's thesis, University of Nebraska, 1940. 
(Abstracts of Doctoral Dissertations . . . , 1940, PP. 94-100. 


" 7 Pane EUR AES on dens ge MOM Competence,” 

P AER EU “A Study of Social Maturity in Per- 
sons Sixteen Through Twenty-Four Years of Age.” J Genetic 
Psychol 64:37-66 Mr '44. * (PA 18:2196) 

Louis Long, Director of Division of Testing 
and Guidance and Assistant Professor, De- 
partment of Student Life, The City College of 
New York, New York, New York. An inven- 
tory, developed for the purpose of assessing 
the level of social maturity, and which consists 
of only 33 items, must necessarily sample a 
limited number of activities. Questions about 
the reliability and validity of the inventory 
cannot be satisfactorily answered by reading 
the manual. A test-retest reliability coefficient 
of .88, based upon 42 college students, is re- 
ported. A correlation of .55 between scores and 
chronological age is reported as indicative of 
the validity of the inventory. Further evidence 
of the validity is found in the fact that the 
mean score on the test is different for the fol- 
lowing groups, equated as to chronological age: 
employed subjects, college students, Civilian 
Conservation Corps enrollees, and unemployed 
persons. The average score decreases (i.e, 
index of maturity drops) from the first to the 
fourth group. Of the six critical ratios between 
these groups two are below 2, two are below 
3, and two are above 4 (unemployed vs. em- 
ployed and unemployed vs. college). The use 
of such indices as the only measures of validity 
is questioned by the reviewer. 

The inventory probably will find its greatest 
use as an interviewing aid. If, however, an 
interviewer wishes to use the total score, it 
would seem absolutely necessary for him to 
build up a set of norms for his own local pop- 
ulation. Three different types of normative data 
are presented in the manual: centile ratings 
based upon total score, mean score for each 
year of chronological age from 16 through 25, 
and mean scores for first-, second-, third-, and 
fourth-year college students. 


Goodwin Watson, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. Dr. Ellis Weitzman, for his 
doctoral dissertation, prepared this inventory 
of 33 questions designed to measure social 
maturity (some items are from Doll's Vineland 
Social Maturity Scale) in young people 16 to 
24 years of age. High scores are achieved by 
youth who report that they handle their own 
finances, go out nights without restriction, 
have regular health checkups, and make their 
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own decisions. Items are weighted in accord 
with their capacity to differentiate older from 
younger adolescents. 

Norms are based on goo Nebraska young 
people during the 1930’s, with young men at 
all ages scoring slightly higher than young 
women. No account is taken of the dependence 
of norms upon other factors such as urban vs. 
tural culture, economic status, foreign-born vs. 
native-born parents, or the particular time 
period. Reliability of .88 is reported (42 cases 
retested after one week), but the problem of 
accuracy of self-report on such items as “Do 
you follow current events?” or “How many 
evenings a week do you stay at home without 
guests or visitors?” or “Do you assume re- 
sponsibilities beyond your own needs, which 
you might avoid?” or “Do you consider your- 
self the type of person who inspires the con- 
fidence of others in you?” is not discussed. 

The inventory seems useful as a basis for 
group discussion among adolescents on the 
freedom they have or ought to have. Granted 
a desire to answer truthfully, it would enable 
an investigator to identify the more emanci- 
pated and the overprotected in a group of col- 
lege or employed youth. Unemployed and un- 
derprivileged youth are penalized by several 
items requiring comfortable income. 


[ 112 ] 
*WilsonScales of Stability and Instability. Grades 
9-16 and adults; 1941; 1 form; $1.15 per 25, postpaid; 
15¢ per specimen set, postpaid; nontimed (20-30) min- 
utes ; Matthew H. Wilson; Bureau of Educational Meas- 
urements, Kansas State Teachers College of Emporia. 


Paul E. Meehl, Assistant Professor of Psy- 
chology, University of Minnesota, Minne- 
apolis, Minnesota. This is a pair of tests 
printed in a single booklet but not to be com- 
bined into a single score as the title might 
seem to imply. The stability test consists of 42 
items, each of which names or briefly describes 
some personal experience or situation, e.g., 
“animal pet," “picking own work,” or “friends 
with high standards.” The testee is asked to 
rate each of these experiences on a ten-point 
scale with respect to how much of a feeling of 
stability it gave him. The total score is the sum 
of these weighted ratings. The instability test 
consists of 38 items of an obviously adverse or 
unpleasant nature, e.g., “hurt in childhood," 
“restless,” “family quarrels," on each of which 
he is asked to give a similar rating with respect 
to how much of a feeling of instability they 


gave him. A short paragraph in the instructions 
briefly describes what is meant by the key 
words stability and instability. 

Raw score distributions and percentile norms 
are given for 400 high school sophomores and 
roo college freshmen. Phrases describing 
amounts of stability are coordinated with vari- 
ous percentile ranges. The test is supposed to 
be used with junior and senior high school and 
college students and “other adult groups.” 
Corrected split-half reliabilities seem to be satis- 
factory (.91 to .95 for various groups). 

No validation data are presented, nor have 
any appeared in the abstracted literature. The 
selection of items was from a larger pool, 
whose initial construction is not described in 
the manual. Basis of selection (which the 
author mistakenly considers evidence of “valid- 
ity") was a self-judgment by the students as 
to the importance of each experience in deter- 
mining their feelings of stability and instabil- 
ity. The items are uniformly of the obvious 
sort which abound in traditional personality 
inventories, and there is every reason to expect 
the usual influence of distortions in these self- 
ratings. The definition given to the testee of 
stability and instability is in the reviewer's 
opinion too vague to be of any real value. The 
author gives no consideration to the presumed 
effect of individual differences in constant rat- 
ing tendencies on the scores. In the case of the 
instability subtest, large regions on the rating 
scale remain substantially unused, as might be 
expected from the nature of the situation pre- 
sented. There is no accompanying definition of 
the measured variable for the use of the psy- 
chologist employing this test, and it is difficult 
to know what can be asserted about a person 
who achieves a deviant score, since no defined 
criterion groups of any sort are mentioned. Tt 
is not clear whether the terms stability and 
instability are to be conceived as extremes of 
a continuum ; remarkably enough, although the 
same adjectival descriptions are used by the 
author to characterize deviates on each test, no 
correlation between the two is reported. Thus, 
a clinician would not know how to interpret 
a pair of scores indicating moderate stability 
on the first test and extreme instability on the 
second. The directions and the nature of the 
task itself are hardly appropriate to use with 
adults generally, although no caution regarding 
education or intelligence is included. 

It is frankly difficult to say anything at all 
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favorable about this instrument. Although the 
complete absence of validity data makes it diffi- 
cult to decide, there seems to be no reason for 
supposing that this is more than merely an- 
other "personality inventory" having all of the 
defects and none of the merits of the tradi- 
tional variety. From everything available to us 
it would hardly seem justified to suggest any- 
one's using it except in an effort to determine 
whether any appreciable validity exists. Even 
this latter process would be quite a task since 
one does not get any clear idea of what sort of 
criterion ought to be used for the trait. 


Katherine W. Wilcox, Chief Psychologist, 
Traverse City State Hospital, Traverse City, 
Michigan. These are self-rating scales intended 
to measure stability and instability. The items 
on the stability scale include such topics as 
"My love for my parents,” “Friendliness,” 
"Physical exercise," and *Work promotes in- 
dependence" which are to be evaluated by the 
individual himself as to the degree to which 
these experiences contributed to his own sense 
of security and feeling of stability. On the in- 
stability scale the subject is asked to decide on 
such items as to what extent "Inferior" under 
the heading of emotional reactions and “Fam- 
ily relations" under the work and business 
heading contributed to feelings of insecurity. 

"These scales violate standard procedures of 
test construction in the following important 
ways: (a) The author assumes validation be- 
cause of split-half reliability without any com- 
parison of scores against any independent cri- 
terion. (b) The college data were from 100 
freshmen attending 64 different colleges. Obvi- 
ously there were some selective factors at work 
in the selection of cases which presumably 
influenced the results. The nature or direction 
of such influence is not considered. (c) The 
ratings are done on an eleven-point scale for 
material which is difficult to quantify at all and 
should certainly not be handled on more than 
a three-point scale. (d) The test items are 
briefly stated and are often ambiguous. (e) 
'The author states his theory of stability of 
personality and expects the subject to have 
insight into his own longitudinal adjustment 
on the basis of the author's theory. The author 
does not give any consideration to the vari- 
ability of results when such insight is lacking 
or defective. The arrangement of items by 
areas of experience, as in the instability scale, 


increases the halo effect and tends to make the 
self-judgments more biased. The author con- 
cludes that adjustment in a life situation is the 
same thing as a score on a self-rating scale. 
The only legitimate use of these scales in 
their present form is in bringing up topics for 
discussion between student and counselor. 
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*Work Preference Inventory. Applicants for em- 
ployment; 1946; IBM; 1 form; separate answer sheets 
must be used ; $25 per 100; $2 per 100 machine-scorable 
answer sheets ; $2 per 100 profile sheets; $6 per sct of 
18 keys (personality traits: reliability, perseverance, 
emotional stability, creativeness, conservatism, ambition, 
masculinity, introversion, anxiety-depression, neurotic 
index ; interests : persuasive, social service, theoretical, 
artistic, mechanical, economic, scientific) ; 35¢ per single 
copy ; 25¢ per manual ; nontimed (15-75) minutes ; Rob- 
crt W. Henderson: the Author, University Personnel 
Office, University of Kentucky, Lexington, Kentucky. 


Edwin W. Davis, Director of Washington 
Counseling Center and Associate Professor of 
Psychology, The George Washington Univer- 
sity, Washington, D. C. A hidden personality 
inventory within a regular vocational inventory 
makes this test about the most complete con- 
tribution to this technique of employee selec- 
tion and vocational choice. The only question 
remaining for such an excellent idea is the 
effectiveness of this dual instrument. The ten 
personality traits are reliability, perseverance, 
emotional stability, creativeness, conservatism, 
ambition, masculinity, introversion, anxiety- 
depression, and neurotic index. The vocational 
interests are persuasive, social service, theo- 
retical, artistic, mechanical, economic, and sci- 
entific. Although the test is mainly for adult 
employees and counselees, it can be used for 
adolescent youth. 

Actual use of these tests on 25 cases in the 
Washington Counseling Center revealed the 
personality traits unreliable indicators for per- 
sons well known to the counselors. The occu- 
pational levels were a little more reliable when 
compared to the Kuder and to the revised 
Brainard. 

The test would carry the technique of a 
combination personality-occupational inventory 
farther than any other test. Once this technique 
is perfected, it will be a great time-saver as 
well as provide a more acceptable personality 
test for over-sensitive clients. Immediate re- 
search should be undertaken to compare this 
test with the best comparable occupational in- 
ventory and personality inventory given to the 
same persons. The wording of the items in a 
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few cases can be improved. A person giving 
too many neutral answers, as experienced 
adults seemed to do, tends not to give a very 
distinctive profile. The author reports fine ac- 
ceptance of this test in industrial circles. 


John C. Flanagan, Professor of Psychology, 
The University of Pittsburgh, Pittsburgh, 
Pennsylvania. This inventory consists of a list 
of 130 pairs of various types of activities, For 
each pair, the individual is asked to state his 
preference by marking on the answer sheet the 
first choice if he likes the first alternative much 
more than the second, the second choice if he 
likes the first a little more than the second, the 
third choice if he likes them both equally well, 
the fourth choice if he likes the second alterna- 
tive a little better, and the fifth choice if he 
likes the second alternative much more than 
the first. 

Items are listed under nine headings, the 
first five of which refer to preferences regard- 
ing kinds of work or work environment. The 
other categories involve other types of prefer- 
ences. One section asks for choices regarding 
living conditions; another section, choices re- 
garding reading materials; another section in- 
volves school subjects; and the last section is 
a miscellany consisting mainly of recreational 
activities. E 

The selection of specific items impressed this 
reviewer quite favorably, The general plan of 
the test seems good, The manual states, 


The Work Preference Inventory gives ten personal- 
ity scores and seven interest scores, although most 
testees will not realize that it is anything other than 
an interest test. Therefore it has the advantage of 
being partially disguised and will give a true personal- 
ity picture in many employment situations. 

The scoring keys for the components defined below 
were developed through item analysis. Clinical ratings 
were compared to test responses and the resultant keys 
include the majority of those who were rated high on 
a trait or interest and exclude the majority of those 
rated low. 


The personality traits for which scoring keys 
are available include reliability, perseverance, 
emotional stability, creativeness, conservatism, 
ambition, masculinity, introversion, anxiety-de- 
pression, and neurotic index. The scoring keys 
available for obtaining information regarding an 
individual’s interests include persuasive, social 
service, theoretical, artistic, mechanical, eco- 
nomic, and scientific. 

Norms are provided for converting the scores 
obtained from each of the traits into the letter 


grades, A, B, C, D, and E. The corrected 
odd-even reliability coefficients based on 50 
cases vary between .57 and .92, with a major- 
ity of them being in the .70's. Two illustrative 
individual profiles are given, one for a success- 
ful sales manager and the other for an indi- 
vidual discharged from the Army because of 
"anxiety state severe." 

Biserial correlation coefficients are reported 
in a "validity table." These coefficients were 
obtained by correlating the test scores with the 
ratings of associates for the surface traits and 
interests and with clinical ratings for the other 
scales (introversion, masculinity, anxiety-de- 
pression, and neurotic index). In a letter to the 
reviewer Mr. Henderson states that in develop- 
ing the scoring keys, “only those for whom 
there was agreement among the 2 to 4 raters 
were used. Consequently, in some cases, as few 
as IS pairs of ratings were available. For ex- 
ample, 15 homosexuals versus 15 men rated as 
definitely masculine were compared item by 
item, and the items selected for the masculinity 
scoring key were those which differentiated the 
masculine from the nonmasculine. Sixty-five 
was taken as the lowest exclusion-inclusion per- 
centage and the average for all items on all 
scales is about 77." 

The “validity coefficients" reported in the 
table are based on between 25 and 85 individ- 
uals, the median number being 34. The biserial 
correlations range from .51 to .98, with seven 
of the coefficients reported above .90. The test 
author suggests that the high values may be 
due to the wide differences between the groups. 
This certainly seems to be a relevant explana- 
tion of part of the exceptionally high values 
obtained. It appears that it would have been 
more appropriate to have used the triserial 
correlation coefficient in this instance. In a com- 
munication to the reviewer the test author 
stated, “The validity s were not obtained 
from the same individuals (as used to develop 
the scoring keys), but the rating psychiatrists 
and clinicians were the same." 

The test materials and method appear prom- 
ising. However, the keys were developed on 
very small numbers of cases, and the norms 
currently available were also apparently based 
on a very small sample (the number was not 
reported). These facts raise serious doubt re- 
garding the advisability of using this test in its 
present stage of development. 

In order to obtain an impression regarding 
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the type of results provided by the inventory 
in its present form, scores were calculated on 
all 17 of the keys for the writer and eight of 
his associates. An indication of what appears 
to the reviewer as an inadequate basis for some 
of the scores obtained from the present blank 
is illustrated by the following finding. One girl 
obtained a score of 2 out of a possible score of 
13 on Emotional Stability, which, according to 
the manual, indicated that she was in the last 
category of the scale and had strong tendencies 
toward being neurotic and unstable. This very 
low score was obtained by marking the follow- 
ing choices: a slight preference for clerical 
over mechanical work; a strong preference for 
criticizing, editing, and examining rather than 
planning, organizing, and creating; a strong 
preference for rewriting rather than writing 
rough drafts of reports; a slight preference for 
work which enables you to use hunches and 
intuition over work which requires you to 
think things out carefully; a slight preference 
for people who are very religious as compared 
with people who are not religious; a slight 
rather than a strong preference for people who 
are realistic over people who are impractical ; 
a slight preference for living among poor 
people rather than among rich people; a slight 
rather than a strong preference for the comics 
or funnies over movie magazines; a slight 
preference for reading about the theater rather 
than reading about medicine; a slight prefer- 
ence for studying drama rather than account- 
ing; and a slight preference for watching 
rather than playing your favorite game. It is 
suggested that the above constitutes very in- 
adequate evidence on which to label a person 
as “emotionally unstable”; and it is believed 
that obtaining and reporting such scores may 
not only be of little value, but may actually do 
harm in some instances. 

In conclusion, it is suggested that the inade- 
quacy of the present test is primarily due to a 
lack of available resources for its develop- 
ment. It is believed that, for so important a 
topic as the diagnosing of personality, a very 
comprehensive and intensive research and de- 
velopment program is essential. 

Although the materials and procedures used 
in the development of this test are superior to 
those uséd in most published tests in this field, 
the inadequate research basis for the present 
scoring keys suggests that it should be used 


only with full knowledge of its present limita- 
tions and by experienced psychologists. 


Gilbert J. Rich, Director, Milwaukee County 
Guidance Clinic, Milwaukee, Wisconsin. This 
inventory attempts to serve both as a person- 
ality test and as an interest inventory. It con- 
sists of 130 paired items relating to the sub- 
ject’s preferences in regard to type of work, 
conditions of work, persons with whom one 
works, conditions of living, reading, studying, 
and leisure-time activities. On each pair of 
items, response is made upon a fivefold scale 
extending from one extreme through “liking 
both” to the opposite pole, with instructions to 
omit items where neither is liked. Scores are 
obtained for 10 personality characteristics: 
reliability, perseverance, emotional stability, 
creativeness, conservatism, ambition, mascu- 
linity, introversion, anxiety-depression, and 
neurotic index; and for 7 interests: persuasive, 
social service, theoretical, artistic, mechanical, 
economic, and scientific. 

The individual questions were picked by a 
process of item analysis, but no information is 
available as to number of subjects used other 
than that it varied from one trait to another. 
Biserial correlation coefficients are given for 
groups of persons who either were rated high 
or low in the particular trait or interest or who 
fell into groups that could be so classified. This 
procedure yields coefficients varying from .51 
to .98, but which are based upon small popu- 
lations ranging from 25 to 85. Moreover, it 
must be noted that many of these were soldiers 
for whom such inexact criteria were used as 
lack of referral to a neuropsychiatric clinic 
(well-adjusted) and discharge from the army 
for neuropsychiatric reasons (neurotic). Any- 
one familiar with military referral procedures 
and military psychiatry will readily recognize 
the inadequacy of such classifications. 

Moreover, a study of individual items like- 
wise indicates the difficulties in selection of 
them within such a situation. Items chosen to 
indicate neurotic tendencies are heavily loaded 
with those which might unfit the subject only 
for military life, such as artistic and musical 
interests and unwillingness to accept respon- 
sibility. 

There is a considerable degree of overlap 
between the items utilized to obtain different 
scores. For example, the scale for anxiety- 
depression contains 26 items, 25 of which are 
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also found among the 36 items on the scale 
for neurotic index. Similarly, the definitions of 
emotional stability and neurotic index are ex- 
-actly opposite, so that one should be the con- 
verse of the other. The inclusion of these three 
scales measuring the same characteristic (one 
in reverse) is a redundancy which gives to the 
inventory an appearance of exactness which is 
not there. 

The plan of testing personality characteris- 
tics under the guise of work interests should 
have the definite advantage of lessening the 


tendency of the subject to describe himself as 
he wishes to be, but it involves a danger that 
he may try to show himself as he thinks his 
employer or the prospective job may require. 
On the whole, however, it would appear to be 
more advantageous than disadvantageous. 

Taken as a whole, the inventory appears to 
be an effort to measure more characteristics 
than are justified upon a basis of 130 items. It 
also appears to be poorly standardized and to 
be validated upon too small a sample as well 
as upon inadequate criteria. 


ENGLISH 


Reviews By Harold H. Bixler, Paul S. Burnham, Ruth D. Churchill, John Cohen, Charlotte W. 

Croon, Edgar Dale, John S. Diekhoff, J. R. Gerberich, Harry A. Greene, Chester W. Harris, 

Joseph E. King, Gerald V. Lannholm, J. Paul Leonard, Robert C. Pooley, H. H. Remmers, 

Henry D. Rinsland, Holland Roberts, Verner M. Sims, Robert H. Thouless, Guy M. Wilson, 
Clifford Woody, Louis C. Zahner. 
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XAbility for English (Language): Fife Tests of 
Ability, Test 1. Entrants to secondary schools (Eng- 
land and Scotland); 1947; 1 form; ss. 6d. per 25; 
. per manual; 30(35) minutes; Frank M. Earle; 
University of London Press Ltd. 


REFERENCES : 

1, EAzLE, FRANK M. Tests of Ability for Secondary School 
Courses. Publications of the Scottish Council for Research in 
Education, No. 10. London: University of London Press, 1936. 
Pp. xiii, 138. 5s. * 


For a review by James Maxwell of the entire 
battery, see 8. 
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Iowa Placement Examinations: English Apti- 
tude: Series EA1, Revised. Grades 12-13; 1925; 
Forms A, B; $4 per 100; soé per specimen set in- 
cluding the other ro tests in the series, postpaid; 
43(50) minutes; prepared under the direction of 
C. E. Seashore and G. M. Ruch; M. F. Carpenter 
and G. D. Stoddard; Bureau of Educational Re- 
search and Service, State University of Iowa. 
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Lafayette, Ind.: Division of Educational Reference, the Uni- 
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10,000 Iowa igh chool Seniors. University of Iowa Studies in 
Education, Vol. s, No. 3, Iowa City, Iowa: the University, 
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8. MizLER, Lawrence WiLLIAM. An Experimental Study of 
the Towa Placement Examinations, University of Iowa Studies 
in Education, Vol. s, No. 6. Jwa City, Iowa: the University, 
ven, ane Connie, “Prediction of College Marks." J Exp 
Ed 8:303-7 Mr ’40. * (PA 14:4739) 

Robert C. Pooley, Professor of English, The 
University of Wisconsin, Madison, Wisconsin. 
A test of English aptitude for high schools and 
colleges consisting of four pages, to be ad- 
ministered in 43 minutes of actual test time. 
Punched keys are provided for manual scor- 
ing, and answer sheets for machine scoring. 
Norms are provided in the form of percentiles 
for the test, but these are undifferentiated for 
grade levels, or for the comparison of high 
school students with college students. 

The test is in four parts. Part I tests the ap- 
plication of two rules of grammar; Part II is a 
reading comprehension test based on about 
thirty lines of prose; Part III is a fact-deter- 
mining test from the reading of thirty-six lines 
of narrative poetry; Part IV is a test of the 
quality of construction of a prose passage, with 
questions on the beginning, the paragraph con- 
struction, punctuation, word choice, etc. 

This test is put forward by its authors as 
serving the following purposes: (a) to afford 
a basis for prediction of the character of the 
work that each student will do in college; (5) 
to aid in selecting and admitting students; (c) 
to serve as an entrance examination in lieu of 
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the more time-honored essay-type content ex- 
amination; (d) to section classes for instruc- 
tional purposes on the basis of mental ability ; 
etc. 

Such claims would be difficult to defend. To 
judge from this test the outcomes of four years 
of instruction in English in high schools are: 
(a) to be able to apply a grammatical rule, 
such as, “Avoid incongruous metaphors,” to 
printed sentences; (b) to read a composition 
textbook and understand it; (c) to read a 
poem for such valuable experiences as, “Was 
the sea calm ?” “How many statues were there ?" 
“What work did the men do on the island?” 
(d) to criticize a passage in print from the 
point of view of an editor or an English 
teacher. 

Nowhere, in any course of study available, 
can such goals be found as the principal out- 
comes of the teaching of English. If the test 
does not test what the teachers are directed by 
courses to study to teach, the test is invalid. It 
is grossly unfair to judge the results of in- 
struction by such a test. Nor is it just to sort 
or segregate students for further instruction by 
such means, Widespread use of this test will 
have one positive effect; it will divert teachers 
from the true goals of English instruction to 
drill upon rules, emphasis on proofreading 
skills, and the reading of poetry for the cata- 
loguing of irrelevant facts. 
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Basic Language Skills: Iowa Every-Pupil Tests 
of Basic Skills, Test C, New Edition. Grades 3-5, 
5-9; 1940-45; IBM, grades 5-9; Forms L, M, N, O; 
2 levels ; 44¢ per specimen set of either level; H. F. 
Spitzer in collaboration "E Ernest Horn, Maude 
McBroom, H. A. Greene, and E. F. Lindquist (Gen- 
eral Editor) ; Houghton Mifflin Co. 
a) ELEMENTARY BATTERY. Grades 3-5; $1.50 per 25; 
46(60) minutes. 
b) ADVANCED BATTERY. Grades ; IBM; separate 
answer sheets need not be used ; ido per 25; 63¢ per 
25 machine-scorable answer sheets; 30¢ per set of 
machine-scoring stencils; 55 or 58(70) minutes. 
REFERENCES 


1. Evans, Paut C. A Critical Analysis of the 1936 Iowa Basic 
Skills Test in Language, Test C. Unpublished master’s thesis, 
University of Iowa, 1936. 

2. Victor, GromciA C. An Analysis of Pupil Responses to 
Items in Iowa Every-Pupil Test in Basic Language Skills for 
1935. Unpublished master's thesis, University of Iowa, 1936. 


For reviews by Frederic L. Ayer, Gustav J. 
Froehlich, and Ralph C. Preston of the entire 
battery, see ro. 
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Clapp-Young ERE Test: Clapp-Young Self- 
Marking Tests. Grades 5-12; 1929; Forms A, B; 
9oé per 25; 8¢ per specimen set; 25(30) minutes; 
Frank L. Clapp and Robert V. Young; Houghton 
Mifflin Co. 
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Gerald V. Lannholm, Assistant Director, The 
Graduate Record Examination, New York, New 
York. This test is divided into three parts, osten- 
sibly designed to measure the pupil's ability: 
(a) to capitalize and punctuate correctly, (5) 
to use correct word forms, (c) to recognize 
various aspects of formal grammar. The test 
can be scored quickly, easily, and objectively 
by means of the authors’ patented “self-mark- 
ing" technique. According to the authors, the 
skills selected for inclusion in the test were 
based on studies of error frequencies. 

Published in 1929, this test was probably in 
advance of others in the field at that time. Al- 
though the evidence of research concerning the 
relative effectiveness of objective techniques 
for measuring a pupil's mastery of the funda- 
mentals of correct writing is neither conclusive 
nor complete, the evidence is sufficiently strong 
to raise doubts concerning the effectiveness of 
the techniques employed in this comparatively 
old instrument. Ability to capitalize and to 
punctuate is measured in this test by what the 
reviewer has called the "locate the error" tech- 
nique. The pupil need locate only the position 
of the error; he is not required to indicate the 
correct usage. In the word-form part of the 
test, two choices are given for each item, the 
pupil being directed to mark one of them as 
correct, The items in the third part of the test 
require the pupils to identify parts of speech 
and parts of sentences. 

The sampling in the test is much too limited 
to have the diagnostic value implied by the 
authors. This may be illustrated by the fact that 
the test includes less than ten different punc- 
tuation usages. Interpretative material provided 
by the authors is limited and likely to be mis- 
leading. For example, the authors state that 
any pupil who is just completing the work of a 
given grade should make a score equal to the 
median score earned by all pupils finishing that 
grade. Similar standards are decreed for pupils 
of various age levels. The reviewer questions 
seriously the appropriateness of such an arbi- 
trary definition of satisfactory performance 
for all pupils. 3 

A coefficient of reliability of .85 is reported 
by the authors, who do not state whether this 
is based on the performance of pupils in all of 
the eight grades for which the test is intended 
or on the results for a single grade. Nor do 
they indicate the method used in estimating the 
coefficient of reliability. The authors rest their 
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claim for validity for the test on its content, 
choosing to ignore the serious effect which other 
factors may have on its validity. 

It is difficult to believe that a test in this 
field would retain its original form for so many 
years. In view of the many improved tests now 
available in this field, it is not anticipated that 
enlightened teachers or supervisors will select 
this edition for use in present-day classrooms. 
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*College Placement Test in Hinglish. College; 
1941-43; Forms A, B; sold only to colleges, $1.60 per 
25, postpaid; sample test free; 120(125) minutes; 
Hector H. Lee; Turner E. Smith & Co. 
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X*College Preparatory Test in English. High 
school; 1943; 1 form; $1.25 per 25, postpaid; sample 
test free; (45) minutes; Hector H. Lee; Turner E. 
Smith & Co. 
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X*Cooperative English Test: Lower and Higher 
Levels. Grades 7-12, 13-16; 1941-47; IBM; three 
tests available separately and in a single-booklet edi- 
tion; separate answer sheets need not be used; Co- 
operative Test Service. 
a) SINGLE BOOKLET EDITION. Grades 7-12, 13-16; 1941— 
43; Rom S, T; $375 per 25; 30 per specimen 
set of eitlier level, postpaid; $1 per 25 sets of ma- 
chine-scorable answer sheets; 45¢ per set of stencils; 
120(130) minutes. 
b) TEST A, MECHANICS OF EXPRESSION. Grades 7-16; 
1941-47 ; Forms R, S, T, X; $175 per 25; 25¢ per 
specimen set, postpaid; 40€ per 25 machine-scorable 
answer sheets; 15¢ per stencil for scoring answer 
sheets; 40(45) minutes; Geraldine Spaulding, W. W. 
Cook (Form R), and Herbert Danzer (Form X). 
C) TESTS BI AND B2, EFFECTIVENESS OF EXPRESSION: 
LOWER AND HIGHER LEVELS. Grades 7-12, 13-16; 1941- 
47; Forms R, S, T, X; 2 levels; $1.75 per 25; 25¢ per 
specimen set of either level, postpaid; 4o¢ per 25 ma- 
chine-scorable answer sheets; 154 per stencil for 
Scoring answer sheets; 40(45) minutes, Geraldine 
Spalding, W. W. Cook (Form R), Miriam May 
(Forms S and T), and Miriam M. Bryan (Form X). 
d) TESTS CI AND C2, READING COMPREHENSION : LOWER 
AND HIGHER LEVELS. Grades 7-12, 13-16; 1941-43; 
Forms R, S, T; 2 levels; $2 per 25; 25¢ per speci- 
men set of either level, postpaid; 40€ per 25 machine- 
scorable answer sheets; rsé per stencil for scoring 
answer sheets; 40(45) minutes; Frederick B. Davis, 
Harold V. King (Form S), and Mary Willis 
(Form T). 
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J. Paul Leonard, President, San Francisco 
State" College, San Francisco, California. [Re- 


view of Lower Level Forms R, S, and T.] 
These tests represent recent achievements in 
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the refinement of test procedures in reading 
and mechanical usage. Norms are provided on 
many cases for different types of schools. The 
tests are quickly scored, available for machine 
scoring, rather comprehensive in coverage, 
quickly and easily administered, and excellent 
to aid the average English teacher who wants 
to use them to supplement theme writing and 
observation. They are not individually diag- 
nostic but are chiefly valuable for giving an 
average comparable level of achievement and 
for securing comparable scores on different 
phases of reading and expression. 

The test consists of three major sections, 
each 40 minutes long, testing reading compre- 
hension, mechanics of expression, and effec- 
tiveness of expression. More specifically these 
areas are measured through vocabulary and 
reading comprehension; grammatical usage, 
punctuation, capitalization, and spelling; sen- 
tence structure and style, active vocabulary, 
and organization. They are based upon careful 
analysis of many studies in English and in the 
main are designed to measure the thinking 
process in relation to the mechanics of reading 
and expression. As such, their motives are 
commendable, their construction unusually care- 
fully planned and checked by sampling of items, 
and the scoring is designed to eliminate in so 
far as possible the nonessential factors fre- 
quently affecting testing. Careful precautions 
are given in the manual to the teacher, the 
claims for the test are guarded and modest, and 
every modern statistical device and caution have 
been used to make the test as accurate for its 
purpose as possible. 

One has the feeling that superior workman- 
ship has gone into these tests and that within 
the framework of statistical standards and ease 
of scoring they are superior. They are not 
meant to be individually diagnostic and thus 
have limited use; they are so short in length 
that they reduce their own reliability. In a few 
places one feels that a load of understanding is 
placed upon the pupil for quickly sensing the 
different ways of responding to the test items 
so that they may be scored easily and that in 
some instances, because of the time element in 
the test, he may be penalized for proceeding 
carefully. Some of the items in the grammati- 
cal section of the test can probably be “felt” 
by the pupil, some of the punctuation items are 
probably overplayed (apostrophe of possession, 
for instance), and the active vocabulary section 


is probably more of a crossword puzzle than- 
anything else. On the whole, however, the test 
is excellent for a short, quick survey of general 
abilities and is far superior to most tests in the 
statistical procedures used in building it. 


Edward S. Noyes, Associate Professor of 
English, Yale University, New Haven, Con- 
necticut. [Review of Higher Level Forms R, S, 
and T.] These three forms are nearly identical 
in plan and in number of items; scores.on any 
one form should be comparable with those on 
any other, a fact which may be useful to teach- 
ers with several divisions. 

The subtest Vocabulary (60 five-choice items, 
15 minutes) is limited to synonyms. The items 
are said to use “words of increasing difficulty" 
and to demand "increasingly close discrimina- 
tion among the choices.” But is “piebald” (Item 
60, Form R) a word more difficult than 
"writhe" or “cornea” (Items 1 and 4)? This 
reviewer sought in vain to find evidence of the 
need for closer discrimination among later than 
among earlier choices, in any one of the forms. 
In any item, the student who knows the mean- 
ing of the given word will find little difficulty 
in his choice; he who does not know must 
guess. The number of items seems small for 
exploration of vocabulary range. 

The subtest Reading Comprehension (90 
items, 25 minutes) is so arranged as to afford 
two scores: one on speed, the other on level of 
comprehension. Since students who finish the 
vocabulary section are told to go on at once to 
the comprehension section (some will and 
some will not), it would seem that speed scores 
would not be reliable. The selections are good 
in that they cover a wide range of material and 
are reasonably complete in themselves. The 
statements made about them, however, often 
seem dubious. Item 22, Form T, states that 
"the humor in this passage depends on the 
meaning of the word . . .” (five single words 
follow). Humor, in this as in most passages, iS 
too elusive and pervading to be pinned down to 
any single word. Again, Item 33, Form T,. 
states, “This passage is intended principally to 
be (1) humorous (2) informative (3) scien- 
tific (4) apologetic (5) critical.” From the 
passage itself, (1), (2), and (5) are almost» 
equally possible as answers. Similar difficulties 
are presented by Items 21, 22, 31, 84 in Form 
S and by Items 16, 48, 51 in Form T. It would 
be unfortunate were students to infer from 
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such statements that writing which is informa- 
tive or critical cannot at the same time be 
humorous. 

Occasionally, students are expected to know 
more than the selection itself reveals. In Item 
76, Form S, the passage simply does not pro- 
vide the evidence necessary for a reasoned 
choice. Similarly Items 33 to 36, Form S, 
would be difficult for anyone not familiar with 
the terms "romantic" and "classic" in their 
literary significance. In such items, the tests 
are not really objective despite their form. The 
testmaker had an attitude toward the passage; 
the choices he offers to the testee are based on 
that attitude. If the student does not adopt the 
same attitude, his anwers will be scored as 
"wrong" even though they may be entirely 
logical from his own premises. 

The test Mechanics of Expression (three 
subtests on grammatical usage, punctuation and 
capitalization, and spelling; 40 minutes) con- 
sists of a series of proofreading exercises. As 
lias long been pointed out, a student's ability to 
spot errors on the printed page and his ability 
to avoid similar errors in his own writing are 
two different abilities. College freshman Eng- 
lish sections are full of students who have done 
reasonably well on such exercises as these but 
who are still lamentably prone to solecisms al- 
most as bad. Actually, this section of the tests, 
in sharp contrast to the preceding one, seems 
hardly designed for a "higher level." Witness 
the emphasis on such glaring errors as "had 
of," double negatives, and misuse of cases and 
tenses. 

The test Effectiveness of Expression (three 
subtests on sentence structure and style, active 
vocabulary, and organization; 40 minutes) 
provides in the first and last subtests more or 
less jumbled sentences to be compared with 
more correct ones and more or less jumbled 
topics to be arranged in their "best" order. 
Here again, the critical rather than the creative 
faculty is being measured ; there is no necessary 
connection between scores on the test and the 
ability of the students to write accurate sen- 
tences or logically planned themes. If it is 
clearly understood what is being tested, I can 
see no objection to these exercises; but one 
should beware accepting their results as a true 
indication of “effectiveness of expression,” 
which, after all, can only be discovered by giv- 
ing the student a blank page and a pencil and 
setting him to write. 


The second subtest, called Active Vocabulary, 
is a guessing game: the student is given a brief 
definition of some word, the number of letters 
in the word, and five initial letters from which 
he is to select the initial letter of the word. “A 
man who does not belong to the army or navy 
is a—(8)—(1)B (2)C (3)D (4)F (5)G.” 
This is a fearful and wonderful example of 
testing. (2)C, standing for “civilian,” an eight- 
letter word, is presumably the “right” answer. 
But why not (4)F, standing for “farmhand,” 
also an eight-letter word defining one who is 
in neither the army nor the navy? For this 
particular item, (4) F seems indicated, at least. 
This game might be fun on rainy evenings, 
but what it has to do with active, as opposed 
to passive or contemplative or retrospective, 
vocabulary is a bit hard to understand. Since 
the testee does not even have to write out the 
word he has in mind, the element of chance is 
inevitably increased: he may guess the right 
letter without knowing any word, or he may 
have a reasonable word, fitting the definition 
and the number of letters, which will be 
“wrong.” 

The strength of these tests lies in their sec- 
tions on reading comprehension and in their 
comparability; the weakness, in their varying 
attempts to provide substitutes for writing. 


Robert C. Pooley, Professor of English, The 
University of Wisconsin, Madison, Wiscon- 
sin, [Review of Higher Level Forms R, S, and 
T.] These tests are a combination of a reading 
skills test and two English skills tests, me- 
chanics of expression and effectiveness of ex- 
pression, brought together to permit in two 
hours of testing time a rather full analysis and 
diagnosis of English abilities at the high school 
and college levels. The test booklets contain 
answer spaces for numerical responses to be 
scored manually, or the booklets may be used 
with separate answer sheets for machine scor- 
ing. The mechanical construction of the test is 
excellent from the point of view of ease and 
objectivity in scoring and recording results. 
Norms are furnished at the college level for 
each of the parts of the test and for the total 
test for freshmen, sophomores, juniors, and 
seniors at the end of the school year and for 
entering freshmen. Moreover, the norms are 
divided into three groups according to type of 
college or course: I, pre-professional; II, 
liberal arts; III, junior and teachers’ colleges. 
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Similar norms are provided for high schools: 
Group I, end-of-year norms for grades 7 
through 12 in northern high schools; Group 
II, grades 8 through rr for southern high 
schools. There is a sixteen-page manual of 
directions. 

The test Reading Comprehension contains 
two parts: vocabulary—meaning tested by five 
choices, one of which is a synonym of the given 
word; and speed and level of comprehension— 
tested by responses to 17 brief reading selec- 
tions drawn from widely different sources, 
informational, scientific, and literary. The test 
Mechanics of Expression contains 60 items of 
grammatical usage placed in sentences, 45 items 
of punctuation, and 24 items of capitalization, 
the latter two types presented in running 
prose. Spelling is presented in 60 items, each 
in a choice between a misspelled and a correctly 
spelled word. The test Effectiveness of Ex- 
pression contains three parts. Part I measures 
sentence structure and style by the comparison 
of passages of prose placed in parallel columns 
and by an exercise in the choice among four 
versions of the same sentence. Part II is a test 
of active vocabulary in which the student must 
guess the word intended by definition and by 
clues to first letter and length of word. Part 
III measures organization by rearranging dis- 
organized paragraphs and by completing a par- 
tial outline. 

The materials of the tests are well chosen 
and clearly presented. The directions are sim- 
ple and concise and make clear to the student 
the purpose of each test. Dubious and contro- 
versial usage has been avoided; so far as is 
possible in an objective test, the materials of 
English have been cast into natural settings of 
sentences and paragraphs. Mechanics are tested 
functionally rather than in isolation from Eng- 
lish expression. It is one of the best tests avail- 
able in the field of English skills. 

Its principal defect is shared by all other 
objective tests in English: it does not test abil- 
ity in English, if ability is defined as the power 
to use English effectively in speech and writ- 
ing. It does test the power to correct errors, to 
proofread, to organize or reorganize material 
composed by others. It does not test the power 
to compose English and should therefore be 
used cautiously in the placing of students in 
ability groups or in the sectioning and exemp- 
tion of college freshmen. 
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For reviews by Robert M. Bear and J. B. 
Stroud of Tests C1 and C2, see 497. 
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XCorrectness and Effectiveness of Expression: 
Tests of General Educational Development: 
High School Level, Test 1. High school; 104-45; 
IBM; Form B; separate answer sheets must be 
used; $2 per 25; 40¢ per 25 machine-scorable answer 
sheets; 15¢ per scoring key; 25¢ per specimen set, 
postpaid; worklimit (120) minutes; prepared by the 
Examinations Staff of the United States Armed 
Forces Institute; published by the American Council 
on Education; distributed by the Cooperative Test 
Service. (Also distributed by Science Rescarch Asso- 
ciates: $2 per 25; 65¢ per 25 machine-scorable answer 
shects ; 50¢ per key; 50€ per specimen set.) 

For reviews by Herbert S. Conrad and War- 


ren G. Findley of the entire battery, see 20. 
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*Correctness and Effectiveness of Expression: 
Tests of General Educational Development: Col- 
lege Level, Test 1. College; 1944-45; IBM; Form B; 
separate answer shects must be used; $2 per 25; 40¢ 
per 25 machine-scorable answer sheets; 15¢ per scor- 
ing key; 25¢ per specimen sct, postpaid; worklimit 
(120) minutes ; prepared by the Examinations Staff of 
the United States Armed Forces Institute; published 
by the American Council on Education; distributed by 
the Cooperative Test Service. (Also distributed by Sci- 
ence Research Associates: $3 per 25; 65¢ per 25 ma- 
chine-scorable answer sheets; so per key; Sof per 
specimen set.) 

Charlotte W. Croon, Test Research Service, 
1213 McChestney Avenue, Nashville, Tennessee. 
Although this test is one of a battery constructed 
for the special purpose of assisting in the guid- 
ance and placement of those who have been in 
the armed services, there is no reason why its 
use should be thus limited. The content is not 
dependent on military experience, and the 
norms provided are based on regular students 
in three types of colleges who are just complet- 
ing a freshman course in English composition. 

The test is not set up to be diagnostic and 
yields just one score. It is entirely objective 
and can be rapidly scored by hand or machine. 
The first 25 items are spelling ones of the type 
in which the testee selects from four words 
the one which is misspelled or, if he thinks all 
are correct, marks the fifth choice, "none mis- 
spelled." As has often been said, this technique 
does not require him actually to spell anything 
or even to recognize the correct spelling of the 
misspelled words. It does permit the partial 
testing of a large number of words in a mini- 
mum of space and time. The selection of words 
seemed satisfactory to this writer. 

The second part consists of four themes in 
which errors and infelicities of the types listed 
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below are introduced. Places where these occur 
(and a few where they do not) are underlined, 
and the testee is required to select the best of 
four ways to improve the underlined portion. 
This technique has been used many times and 
is always open to the criticism that areas need- 
ing rewriting are inevitably pointed out to the 
testee. To some degree, this objection is met 
by having the first choice identical with what 
is already in the selection and occasionally hav- 
ing that as the right answer. 

A rough analysis of this part yielded the 
following distribution of items: 


Punctuation (including capitalization) . 10 


Grammar . . . LAS PEE. 
IDicBonn TOES o uev e XE arc e 30. 
Diction taste and style . . . . . . . 42 
Paragraph and sentence organization. . 17 


In view of the intent to measure "self-educa- 
tion . . . as opposed to formal education," con- 
siderable emphasis on diction and especially on 
taste and style is desirable. Whether over half 
the items should fall in this category is ques- 
tionable. The use of running text tends to limit 
both the sampling of points tested and the 
methods of testing (e.g., paragraph organiza- 
tion, an essential aspect of effective expression, 
is tested meagerly and indirectly by four items 
in which the desired answer is to leave out 
completely a sentence or clause). Many com- 
mon errors, important enough to justify one 
item, are not touched on. One wonders whether 
the author used an outline in building the test, 
i.e., decided what points to cover and shaped 
his material accordingly, or let his material 
determine the items. 

The first and last themes are interesting, and 
a good deal of ingenuity has been used in devis- 
ing the items, which range from gross errors to 
fairly subtle distinctions. The punctuation and 
grammar items are clear-cut, but as soon as the 
author enters the subjective area of taste and 
style, he is on dangerous ground and should 
have taken every precaution to use only items 
that a large majority of experts will agree upon. 
This part could have profited from thorough- 
going criticism prior to publication, which would 
have eliminated most of the items that have 
debatable answers and ironed out rough places 
in the selections themselves. 

The test is apparently photo-offset from Vari- 
Typed copy, and its appearance and legibility 
are marred by too much use of boldface type. 
There are a few misprints, one of which (in 


Item 61) happens to eliminate the correct 
answer. 

No data are given on reliability or validity 
'The scoring method makes no correction for 
chance, An examination of the percentile norms 
arouses some suspicion regarding the distribu- 
tion of item difficulties and the efficiency and 
reliability of measurement. For example, only 
16 raw-score points separate the 20th and the 
Both percentiles in the Type I colleges. This is 
spreading the items pretty thin and increasing 
the error of measurement over the middle three- 
fifths of the scores. One would judge that there 
are too many easy and too many hard (or 
ambiguous) items. Probably the “correctness” 
items tend to be easy, and the “effectiveness” 
ones hard, There is no mention of the tryout 
of the items or the criteria used in selecting 
them. 


For reviews by Herbert S. Conrad and War- 
ren G. Findley of the entire battery, see 20. 
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*Cotswold Measurement of Ability II: English. 
Ages 10-11; 1947; 2 forms (Series 1, 2) ; 6s. per 20; 
1s, per manual; 6d. per single copy; 40(50) minutes; 
Sade Fleming; Robert Gibson & Sons (Glasgow), 

td. 

[ 124 ] 
“Dingwall” Test in English Usage. Ages 9-10, 10- 
12; 1037; 1 form; 2 parts; 2 levels; 6d. per manual; 
1s. 3d. per specimen sct, postpaid; prepared hy the 
Educational Institute of Scotland; University of Lon- 
don Press, Ltd. 
a) PUPILS AGED 9-10: PUPILS’ BOOKLET I. 3s. 6d. per 25. 
Lb) PUPILS AGED 9-10: PUPILS’ BOOKLET II. 45. 6d. per 25. 
C) PUPILS AGED 10-12: PUPILS’ BOOKLET I. 35. 6d, per 25. 
d) PUPILS AGED 10-12: PUPILS’ BOOKLET II. 45. 6d. per 25. 


Robert H. Thouless, Reader in Educational 
Psychology, Cambridge University, Cambridge, 
England. These tests are designed to give a 
measure of achievement in ability to write “cor- 
rect” English: to use the past tense, to use the 
correct cases of pronouns, to use appropriate 
conjunctions, to punctuate, to insert capital let- 
ters, etc. Even for the older age range, there 
seems to be no attempt to assess skill in English 
usage as distinct from mere correctness. No use 
is made of alternative answers to be underlined 
or the like, so an amount of writing is required 
which seems undesirable in tests to be done with 
a time limit. The usefulness of the two forms 
is reduced by the fact that some of the test 
items are identical. 

There is no indication of any measurement of 
the validity or reliability of the test; the small 
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number of items in each part would probably 
make these low. Tentative norms are given at 
the end, but it is not stated from how many 
subjects these have been calculated, so one can- 
not judge how accurate they are likely to be. 
In any case, these norms are simply single fig- 
ures indicating the average score for the chrono- 
logical age for which each test was designed. 
There is no measure of scatter or other means 
of assessing how much better or worse a given 
score is than this average. This information is 
insufficient to be of any practical use for anyone 
who wishes to use the tests. 


For review by Charles Fox, see 40:1275. 
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English Survey Test: Ohio Senior Survey Tests. 
Grade 12; 1935-41; Forms A, B; 3¢ per test; 85(90) 
minutes ; S. L. Presscy and Maurice E. Troyer (Form 
B); Ohio Scholarship Tests, Ohio State Department 
of Education. 


REFERENCES 
1. Miter, Dorotny A, A Comparison of Scores on the 


Pressey-Troyer English Survey Test With Point-Hour Ratios 
ip, Business Organization, Accounting and Business Education 
[oce Unpublished master's thesis, Ohio State University, 

Charlotte W. Croon, Test Research Service, 
1213 McChestney Avenue, Nashville, Tennes- 
see. This test yields a total score and five subtest 
scores: grammar, spelling, capitalization, punc- 
tuation, and sentence structure. The maximum 
possible score on each subtest is 80. However, 
the sigmas (estimated from the percentiles for 
Form A) gives a better idea of the weighting 
of the five parts in the total score: grammar I5, 
spelling 16, capitalization 9, punctuation 20, 
and sentence structure I5. 

The two forms were apparently constructed 
to cover about the same material at the same 
difficulty level. The percentile norms for Form A 
apply to pupils before they have had a remedial 
course, and those for Form B apply after this 
course. There is no information given about 
reliability. 

Except for the sentence structure subtest. 
which is entirely objective, the scoring would 
probably have to be done by the teacher, since 
the pupils write in their answers. A correction 
for guessing in the three-choice sentence struc- 
ture section would have been desirable. On 
Form A, a testee who knew 25 items and omitted 
the other 15 would score at the soth percentile ; 
another testee who also knew 25 items but 
guessed on the others would be most likely to 
score at the 75th percentile. 


The items are crowded, and the format is 
poor. In the grammar section of Form A, the 
pupil has to decide which one of three italicized 
words in each line of running text is incorrect. 
The printing is so bad that it is hard to pick 
out the words. 

With 80 items, one would expect good cover- 
age of grammatical usage. Over a third of the 
errors are in verb number. In each form there 
are three items on "to" for "too." There are no 
items on such common errors as "hadn't ought,” 
“of” for "have" in such a phrase as “should 
have gone,” “can’t hardly,” “them” for “those,” 
“these kind” for “this kind,” double negatives, 
double subjects, "let's us,” “use to,” “like” for 
“as,” etc. On the other hand, some of the errors 
included are rarely heard in real life, e.g. “all 
of we cheered” and “I have clumb.” Also there 
are cases where there are two errors in one line 
(Item 1o in A), other cases where the error 
keyed is really not an error (Item 62 in B— 
"none" may be construed as either singular or 
plural), and other cases where an error can be 
corrected in two ways (Item 74 in B—change 
either the “each” to “all” or the “seem” to 
"'seems"). 

The second section of the test is called Spell- 
ing Vocabulary. According to the key, it yields 
a score in spelling only. The items require the 
pupil first to think of a word from a definition 
used in a sentence, e.g., “His life was filled with 
many (disastrous circumstances, fatal happen- 
ings)," and then to write the word. If, in the 
example quoted, "tragedies" did not come to 
mind, the testee would obviously have no chance 
to spell it. This double-barreled technique is not 
in accordance with present principles of test 
construction. 

The other parts of the test are also open to 
many criticisms. Six items in the punctuation 
section of Form A require the pupil to use a 
comma before the third member of a series of 
three, While the writer believes this is prefer- 
able, present usage permits omitting this comma. 
At least one of the punctuation sentences can be 
interpreted two ways (Item 7 in A). A number 
of sentences in the sentence structure section 
are keyed as wrong because a pronoun is used 
to refer to a clause; present usage allows this 
practice. Other items depend on technicalities ; 
the sentence, "She received a letter from her 
employer that was postmarked in Washington," 
is perfectly clear even though "that" does not 
directly follow the word to which it refers. 
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The two forms of this test were copyrighted 
in 1935 and 1937. The decade since then has 
seen many improvements in test-construction 
techniques, such as preparing a careful outline 
of points to be covered, testing for one thing at 
a time, trying out the items experimentally, sub- 
mitting the test to outside criticism before pub- 
lication, etc. There are now several tests in this 
field that are more carefully planned and con- 
structed than this one. 


J. Paul Leonard, President, San Francisco 
State College, San Francisco, California. This 
is one of the senior survey tests of the Ohio 
State Department of Education and is designed 
"to help the high school senior to survey his 
ability in these important elements of educa- 
tion." The makers claim the tests were based 
upon research data but do not specify any defi- 
nitely except the Thorndike word list for the 
spelling test. The test covers grammar, spelling, 
punctuation, capitalization, and sentence struc- 
ture. The grammar test is based upon a running 
story with grammatical errors which the pupil 
is asked to identify and correct. The section on 
punctuation and capitalization is made up of a 
group of sentences without appropriate mark- 
ings, and the pupil is required to correct them. 
The spelling section is composed of sentences, 
in which the word to be spelled is defined but 
not mentioned. The pupil is required to guess 
the word and spell it. The sentence structure 
section is made up of groups of three sentences, 
one of which has some incorrect form. The 
pupil is asked to identify the wrong structure 
but not to correct it. 

The testmakers supply norms in terms of raw 
scores and percentiles. The test is rather easily 
scored, with the exception of the punctuation 
and capitalization section. Very little special 
Statistical work has gone into the test. It is 
chiefly an instrument which any good teacher 
could make, supplied with norms for a large 
group of pupils. In the spelling test the pupil 
may guess other words than the actual one 
desired and will hence be marked incorrect. A 
good deal of time is spent trying to determine 
the word desired. The test is based entirely upon 
mechanical errors and represents nothing novel 
or distinctive in test building. It is a reasonably 
simple, comprehensive instrument to determine 
how well pupils can perform the operations for 
which it calls. 


[ 126 ] 
i E ional Achievement Tests. 
Lo ep inicio Forms A, B; $1.50 per 25 
tests for grades 3-8; $1.75 per 25 tests for grades 7-12; 
15¢ per specimen set; nontimed (40) minutes; Robert 
K. Speer and Samuel Smith; Acorn Publishing Co. 

Harry A. Greene, Director of the Bureau of 
Educational Research and Service and Profes- 
sor of Education, The State University of Iowa, 
Iowa City, Iowa. The test for grades 3-8 is a 
six-page survey test of 100 items. It consists of 
six parts: (a) capitalization, 10 items; (b) 
punctuation, Io items; (c) language usage (sen- 
tence), 20 items; (d) language usage (words), 
30 items; (e) expressing ideas, 20 items; and 
(f) letter writing, 10 items. The test for grades 
7-12 is an eight-page booklet covering six 
phases of language skill: (a) word usage, 30 
items; (b) punctuation, 20 items; (c) vocabu- 
lary, 20 items; (d) language usage—sentences, 
45 items (15 points) ; (e) expressing ideas, 15 
items; (f) expressing feeling, 10 items. A per- 
fect score on this booklet is 110 points. 

The selection of the item content of these 
tests seems to have been based largely on text- 
book analysis and personal judgment. While 
some objection might be registered against some 
of the socially improbable errors listed, the chief 
criticism must be of the extremely limited sam- 
pling in the several skill areas tested. Not only 
are capitalization, punctuation, and usage at 
both the word and sentence levels sampled in- 
adequately, but many skills in these areas that 
receive much instructional emphasis are crowded 
out by a section on expressing ideas which ap- 
pears to be more a reading test than a language 
test. In the general survey form for grades 3-8 
the test of letter writing is extremely brief and 
unreal. 

With the exception of the punctuation sec- 
tions the pupil response is given by recognizing 
and underlining the correct form. In most exer- 
cises three choices are offered. Not only is this 
space-consuming in the booklet, but in some of 
the items the third choice adds very little to the 
point of the exercise. No evidence on item diffi- 
culty or item discrimination is presented. No 
demonstration of the validity of the testing 
techniques is made in the material available to 
this reviewer. 

The tests have a number of unique features, 
one of which is the clever system of marking 
the correct choice under each item by letters 
found only in a specified code word. This ar- 
rangement greatly simplifies the scoring key 
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but does make for a rather eye-tiring scoring 
process. The supplementary material provided 
with the tests is limited to a single printed sheet 
bearing directions to the teacher, the class rec- 
ord, the answer form, and the norms. While 
time limits are not stressed in these tests, they 
are clearly set forth in the directions. Grade 
levels corresponding to point scores earned on 
the tests are the only norms reported. No indi- 
cation is given of the size or nature of the 
population on which the norms are based. No 
statistical evidence of the reliability of the tests 
is reported. 

It is possible that these tests would prove very 
effective if treated as a part of a complete 
achievement test battery; but when isolated as 
subject-matter tests, they seem to lack the defi- 
niteness and reliability which should be expected 
of a useful survey or diagnostic instrument. 


[ 127 ] 
English Usage: Every Pupil Test. Grades 3-6, 7-9, 
10-11; new form usually issued each April and De- 
cember; forms April, December 1946; 2¢ per test; 1¢ 
per answer key; 40(45) minutes; Jessie Bechtel, Helen 
Kaser, and W. G. Findley under the direction of John 
W. Evans; Ohio Scholarship Tests, Ohio State De- 
partment of Education. 
REFERENCES 

1. Epmrnston, R. W., AND GtNGERICH, C. N. “The Relation 
of Factors of English’ Usage to Composition.” J Ed Res 
36:269-71 D '42. * (PA 17:2169) 

$ J. R. Gerberich, Director, Bureau of Educa- 
tional Research and Service, and Professor of 
Education, University of Connecticut, Storrs, 
Connecticut. [Review of the April and De- 
cember 1946 forms.] The Every Pupil Ohio 
Scholarship tests of English usage are issued 
at three levels and in April and December forms 
annually. Both of the 1946 forms for grades 3 
to 6 were constructed by teachers and adminis- 
trators of the New Philadelphia, Ohio, public 
schools, while committees of Toledo junior high 
school teachers and of Mercer County teachers 
constructed the April and December forms re- 
spectively for both the junior high school and 
the senior high school levels. A program of test 
use suggested by the Ohio State Department of 
Education, by which the Every Pupil Tests are 
sponsored, consists of December testing pri- 
marily for diagnostic purposes and of April 
testing for measurement of progress and of 
achievement for three-fourths of the school 
year. The April and December forms for 1946 
here reviewed consequently are not subject to as 
exact comparisons as would have been true of 
forms covering a school year. However, many 


characteristics of the tests commented upon be- 
low are entirely independent of any expected or 
desirable parallelism between the two forms for 
one school year. 

All six of the four-page tests are printed by 
a photo-offset or similar method from what 
were primarily typed originals. With the 40- 
minute total working time and 100 scoring 
points common to all, any observable uniform- 
ity or parallelism ends except in the intermedi- 
ate-grade forms. Here there are five parts, 
allotted 20 points each, in both the April and the 
December forms. Tests for the junior and 
senior high school levels vary in number of 
parts from six to eight, and the tests constructed 
by Toledo junior high school teachers con- 
tain parts including such small numbers of items 
as 4, 5, and 6 and as many items as 35 and 40. 
Although there is no especial merit in complete 
parallelism and certainly there is no optimum 
number of items per test part, this situation 
nevertheless suggests to the reviewer some lack 
of balance among skills distinguished by num- 
bered and named parts. The reviewer has a pre- 
disposition against 100 scoring points on any 
test, for the reason that interpretations based on 
percentages are thereby suggested to persons 
who still believe in percentage marking systems. 

In both intermediate-grade tests, 20 scoring 
points each are allotted to correct usage, spell- 
ing, capital letters, punctuation, and sentence 
structure, with what may be an overemphasis 
upon capitalization and punctuation. However, 
the items in these tests are primarily functional 
rather than formal. Items are alternate response, 
multiple choice, and completion in nature. 

The April tests at the two higher levels con- 
sist predominantly of items requiring correction 
of errors or at least of writing by the examinee. 
Formal grammar to the extent of at least 45 
items of the total 200 is found in parts on sub- 
ject and predicate, parts of speech, phrases and 
clauses, verbals, classification of sentences, and 
sentence sense. Other parts deal functionally 
with word usage, sentence structure, spelling, 
punctuation, and capitalization. 

The December tests at the junior and senior 
high school levels are completion in nature to 
a smaller degree than are the April tests. How- 
ever, formal grammar again accounts for 30 of 
the 200 items, in parts on punctuation and capi- 
talization, parts of speech, and sentence pat- 
terns, whereas a more functional approach is 
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used for parts on word usage, sentences, spell- 
ing, vocabulary, and sentence recognition. 

All six tests make wide use of a completion 
item form which requires writing by the pupil 
of one or more words and in some instances 
also underlining of a capital letter or punctua- 
tion mark in the correction of usage errors. 
The scoring keys provide many instances of 
two, and a few cases even of three, different 
acceptable forms of response to items of this 
type. Elsewhere, such words as preposition, 
conjunction, etc., are to be written by the pupil, 
although in one part he is allowed to use Ph. 
and Cl. for phrase and clause. Certainly many 
standardized tests accomplish similar purposes 
with a greater degree of scorability and econ- 
omy of time. For example, the Cooperative 
English Test A, Mechanics of Expression em- 
ploys a rather complex but highly objective 
technique in measuring ability to detect and 
correct English usage errors. 

This test series makes frequent use of items 
requesting the pupil to distinguish between sen- 
tences and sentence fragments or among com- 
pound sentences, simple sentences, and sentence 
fragments, a type of ability or aptitude often 
called sentence sense. Three designations ap- 
pear at different levels for parts measuring 
this type of ability—sentence structure, sen- 
tence sense, and sentences. Although sentence 
structure is used in this manner at the inter- 
mediate-grade level, the same term in the April 
junior high school test designates items for 
which the pupil is asked to choose the better 
(or best) sentence of two (or three) for mean- 
ing exactly what it says. Furthermore, the sen- 
tence structure part of this junior high school 
test is almost exactly paralleled in the Decem- 
ber senior high school test under the part head- 
ing of sentence recognition. Apparently various 
schools of thought are represented in the titles 
of parts for the different tests of this series. 

Supplementary materials provided for scor- 
ing each test and for item counting the results 
are a mimeographed scoring key and a “Uni- 
versal: Class Error Sheet.” The former is in 
ruled columns designed for strip key use, and 
the latter provides for recording the errors and 
omissions of each pupil on every item of the 
test. Instructions on the error check sheet con- 
flict by recommending in one place that wrong 
answers be indicated by a check mark and in 
another place that wrong answers be indicated 
by a dash line. Provision is made in summary 
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rows and columns of the sheet for obtaining 
individual pupil and entire class data on errors, 
omissions, and correct responses. Additional 
supplementary forms not available to the re- 
viewer are a score distribution sheet and an 
item report sheet to be used by each school in 
forwarding results to the Ohio State Depart- 
ment of Education. 

A supplementary tool for use in the inter- 
pretation of results is the booklet of “Percen- 
tile and Item Norms.” The percentile norms 
for total scores of Ohio pupils are in skeleton 
form—deciles, quartiles, and percentiles 1, 5, 
95, and 99—to the nearest one-tenth of raw 
score values. Interpolations from such tables 
are necessarily rather involved. In a section 
devoted to what are called item norms, per- 
centages of Ohio pupils answering each item 
correctly are reported. The suggestion is made 
that the teacher can compare the percentage of 
success of her pupils on each item with the 
comparable percentage for Ohio pupils in gen- 
eral at the same grade level 

Still another supplementary too!, for use in 
summarizing local results, is the “Universal 
Class Percentile Record Blank.” This blank is 
designed for comparing pupil and class per- 
formance with the state-wide norms and for 
determining pupil and class progress between 
tests. However, progress can be evaluated by 
this device only relatively and primarily within 
the class or grade group, by means of the direc- 
tion and degree of change effected in the 
pupil’s placement relative to state-wide norms 
and to local results. In the absence of some 
form of comparable scores, as embodied in the 
various types of derived scores based on the 
standard deviation, the amount of pupil growth 
is not subject to estimation. Percentile norms 
based on raw scores, used for both December 
and April tests, are valid only for relative 
evaluations of pupil growth. 

Percentile norms for total scores show that 
the highest score made in eight of the ten 
grades covered by the test series was a perfect 
score and that ggth percentiles ranged from 
85.0 for the third grade to 98.7, 99.3, and 99.9 
of a possible roo for the sixth, ninth, and 
twelfth grades respectively. Clearly some pupils 
were not truly measured. At the other extreme, 
Ist percentiles ranged from 15.0 for the third 
grade to 35.0, 43.7, and 45.9 for the sixth, 
ninth, and twelfth grades respectively. The re- 
viewer infers that these tests lack sufficient top 
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for adequate measurement of superior pupils 
in the highest grade at each test level and that 
the tests in general are at a difficulty level con- 
siderably below that usually recommended for 
standardized tests. Perhaps the unfortunate 
tendency, if not desire, of many teachers to 
think and to test in percentage terms is re- 
flected here. 

These tests probably are not standardized, 
in the sense in which that term is ordinarily 
used. At least there is no evidence to indicate 
that the items were put through initial tryouts 
and that those finally included were chosen in 
terms of their difficulty levels and discrimina- 
tive power. Comparability is obtained, more- 
over, only by means of percentile norms. This 
type of comparability is often approximated in 
specificity and for smaller groups by classroom 
teachers who determine marks by some arbi- 
trary method based on the arithmetic mean and 
standard deviation or mean deviation. 

The reviewer's general reaction to these 
tests is that they merit placement somewhere 
between a standardized test and a teacher- 
made informal objective test. He does not be- 
lieve that a very high quality of workmanship 
went into their construction, It seems probable 
that many schools would be able to enlist the 
services of several teachers who could rather 
casually, with a good background of English 
studies but little understanding of objective 
testing techniques, cooperatively construct a 
series of tests of at least equal merit. 


128 
Essentials of SE EA Grades 7-12; 1939-44; 
Forms A, B, C; $1.50 per 25; 35¢ per specimen set, 
postpaid; 45(50) minutes; Dora V. Smith and Con- 
stance M. McCullough ; Educational Test Bureau. 

Charlotte W. Croon, Test Research Service, 
1213 McChestney Avenue, Nashville, Tennes- 
see. This test is attractive, carefully constructed, 
and easy to take. The authors are to be com- 
mended on “confining the contents to essentials 
upon which there is fairly universal agreement 
among English authorities” and on their em- 
phasis on validity as judged by experts in the 
field. 

The test is divided into five parts: Spelling, 
25 items; Grammatical Usage, 44 items; Word 
Usage, 15 items ; Sentence Structure, 20 items; 
and Punctuation and Capitalization, 53 items 
(between one-fourth and one-third test capi- 
talization). The publishers point out that per- 
sons using the test for survey purposes should 


notice that the parts vary in length but say 
that, since there is so little agreement upon the 
relative importance of the various phases of 
English usage, the authors have not attempted 
to apply arbitrary weightings. They apparently 
do not realize that, as soon as they build a test 
composed of several parts, they have inevitably 
weighted the parts in the total score. It would 
be better to weight the parts as they think best 
than to leave the weighting to uncontrolled 
factors. 

The authors minimize the value of the test 
as a survey measure and stress its diagnostic 
aspects, They provide a Diagnostic Key to Er- 
rors, which states the point tested by each 
item and which appears to be a helpful teach- 
ing aid. However, norms are provided giving 
selected percentiles for part and total scores for 
grades 7 through 12 so that it is possible to 
determine roughly where a pupil or class stands 
in relation to pupils in general. 

The correlation coefficient between total 
scores on comparable forms is given as .87 to 
.89. While the writer agrees that undue em- 
phasis has sometimes been put on obtaining 
high coefficients, it should be pointed out that 
the publishers err when they imply that “in- 
troducing items so easy that all pupils would 
recognize them” will alter the “statistical re- 
liability.” It will merely raise all the scores 
equally. No coefficients are given for the par*s. 
The writer wonders whether combining the 
short, and probably unreliable, Part III with 
Part II would not save time for both the testees 
and the scorers with little loss in diagnostic 
value. 

The sampling is good. The punctuation part 
is particularly well constructed to cover the 
points to be tested in a natural and interesting 
way. Most of the scoring would probably have 
to be done by the teacher since only one part 
is completely objective and there are places 
where the scorer's judgment might enter in. It 
may be a little dangerous not to correct for 
guessing on Part IV. A tenth-grade pupil who 
knew eight items and omitted the rest would 
score at about the 20th percentile; another 
pupil who also knew eight items but guessed 
on the others would be most likely to score at 
the soth percentile. The writer noticed only 
one item that was definitely in error, Item 42 
in Part V of Form A. The key requires the 
capitalization of “junior” in the phrase “junior 
class.” In Forms B and C, the authors reverse 
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themselves, and the testee is marked wrong if 
he capitalizes the names of school classes. 

Part IV, although completely objective in 
scoring, is really the most subjective in content. 
"The testee is asked to choose the one of four 
sentences that "most correctively and effec- 
tively states the idea.” In a number of items, 
two or more of the choices state different ideas 
equally correctly, e.g., Item 8 in Form A, Item 
4 in Form B, Item 7 in Form C. There are 
a few other items where the testee will find 
himself trying to select what the testmaker 
wants rather than being able to pick out one 
sentence as clearly superior, e.g., Items 9 and 
15 in Form C, Items 9 and 20 in Form B. 

The directions for administering might be 
improved by requiring the testees to go on to 
succeeding parts at appropriate intervals. In 
many classes, there are likely to be a few slow 
pupils who will spend an undue amount of 
time on the first parts of a test unless they are 
forced to move on. If the testing time were 
40 minutes instead of 45, the test could be 
given in the average school period. 

On the whole, the authors have done an ex- 
cellent job in their avowed purpose of provid- 
ing "teaching materials designed to orient the 
teacher with her instructional job in each class." 
The writer believes that this test will have its 
greatest value where the individual teacher is 
interested in giving it and is willing to do the 
scoring and interpret the results herself. 


Gerald V. Lannholm, Assistant Director, 
The Graduate Record Examination, New York, 
New York. In their Manual of Directions, the 
authors state that this test may be used as a 
survey test. They also state that its chief value 
“probably lies in its diagnosis of individual 
strengths and deficiencies in the English abili- 
ties.” A diagnostic key to errors helps to serve 
the diagnostic function of the test. Percentile 
norms based on total scores are presented for 
each of the grade levels for which the test is 
said to be appropriate. 

The abilities which the test purports to 
measure are implied by the names of the five 
parts of the test: Spelling, Grammatical Usage, 
Word Usage, Sentence Structure, and Punc- 
tuation and Capitalization. The number of 
items in these parts varies from 15 in the word 
usage section to 53 in the punctuation and 
capitalization section. The items in the test are 
said to be based on skills selected according to 


the findings of studies of frequencies of usage 
and of errors. These studies are neither iden- 
tified nor described. Eliminated from consid- 
eration for selection were skills upon which 
authorities disagree regarding the correct usage. 

The authors minimize the value of a statis- 
tical evaluation of a test’s effectiveness. How- 
ever, they admit selecting “items which dis- 
criminate between students scoring high and 
those scoring low.” They report that the cor- 
relation between scores on the various forms 
was found to be between .87 and .89. 

That the authors recognized the importance 
of a close relationship between the purpose of 
a test and the nature of the task imposed upon 
the examinee is evident in their selection of 
testing techniques. In all but one of the five 
parts of the test, the examinee must make a 
written response to the items. This reviewer 
suggests that greater reality might have been 
achieved without increasing the difficulty of 
scoring if other techniques had been employed 
for measuring certain of the abilities. For ex- 
ample, the spelling test requires the examinee 
to spell only those words which he thinks are 
spelled incorrectly in the test copy. In the case 
of such words, the pupil must first examine the 
distorted spelling presented and determine 
what word it represents. Use of the list-dicta- 
tion technique would eliminate this possible 
source of ambiguity and introduce a greater 
degree of reality, without increasing the scor- 
ing difficulty. 

Ability in grammatical usage is measured by 
having the pupil evaluate the correctness of 
designated words or groups of words, some of 
which are presented correctly and some incor- 
rectly. He is directed to correct those which 
he thinks are incorrect. A similar technique is 
employed in the word usage section. In the 
sentence structure section, each item presents 
four variations of the same thought. The pupil 
is to select the best of these statements. In the 
punctuation and capitalization section, mate- 
rial is presented without punctuation except for 
periods at the ends of some sentences. Some 
words are capitalized; some are not. At each 
of a number of underlined places, the pupil is 
to insert punctuation marks or capital letters if 
he thinks they are needed. 

This reviewer considers misleading the prac- 
tice of test authors generally to classify as 
diagnostic almost any test that covers more 
than one small phase of any field of learning. 
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If this practice persists, it is imperative that 
those who select and use tests be aware of the 
fact that tests vary greatly in the degree to 
which they may be of value in the process of 
diagnosis. At best, a test may assist in the 
identification of the specific strengths and 
weaknesses of individual pupils. The effective- 
ness of a test for accomplishing even this one 
step in a program of diagnosis will be limited 
to the extent that it fails to include a complete 
coverage of the specific skills involved. The 
effectiveness of the testing techniques em- 
ployed is another limiting factor. 

The Essentials of English Tests contain a 
total of 157 items. In the area of punctuation 
alone, authorities have identified nearly that 
many different usages, with several times that 
many variants. This test meets the criterion of 
complete coverage of skills more adequately 
than many other tests on the market. However, 
it still represents only a relatively small portion 
of the field it purports to cover. It is this re- 
viewer’s considered opinion that, despite the 
claim of the authors that their test has only 
incidental value for survey purposes, the test 
will be more useful for survey than for diag- 
nostic purposes. With the substitution of im- 
proved testing techniques and the adjustment 
of the test's difficulty level to a smaller grade 
range, the test might well serve as a useful 
instrument for the measurement of general 
achievement in these aspects of correct writing. 


Ed Res B 2r:27-8 Ja 14 '42. William J. 
Jones. * no norms are reported for the college 
Freshmen * Each form is composed of 157 
items divided into five different parts as fol- 
lows: spelling, grammatical usage, word usage, 
sentence structure, punctuation and capitaliza- 
tion. More than one-third of the items are in 
the last part. This is probably too much em- 
phasis on punctuation and capitalization if the 
test is to be used for survey purposes, as the 
authors suggest it may be used. Only 15 items 
are included on word usage, moreover. The 
content of the test was selected "according to 
the findings of studies concerning frequency 
of use and frequency of error." A hint con- 
cerning the way in which the authors expect 
their test to be used is found in the accompany- 
ing manual wherein they state that they have 
confined the "contents to essentials upon which 
there is fairly universal agreement among Eng- 
lish authorities," believing that the presence in 


English tests of items upon which authorities 
are not agreed “has an undesirable effect upon 
current practice in the teaching of English." 
This seems to imply that the test items should 
be the guide for teaching practice. The tail 
wags the dog again! Although the authors 
claim that "the chief value of the examination 
probably lies in its diagnosis of individual 
strengths and deficiencies in the English abili- 
ties tested," reliabilities for part scores are not 
reported. The probable error of a test score for 
some of the parts may be high, however, for 
in the accompanying norms there are instances 
where only 1 point change in part score makes 
a percentile difference of 25. The reliability of 
the total score is reported as .89. The test is 
well printed and seems easy to take. The scor- 
ing stencil for Part V is ill-fitting and various 
answers are acceptable for certain items in the 
other parts. 

[ 129 ] 
*Examination in English: College Level. First 
year; 1944; 1 form (usually called Form B);2 parts; 
separate answer sheets must be used; $2 per 25 of 
either part; 180( Vox E prepared by the Exami- 
nations Staff of the United States Armed Forces Insti- 
tute; published by the American Council on Educa- 
tion; distributed by Cooperative Test Service. (Also 
distributed by Science Research Associates : $2 per 25 
of either part; 6sé per 25 machine-scorable answer 
sheets of either part; 50¢ per key for either part; 50¢ 
per specimen set of either part.) 
d) BOOK I, READING AND LITERARY ACQUAINTANCE. Form 
CEn-2-B-4; 75¢ per 25 sets of machine-scorable an- 
swer sheets; 30¢ per scoring key; 35¢ per specimen 
set, postpaid; 120(125) minutes. d 
b) BOOK 1t, COMPOSITION. For administration only to 
students who have first taken Book I; Form CEn-3- 
B-4; 40€ per 25 machine-scorable answer sheets; 15é 
per scoring key ; 25¢ per specimen set, postpaid ; 60(65) 
minutes. 

John S. Diekhoff, Acting Chairman, Depart- 
ment of English, Queens College, Flushing, 
New York. Part I of this USAFI test has vir- 
tues which are not always found in such tests 
and which deserve special commendation. The 
four passages upon which questions are based 
are worth anybody's reading, for they are se- 
lected from Thoreau’s Walden, from the Gos- 
pel according to St. Matthew, from Fitzger- 
ald’s Rubdiydt, and from Pater’s The Renais- 
sance. The passages themselves are well chosen. 
They present a variety of reading problems 
and are yet sufficiently alike in content to pro- 
vide a basis for thought-provoking comparative 
questions. The passages are sufficiently difficult 
and the questions are sufficiently searching to 
tax the reading ability of very good under- 
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graduates and to stretch their minds. The test 
will enhance the students' skill in reading 
whether or not it succeeds in measuring that 
skill—and I think it will measure it, too, If the 
selections and the questions are thought by 
some to demand too much of undergraduates, 
that is a good fault and a rare one in a time of 
relaxing educational standards. 

The second part of the test will no doubt 
perform its function of measuring “literary 
acquaintance,” but this is not so important a 
function. It is important to know whether a 
student has read Aristotle, Boswell, Corneille, 
and Zola. It is perhaps more important to know 
whether he can read them intelligently. It is 
certainly less important to know whether he 
has ever heard of them and whether he knows 
which is “classic,” which is a biographer, which 
is a dramatist, and which is a “realist.” This 
mere hearsay acquaintance, however, is what 
is measured by the test. Granted the limited 
objective, this portion of the test is good of 
its kind. 

One may say the same of Book II, Part III, 
the test in composition. But I am doubtful of 
the ability of this kind of test to measure the 
really important composition skills—doubtful 
whether any objective-type test yet developed 
will do the job. It is one thing to discover 
whether a student recognizes correct or incor- 
rect writing, relatively effective or ineffective 
writing, in what he reads for that purpose; it 
is another thing to discover whether he can 
plan and organize a composition and then write 
it correctly and effectively. No doubt the two 
skills are closely related, but they are not the 
same. Few teachers of composition will be 
willing to substitute the USAFI or any other 
objective test for theme reading. Many, seek- 
ing a means of supplementing and correcting 
their “subjective” judgment, will be glad to 
use the USAFI test to measure the student's 
ability to analyze the syntax, style, and struc- 
ture of the writing of others. 
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XExamination in English: High School Level. 
Grades 10-12; 1944; IBM; 1 form (usually called 
Form B); 2 parts; separate answer sheets must be 
used; $2 per 25 of either parts; 40¢ per 25 machine- 
scorable answer sheets for either part; 15¢ per key for 
either part; 25¢ per specimen set of either part, post- 
paid; 180(190) minutes; prepared by the Examina- 
tions Staff of the United States Armed Forces Insti- 
tute; distributed by Cooperative Test Service and 
Science Research Associates. 

@) BOOK I, READING AND INTERPRETATION OF LITERATURE 
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AND LITERARY ACQUAINTANCE. Form SR and L-1-B-4; 
120(125) minutes. Medias 
b) BooK 1, coMPosrrION. For administration only to 
students who have first taken Book 1; Form SCp-2- 
B-4; 60(65) minutes. 
REFERENCES 
1, WHITE, VERNA, AND ENOCHS, J. B. "T 
and Interpretation of Literature.” Engl J 33 
2. Wuite, Verna. The Construction of 1 H 
School Students in the Reading and Interpretation of Litera- 
ture. Unpublished doctor's thesis, University of Chicago, 1945. 
3. Wuite, Verwa. "Measuring Achievement in. High-School 
English." Sch R 55:474-83 O '47. * (PA 21:4173) íf 
4 Wuite, Verna. "Measuring Competence in Ye 


Armed Services Personnel.” Sch R 55:345-55 Je ^47." 
21:1449) 

Holland Roberts, Educational Director, Cali- 
fornia Labor School, Inc., San Francisco, 
California. These two wartime tests are mod- 
ern in the sense that they not only reflect the 
conscious educational philosophy of the test- 
makers but in their construction form a unified 
program that anticipates the wide variation in 
conditions under which they may be used. A 
major effort has been made to insure that the 
tests are not mechanically applied, but that 
through comparative interpretations they are 
employed to meet the real needs of teachers 
and students. Adaptation to particular schools 
and systems has been provided for by planning 
the tests so that the results may be interpreted 
on the basis of the development of each stu- 
dent and be used in educational planning and 
individual guidance. 

This provision for the great differences that 
exist between and within communities is an 
important advance in test construction and out- 
modes a large body of materials now used in 
educational measurement. Teachers who have 
recognized the inadequacy of the Procrustean 
tests that have dominated measurement can 
look forward to a wave of new development in 
the entire field of tests and measures. Testing 
in the past has often created a crisis in the 
minds of teachers and students, and at the 
same time a sense of fatality. If the results 
reached national norms, everyone relaxed and 
the program in operation was approved; when 
they were below national norms, the whip was 
applied and/or the program re-examined. In 
either case the test results were generally en- 
tombed in the files; when they have been 
referred to in counseling, the data have some- 
times been considered in the light of a revela- 
tion from Mount Sinai. 

These two tests, the one in literature and 
the other in composition, are designed to be 
administered as a unit. The literature test can 
be used alone but since the composition test is 
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based upon the reading of the literature it 
cannot be used separately. Taken together they 
constitute the beginning of a core program in 
English. 

The two-hour test, Reading and Interpreta- 
tion of Literature and Literary Acquaintance, 
is organized in the two main divisions stated 
in the title. Part I on reading and interpreta- 
tion is based upon four modern selections: 
two prose and two poetry, all centering on sig- 
nificant aspects of the meaning of America: 
"Keystones of Our National Unity" (prose); 
Whitman’s famous poem, “Thou Mother With 
Thy Equal Brood”; “Western Star" (poem); 
and “How I Found America” (prose). These 
choices of core materials are distinguished for 
their literary quality, dramatic unity of theme, 
and interest and significance to the students as 
responsible members of our democracy. They 
are in sharp contrast to the miscellany of dull 
and unimportant scraps which are often pre- 
sented to young people in reading and literature 
tests, for their subject matter brings the stu- 
dent to grips with key issues of our day in 
terms of his own responses to Americanism. 

Here are eloquent statements of our great 
unfinished democratic experiment in discover- 
ing the ability of men of all nations, races, and 
creeds to rule themselves, presented in vivid 
scholarly prose, in sweeping, imaginative 
poetry, and in the passionate outpourings of an 
East Side immigrant girl caught in the tentacles 
of the New York sweatshop system of a gen- 
eration ago. 

The importance of content of this character, 
arranged in a pattern of fundamental meaning 
which is itself a vital part of the daily lives 
of the students, is difficult to overemphasize. 
When test material is chosen and organized 
according to such a standard, tests and the 
whole process of measurement reach an en- 
tirely new conception and make possible great 
qualitative as well as quantitative advances in 
educational achievement. 

The multiple-choice questions that follow 
each of the selections make effective teaching 
use of the content. Many of the ninety exer- 
cises based on the selections "represent prob- 
lem situations involving for their successful 
solution applications of knowledge and the 
functioning of critical reflective thinking." 
Ability to understand relationships and inte- 
grate them in thinking is tested rather than the 
older reliance upon simple recall of facts of no 


relevance to the student. A part of the testing 
provides for comparative thinking on all four 
selections. 

Part II tests literary acquaintance with 60 
multiple-choice exercises based upon literature 
commonly read and studied in elementary and 
high schools. Some are pure memory questions 
of fact, but many depend upon such central 
understanding of the classics as: 


Macbeth deals fundamentally with the 
A. relations between husband and wife s 
B. murder instigated by Lady Macbeth 
C. effect upon man of his ambition to rule 
over his fellows 
D. crueltv of Macbeth 


The authors have also made advances in an- 
ticipating the uses of the test. They urge that 
teachers should “evaluate the (test) exercises 
in relation to the objectives of their own 
courses” and establish norms suitable for their 
own needs. A method for setting up local 
norms, with specific directions for translating 
the test scores into grades is presented, so that 
fair comparisons may be made with the norms 
given in the manual. As the authors point out, 
uncritical acceptance of published norms re- 
sults in an exaggerated estimate of the abilities 
of students in schools where educational stand- 
ards are high and, conversely, depreciation of 
student ability in schools of low scholastic 
status. They note that effective educational 
guidance and placement depend upon care in 
interpretation of part scores in relationship to 
courses “immediately in prospect.” 

Thorough, practical directions and materials 
are provided for administering, machine or 
hand scoring the tests, and interpreting the 
scores on a variety of bases. The norms are 
given in percentile tables so that the standing 
of any student, group, or class may be com- 
pared with others for the parts of the test as 
well as in toto. Such study of relative standing 
makes it possible to evaluate and reorient the 
teaching program to emphasize the objectives 
set up. 

Part I is too rigorous for low ability groups, 
and the test as a whole is not suitable for re- 
tarded readers; but outside these areas it should 
prove of marked value for schools interested in 
developing higher standards of work through 
modern measurement. 

Book II, Composition, is an hour and a half 
test based entirely upon three short papers 
written by high school students who had just 
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taken the reading examination reviewed above. 
This reading furnished the subject matter for 
their composition. In writing their commen- 
taries in these papers the students were guided 
by five key questions, such as “How are the 
ideas of these authors alike? How different?" ; 
but they were instructed to prepare unified, 
well-organized material and to avoid making 
their statements a series of disjointed answers. 
In this they had varying success. 

- In taking this composition test, the student 
first reads each theme and then works out a 
series of multiple-answer exercises which in- 
volve judgments on its organization and style. 
After all three themes have been considered in 
this way, comparisons are made of their rela- 
tive effectiveness in style, organization, natural 
use of experience, and comprehensiveness, As 
there are no established standards for any of 
these items, this section of the test is highly 
subjective and controversial. The foundation 
work for measurement in this field has not yet 
been done, 

The second part of the work is a usage test 
in grammar, punctuation, and word choice 
constructed by a special reprinting of the three 
student themes. Sentences and phrases are re- 
printed in the left-hand column of each page, 
with some part underlined and numbered. In 
the right-hand column, two, three, or four alter- 
native ways of writing the underlined portion 
are presented. The first choice is the orig- 
inal expression of the student, There is origi- 
nality in this effort to test composition ability 
under conditions closely paralleling a natural 
life situation. It has definite advantages over 
the series of disconnected, isolated sentences 
often used for the same purpose. However, it 
should be clearly understood that all such exer- 
cises are concerned primarily with composition 
form. Basically they are proofreading tests and 
do not attempt to measure the power to com- 
pose. Composition power is concerned with 
conceiving and formulating ideas and action, 
with content—of which form is an expression. 
Proofreading tests have their uses, but the 
critical skill required to select preferred usages 
from someone else’s writing can be at best 
only a minor contribution to original composi- 
tion. It has not even been proved that proofing 
skill is an adequate test of ability to use ac- 
cepted forms and usages in writing. 

Considered, however, not as a complete com- 
position but as a proofreading test, these exer- 


cises have definite advantages. They are based 
upon actual student writing and therefore 
present real problems of usage as they occur in 
daily student life rather than teacher-devised 
problems stated in adult language. This is an 
important psychological consideration generally 
neglected in the English classroom. 

Taken as a whole, the major advantage of 
this test is its continuity of significant thought. 
To center a complete test upon the building of 
unity among the diverse peoples of America 
establishes a new high standard and a mile- 
stone in measurement. Tests constructed from 
such an outlook can become an integral part of 
creative teaching and contribute to individual 
and social integration. 


Louis C. Zahner, Head of English Depart- 
ment, Groton School, Groton, Massachusetts. 
It is to the credit of the United States Armed 
Forces Institute that it disclaims any attempt 
to impose standards upon schools and colleges. 
Furthermore, it emphasizes the importance of 
local validation of its tests, even advising sub- 
jective analysis and judgment. This wisely 
moderate and all too rare view of the limita- 
tions of an objective test and of its proper 
function and application becomes even more 
striking when one considers the research and 
expert knowledge that have gone into the con- 
struction of the test and its percentile norms. 

Book I departs from the usual technique of 
presenting many snippets out of context for 
interpretation, and instead bases its items on 
four long and self-sufficient passages, two of 
prose and two of poetry, each developing dif- 
ferent aspects of a common theme. The au- 
thors make full use of the additional testing 
areas opened by this basic organization. 

This part of the test is not entirely free 
from the faults of detail common to its type. 
Some of the keyed responses are too obvious, 
others are debatable, others can be detected by 
a process of elimination. Some keyed and 
wrong responses are ambiguously worded. To 
the usual assumptions of literary values which 
the successful test-taker must accept, moreover, 
are added social and political assumptions hav- 
ing to do with the common theme, American 
Democracy. 

The more general problem raised by this 
type of test is brought no nearer to solution: 
the psychological processes at work in taking 
an objective test in reading are not those called 
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into play in effective reading. It is one thing 
to select a ready-made response; it is quite an- 
other for a reader to ask himself pertinent 
questions and to make his own responses, Close 
correlation between the two, if it exists, has 
not been clearly established and must be ac- 
cepted, if at all, on faith. 

The section on literary acquaintance is like- 
wise open to the common criticisms of this 
type of test, When the items move off the 
ground of simple and relatively insignificant 
fact, they must resort to oversimplified judg- 
ments and stock tags, such as the statement 
that "Macbeth deals fundamentally with the 
effect upon man of his ambition to rule over 
his fellows.” Since some of the unkeyed re- 
sponses in such items are tenable answers, each 
representing a part of a situation too complex 
to be reduced to a single phrase, such an item 
might better be constructed for elimination of 
one wholly impertinent response. 

In items dealing with three or four books 


together, the keyed response can often be found: 


from one book alone. For instance, in an item 
dealing with Jane Addams, Eve Curie, and 
Pasteur three of the four responses can be 
eliminated by anyone who knows only that 
Jane Addams was a social worker. 

Booklet II of this test, on composition, sets 
up three student themes and asks questions 
about them ranging from organizati n to de- 
tails of diction and punctuation. It is a thor- 
ough test of critical judgment applied to writ- 
ing at the high school student's level as a 
writer, Whether a student who can accurately 
identify a possible error that has been isolated 
for his attention or select the best of several 
improvements composed for him could create, 
write, criticize, and revise his own composition 
with equal proficiency remains an open ques- 
tion. In spite of occasional weak spots, how- 
ever, such as inclusion of responses that are 
highly debatable or not mutually exclusive, the 
formal knowledge of the principles of writing 
generally taught at the high school level are 
more ingeniously and fully tested here than in 
most other commercial tests. 

In fact, this series of tests as a whole, if 
taken and interpreted as the authors advise and 
not as “the be-all and the end-all,” should prove 
to be a useful adjunct to any good high school 
or even college freshman course in English. 
The authors have made a notable contribution 
to testing in this field. 
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postpaid; $1 per 100 machine-scorable answer sheets ; 
15¢ per stencil for scoring answer sheets; 45(50) min- 
utes; revised by M. F. Carpenter and D. B. Stuit. 
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Robert C. Pooley, Professor of English, 
The University of Wisconsin, Madison, Wis- 
consin. This is a 4-page test of English skills 
in spelling, punctuation, sentence structure, and 
correct usage. Forms A and B are furnished 
with an answer key for manual scoring. Norms 
are presented in the form of percentiles undif- 
ferentiated for grade levels. Instructions for 
administration are covered by the general man- 
ual for the Iowa Placement Examinations. 

Part I is a test of spelling. A list of 75 words 
is presented, of which 25 are misspelled. Space 
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tory. The marking is-bighly elaborate and time- 
- ie yee coi hence limits the practical utility 
ic 


of the test material. Nevertheless the tésts, first 
published in 1940, have run into four editions. 
Norms are given for the first four subtests 

in 5 of averages of 1,200 pupils. The mean 
scores are provided for each age group 8+ to 
. 14+, thus enabling the scores for individual 
«pupils to be expressed in equivalent ages, which 


$ may, be averaged to yield an "English" age. 


3 ee t H. Thouless, Reader in Educational 
: ji ology, Cambridge University, Cambridge, 
ngland. ‘This test is intended for two pur- 
poses; for diagnosing directions of weakness 
in the use of English and for obtaining an es- 
timate of ability in the use of English. For the 
first purpose it is recommended that the test 
_ should be given without time limit; a time 
limit is used, however, for the second purpose. 
* The parts of the test include English usage, 


"capital letters and punctuation, vocabulary, 


sentence structure, and the writing of an essay. 
Objectivity of marking is attained in all except 
the last of these parts, but the assessment of 


— ability in English can be made without the 
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essay since norms are given separately for each 
part, There is an odd discrepancy between the 
test itself and the handbook of instructions as 
to what is to be the subject of the essay, 
The test items are graded in difficulty so that 
the test can be used for a wide age range 


. (given as 9+ to 16+), Norms are given as 


Average performance for ages from 8+ to 
14+. These norms are stated to be tentative, 
but since they have been derived from 1,260 
- pupils, they should be reasonably reliable. The 
- validity and reliability of the test have been 
determined by the author and are satisfactory. 
It appears to be a well-constructed test which 
should prove useful, 
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English Test, Which in the course of 160 mul- 
tiple-choice items ‘divided into four subtests 
coyers the topics of reading, words, literature, 
and grammar, was published in 1931, Even 
then, it was in all probability a mediocre test. 

Detailed criticism of the items would take 
too long. Many of the reading-comprchension 
items are trivial. Part 2, called Words, con- 
taining both spelling items and items dealing 
with the correct choice of words, is expressed 
in highflown language with many of the defi- 
nitions inexact. The literature items call mainly 
for memory of authors and story details and 
for the identification of quotations. The gram- 
mar items are formal and superficial, many of 
them calling for no more than recognition of 
definitions and identification of parts of speech. 

The manual provided consists of one sheet, 
which contains a description of the test, direc- 
tions for administering it, data on reliability 
and validity, and norms. The directions, com- 
bined with the awkward arrangement of the 
test booklet, are unnecessarily confusing. While 
the "chance-halves" reliability of the total test 
seems satisfactory for both forms (over .90), 
the validity data, consisting of correlations 
with first-semester grades for freshman at the 
University of Oklahoma, are somewhat diffi- 
cult to interpret, The correlation between test 
scores and grades in English is .46 while the 
corresponding correlations with grades in mod- 
ern languages and social science are .53 and 
48, Perhaps the test is a better measure of 
verbal ability than of knowledge of “English.” 
Norms based on 11,632 students from 19 states 
(but apparently all in Oklahoma or Florida 
colleges) are given without distinction as to 
the form of the test. 

A review of a test as old as this one (ap- 
parently no research has been undertaken, nor 
any new forms published, since 1931) would 
scarcely seem necessary but for the fact that 
this test appears in the current test catalogue 
of a reputable company and, unfortunately, is 
representative of a large group of tests which 
continue to be listed in such catalogues even 
though their contents and the methods by which 
they were constructed have long been outmoded. 
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Chester W. Harris, Assistant Professor of 
Education, The University of Chicago, Chicago, 
Illinois. Many of the obvious weaknesses of the 
Barrett-Ryan Literature Test are suggested by 
the copyright date of 1933; tests approximately 
fifteen years old are likely to be inappropriate 
from the standpoint of present-day teaching and 
evaluation. Each form includes 120 multiple- 
choice questions on 35 “classics,” These selec- 
tions are "the ones which were reported most 
frequently by a large number of high school 
English teachers as being studied in their 
classes," This survey must have been made in 
the early 1930's at the latest, if the copyright 
date is accurate, The "classics," as the content 
of these tests defines them, consist primarily of 
nineteenth-century American and English writ- 
ings. Chaucer, Shakespeare, Milton, Bunyan, 
and a few eighteenth-century authors also are 
represented, but only one selection from the 
twentieth century (Frost's "Mending Wall") 
appears. This method of determining the con- 
tent of a test usually results in creating a test 
whose content is inappropriate for use in any 
particular teaching situation. 

The test is designed to measure "understand- 
ing and knowledge of literature." The test items 
indicate that "understanding and knowledge" 
means only the memory of characters’ names, of 
quotations, of who did what, where, and when, 
and of hackneyed pronouncements about form 
or style or “theme.” The test is clearly a quiz, in 
the sense in which present-day radio programs 
use that term. Use of this test as the means of 
evaluating student achievement in literature 
would assume that determining to what extent 
the student remembers the specified points about 
these particular writings gives a valid index of 


oh 


it is lil d 
suggest to teachers and students Men quor, 
tionable interpretations of literature. The trivi 

ity of some of the items is objectionable] the 
student must tell, for example, whether Eyange- 
line traveled on the Thames, Tiber, Amazon, or 
Mississippi River (Item 28, Form A), and 
whether Bryant's water fowl is "on the water, 
in the air, in a tree, or on the ground" (Item. 
110, Form A), The key indicates that King Lear 
is properly regarded as "a foolish old man" 
(Item 55, Form A) and that the theme of 
"Crossing the Bar" is "death" (Item 13, Form 
A). Sensitive students of literature should ob- 
ject strenuously to such interpretations, The 
obvious bad writing in the test is not worthy 
of test authors who are members of a college 
English department. Item 60, Form A, when 
"correctly" answered, reads: "The first episode 
in the story is the dying of a horse"; Item 75, 
Form A, states: "An important element if a_ 
woman disguises as a man." Apparently, the 


"numerous revisions" claimed for the items — 


were made without benefit of first-rate criticism, 


The three forms are advertised as equivalent; — 


yet the only statistical evidence given is one 
correlation (between Form A and Form C) of 
.78 for an unspecified number of college stu- 
dents. On the basis of this evidence, most 
students of measurement would reject the con- 


clusion. Examination of the selections indicates ~ 


that the three forms are far from equivalent in 
content. Sixty-five different selections are in- 
cluded in the forms; of these, thirty are com- 
mon to two or more of the forms, Each form 
therefore includes a set of selections A: 
from that of each of the other forms. The re- — 
ported correlation of .78 supports^the 


tion that familiarity with one set of selections — 
would not be highly related to familiarity with —. 


another set. High reliability (.93 to .94) is 
claimed for each form. Validity is indicated by 
= 
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correlations—again based on unspecified num- 
bers of cases—between test scores and certain 
“estimates” made by teachers. The lack of perti- 
nent information needed to evaluate these cor- 
relations—such as estimates of reliability of the 
teacher judgments, for example—gives one little 
confidence in these numbers as evidence of 
validity. The lack of curricular validity has al- 
ready been emphasized. 
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12, 13-16; 1932-35; 1 form; 3 levels; separate answer 
sheets must be used; $1.50 per 25; 25 per 25 answer 
sheets; 35¢ per specimen set, postpaid; nontimed (35) 
minutes; Herbert A. Carroll; Educational Test Bu- 
reau, Inc. 


ay REFERENCES 
1. Carnot, HxmsERT A, “A Standardized Test of Prose 


Appreciation for Junior High School Pupils.” J Ed Psychol 
M ELM ayes TOS standardized Test of Prose Ap- 
preciation for Senior High School Pupils." J Ed Psychol 23: 
401-10 S '32. * (PA 7:347) 

3. CARROLL, ERBERT A, “A Method of Measuring Prose 
Appreciation.” Engl J 22:184-9 Mr '33. * (PA 7:3561) 

4. CARROLL, Herpert A. "Appreciation of Literature and 
Abstract Intelligence." J Ed Psychol 25:547 Ja '34. * (PA 
8:2659) 

Chester W. Harris, Assistant Professor of 
Education, The University of Chicago, Chicago, 
Illinois. Prospective users of a test should de- 
mand that a test manual present at the least: 
explicit definitions of what the test is expected 
to measure, descriptions of how the test was 
constructed, and relevant data that enable one 
to make judgments of the test as a measuring 
instrument. Carroll is to be commended for 
including these minimum essentials in his man- 
uals. He singles out for measurement the “power 
to differentiate the good from the less good, and 
the less good from the very bad." His hypothe- 
sis—clearly stated—is that this power to differ- 
entiate constitutes a crucial index of the ability 
to appreciate prose ; the term appreciation in the 
title of the test is therefore given a definite 
meaning. (See my review of English: Under- 
standing and Appreciation of Poetry for a com- 
ment on possible meanings of appreciation.) 
The method of constructing the test and the 
sources of the materials used are described, and 
data secured from studies of the test as a meas- 
uring instrument are presented. To censure the 
test because it does not measure all aspects of 
appreciation that can be defined would be de- 
cidedly unfair. Carroll claims only to measure 
one index of appreciation; the score yielded by 
the test is designed to indicate the extent of the 

ent’s agreement, in his ranking of selections, 
with the criterion used. 


An important assumption on which any meas- 
urement of this power to differentiate rests is 
that the terms “good,” “less good,” and “very 
bad” can be so defined as to secure agreement 
among competent judges regarding the proper 
category for each of the prose selections com- 
pared. Carroll’s technique for securing written 
materials that are likely to be judged in this 
consistent fashion is that of drawing on three 
sources that presumably differ widely in literary 
merit for three of the four passages to be 
compared and of using a mutilation as the 
fourth. In this fashion, he has built a number 
of sets of exercises, each consisting of four 
passages and each to be responded to by ranking 
the four passages in order of literary merit. In 
each set, the first choice was drawn from a book 
regarded as of excellent quality, the second from 
books generally regarded as of poor quality, 
the third from the “less literary” magazines, and 
the fourth from mutilations. Data regarding the 
agreement of a group of judges (not named) 
among themselves and with the criterion of 
source are given for the college form. For some 
of the sets, the agreement both among judges 
and between judges and source is quite close; 
for others, the same selection has been placed in 
all four positions of merit by one or more of 
the judges. Carroll's standard for including a 
set is that a consensus of opinion of the judges 
must agree with the ranking given by the source. 
For one of the four passages of a set, this con- 
sensus may represent as few as 50 or as many 
as 100 per cent of the judges. The objectivity of 
the key therefore varies from set to set. 

Reliability estimates for the junior and the 
senior high school forms are given as .70 and 
71; no estimate is given for the college form. 
Estimates of this magnitude indicate an impor- 
tant limitation of the test as an instrument for 
the appraisal of an individual's power to dif- 
ferentiate. The shortness of the passages gives 
rise to a further limitation, The judgment of 
the literary merit of a novel, for example, might 
reasonably consider the development of char- 
acter, the various principles of organization that 
are brought together to form a plot, and the like. 
A short passage of from 100 to 300 words taken 
from that novel is unlikely to provide a basis 
for such judgments. As Carroll acknowledges, 
the basis for judging the literary merit of these 
short passages is likely to be that of stylistic 
characteristics. In interpreting scores on this 
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test, this restriction of the concept of literary 
merit should be taken into account. 

Perhaps the major value of Carroll's test is 
that of a device to stimulate discussions of style, 
of literary merit, and of what constitute de- 
fensible standards by which to judge these. 
Presumably, many teachers are interested in 
helping students to arrive at defensible judg- 
ments of literary works by means of a set of 
principles that can be applied in many situations. 
For them, the final judgment and the extent to 
which it agrees with more “expert” opinions are 
important to know; but it is even more impor- 
tant to know the way in which this final judg- 
ment was arrived at. Carroll’s work is valuable, 
since it leads in the direction of appraising a 
student's judgment in terms of expert judg- 
ment. What is needed—in addition to techni- 
cally better measures of this type—is a tech- 
nique for describing the principles and criteria 
that students use in making such judgments. 


[14] 
Cooperative Literary Acquaintance Test. Grades 
10-16; 1940-41; IBM; Forms Q, R; separate answer 
sheets nced not be used; $2 per 25; 25€ per specimen 
set, postpaid; 40¢ per 25 machine-scorable answer 
sheets; 156 per stencil for scoring answer shects; 
40(45) minutes; Rosa Lee Walston and Edward E. 
Cureton; Cooperative Test Service. 
REFERENCES 

1, Srrover, Ronext B, "A Study of the Sophomore Testing 
Program at the Univ y of Minnesota—Part I." J Appl Psy- 
chol 26:296-307 Je "42. * (PA 16:4183) 


For reviews by Carleton C. Jones, Lou La- 
Brant, Edward S. Noyes, and John H. Thomp- 
son of earlier forms, see 38:970 and 40:1298. 
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X*Cooperative Literary Comprehension and Ap- 
preciation Test. Grades 10-16; 1941-43; IBM; Pro- 
visional Forms R, T ; separate answer sheets need not 
be used; $1.75 per 25; 25¢ per specimen set, postpaid ; 
40& per 25 machine-scorable answer sheets; 15¢ per 
stencil for scoring answer shects; 40(45) minutes; 
Form R: Hyman Eigerman, Mary Willis, and Fred- 
erick B. Davis; Form T: Mary Willis and H. A. 
Domincovich ; Cooperative Test Service. 


REFERENCES 

1. Cuxetox, Eowann E. ion or Guidance? The Re- 

rt of the Eighth Annual National College Sophomore Test 
ing Program April 17 to May 5, 1939." J Er Ed 8:308-40 Mr 
"40. * (PA 1414726) 1 r z 

2. Jonssos, A, P. The Prediction of Scholastic Achieve 
ment for Freshman Engineering Students at Purdue University, 
Purdue alversty, Division of Educational Research, Studies 
in Engineering Education ll. Lafayette, Ind.: the Division, 
May 1942. Pp. 22. Paper. $0.35. * (PA 16:5020) 

$. oxxsow, A. P. The Relationship | of Test Scores to 
Scholastic Achievement for 244 Emgincering Freshmen Enter- 
ing Purdue University in September, 1939. | npublished doc- 
tor's thesis, Purdue University, 1942. 


Holland Roberts, Educational Director, Cali- 


fornia Labor School, Inc., San Francisco, Cali- 
fornia. As the comprehension and appreciation 
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of literature embrace the entire scope of human 
life and thought, measurement in the field offers 
problems of corresponding difficulty, These tests 
are clear, well adapted to the level of secondary 
students, varied in form and subject, and some- 
times lively and humorous. They present, how- 
ever, no new basic approach and have not 
grappled with the underlying problems of ap- 
preciation. 

Form R of the test is made up of fifteen short 
literary selections, half prose and half poetry. 
The prose selections range in length from five 
to fifty lines and the topics vary from the humor- 
ous autobiographical recollections of two pre- 
cocious youngsters in a summer camp to James. 
Fennimore Cooper, a summary of a novel, and 
a short short horror story from the Civil War. 
The poetry is chiefly brief excerpts of five to 
twenty lines taken from a wide variety of 
sources, from the Bible to “Rainbow Round My 
Shoulder,” and spoiled versions of mediocre 
stanzas on miscellaneous aspects of life and 
love. 1 

Form T bears a later date and gives evidence 
of greater experience and talent. Twelve selec- 
tions were used in its construction, five prose 
and seven poetry. Four of the five prose selec- 
tions have a rollicking humor and a marked 
narrative appeal for adolescents. The poetry is 
symbolic and philosophic, and represents sharp 
differences in style, content, and difficulty. The 
excerpts are from the Bible and English and 
American sources, and the subject matter ranges 
from Hebraic apostrophes to God to the pace 
of time in the different ages of man, the puni- 
ness and futility of human life in the struggle 
against the blind forces of the universe, and 
Napoleon's callous and hypocritical slaughter of 
the people in his drive for power. The prose 
selections range from four to seventy-five lines 
and the poetry from four to seventeen. From 
one to ten multiple-choice questions follow each 
selection, a total of about sixty-five for each 
form of the test; the majority of these ques- 
tions center upon the meaning of the lines, and 
the others, on problems of style. 

The two forms of the test are similar in struc- 
ture, and superficial in content, but they are 
markedly different in their essential characteris- 
tic, the quality of the literature which they 
sample. The selections in Form R are on the 
whole inferior and therefore do not provida 1 
basic measure of ability in literary appreci i 
or understanding. A number of them are dull 
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and trivial. Form T offers a larger variety of 
lively, interesting selections, a few of them of 
literary power. The 40 minutes spent in taking 
it can be an enjoyable experience on the level 
of a children's game. 

As a whole this test illustrates both the great 
difficulty of measurement in appreciation and 
understanding on the higher. cultural levels 
and the inability of testmakers to conceive of 
measurement as an integral part of the entire 
process of learning and teaching. Such tests are 
based upon a miscellany of scraps of material, 
some of it with no pretensions to literary qual- 
ity. They attempt to test literary understanding 
and appreciation without providing the student 
with a literary experience of intensity or signifi- 
cance, or integrating literature with life. There 
is no over-all conception or unity and no op- 
portunity for the development of emotion or 
insight into the profound human relationships 
portrayed by a Shakespeare or a Tolstoy. In- 
stead of helping the student grow into an au- 
thor's mood, the test is limited to quickness of 
perception, to surface aptness rather than broad 
grasp and depth of penetration. The conception 
held by Dickens and Hugo, and by Gorky and 
Rolland, of literature as a great force for man's 
betterment is almost entirely lacking. As a 
whole it is difficult to see what advantage there 
would be in administering tests constructed ac- 
cording to such a pattern. The authors offer no 
indication of how they might be used to advance 
a teaching program or for individual guidance 
or progress. 

These tests do offer some technical advances. 
In addition to norms in the form of percentile 
tables, scaled scores are given. Among other 
characteristics these scaled scores provide equal- 
ity of scores throughout the scale-so that the 
scores from a single school system tend to form 
a normal distribution. They also offer a com- 
mon scale for tests, so that comparisons may 
readily be made of the relative standing of an 
individual or a class, not only on standard tests 
but with grades. The question is, of course, 
whether the test measures anything of any 
consequence to either students or teacher. 

s * [ 143 ] 
EXE ste pdf, apt Aa f 


K les 9-12; 1934-44; Forms A, B; 434¢ per test; 
T nutes; G. S. Wykoff; Division of Educa- 
tional erence, Purdue University. 


Chester W. Harris, Assistant Professor of 
Education, University of Chicago, Chicago, 
Illinois. A test title that contains the term 
appreciation is often interpreted somewhat dif- 
ferently by different persons. Some time ago, this 
reviewer surveyed a number of publications in 
order to find out how this term is defined by 
"experts" in the teaching of English. Two major 
kinds of definition were found: (a) a definition 
that specified certain reactions to literature on 
the part of students as the characteristics of 
appreciation and (b) a definition that specified 
certain abilities as the bases or indexes of appre- 
ciation, It is this second kind of definition that 
is pertinent in considering this test. The survey 
indicated that a number of different abilities 
were specified. Abilities that miglit be classified 
under the heading of "comprehension," such as 
the ability to determine word meanings, the 
main thought, the mood, the author's purpose, 
etc., were evident. Others that might be classi- 
fied under the heading of “judgment,” well illus- 
trated by the early work of Abbott and Trabue, 
Speer, and others, also were evident. In addi- 
tion, a number of other abilities, such as those 
Pooley has called the “basic recognitions,” were 
specified. 

This test, Understanding and Appreciation of 
Poetry, prepared by Professor Wykoff of Pur- 
due University, attempts to measure two well- 
defined abilities: comprehending the mood and 
comprehending the main thought of selections 
written in verse. The term "appreciation" in 
the title apparently refers to the items testing 
the student's ability to determine the mood of the 
selections. The limited concept of appreciation, 
when compared with other concepts mentioned 
above, is evident. 

An important consideration in selecting the 
content of a test, such as this one, designed to 
measure certain abilities of comprehension or 
interpretation, is that the literary materials be 
relatively unfamiliar to the student. If they are 
not, then it is always possible that the student 
will be able to answer items correctly on the 
basis of his memory of what was said about the 
poem in class or in a text. Such responses, of 
course, do not provide evidence of the ability 
to make such interpretations himself. The plan 
used in selecting the content is not specifically 
described; one notes, however, that several of 
the thirty-one selections are ones that are often 
interpreted for the student in texts and in class 
discussions. The keying of a test like this also 
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presents problems. Particularly, one might ex- 

pect students of literature to disagree about the 

- sproper term to describe the mood of a poem; 
that one term can adequately describe the mood 
of some of the selections is in itself doubtful. 
Again, information regarding the construction 
of the test is lacking; we are not told what 
judges determined the correct responses, nor 
how the possible responses that are listed were 
secured. It would have been particularly appro- 
priate in a test of this sort to build the objective 
form from responses actually made by students 
in a free-response situation. Other limitations 
might be pointed out. Only a total score is rec- 
ommended; yet the test is clearly designed to 

-measure two abilities. No estimates of reliability 

are given. Norms based on a very small number 
of cases are provided; teachers using the test 
are urged to supply distributions of scores so 
that the norms may be amplified. The kind of 
"norm" that results from data secured in this 
fashion is likely to be unrepresentative of any 
well-defined population. 

The major value of this test is that of sug- 
gesting to teachers that certain abilities in inter- 
preting literature are important achievements. 
Measurement of a broader range of abilities 
than this test provides is needed, and the specific 
techniques need to be experimented with, criti- 

- cized, and improved. However, a direction in 
which we might move—one that points away 
from the reliance on a measure of information 
about literature as the test of English achieve- 
"ment—is indicated here. 


[14] 

*Interpretation of Literary Materials: Tests of 
General Educational Development: College 
Level, Test 4. College; 1944-45; IBM; Form B; 
separate answer sheets must be used; $2 per 25; 4o¢ 
per 25 machine-scorable answer sheets; 15¢ per scor- 
ing key; 256 per specimen set, postpaid; worklimit 
(120) minutes; prepared by the Examinations Staff of 
the United States Armed Forces Institute; published 
by the American Council on Education; distributed by 
the Cooperative Test Service. (Also distributed by 
Science Research Associates: $3 per 25; 65¢ per 25 
machine-scorable answer sheets; 5o¢ per key; so¢ per 
specimen set.) 


For reviews by Herbert S. Conrad and War- 
ren G. Findley of the entire battery, see 20. 
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XInterpretation of Literary Materials: Tests of 
General Educational Development: High School 
Level, Test 4. High school; 1944-45; IBM; Form 
B; separate answer sheets must be used; $2 per 25; 
40¢ per 25 machine-scorable answer sheets; I5¢ per 
scoring key; 25¢ per specimen set, postpaid ; worklimit 
(120) minutes; prepared by the Examinations Staff of 
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the United States Armed Forces Institute; published 
by the American Council on Education; distributed by 
the Cooperative Test Service. (Also distributed by 
Science Research Associates: $2 per 25; 65¢ per 25 
machine-scorable answer sheets; so£ per key; 50 per 
specimen set.) 


For reviews by Herbert S. Conrad and War- 
ren G. Findley of the entire battery, see 20. 
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XRigg Poetry Judgment Test. Grades 9-16; 1942; 
Forms 1, 2; 75¢ per 25; 10¢ per manual; 1o¢ per set 
of scoring keys; 15¢ per specimen set, postpaid ; non- 
timed (40-50) minutes; Melvin G. Rigg; Bureau of 
Educational Research and Service, State University of 
Iowa. 


REFERENCES 
1. Yaxnoxoucu, OLLIE Jean. An Attempt to Standardize the 
Rigg Poetry Test for High Schools. Unpublished masters 
thesis, Oklahoma Agricultural and Mechanical, College, 1938. 
2. luco, Mervin G. "Measuring the Ability to Judge 
POR Abstract. Proc. Okla Acad Sci 19:157-8 '39. * (PA 14: 
467. 


John S. Diekhoff, Acting C. hairman, Depart- 
ment of English, Queens College, Flushing, 
New York. We need not waste words on the 
difficulty of measuring “literary taste inthe field 
of poetry,” which (according to the Examiner’s 
Manual) is the purpose of the Rigg test. We 
must regret, however, that in the manual Dr. 
Rigg preferred the word taste to the more pre- 
cise word judgment which he uses in the title 
of his test. 

Each form of the test presents the student 
with forty pairs of brief verse extracts and asks 
him to designate the better of each pair. One 
is a passage from a “recognized” poet, the other 
a paraphrase or parody meant to be inferior. 
None of the extracts is longer than six lines. 

The brevity of the passages, which is said in 
the Examiner’s Manual to be an advantage of 
the test, is also its weakness. It is possible on the 
basis of a few lines of verse to require a student 
to make critical judgments reflecting his knowl- 
edge of such details of composition as concrete- 
ness of diction, propriety of imagery, and cor- 
rectness of prosody; but it is not possible to 
require judgments of the architectonic structure 
of a poem as a whole or of the value of the 
content of a poem. The stress in the Rigg test 
seems to be on prosody. Yet there are much 
more important considerations in judging a 
poem than minutiae of technical skill in versifica- 
tion, and it is whole poems that students should 
learn to judge. We are still waiting, I think, for 
a test of their skill in doing so. ] 

If a test of student taste in or understating 
of such things as prosody and minor imagery is 
desired, the Rigg test is probably satisfactory. 
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The basic assumption that the extracts from 
"recognized" poets are better than the parodies 
of them is sound, in spite of the disagreement 
among experts that Dr. Rigg cites. The superi- 
ority of the better passage is usually clear, but 
not so clear as to be obvious to the undiscrim- 
inating. 


Louis C. Zahner, Head of English Depart- 
ment, Groton School, Groton, Massachusetts. 
This test “is designed to measure literary taste 
in the field of poetry.” It is made up of forty 
items, each consisting of two passages of poetry 
from two to six lines in length. One passage is 
an extract from “a poet of established reputa- 
tion” and the other “a parody written by a 
psychologist who has no standing as a poet, and 
who, moreover, was trying to produce some- 
thing inferior.” The test, in other words, uses 
the same general procedure as that used in the 
Abbott-Trabue Exercises in Judging Poetry, 
except that one parody is used instead of three. 

It is fair to ask any constructor of a test 
exactly what he is attempting to test. What is 
"taste" in poetry? Since the author does not 
answer this question directly, the reviewer is 
thrown for an answer upon the implications of 
the test itself. "Taste" thus defined seems to be 
conformity to some of the more mechanical and 
obvious traditions of poetry as established prior 
to the twentieth century. (The few lines selected 
from later poetry are in the older, established 
tradition.) The test becomes little more than an 
exercise in detecting glaring evidence of dog- 
gerel on the one hand and lush, saccharine dic- 
tion on the other. It is not designed to get at the 
central problem of a good interpretation of 
poetry, without which "taste" can hardly exist 
at'all and with which it can safely be left to 
take care of itself. The "taste" tested here is 
the kind acquired by acceptance of stock judg- 
ments. For anything approaching a defensible 
test on taste a considerable context, preferably 
an entire poem, would be needed as a setting for 
the four or five lines in question. Without an 
adequate context their meaning, their appropri- 
ateness in mood and tone, their “taste,” or any- 
thing else that concerns them cannot be properly 
judged. No context whatever is provided for 
the snippets used in this test. 

One becomes even more skeptical of whether 
the test is examining anything worth having 
when one reads in the Examiner's Manual that 
a certain study showed "that significant gains 


in the ability to recognize good poetry were 
made by high school students as the result of 
a three-day training period." (Italics are the 
reviewer's.) 

The test contains such a preponderance of 
quotations from well-known poems that it be- 
comes a test in memory rather than in judgment 
for any person even moderately well read in 
poetry ; and one is again led to ask how a person 
not even moderately well read in poetry is ex- 
pected by the author to have acquired “taste.” 

Even were this a valid test on theoretical 
grounds, its percentile norms would be open to 
question, since for the three groups for which 
the norms are given, high school, college, and 
adult-expert, they are based on only 804, 572, 
and 100 papers respectively. 

The Rigg test has not improved upon the 
Abbott-Trabue Exercises, its acknowledged 
source, 


[14 ] 
XSurvey Test in American Literature. High school 
and college; 1940; 1 form; $1.60 per 25, postpaid; 
sample test free; 50(55) minutes; edited by Harold 
H. Bixler; Salibelle Royster; Turner E, Smith & Co. 
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XSurvey Test in English Literature. High school 
and college; 1940; 1 form; $1.60 per 25, postpaid; 
sample test free; 50(55) minutes; edited by Harold 
H. Bixler; Elmer R. Smith; Turner E. Smith & Co. 
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*P-L-S Journalism Test. High school and college; 
1944; 1 form; $1.15 per 25, postpaid; 154 per specimen 
set, postpaid; 40(45) minutes; George H. Phillips, 
Harry Levinson, and H. E. Schrammel; Bureau of 
Educational Measurement, Kansas State Teachers 
College of Emporia. 


SPEECH 
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*Guidance Questionnaire for Students of Speech. 
Grades 13-16; 1940; Form C; $2.20 per 25; specimen 
sets not available; 30(35) minutes; Howard Gilkinson 
and Franklin H. Knower; C. H. Stoelting Co. 


REFERENCES 
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4. ALUSOW, Frank TELESPORE, A Validation of the Gilkin- 
son-Knower Guidance Questionnaire on University of Iowa 
Freshman Speech Students, Unpublished master's thesis, State 
University of Iowa. 1941. 
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XSpeech Attitude Scale. Grades 9-16; 1938; Form 
F; $2.20 per 25; specimen sets not available; 30(3: 
minutes; Franklin H. Knower; C. H. Stoelting ë 


REFERENCES 

1. Knower, FmawKLIN H. “A Study of Speech Attitudes 
and Adjustments." Speech Monogr Eod P3 i 
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chological Studies of Individual Differences Among Students 
of Speech. Minneapolis, Minn.: Department of Speech; Uni- 
versity of Minnesota, June 1939. Bp. ii, 196. Paper, mimeo- 
graphed. Gratis, * 
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*Speech Experience Inventory. Grades 9-16; 1937 ; 
Form C; $1.60 per 25; specimen sets not available; 
15(20) minutes; Franklin H. Knower; C. H. Stoelting 
0. 
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of Speech. Minneapolis, Minn.: Department of Speech; Uni- 
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*Kansas Spelling Test. Grades 1-3, 4-6, 7-9; 1941; 
Forms A, B; 3 levels; 75¢ per 25, postpaid; 15¢ per 
specimen set, postpaid; 15(20) minutes; H. E. 
Schrammel, O. M. Rasmussen, Nathan Budd, Wayne 
Gordon, and Fayrene Reiff; Bureau of Educational 
Measurements, Kansas State Teachers College of 
Emporia. 


Henry D. Rinsland, Director, Bureau of Edu- 
cational Research, The University of Oklahoma, 
Norman, Oklahoma. The source of words for 
these tests covering the elementary grades from 
1 to 9 is greatly weakened by the inclusion of 
words from “recognized spelling texts, notably 
the Kansas adopted text by Ayers, Oberholtzer 
and Woody” in addition to words from the 
Buckingham extension of the Ayers Spelling 
Scale, the Iowa Spelling Scale, and the Thorn- 
dike Teacher's Word List (date not given). 
Even these lists are questionable, as the first 
two are very old and the Thorndike Word List 
is primarily a count of words from adult writ- 
ings. There are no data given as to how the 
words were chosen from these lists and books 
or how proper grade placement was made. Since 
Betts, in his Spelling Vocabulary Study (Amer- 
ican Book Co., 1940), has shown that seventeen 
authors-can unanimously agree only on 6.26 per 
cent of a total vocabulary of 8,645 words and 
unanimously agree on the grade placement of 
one word, this reviewer cannot find any grounds 
for the inclusion of words from one spelling 
textbook or several, unless it is to test the words 
in that or those books only on a local basis. 

The test is multiple choice, giving one correct 
spelling with three misspellings. Incorrect spell- 


ings used for multiple choices were obtained 
chiefly by an extended study of spellings, but no 
mention is made of the only publication of such 
misspellings, which is Gates’ List of Spelling 
Difficulties in 3876 Words (Bureau of Publica- 
tions, Teachers College, Columbia University, 
1937). 

The correlation between the multiple-choice 
form and the pronounced form varies from .55 
for grade 3 to .82 for grade 2. Certainly these 
correlations are not high for the same words in 
two different settings and hardly high enough 
to warrant a conclusion that one test is a good 
test of the other. One would expect correlations 
well above .90 in such a situation. The coefficient 
of alienation is .83 for one and .56 for the 
other. In his Statistics in Education and Psy- 
chology, (Longmans, Green and Co., 1946, p. 
336) Garrett says, “For r’s of .8o or less, the 
coefficients of alienation are clearly so large 
that predictions of individual scores based upon 
the regression equation are little better than 
'guesses'." 

The reliability coefficients between Forms A 
and B for all grades under grade 4 range from 
.72 to .86, but in grade 6 the reliability is as 
high as .94. Perhaps the number of words, 
which is only 50, has much to do with this low 
reliability. Fifty words can not be considered 
a very fair sampling for a range of three grades 
above grades 3 or 4; the total number of words 
for grades 1 to 9 is only 150! It is, therefore, 
seriously questioned whether the following 
statement in the manual is borne out by any of 
the published data, “Since the criterion possesses 
such high reliability and since the correlations 
between the criterion and the objective test 
yielded very satisfactory coefficients, the con- 
clusion that the objective test possesses satis- 
factory validity seems justified.” 

With the more recent vocabulary research 
of Rinsland, Basic Vocabulary of Elementary 
School Children (Macmillan Co., 1945) now 
available for grades 1 to 8, in which the sam- 
pling is greater than in other studies and which 
is drawn from children’s own writings, there 
seems to be little gained by the continued use 
of such tests as the one reviewed, which greatly 
lack adequate sampling in the choice of words 
and lack reliability coefficients high enough for 
any individual application. The reviewer is cer- 
tain that no one spelling book or even several 
should be used for any test except one for local 
measurements. 
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Guy M. Wilson, Professor Emeritus of Edu- 
cation, Boston University, Boston, Massachu- 
setts. In establishing validity for this test the 
authors used in its construction, the Bucking- 
ham Extension of the Ayres Scale, the Iowa 
Spelling. Scale, the Thorndike Teacher's Word 
Book, and a number of recognized spelling texts, 
including the state-adopted text. Just how these 
primary and secondary sources were used, we 
are not told; no distribution tables are given. 
The purchaser and user of a test has a right 
to full information on validity. 

The main criticisms from the standpoint of 
validity are that: (a) too much reliance is placed 
upon secondary sources; (b) words quite be- 
yond reasonable spelling expectations are as- 
signed for spelling in the various grades. The 
only primary source used in checking words is 
the Thorndike Teachers Word Book. The 
authors should also have made use of the Com- 
monwealth List at least. 

The neglect of curricular considerations in 
the assignment of high word-value words for 
spelling is a serious criticism. Curricular con- 
siderations should never be subordinated to 
statistical considerations. Valid comparisons 
should be based upon familiar words for which 
perfect spelling is a reasonable expectation. 

The multiple-choice form of the test is ques- 
tionable for grade usage, or as a test of spell- 
ing. Lower-grade pupils are confused by the 
various forms; and merely checking is not 
equivalent to spelling. 

When words are pronounced by the teacher, 
meaning can be checked. If a child does not 
understand a word, that word for him is not 
a proper spelling word and it should be omitted 
from the test. There should be a different word 
list for each grade. Right selection of words is 
more difficult when words are used in three 
grades. 

Spelling for first-grade pupils is of doubtful 
value. If words are assigned to first-graders for 
spelling, they should be taken from the first 
five hundred words in the Thorndike Teacher's 
Word Book. Second- and third-grade pupils 
should not be given words above the first thou- 
sand. Words above the fourth thousand should 
not be assigned for spelling in any grade. 

The time limit on a test is not desirable if 
testing is to be a part of teaching. 

All told, the makers of this test appear to be 
more concerned with statistical results than in 
providing a real aid to a good teaching program. 
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Any revisions undertaken should give more at- 
tention to curriculum and methods of teaching. 
In general, unless it is good teaching, it is not 
a good test. A good test will yield equally valid 
and more helpful statistical data. 


[ 154 ] 
XLincoln Diagnostic Spelling Test. Grades 8-12; 
1941-44 ; Forms 1, 2, 3, 4; 6¢ per test; 1o¢ per specimen 
set; (30) minutes; A. L. Lincoln; Educational Records 
Bureau, 
REFERENCES 


1. TOWNSEND, AGATHA, “A Study of the Lincoln Diagnostic 
Spelling Test,” pp. 49-53. In In Achievement Testing Pro- 
gram in Independent Schools and Supplementary Studies, Edu- 
cational Records Bulletin, No. 38. New York: Educational 
Records Bureau, June 1943. Pp. xiii, 53. Paper, lithotyped. 
$1.50. * (PA 18:315, title only) 


[ 155 ] 
*Lincoln Intermediate Spelling Test. Grades 5-8; 
1947; Forms A, B; 6¢ per test; 10¢ per specimen set; 


(30) minutes; A. L. Lincoln; Educational Records 
Burcau. 


REFERENCES 


1. TOWNSEND, AGATHA. “A Report on the Use of the Lincoln 
Intermediate Spelling Test," pp. 40-8. In 1947 Fall Testing Pro- 
gram in Independent Schools and Swepleninter, Studies, Edu- 
cational Records Bulletin, No. 49. New York: Educational 
poco Bureau, February 1948. Pp. xii, 69. Paper, lithotyped. 

1.50, * 


[ 156 ] 
New Standard High School Spelling Scale. Grades 
7-12; 1925-40; a 64-page book consisting of 64 40-word 
lists to be used for either testing or teaching and an 
alphabetical list of the 2560 words with difficulty 
values in each of grades 9, 10, 11, and 12; 48¢ per copy; 
Ernest P. Simmons and Harold Hench Bixler ; Turner 
E. Smith & Co. 

[ 157 ] 
X*Wellesley Spelling Scale. Grades 9-16; 1944; 
IBM; Forms 1, 2; 2 editions; hand-scoring edition: 
90£ per 25; 25¢ per specimen set, postpaid; machine- 
scorable edition: 4¢ per test; 2¢ per answer sheet; 60e 
per set of scoring stencils; nontimed (30) minutes; 
Thelma G. Alper and Edith B. Mallory; California 
Test Bureau. 


REFERENCES 

1. ALPER, THELMA G. “A Diagnostic Spelling Scale for 
the College Level: Its Construction and Use.” J Ed Psychol 
33:273:90 Ap '42. * (PA 17:932) 

Henry D. Rinsland, Director, Bureau of Edu- 
cational Research, The University of Oklahoma, 
Norman, Oklahoma. The purpose of the scale 
is to measure the student's ability to spell in 
high school, vocational school, and college. The 
authors state that they included only words 
which are frequently found in the written vocab- 
ulary of high school graduates and that these 
words genuinely represent the spelling that the 
student will do in the course of ordinary school 
or business writing. There are no data and no 
research studies quoted to substantiate these 
statements beyond the facts that the words are 
taken from the written themes of 363 fresh- 
men in a required college course in English 
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composition and that a listing of the words most 
frequently misspelled is given. Evidently this 
was done at Wellesley, but it is certainly a 
meager sampling for choosing the spelling 
words to test high schools and colleges the 
country over and does not warrant the attach- 
ment of the term “standardized” to a test of 
such limited curricular validity. 

The test is published for both hand and 
machine scoring. The number of words tested in 
each scale is, in itself, a very skimpy sampling, 
there being only 50 words to cover the range 
from grade 9 to the freshman year in college. 
The norms, in percentiles, themselves are based 
on a very meager sampling; 152 college fresh- 
men ; 242, grade 12; 333, grade 11; 242, grade 
10. Grade 9 percentiles were derived by ex- 
trapolation and a completion form, and there- 
fore lack both validity and reality. Certainly test 
builders should begin to realize the importance 
of these two major sampling factors, which 
affect both the validity and reliability of a test. 
Because of the very wide range of words used 
by high school students and college freshmen, 
the selection of words studied from which some 
will be chosen to go in the test must be exten- 
sive. In applying the averages of any group to 
pupils the country over, as is done in the norms 
of a standardized and published test, test build- 
ers must not throw all principles of reliability 
and sampling to the wind. 

Forms 1 and 2 correlate .86, which is really 
not high for elements as objective as multiple- 
choice items in spelling. The multiple-choice 
form was chosen because of the correlation of 
.75 between recognition scores from Form 1 and 
recall scores obtained by dictating the words in 
Form 2. A validity coefficient of .75 cannot be 
considered high and does not warrant the con- 
clusion that the multiple-choice form is as valid 
as dictation, considered by most test builders 
as a good criterion or the best criterion. Too 
often ease of scoring is taken as the first cri- 
terion of a test: that is a very serious error. 

No information is given as to the source of 
the erroneous spellings used in the multiple 
choices. 


Guy M. Wilson, Professor Emeritus of Edu- 
cation, Boston University, Boston, Massachu- 
setts. The Wellesley Spelling Scales are de- 
signed as aids in selecting the best spellers from 
among those who apply for secretarial jobs. The 
requirements are very different from those 


called for in a scale designed to aid a teaching- 
mastery program in spelling. A selection scale 
can properly rely more upon distribution, can 
use a reasonable proportion of high-value 
words, and can more appropriately use the 
multiple-choice form of test. Recognition of 
misspellings is a skill much needed by secre- 
taries. 

The modest claims of the authors of these 
scales appear to be justified by the product. 
The scales are conveniently arranged. Sentences 
are used to show the meanings of words. Com- 
parative statistics are given. All of these fea- 
tures add to the usefulness of the scales. 
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Clinton General E Test for Bigh 
Schools and Colleges. Grades 9-16; 1936; Forms A, 
B; $1.15 per 25; 10f per specimen set, postpaid; R. J. 
Clinton; Cooperative Book Store, Corvallis, Oregon. 

Harold H. Bixler, Director of Research and 
Guidance, Public Schools, Atlanta, Georgia. 
This test “is designed to measure the extent of 
a person's general reading and understanding 
vocabulary and not his active vocabulary." 

Each form has 150 items of the multiple- 
choice type with four choices for each item. The 
author states that the words are selected from 
words used generally in the curriculum mate- 
rials of high school and college students. They 
were checked by the list appearing in the Thorn- 
dike Teacher's Word Book. There is no time 
limit, but the author states that the average time 
required for college students is 20 minutes and 
for high school students 25 minutes. This would 
seem to be a rather short time for students in 
the lower high school grades in view of the 
number of items. 

The reliability of the test, as determined by 
the use of the Spearman-Brown formula, is .94, 
based on 307 cases. This would seem to be ade- 
quate reliability for individual pupil diagnosis. 

Norms provided are in the form of medians 
for students in grade 9 up through college grades 
in terms of numbers of words correct. The 
norm for high school freshmen, for example, 
is 55 words, and for high school sophomores, 
61 words. This is not a very large increase from 
one grade to the next. The norms are not given 
in terms of grade equivalents. 

The author also provides norms showing 
numbers of words in the student's vocabulary. 
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'Thus, a score of 52 is reported as equivalent 
to a vocabulary of 5,540 words. The norms are 
based on 4,000 cases. 

One of the limitations of the test is that the 
words are presented alone and not in sentences 
or phrases. This lack of context keeps the test 
from being a lifelike situation. In some in- 
stances, the author would do well to revise the 
meanings of words as given in the test. For 
Item 129, sequestrate, the correct meaning is 
given as seize. Normally one would expect set 
apart to be the correct answer, although the 
dictionary does give confiscate as a second 
meaning of this word. 

On the whole, this test should prove to be a 
useful instrument for upper high school and 
college grades. 


[ 159 ] 
Columbia Vocabulary Test. Grades 3-12; 1939; 
IBM; 1 form; $1.10 per 25 machine-scorable test-an- 
swer sheets, with manual and hand-scoring keys; $1 
per set of machine-scoring keys; 35¢ per specimen set, 
postpaid; nontimed (15-20) minutes; Irene Gansl and 
H. E. Garrett; Psychological Corporation. 


REFERENCES 

1, GANSL, Irene. Vocabulary: Its Measurement and Growth. 
Archives of Psychology, No. 236. Washington, D. C.: Ameri- 
can Psychological Association, ips March 1939. Pp. 52. Paper. 
$1.00. * (PA 13:4819) 

2. Bennett, Grorce K., anp RosLow, Sypney. "Extension 
of the Norms of the Columbia Vocabulary Test." J Appl Psy- 
chol Apes F'41. * (PA 15: 3:58) 

3. Rercuarn, Suzanne. Mental Organisation and Age 
Level, Archives of Psychology, No. 295. Washington, D. C.: 
American Psychological Association, Inc., June 1944. Pp. 30. 
Paper. $0.80. * (PA 19:1183) 

4. Sartarn, A. Q. “Predicting Success in a School of 
Nursing.” J Appl Psychol 30:234-40 Je '46. * (PA 20:4350) 

Verner M. Sims, Professor of Psychology, 
University of Alabama, University, Alabama. 
The Columbia Vocabulary Test is designed as 
a measure of the general vocabulary of pupils 
from grades 3 through 12. It consists of a sample 
of roo words selected from a list of the “words 
found in 22 contemporary measures of vocabu- 
lary," arranged in order of difficulty, presented 
in multiple-choice test form, and printed on a 
single, standard, machine-scored answer sheet. 

Considerable work has gone into the construc- 
tion of this test, and the result is an instrument 
which is probably adequate as a rough measure 
of vocabulary. Printing it directly on an answer 
sheet makes for economy in use of the test and 
it should serve a purpose in survey studies where 
the concern is with group differences only. 

There are, however, a number of limitations 
to the test which should be called to the atten- 
tion of prospective users. The test items are 
printed in 8-point type, which is commonly 
assumed to be too small for use in the lower 


grades. (The generally accepted minimum-size 
type is 14-point for the third grade, and Io- or 
12-point for grades 4 to 6.) What influence this 
small type may have at the lower levels is not 
known nor was it subjected to experiment by the 
testmakers. 

Although elaborate norms have been devel- 
oped for the test, they are actually loca norms 
only, being based upon a sampling of pupils 
from three elementary schools and one high 
school in New York City. No information is 
furnished as to how these schools were selected 
from the many and varied schools in New York ; 
consequently one hesitates to conclude even that 
the norms are representative of this one city. 
Certainly they cannot be accepted for general 
use. Furthermore, the elaborateness of the 
norms (means, standard deviations, and deciles 
are reported for each half grade for the grades 
3 through 12; grade-month means are given for 
grades 3 through 8; and means, standard devia- 
tions, and deciles for ages eight through thir- 
teen) represents overrefinement of the test data. 
This point is well illustrated by the grade-month 
means. Although the standard error of a raw 
score on the test is reported as 3 points, the 
maximum increment of score found for any 
month was only .8 point. In other words, the 
chances are almost ten to one that the differ- 
ences between two succeeding months do not 
represent actual differences in ability, and two 
scores would need to be more than 11 months 
apart before we could interpret them as almost 
certainly not due to chance error! Such over- 
refinements in standard scores are comparable 
to measuring weight by hefting and reporting 
the results to the hundredth part of an ounce. 
They inevitably create an illusion of accuracy 
which may be actually harmful in the hands of 
relatively unskilled users. 

Finally, this test, along with most other tests 
purporting to measure vocabulary, actually 
measures ability to recognize synonyms of test 
words, and, at that, only one synonym (and 
therefore one meaning) for each test word. For 
example, the first word in the test, fast, is to be 
matched with quickly; yet Thorndike's Century 
Senior Dictionary reports a total of thirteen 
meanings for this word: six adjectives, four 
adverbs, two nouns, and one verb. Just what 
can we infer concerning the child’s knowledge 
from a right or wrong response to this test 
situation? The fact is we have little or no in- 
formation concerning the relation between the 
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rather narrow skill measured by current vocabu- 
lary tests and the child's functioning vocabu- 
lary, either "impressive" or "expressive." 


Clifford Woody, Professor of Education, 
University of Michigan, Ann Arbor, Michigan. 
The author of this test has employed more exact- 
ing criteria in selecting the words constituting 
the test than have been applied in choosing the 
words for most tests. The original lists of ap- 
proximately 1,500 words from which the final 
selection was made consisted of the words em- 
ployed in twenty-two widely used vocabulary 
tests designed for use from grade I to the 
college level. These words were then classified 
according to the level at which they appeared 
in Thorndike's Teachers Word Book. From 
these classified lists, 135 words from difficulties 
1to13 were selected. Each of these words was 
then incorporated into a brief sentence or 
phrase to make the use of the word correspond 
as closely as possible to the manner in which 
it would appear in the life process. Five re- 
sponse words were selected for each test item 
from the original list of 1,500 words. Extreme 
caution was employed in selecting these words: 
no response word should have a higher fre- 
quency rating than the test word; all response 
words must be of the same part of speech ; all 
words must fit smoothly into the sentence. Only 
one response must be a synonym of the critical 
word. These precautions were taken in devis- 
ing a preliminary edition of the test. 

On the basis of the results obtained from 
administering the test to 1,041 pupils in grades 
3 to 8 of a public school in Oueens, New York, 
100 words were selected and arranged accord- 
ing to percentage of pupils responding cor- 
rectly. In making the final selection, attention 
was given not only to the percentage of correct 
responses but also to those words having a 
steady increase in successive grades in the per- 
centage of correct responses. Furthermore, 
words whose individual curves of correct re- 
sponses followed that for the test as a whole 
were selected. Thus, it should be clear that ex- 
treme care was exercised in selecting the words 
for the test. 

The final test was standardized by administer- 
ing it to 3,306 pupils in three public schools of 
New York City. The student populations of 
these three schools were considered to be typical 
of the total student population of the United 
States, but only further administration of the 


tests can establish the truth or falsity of this 
assumption. The reviewer feels that better 
standards can be obtained. Standards are avail- 
able for boys and girls for both age and grade. 
The coefficient of reliability determined by the 
Spearman-Brown formula is .94. The standard 
error of measurement of a single score shows 
that the chances are 68 in 100 that a single score 
among the standardization group did not vary 
from the true score by more than 3 points. 
While the test has not been widely used and 
the results therefore have not been correlated 
with those from other standard vocabulary tests, 
the reviewer has no hesitation in recommending 


its use. 
[ 160 ] 

XCooperative Vocabulary Test. Grades 7:16; 1940- 
41; IBM; 2 forms; 2 editions; Frederick B. Davis; 
Cooperative Test Service. 
a) FORMS Q AND R. Separate answer sheets need not be 
used ; $1.75 per 25; 25¢ per specimen set, postpaid ; 60 
per 25 machine-scorable answer sheets; 30¢ per stencil 
for scoring answer sheets; 30(35) minutes. 7 
b) SHORT FORMS QS AND RS. $1.25 per 25 machine- 
scorable test-answer sheets; 25¢ per specimen set, 
postpaid; nontimed (10-15) minutes. 

REFERENCES 


1. Coorerative Test Service. The Cooperative Vocabulary 
Test: Information Concerning Its Construction, Interpretation 
and Use. New York: Cooperative Test Service, 1940. Pp. 2. 
Paper. Gratis. * 


2, StmPson, R. G. “The Vocabulary Sections of the Coopera- 
tive English Tests at the Higher Levels of Difficulty.” J Ed 
Psychol 34:142-51 Mr "43. * (PA 18:311) 

Edgar Dale, Professor of Education, The 
Ohio State University, Columbus, Ohio. The 
Cooperative Vocabulary Test contains 210 items 
subdivided into seven scales of 30 items each. 
The seven scales are equivalent, but the most 
discriminative items are in the first scales, the 
least discriminating in the last scales. 

The final score does not depend upon the 
number of items attempted. The student who 
gets all the items correct on only two of the 
seven scales gets the same scaled score as one 
who gets all items right in three or more of the 
scales. However, the authors suggest that an 
unlimited time and the completion of all the 
scales produce the most reliable results. 

All the words were selected from the Thorn- 
dike Teacher's Word Book and represent “a 
sampling from many subject-matter fields.” 
The data show a varying number of words from 
each of the “thousands” levels of the Thorn- 
dike Teacher's Word Book, and no statement 
is made how the subject-matter fields were 
sampled. 

Although the authors state that the scores 
“indicate the extensiveness and precision of an 


161 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 250 


individual's knowledge of word meanings" and 
that "knowledge of word meanings is an impor- 
tant element in determining scholastic success," 
no correlations are presented between this test 
and indices of scholastic success. 

Percentile norms for each grade level in three 
types of secondary schools and three types of 
colleges are provided and are published in the 
Booklet of Norms. 

Some vocabulary tests include as correct cer- 
tain items which are debatable. No such items 
were found here. However, a few of the choices 
while correct as far as the dictionary was con- 
cerned were not common uses, nor would a more 
suitable definition have made the test more 
difficult. Thus, “relic” is made synonymous 
with “souvenir,” “churl” with “peasant.” 

What does the teacher have who gives this 
test? He will have an indication of whether an 
individual or a class is below, at, or above the 
norms. But he will not know the areas or fields 
of experience in which the student does well or 
poorly. Nor will he have any suggestions on 
ways of using the test to develop vocabulary. 
Nevertheless, the test will be one which the 
students will recognize as dealing with impor- 
tant words, one which has been constructed with 
real care, Further, it is a test which will give, 
in a few minutes, a good indication of level of 
general vocabulary in a college class. 


Henry D. Rinsland, Director, Bureau of Edu- 
cational Research, The University of Oklahoma, 
Norman, Oklahoma. The words in this test rep- 
resent a sampling from many subject-matter 
fields. In the easy items only one response word 
is close in meaning to the word being tested, but 
in the difficult items the response words ap- 
proach closely the meaning of the word being 
tested. Great care has been shown in the fine 
discrimination of meaning. Both the correct 
response and the decoys for every item are in 
the Thorndike list of 20,000 words and are of 
more frequent occurrence than the word being 
tested. Frequency was not taken as an index 
of difficulty ; rather, the subjective judgment of 
the authors was used to supplement the fre- 
quency-level data and to exclude words from 
the responses which appeared to be more diffi- 
cult than the word being tested. 

No coefficients of reliability are given, but 
from experience with other Cooperative tests, 
they are undoubtedly between .90 and .94, suffi- 
ciently high to possess real diagnostic value. 


The experience and reputation of the several 
authors warrant confidence in similar factors 
of the test not substantiated by published data. 


[ 161] 
English Recognition Vocabulary Test. Grades 7- 
16 and adults; 1938; Form 1; $3.75 per 100, postpaid ; 
50$ per specimen set, postpaid; for manual sec Journal 
of Educational Psychology 31:14-38 January 1940; 
(50) minutes; Robert H. Scashore and Lois D. Ecker- 
son; published by Robert H. Seashore, Department of 
Psychology, Northwestern University. 
REFERENCES 

E udo A A Construction, Anatys.s, and Evaluatwn 
ofa Vocabulary Measure. Unpublished master's thesis, Univer- 
AER 1 OBERT H, “The Measurement and Analysis 
of Extent of Vocabulary. SEDE Psychol B 30:709-10 N 
"33. * (PA 8:3720, title on! s 
a pea. 'EORGE D. Tnterrelations of Vocabulary Skills: 
Gommonest: cr xe Meanings. Unpublished master's 

i t ersity, 1939. 
Cree. easton, Koszar H, Further Data on the Measurement 
of jencral num Me Abstract, Psychol B 36:525 
a H t r^ 

oer r^^ Ecxensox, Lois D. "The 
Measurement of Individual Differences in General English 
Vocabularies.” J Ed Psychol 31:14:38 Ja '4o. * (PA 14:3226) 

Pudor a ela Pm to 8 Wey d Are Qon 
Vocabulary o! ildren From 6 to 18 Ye: 
Onda r To ey Abstract. Psychol B 37:581 O i * (PA 

3612, title oi q 
MCA ETN ay KATHERINE, Measurement of the Size of 
General English Vocabulary Through the Elementary Grades 
and High School. Unpublished doctor's thesis, Northwestern 
University, 1940. Skaara of Doctoral Dissertations . . . 

- 300. A 
1940 Pover., Geoxce D. "Interrelations of Vocabulary Skills: 
Comnpnest Versus Multipie Meanings.” J Ed Psychol 32:67- 
. 15: ^ 
UL RES ATUERINE, “Measurement of the Size of 
General English Vocabulary Through the Elementary Grades 
uon cU uet Genetic Psychol Monogr 24:311-45 N '41. 
16:2082 

13. ScHULMAN, Mary Jean, AND Havicuurst, RomERT J. 
“Relations Between Ability and Social Status in a Midwestern 
Community: IV, Size of Vocabulary.” J Ed Psychol 38:437-42 
N 'az. * (PA 22:2064) 

14. SrAsuonE, Rosert H. "How Many Words Do Children 
Know?" Packet: Heath's Serv B El Teach 2:3-17 N '47. * 

15. SEaAsuonE, Rosert H. “The Importance of Vocabulary in 
Learning Language Skills," El Engl 25:137-52-- Mr '48. * 


Paul S. Burnham, Associate Director, Stu- 
dent Appointment Bureau, Yale University, 
New Haven, Connecticut. Copyrighted and pub- 
lished in 1938, this test is a controlled sampling 
of words found in the Funk and Wagnalls New 
Standard Dictionary of the English Language, 
1937 edition. The test is divided into three 
parts. Part I consists of a general recognition 
test involving 173 chosen words, apparently 
arranged in order of difficulty. For each given 
word the student selects one among four choices 
as a synonym related to the commonest meaning 
of the given word. Part II is a list of 158 words 
taken from the dictionary’s supplementary list 
of proper names, geographical locations, and 
rare words. Part IIl isa list of 46 derived terms. 
In both Parts II and III the problem is to give 
the meanings of the words or terms presented. 

The sum of the correct answers can be con- 
verted into an estimate of the total English 
vocabulary which the student probably has at 
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his command. Norms in terms of this total Eng- 
lish vocabulary score are provided for grades 1 
through 12, based on populations varying from 
40 to 116. Norms are also available in terms of 
decile standing among approximately 200 state 
university freshmen and sophomores in educa- 
tional psychology courses. 

Spearman-Brown reliabilities varied from .83 
(unlimited time) to .93 (20 minutes) for Part I. 
Decile ratings applicable to the 20-minute time 
limit are available but the authors recommend 
unlimited time. Meager evidence suggests cor- 
relatiohs of approximately .60 with tests of 
reading comprehension, approximately .50 with 
the Otis Advanced Intelligence Test, and .57 
with the American Council on Education Psy- 
chological Examination, Lack of data makes 
appraisal of this test difficult. 


Edgar Dale, Professor of Education, The 
Ohio State University, Columbus, Ohio. This 
test aims to discover the total number of words 
in the vocabulary of an individual. The test is 
based on a sampling from Funk and Wagnall’s 
Unabridged Dictionary. The test includes 173 
general terms; 158 proper names, geographical 
locations, and rare words ; and 46 derived terms. 
Provision is made for separate estimation of the 
total words in each of these three classes. 

The 173 words in the general list are tested 
by a four-choice multiple-response recognition 
test. The proper names, geographical locations, 
and rare words are tested by defining, identify- 
ing, or using in illustrative sentences. Among the 
words in this group are cit, frimsel, mamoul, 
yoghoort, and many others like them. The 46 
derived terms are tested by asking the student 
to think of or write, if requested, a specific 
instance in which each would be correctly ap- 
plied, so as to describe its real meaning. 

Norms are available for college groups for 
Parts 1 and 2. The groups on which the norms 
are based are not too large—237 Ohio State 
University students and 116 Northwestern un- 
dergraduates. “Part 3 has not yet been ade- 
quately standardized, the norms given being 
based upon only 120 Northwestern freshmen 
and sophomores who employed the self-rating 
criterion. . . ” (8) Preliminary work on stand- 
ardization for grades 1 to 12 of school has been 
done but the numbers tested are still too small to 
be reliable. 

Reliability of the test has been established 
only for Part r for college groups. Correlation 
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between scores on the odd words with the even 
in Part 1 ranged from .83 to .93 for different 
time limits. The highest reliability was for the 
20-minute time limit. 

The authors state that this test was made for 
research purposes and has never been published 
commercially except in the sense that enough 
copies have been printed to supply people who 
wish to purchase them. A new pictorial form 
covering the easier portion of the same word 
list is now being standardized for children in 
the age range of three to ten years. 

This test, of course, should be criticized on 
the basis of what it attempts to do—to secure a 
measure of the total vocabulary of an individual. 
If we accept the three types of test as valid and 
if the sampling is adequate, then we may accept 
a total vocabulary figure secured by this test. 
The writer believes that neither assumption is 
acceptable. While the test may prove to have 
great value as a correlate of other important 
school abilities and thus give us quick significant 
data about mental ability, nevertheless it does 
not give us the really important measures of 
vocabulary relating to specific areas of experi- 
ence. More important than a single measure of 
total vocabulary are measures of specific vocab- 
ularies in science, business, geography, poli- 
tics, music, and technology. Such data would be 
valuable for guidance and remedial work. 

{ 162 ] 
High School Vocabulary Test. 1940; Form A; 50¢ 
per 20; 10¢ per manual; 15¢ per specimen set; 30(35) 
minutes J. W. Chalmers; Institute of Applied Art, 

Harold H. Bixler, Director of Research and 
Guidance, Public Schools, Atlanta, Georgia. 
This test has been devised by a Canadian and 
published by a Canadian company. Norms are 
based on 1,500 intermediate and high school 
students in Alberta schools, grade 7-12 inclu- 
sive. However, there is no good reason why 
the test could not be used in schools in the 
United States. 

The words were chosen from the Thorndike 
Teacher's Word Book of Twenty Thousand 
Words. The author reports that they were chosen 
as follows: 5 words from each 1,000 of the most 
common 4,000; 6 words from each 1,000 of the 
next most common 9,000; 5 words from each 
1,000 of the next most common 4,000; and 2 
words from each r,ooo of the least common 


3,000. 
The validity of the test may be judged from 
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the method of selection of the test words as 
listed above. Also the author reports correla- 
tion coefficients of .91 with the Thorndike Test 
of Word Knowledge and .85 with the vocabu- 
lary section of the New Stanford Achieve- 
ment Test. As to the reliability of the test 
there is some question. The coefficient reported 
by the author is .89, based on 297 students. 

One disadvantage of the test is the fact that 
the words are given alone and not in sentences 
or phrases. However, the selection of words on 
the whole appears to be good. There are 100 
items in each form. In two instances, at least, 
students in the United States would be puzzled 
by the spelling. Item 76 is listed as dyke when 
most of our students would know it as dike. 
ltem 92 is given as silvan, and our students 
would know it as sylvan. There are a few un- 
usual words such as palfrey and jejune. 

Norms are based on 1,939 students. The 
range of cases is from 119 in grade 7 to 350 
in grade ro. The test would have been better 
if norms had been based on more students, and 
particularly students from a much more ex- 
tensive geographical area. 


[ 163 ] 
Inglis Tests of English Vocabulary. Grades 9-16; 
1924-27 ; Forms A, B, C; $1.60 per 30; nontimed (30) 
minutes; Alexander Inglis (Form C completed by 
Ralph W. Walter) ; Ginn & Co. 
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Henry D. Rinsland, Director, Bureau of 
Educational Research, The University of Okla- 
homa, Norman, Oklahoma. Form A was copy- 
righted in 1923, 1924, and 1936; Form B, in 
1924, and Form C, in 1927; therefore, the 
tests are fairly old. The norms are in terms 
of median scores for grades 9-12, college 
freshmen, and college graduates; the variation 
of percentage correct runs from 30 per cent or 
45 words out of a total of 150 words for ninth- 
graders to 86 per cent or 129 words for col- 
lege graduates. 


Little can be found about the steps in valida- 
tion, as the reference in the description of the 
test gives the English Leaflet for October 
1923 as a publication carrying the full account 
of the method employed in the development of 
the test; but this Leaflet cannot be obtained 
either from the English Society in New Eng- 
land, which published it, or from the printer. 
The coefficient of correlation for Form A with 
Form B is .9o. The medians are within one 
point of each other. 

Inglis reported that the words tested repre- 
sented a "true sampling of the field covered by 
the Intelligent General Reader's vocabulary." 
He first made a careful analysis of the non- 
obsolete words other than those belonging to 
our everyday vocabulary of commonest words 
or to our special and technical vocabulary. 
From these he chose several lists of about 
3,000 words each, and from one of these for 
Form A he chose words 1, 17, 33, etc.; for 
Form B, words 2, 18, 34, etc.; and for the 
other forms he evidently intended to sample 
in a similar fashion. 

Nothing is given about the placement of 
these words in the Thorndike 1921 frequency 
count. It is evident that whatever value the 
present test has is based upon the superior 
judgment of the author in selecting words 
which gave fair sampling distributions from 
grade to grade as judged by the spacings of 
the median scores; to his very excellent choice 
of the sample sentences in which each word 
is given; and to the wise choice of the correct 
word as well as the wrong words. 

It is very probable that, if the words tested 
in the three forms were checked against the 
Thorndike-Lorge Teacher's Word Book of 
30,000 Words, a test more valid for these levels 
would be obtained by a thin sampling in the 
lower and the higher levels, say, from the fourth 
to the tenth thousand and from the twentieth to 
the thirtieth thousand; and a richer sampling 
from the middle levels, say, the tenth to the 
twentieth thousand. 


[ 164 ] 
*Kansas Vocabulary Test. Grades 4-8; 1940; 75¢ 
per 25, postpaid; 15¢ per specimen set, postpaid; 30 
(35) minutes; H. E. Schrammel, O. M. Rasmussen, 
Anna Huebert, and Donald J. Tate; Bureau of Educa- 
tional Measurements, Kansas State Teachers College 
of Emporia. 


Harold H. Bixler, Director of Research and 
Guidance, Public Schools, Atlanta, Georgia. 
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This test has 85 items of the multiple-choice 
type. The authors state that the purpose of the 
test is to "measure the pupil's reading vocabu- 
lary and that it is a valuable aid for diagnosing 
reading weaknesses." Since the validity of any 
test is the most important consideration, it is 
well to quote the author's statement: “The 
words comprising the test were selected from 
the vocabulary in a large number of school- 
subject fields and other supplementary read- 
ings of interest to pupils of this school level. 
An attempt was made to select words from the 
various fields in the same proportion as the 
pupils will in general be confronted by them. 
All words were checked against the Thorndike 
Word Lists in order to insure an equitable dis- 
tribution according to difficulty and frequency 
of use.” Inspection of the test indicates that 
the author has included a wide variety of 
words. 

One feature of this test is the fact that the 
words are presented in a sentence or phrase, 
e.g., “sterilize the container.” This context fea- 
ture makes the test more nearly a lifelike situ- 
ation. 

As to reliability there would seem to be 
some question. The authors state that the reli- 
ability coefficients range from .84 to .96, with 
an average of .go. Authors of other vocabulary 
tests have reported reliability coefficients as 
high as .95. The reason, of course, for low 
reliability is the fact that there are only 85 
items in this test. The time limit is 30 minutes. 
It would have been better if more items had 
been included, and the time limit extended to 
a somewhat longer period. 

Percentile norms are provided for midyear 
testing and for end-of-the-year testing for 
grades 4-8 inclusive. The range in number of 
cases is from 543 to 914. No statement is given 
as to the distribution of cases by states or by 
urban and rural areas. However, since the tests 
were used in the 1940 Kansas Every Pupil 
Test Survey, it is assumed that most of the 
cases are from that state. The authors do not 
give norms in terms of grade equivalents. 
Many teachers in these grades prefer grade- 
equivalent scores. 

On the whole, the multiple-choice items 
listed in the test appear to be well chosen. 
However, some choices, intended to be the 
correct answers, are not so good. For instance, 
the expected definition of navigable is given as 
passable. 

> 


By way of summary, the reviewer feels that 
this test will serve a useful purpose. 


[ 165 ] 
Kennon Test of Literary Vocabulary, 1941 Edi- 
tion. Prospective teachers of English in secondary 
schools; 1926-41; Forms I, II; $8.50 per 100; 30¢ per 
specimen set; nontimed (40) minutes; Laura H. V. 
Kennon and Irving Lorge; Bureau of Publications, 
Teachers College, Columbia University. 


REFERENCES 
1. Kennon, Laura Hart Vere. Tests of Literara Vocabular. 


‘or Teachers of English, Columbia University, Teachers Col- 
fs, a atona to Education, No- Zia) Out of pant. n 
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H. H. Remmers, Director, Division of Educa- 
tional Reference, Purdue University, Lafayette, 
Indiana. The purpose of this test in the words 
of the manual is "to arrive at an estimate of 
the control of word knowledge specifically re- 
lated to the interpretation of English and 
American literature" and thus to be of value in 
"selecting for mastery of content . . . students 
who are preparing to teach English in the 
junior or senior high school, and also in coun- 
seling students toward a better orientation in 
their chosen field" on the assumption that 
“one important asset of prospective teachers 
of English is the possession of an extensive 
and special literary vocabulary.” 

Both curricular validation (occurrence of 
words in relevant types of literature and in 
the works of specified authors) and statistical 
validation (correlation of scores of teachers 
and prospective teachers against two older 
forms, item-correlation vs. total score and vari- 
ous tests as well as comprehensive English 
departmental examinations at Teachers Col- 
lege) make the statement of purposes perstta- 
sive and acceptable as having been achieved in 
this test at least for mine-run students and 
literature courses. The average item difficulty 
is optimal—so.6 per cent. Centile norms for 
the equated two forms are provided, based on 
1,091 teachers and prospective teachers. 

To call the correlation of .93 between each 
of the older Forms A and B and their new 
counterparts I and II a “validity coefficient,” 
except in a narrow, statistical and rather un- 
real sense (validity by definition), elicits from 
this reviewer a demurrer re semantic liberties. 
But this is perhaps a very minor point. The 
reliability of each form was found to be .94 
for the maximum range of talent (teachers and 
prospective teachers) investigated, and .97 for 
the average of both forms. It would have been 
desirable to include in the manual the stand- 
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ard error of a score on the test. Administration 
and scoring are simple and efficient, 


[ 166 ] 
Michigan Vocabulary Profile. Grades 9-16; 1937- 
39; IBM; Forms AM, BM; separate answer shcets 
need not be used; $1.90 per 25; 35¢ per specimen 
set, postpaid; $1.10 per 25 machine-scorable answer 
sheets; 40¢ ‘per set of stencils for scoring answer 
sheets ; nontimed (50) minutes; prepared under the 
direction of Edward B. Greene; World Book Co. 
(1939). 
REFERENCES 
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Joseph E. King, Director, Personnel Man- 
agement Consultants, Chicago, Illinois. This 
test measures word knowledge in eight fields of 
information: human relations, commerce, gov- 
ernment, physical science, biological science, 
mathematics, fine arts, and sports. The basic 
factor tapped by the test battery appears to be 
verbal ability, which involves the ability to un- 
derstand words and ideas, is the usual compo- 
nent of verbal intelligence tests, and is meas- 
ured by tests of vocabulary, definitions, and 
same-opposites. The fact that the Michigan 
Vocabulary Profile correlates .61 with the 
American Council on Education Psychological 
Examination and .56 with the Cooperative 
English Test indicates its loading in factor V. 

What is being measured by the Michigan 
Vocabulary Profile is probably general V plus 
a series of specific V’s. The median correlation 
among the eight tests of the battery is .27 for 
a college sophomore group, indicating a good 
degree of independence between the subtests 
for this particular population and thus show- 
ing each score to be valuable in itself as a 
predictor. Since studies indicate a relationship 
between technical vocabulary and preference, 
the test may be considered to measure interest 
in the area. The student’s score is probably 
also influenced by his scholastic background. In 
general, the profile may be said to measure the 
student’s ability to deal with words, and to 
some extent his interest, in broad technical 
areas. In view of this, the contribution of the 


test to a profile of the student’s abilities and 
interests might be questioned. No correlation 
between the profile and standard tests of this 
type are furnished, but from the traits sam- 
pled by the Profile there would appear to be 
considerable overlap. Also, since the Vocabu- 
lary Profile samples word achievement more 
than verbal aptitude, its usefulness for predic- 
tive purposes at the high school level would 
probably not be as great as a measure of every- 
day activities related to a given area. 

The test is composed of eight subtests, is 
administered without time limit (about 50 min- 
utes required), and employs a separate answer 
sheet for both hand- and machine-scored 
forms. Responses are multiple choice, and two 
equivalent forms are available. Items were sc- 
lected by the criterion of internal consistency 
and are arranged in order of difficulty. Equiva- 
lent-form reliabilities range from .78 to .94. 
Raw scores, obtained by stencil or scoring 
machine, are converted to percentile norms. 
Norms are available for 4,677 students from 
ninth grade to college senior. A profile sheet 
is contained in the test booklet to plot the 
student's pattern of technical vocabularies. 
Typical profiles for seven educational and 
three occupational groups are provided. The 
Michigan Vocabulary Profile thus appears to 
meet the desired standards of test construction 
and usage. The profile could be improved by 
allowing conversion to percentile rank to occur 
automatically by plotting raw scores on the 
profile and by adapting the profile for student 
use and self-interpretation. Sound interpreta- 
tion of the student's vocabulary pattern re- 
quires more research as to the profiles of work- 
ers in various scholastic and occupational 
groups. 


For a review by Herbert A. Landry, see 
40:1320. For reviews by John G. Darley, 
Richard Ledgerwood, John M. Stalnaker, M. 
R. Trabue, and Arthur E. Traxler of an earlier 
edition, see 38:1171. 


{ 167 ] 
Survey Test of Vocabulary. Grades 3-13; 1931-33; 
Forms X, Z, X4; $1.25 per 50 copies of Form X or Y, 
postpaid ; $1.75 per 50 copies of Form X4, postpaid; 25¢ 
per specimen set, postpaid ; L. J. O'Rourke; Psycholog- 
ical Institute. 
REFERENCES 
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Verner M. Sims, Professor of Psychology, 
University of Alabama, University, Alabama. 
This test was developed for use in a nation- 
wide study of vocabulary which was sponsored 
by the Psychological Corporation some 15 
years ago and is now being offered for general 
use by the Psychological Institute. 

The test is presented without a manual. The 
directions for administering and scoring are 
printed on the test sheet itself, and an accom- 
panying mimeographed sheet reports the cor- 
rect answers for the items and mean scores for 
succeeding grades from grade 3 through 12 
(based upon from two to more than three 
thousand children at each grade). 

No information is furnished concerning how 
the test was constructed (the original form 
used in the survey did report that it was based 
upon Thorndike’s Teachers Word Book); 
what its strengths and weaknesses are; what 
its reliability is; or how and when the norms 
were obtained (they are labeled nationwide 
grade norms). Nor are any suggestions offered 
concerning its possible uses. No attention seems 
even to have been given to the matter of econ- 
omy of use. 

The test may have served its original pur- 
pose, but the form in which it now appears 
makes it difficult if not impossible to arrive at 
a reasonable judgment concerning its worth. 
Unless one is willing to accept the test on the 
basis of the prestige of its sponsor or because 
of the author's reputation in the field of meas- 
urements (Dr. O'Rourke served for years as 
the director of research in the U. S. Civil 
Service Commission), there is no intelligent 
method by which one can decide for or against 
its use except to evaluate it experimentally 
oneself. This can only mean that from a prac- 
tical standpoint the usefulness of the test is ex- 
tremely limited. 


Clifford Woody, Professor of Education, 
University of Michigan, Ann Arbor, Michigan. 
This vocabulary test was but one of several 
tests employed in a nationwide effort to deter- 
thine the status of mastery of English usage in 
grades 3 to 13. The vocabulary test consists 
of 100 items of the multiple-choice type. It is 
based upon the Thorndike Teacher's Word 
Book of Twenty Thousand Words—a list in 
which the words are distributed into groups 
according to frequency of usage in the sources 
upon which the word count was based. In 


building the test, five words were selected from 
each of the twenty levels indicated by Thorn- 
dike. Thus, the first group of five words in the 
test represents words in the first thousand’ 
most commonly used words; the second group 
of five words, those in the second thousand 
most commonly used words, etc. Each word 
chosen for the test is in a simple sentence with 
the word to be defined printed in capital let- 
ters. Each sentence is followed by five words 
from which the person taking the test must 
select the one which means the same as the 
word printed in capital letters, Care was taken 
in selecting the five words from which the 
synonym is to be chosen to select all five words 
from the levels of usage which are lower in 
frequency than that on which the word to be 
defined occurred. Thus, if frequency of use 
can be taken as a criterion of difficulty, it can 
be assumed that the word to be defined is al- 
ways more difficult than the response words 
from which the synonym is to be selected. It 
should be pointed out that, while the above 
assumption tends to be true, there are many 
exceptions in which frequency of use is no 
measure of the difficulty in defining a word. 
While O’Rourke’s initial selection of words 
was based upon the level upon which the words 
occurred in the Thorndike List, the final se- 
lection was upon frequency levels and actual 
difficulty determined by the percentages of cor- 
rect responses made by 600 pupils in grades 
3, 5, 7, 9, and 12. Only those words which 
were common to the Thorndike scale and the 
difficulty scale by actual trial were selected. 
Words not meeting the double requirement 
were eliminated. 

The word lists and response words for Forms 
X and X4 are identical. Form X was copy- 
righted in 1931; Form X4, in 1940. Form X 
is printed on both sides of a single page; 
Form X4, as a four-page booklet. The time 
limit for Form X is 20 minutes; for Form X4, 
10 minutes. The instructions for the two forms 
are essentially the same, although those for 
Form X4 are somewhat more extended and 
include three practice exercises not given in 
the instructions for Form X. Both sets of in- 
structions call for recording the response in 
such fashion that scoring with a provided key 
is facilitated. In determining the score on each 
form, each correct response counts one point. 
Norms of achievement are provided for grades 
3 to 13 based on responses from 26,008 pupils, 
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with from two to three thousand per grade. 
The norms provided are for the 20-minute 
test, but no doubt norms for the ro-minute 
test are or will be available soon. Form 2 of 
the tests has been advertised as equivalent to 
Form X, and the author has indicated still 
other forms are being constructed and will be 
available in the near future. The coefficient of 
reliability of the test as provided by the author 
is .98. No coefficients of validity have been 
provided, but the reviewer's experience with 
similar tests gives assurance that one need not 
question the test on that basis. This test has 
been widely used by the United States Em- 
ployment Service and in personnel work in 
many industries. The reviewer has no hesita- 
tion in recommending the use of this test. 


[ 168 ] 
*Vocabulary Test: National Achievement Tests. 
Grades 3-8, 7-12; 1939-43; Forms A, B; 2 levels; 85e 
per 25 tests for grades 3-8; $1.50 per 25 tests for grades 
7-12; 15¢ per specimen set of either level; nontimed 
(r5) minutes; Robert K. Speer and Samuel Smith ; 
Acorn Publishing Co. 


Clifford Woody, Professor of Education, 
University of Michigan, Ann Arbor, Michigan. 
The test for grades 3-8 consists of 50 vocabu- 
lary exercises of the multiple-choice variety 
printed on both sides of a single page; the test 
for grades 7-12 consists of two parts: Part I, 
Word Meanings, consisting of 60 exercises 
like those in the test for grades 3-8, and Part 
II, Word Discrimination, consisting of 20 ex- 
ercises in which the pupil must complete a 
given sentence by selecting the appropriate 
word from the four given words accompanying 
the incomplete sentence. In the first test and 
in Part I of the second test, the exercises em- 
phasize recognition of a synonym for a given 
word; in Part II of the test, the ability to se- 
lect the word that completes correctly the 
meaning of the sentence. 

The reviewer does not have available the 
exact method employed in selecting the words 
for this test, but in the catalogue describing 
the test occur the following statements: (a) 
“For each part of each test, the words used 
have been carefully selected and graded”; (b) 
in each exercise of the vocabulary test, "the 
base word is considered more difficult than the 
synonym given; therefore the pupil's choice 
tests his ability to compare the word with a 
correct word slightly less difficult." Checking 
the list of base or synonym words with the 


levels of frequency of use on the Thorndike 
list of twenty thousand most commonly used 
words revealed little relationship in the order 
of the words appearing in the forms of this 
test and in the Thorndike levels. The catalogue 
indicates that each test has been used experi- 
mentally, has been analyzed by specialists, and 
has been checked against courses of study and 
content of textbooks; but no specific details 
have been given. 

Two characteristics of the test exercises ap- 
peal to the reviewer: (a) All words of the 
sentence are much more simple than the word 
to be defined. (b) The correct. response can 
be marked by reference to a code word, thus 
making an answer form unnecessary. There is 
no time limit for any of the tests, but approxi- 
mately 15 minutes is required. Norms are 
given for grade and age on a month-by-month 
basis. A special set of norms is given for pupils 
with high and low IQ’s. The coefficients of re- 
liability of .94 and .95 are reported for the 
two tests. No coefficients of correlation with 
other vocabulary tests are available. The scor- 
ing by means of the code words eliminates the 
necessity of a scoring key, but it precludes 
allowing the pupils to score their own tests 
unless they should do so from a perfect test 
paper. Otherwise the pupil will have the code 
word and thus invalidate the use of the test 
for future use with other pupils. 


[ 169 ] 
Wide Range Vocabulary Test. Grades 3-16 and 
adults; 1937-45; Form B presents words in order of 
difficulty; Form C presents words alphabetically ; $1.75 
per 25, postpaid; 35¢ per specimen set, postpaid; non- 
timed (10) minutes; C. R. Atwell and F. L. Wells; 
Psychological Corporation. 


REFERENCES 

1. ATWELL, C. R, anp Werts, F. L. “Wide Range Multiple 
Choice Vocabulary "Tests." J Appl Psychol 21:$50-$ O '37. (PA 
12:2148) 

Paul S. Burnham, Associate Director, Stu- 
dent Appointment Bureau, Yale University, 
New Haven, Connecticut. A multiple-choice 
form of vocabulary test, consisting of 100 
words with five potential synonyms for each 
word. There is no time limit, and under normal 
circumstances approximately ro minutes are 
needed. The grade range is grade 3 through 
adult. Two forms claimed to be equivalent are 
available: Form B, copyrighted in 1937, pre- 
sents the words in order of difficulty; Form C, 
copyrighted in 1945, presents words in alpha- 
betical order. Median scores for groups of 
average ages—8 through 21—are given as 
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norms. No information is given about the size 
of population on which these medians were 
based. 

The manual gives no data on reliability of 
either test, their intercorrelation, or correla- 
tion with other well-known measures. Subjec- 
tive impression leads one to believe that this 
test might be useful for a preliminary screen- 
ing particularly if followed by more extensive 
testing of the verbal factor. 


[ 170 ] 


*Word Dexterity Test. Grades 9-13; 1942; Form 
A; $2 per 25, postpaid ; 100 or more, 614¢ per copy, post- 
paid; 25¢ per specimen set, postpaid; nontimed (40- 
45) minutes; Shailer Peterson; the Author, 222 East 
Superior St., Chicago 11, Ill. 


REFERENCES 


1. Pererson, Suatter A. “Teaching the Special Vocabu- 
laries." Engl J, Col Ed 23:53-6 Ja "34. * 

2. Pererson, Suarer, "The Word-Dexterity Test: A Better 
Measure of College Aptitude. ' Ed & Psychol Meas 4:307-13 wW 
"44. * (PA 20:326) 


FINE ARTS 


Reviews Bv Paul R. Farnsworth, William S. Larson, James L. Mursell, Edwin Ziegfeld. 
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*Horn Art Aptitude Inventory: Preliminary 
Form, 1944 Revision. Grades 12-16 and adults; 1939- 
44; 8¢ per test; 50¢ per manual; (50) minutes; Charles 
C. Horn; Office of Educational Research, Rochester 
Institute of Technology, Rochester, N. Y. 


REFERENCES 

r. Horn, CuarLes C., AND SwiTH, Leo F. “The Horn Art 
Aptitude Inventory." J Appl Psychol 29:350-5 O '45. * (PA 
20:857) 

Edwin Ziegfeld, Head, Department of Fine 
-and Industrial Arts, Teachers College, Colum- 
bia University, New York, New York. This is 
a two-part performance test designed to meas- 
ure art aptitude. The first part requires the 
subject to make drawings of lines and shapes 
and has two subparts: A, outline drawings of 
simple suggested objects—square, house, box, 
tree, etc.; and B, outline drawings of a num- 
ber of suggested geometric lines or forms— 
six triangles, a square in which the subject 
places another square, a rectangle which he 
breaks up with two lines, etc. In the second 
half of the test, titled Imagery, certain lines 
are printed in rectangles. These are to act as 
“spring boards” for compositions which are 
drawn in the rectangles. 

For scoring, examples are furnished of ex- 
cellent, average, and poor work against which 
the work of the subjects can be compared. A 
number of factors such as order and clarity of 
thought and presentation are listed and de- 
scribed to serve as guides in rating the tests. 
The author feels that lay persons, by referring 
to the examples, can score the test with accu- 


racy and cites several studies using a small 
number of cases in which tests were scored in- 
dependently by art faculty members and lay 
persons. Correlation coefficients between the 
two séts of scores ranged from .79 to .86. 

Two small validity studies have been con- 
ducted. One compared scores on the test with 
faculty ratings of success of art school grad- 
uates. This yielded a correlation coefficient of 
.53. With high school students a correlation 
coefficient of .66 resulted. Like all art tests, 
correlation with measures of intelligence is low : 
.15 with the American Council on Education 
Psychological Examination. 

The test has merit in that it requires the sub- 
ject to perform several of the activities in- 
volved in art production. There is some ques- 
tion as to whether these activities are broadly 
representative. There is little doubt that the test 
would reveal a certain type of talent in that the 
subject is required to sketch rapidly various‘ 
objects and ideas. If he conforms to tradition 
in his methods of delineation and composition, 
he can be rated with reasonable accuracy. Since 
there are but three steps in the rating scale, 
only rather general differentiations among sub- 
jects can be made, 

Great artists, even at a young age, are usu- 
ally remarkable for their deviations from pre- 
vailing norms in their interpretations of objects, 
in their compositions, in their techniques. There 
is considerable doubt as to how such an indi- 
vidual would fare in this test. Judging original 
creative work by existing accepted examples 
would almost certainly result in the rejection 
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or low rating (particularly by an untrained 
person) of the truly original artist. 


[ 172 ] 

*Meier Art Tests: I, Art Judgment. Grades 7-12; 
1929-40; revision of Meier-Seashore Art Judgment 
Test; 1 form; test booklets may be used repeatedly; 
75¢ per test; 25 or more, 60¢ each; 100 or more answer 
sheets, 2¢ each; 10¢ per manual; $1 per specimen set, 
postpaid ; nontimed (45-60) minutes; Norman Charles 
Meier; Bureau of Educational Research and Service, 
State University of Iowa. 


REFERENCES 

1-15. See 40:1326. 

16. Sroiz, ALaerTa, AND Manue, H. T. "The Art Ability 
of Mexican Children." Sch & Soc 34:379-80 S 12 '31. * (PA 
o: AULKNER, Rav Nzrsow. An Experimental Investiga- 
tion Designed to Develop Tests to Measure Art Understand- 
ing ee carat unpublished doctor's MER University 
ol à, 1937. (Summaries o; . Theses . . ., 1939, 
PP: 5015) iow Rosert S. “An Experimental Analysis of the 
Beery ef padependent Abilities." J Ed Psychol 32:495-512 

19. Barrett, DomorHy M. "Aptitude and Interest Patterns 
of Art Majors in a Liberal Arts College." J Appl Psychol 
29:483-92 D ’45. * (PA 20:1248) 

Edwin Ziegfeld, Head, Department of Fine 
and Industrial Arts, Teachers College, Colum- 
bia University, New York, New York. This is 
a revision of the Meier-Seashore Art Judg- 
ment Test. It contains 100 plates instead of the 
former 125, and the scoring is weighted so 
that the total is the same as in the earlier form. 
This change has, according to the author, re- 
sulted in a significant improvement of the in- 
strument by enhancing high scores and lower- 
ing poor ones. A shorter time is also required 
for administration. The present test is the first 
in a projected battery of three concerned with 
the discovery and measurement of art ability. 
The two which are to follow are on Creative 
Imagination and Aesthetic Perception. 

All plates are in black and white, and each 
consists of two pictures, identical except for 
one portion which has been altered. In taking 
the test, the subject is required to select the 
better of the two. Validation was by opinion 
of experts; the subject therefore pits his judg- 
ment against that of the specialist. Reliability 
of the test is from .70 to the low .8o's. Norms 
are provided in percentiles and quartiles at 
three levels: junior high, senior high, and col- 
lege-adult. 

The author of this test feels that aesthetic 
judgment is one of the most important, if not 
the most important, single factor in artistic 
competence, a trait which this test was de- 
signed to measure. Study of the test reveals 
that of the roo plates, 94 are of paintings (or 
of related graphic media), 3 are of vases and 


urns, 3 are of designs which appear to be 


derived from or suitable to pottery. The meas- 
ure of aesthetic judgment is, therefore, based 
almost entirely upon the graphic arts, and the 
assumption apparently is made that this is also 
a measure of aesthetic judgment in other areas, 
such as sculpture, architecture, etc. Correla- 
tions between scores on the earlier version of 
the test with measures of art judgment in other 
areas are generally low, and the assumption 
that this test is a measure of aesthetic judg- 
ment in all areas is not proved. 

Study of the test also reveals that the dif- 
ference between the two pictures in a plate 
are based upon factors of design; that is, the 
balance has been destroyed, emphasis altered, 
proportions changed, and the rhythm of continu- 
ity broken. To assume that aesthetic judgment 
is based entirely upon the ability to respond to 
design is a misleading oversimplification. There 
are many other factors, such as expression, color 
materials, subject matter, and all their inter- 
relationships, that enter into the making of an 
aesthetic judgment. 

What the test measures, however, it meas- 
ures well; it is the most satisfactory of all the 
art tests that have been constructed. The test 
is easy to administer and score and is reliable; 
the plates are not dated in their appearance. As 
a measure of sensitivity to design, it is helpful 
and useful, and this trait is one that is prob- 


ably the property of all artists. Results of the. 


test, so far as counseling is concerned, must 
be used with caution. The author points out in 
the manual other desirable traits which should 
be possessed by individuals who hope to be a 
success in the field of art. 


For reviews by Paul R. Farnsworth and 
Aulus Ward Saunders, see 40:1326. 
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Selective Art Aptitude Test, Second. Edition. 
Grades 9-16 and adults; 1939-46; 1 form; separate an- 
swer booklets must be used; $1.20 per test ; 32¢ per an- 
swer booklet; 68¢ per examiner's manual and supple- 
ment ; $1.34 per statistical manual ; 6¢ per sheet for any 
one of the three revised scoring sheets of the individual 
pupil record file; (50) minutes; William Harrison 
Varnum; International Textbook Co. 


Edwin Ziegfeld, Head, Department of Fine 
and. Industrial Arts, Teachers College, Colum- 
bia University, New York, New York. De- 
vised for prognoses for professional futures, 
this test is composed of seventeen parts which 
are grouped into seven subtests; these, in turn, 
are arranged in three groups, A, B, and C. 
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Group A is composed of tests concerned with 
form observation and color memory. Groups 
B and C are concerned with the measurement 
of "feeling-tone"; the former, proportions, 
balance and rhythm; the latter, speed in break- 
ing tip forms and in creative imagination. The 
test is scored by subtests and by groups; and 
by observing the interrelationships which exist 
among the scores of the subject on the various 
groups and relating these to the interrelation- 
ships which exist among these same groups 
for professional workers in art, prognoses can 
be made. 

Reliability on the total test by the split-half 
technique is .83, by retesting, .88. The test has 
been validated by administering it to profes- 
sional workers in various fields, such as inte- 
rior decorators, architects, department store 
employees, and commercial artists, and observ- 
ing the patterns of scores made in the various 
parts, subtests, and groups by workers in art 
fields with varying specific duties and abilities. 
No validation data have been published on the 
degree of professional success gained by indi- 
viduals in relation to test scores, but the author 
has indicated that on the basis of available 
data the test appears to have promise. Three 
sets of norms have been established for subtest 
and group scores: for 14- to 18-year-olds who 
have had no art training ; for the same age level 
but with art training; and for adults 18 and 
older who have had art training. Like practically 
all other tests involving art discriminations and 
judgments, the correlation with intelligence 
quotients is low but positive. Varnum reports 
correlations with the Meier-Seashore Art Judg- 
ment Test, on scores from five high school art 
classes, ranging from —.02 to .25. Unlike most 
art tests, there seems to be no significant dif- 
ference between scores made by males and 
females. 

Little general use of this test has been made 
to date. Judging from data collected by the au- 
thor, it gives evidence of promise. In particular, 
the test takes some cognizance of the complex 
nature of art and the fact that many abilities are 
involved. Various scoring patterns, while ap- 
pearing complex, make the instrument a flexi- 
ble one for interpreting scores in relation to 
various art jobs. The highly inartistic appear- 
ance of some of the test items, particularly the 
color plates, is unfortunate. Not enough work, 
however, has been done with this test to date 
to be able to state what it does or does not do, 


but a number of new directions have been de- 
veloped. 


MUSIC 
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Beach Music Test. Grades 4-16; 1920-39; 1 form; 
$1.50 per 25, postpaid; 20¢ per specimen set, postpaid ; 
(40) minutes; Frank A. Beach and H. E. Schrammel; 
Bureau of Educational Measurements, Kansas State 
Teachers College of Emporia. 


James L. Mursell, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. This instrument for the meas- 
urement of musical achievement consists of 
eleven subtests: (a) knowledge of musical sym- 
bols (by recognition from the notation) ; (b) 
recognition of measure (recognizing the number 
of beats in measures played to the subjects) ; (c) 
tone direction and similarity (indicating direc- 
tion and repetition in melodic items played to the 
subjects) ; (d) pitch discrimination (deciding in 
various situations which tone is higher or 
lower, highest or lowest) ; (e) application of 
syllables (indicating correct syllable designa- 
tions for tones heard) ; (f) time values (indi- 
cating the time-placement of designated notes 
or note patterns) ; (g) musical terms and sym- 
bols; (h) correction of notation (from listen- 
ing to what is played and making corrections 
on the test blank) ; (i) syllables and pitch names 
(writing in correct syllable and pitch names 
under the notation); (j) representation of 
pitches (writing down in notation the pitches 
of various notes heard after establishment of 
the tonic); and (k) composers and artists 
(knowledge items). 

Items have been selected on basis of teacher 
opinion. The test has been validated against 
teacher rankings on (a) knowledge of musical 
fundamentals and (b) general musicianship. 
Correlations with musicianship ranged from 
.35 to .92, with a median of .65; with knowl- 
edge of musical fundamentals from .14 to .94, 
with a median of .74. Average split-half relia- 
bility coefficients were .86 for college students, 
.83 for high school pupils. Percentile grade 
norms are presented for each grade from the 
7th to college, inclusive. 

This instrument probably has considerable 
value. Its most commendable feature is that it 
requires several types of significant response 
to actual music, instead of dealing largely or 
exclusively with responses to purely verbal 
situations. 
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Drake Musical Memory Test: A Test of Musical 
Talent. Ages 8 and over; 1934; Forms A, B; 30¢ per 
musical-score booklet of either form; 30¢ per manual ; 
3¢ per score sheet; 6o¢ per specimen set, postpaid ; (25) 
minutes; Raleigh M. Drake; Public School Publishing 
Co. (The test was recorded on phonographic records in 
1942 and is available directly from the author [Kent 
M University, Kent, Ohio] at $6.50 per testing out- 


REFERENCES 
1. Karun, J. E. “Music Ability." Psychometrika 6:61-5 


F '41. * (PA 15:2692) 

2. Lunpin, Rosert W. “The Development and Validation 
of a Set of Musical Ability Tests.” Abstract. Am Psychol 
2:350 Ag '47. * (PA 21:4664, title only) 

William S. Larson, Chairman of the Music 
Education Department, Eastman School of 
Music, The University of Rochester, Roches- 
ter, New York. This test is presented as a test 
of musical talent. It has two forms of approxi- 
mately the same length and difficulty, but the 
author states that to secure increased relia- 
bility and a more accurate measure of individ- 
ual ability in the interest of actual prognosis 
or guidance both forms should be used and the 
sum of the two taken as a final score. Norms 
in percentiles are given for eight different age 
levels from seven to twenty-four for both boys 
and girls for each of the forms and for the 
combined form. 

This, test makes use of twelve melodies, each 
one in turn being of increasing length and dif- 
ficulty. The subject is to determine whether 
the melody, when repeated in various ways, is 
exactly the same (S), whether it is the same 
melody played in a different key (K), whether 
the time has been changed (T), or whether 
any notes have been changed (N). 

The printed test, with instructions for the 
experimenter to play the various examples on 
the piano, is the one originally presented and 
generally used. This type of presentation is 
naturally subject to considerable variation in 
performance. However, this test may now be 
secured in recorded form directly from Pro- 
fessor Drake. The set of records which this 
writer secured within the past year consisted 
of one record for practice examples, and one 
each for Form A and Form B. The recording 
was done on Silvertone record discs, the mate- 
rial of which seems to be of questionable qual- 
ity. Instructions recommend the use of cactus 
needles only, and it is doubtful that the rec- 
ords will provide very many reproductions 
even when considerable care is exercised in 
playing them. The recorded test, however, does 
have the distinct advantage of accurate and 


uniform presentation, and the quality of the 
record when played on a good phonograph is 
fairly satisfactory. 

For reviews by Paul R. Farnsworth, and 
James L. Mursell, and an excerpt from a re- 
view by E. L. Abell, see 38:1083 and 40:1330. 
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Kwalwasser-Dykema Music Tests. Grades 4-16 and 
adults; 1930; 1 form; $2.25 per manual, 100 test 
blanks, and keys; records not available at present; 
(60) minutes; Jacob Kwalwasser and Peter W. Dy- 
kema ; Carl Fischer, Inc. 
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William S. Larson, Chairman of the Music 
Education Department, Eastman School of 
Music, The University of Rochester, Roches- 
ter, New York. The Kwalwasser-Dykema 
Music Tests consist of ten tests recorded on 
five double-faced ten-inch records. The tests 
are (a) Tonal Memory, (b) Quality Discrimi- 
nation, (c) Intensity Discrimination, (d) Feel- 
ing for Tonal Movement, (e) Time Discrimi- 
nation, (f) Rhythm Discrimination, (g) Pitch 
Discrimination, (A) Melodic Taste, (i) Pitch 
Imagery, (j) Rhythm Imagery. 

These tests are generally considered as talent 
tests to be used for prognostic purposes, but the 
authors in the manual accompanying the tests 
state that "Experimental investigations in the 
field of psychology have established the impor- 
tance of these tests as indicative of music talent 
and achievement. They are however intended to 
reveal reliable information for the guidance of 
student, teacher and parent." 

The K-D tests were presented in 1930, a time 
representing about one-half the elapsed period 
between the appearance of the original and the 
revised Seashore tests. The manual does not 
give data on reliability and validity. 

Most of the ten tests have twenty-five or thirty 
examples, the two exceptions being Melodic 
Taste which has ten examples and Pitch which 
has forty examples. A glance at the norms indi- 
cates that in most of these tests not enough 
discriminating items are given in the center of 
the distribution where a concentration of cases 
is usually found. Evidently most of them have 
an undue proportion of difficult items. Accord- 
ing to the norms, a subject can miss half of the 
test items in Pitch Discrimination and have a 
percentile rank of 81. In Tonal Memory twenty 
of the twenty-five examples are used to test the 
upper 50 per cent; and when considering the 
total scores, a subject needs only to discriminate 
one-half of the examples and guess one-half of 
the remainder to have a percentile rank of 100. 

These tests are relatively easy to administer 
and score; the responses for all ten can be re- 
corded on a single answer blank. Such advan- 


tages may partly account for the fact that they 
are used quite extensively by music educators. 
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Seashore Measures oi Musical Talents, Revised 
Edition. Adults, musicians and prospective musicians ; 
1919-39; 1 form; 2 levels; $9 per set of six records, 
manual, and 200 test blanks; 50¢ per manual; 40£ per 
200 test blanks; RCA Victor Division, Radio Corpora- 
tion of America. 
a) SERIES A, Adults. F 14 
b) sERIEs B. Musicians and prospective musicians. 
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named the “measure of intensity” has now 
been given the more psychological title of 
“measure of loudness,” 

Each measure appears in two forms, an 
easier one for crude, group work and a more 
difficult record-disc face for individual and 
more exact testing. The source of stimuli for 
the pitch, loudness, and time tests was a beat- 
frequency oscillator; for the rhythm test, two 
oscillators ; and for the measure of tonal mem- 
ory, a Hammond organ. The differences in 
timbre were effected by reciprocal alterations 
in the intensities of the third and fourth har- 
monics of tones from a generator of the elec- 
trostatic type. In the time test the judgments 
now concern filled time rather than the unfilled 
of the older measure. 

Norms are given for grades 5 and 6, grades 
7 and 8, and for adults, These data were de- 
veloped independently for each form, rather 
than by giving Form A first and then Form B. 
Reliabilities of from .62 to 89 are reported, 
the highest being those for tonal memory, 
pitch, and loudness, No information has ap- 
peared as yet on the interrelationships of the 
several measures, on the relationships between 
the pairs of forms, or on the reliabilities of the 
combined forms of each measure, The reviewer 
has so far accumulated data only on the meas- 
ure of rhythm. The two forms were found to 
have an intercorrelation of .33 while the relia- 
bility of the two forms treated as a single test 
was .72 (stepped up). 

The newer battery has a number of advan- 
tages over the older measures—better record- 
ing, two forms which differ in difficulty, a 
better selection of areas in which to test, and 
more musical-sounding stimuli, The battery is 
so much better in almost every way than its 
chief rivals, the Tilson-Gretsch and the Kwal- 
wasser-Dykema, that music testers should use 
it exclusively in their attempts to screen out 
those unfortunates who will not achieve suc- 
cess in music without enormous effort. 


William S. Larson, Chairman of the Music 
Education Department, Eastman School of 
Music, The University of Rochester, Roches- 
ter, New York, In the revision of the Seashore 
tests five of the six original tests are kept in 
the battery, and a timbre test is substituted for 
the consonance test. The revised edition con- 
sists of two series: Series A, which is intended 
for general or “dragnet” purposes, and Series 
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B, which has more difficult increments for test- 
ing musical groups or individuals. In present- 
ing the two series of the revised tests on six 
double-faced records, the same number as for 
the single series in the original measures, only 
about one-half the number of examples were 
used. For instance, the revised pitch test has 
fifty examples, while the original has one hun- 
dred; the revised rhythm and tonal memory 
tests have thirty examples each, while the 
original tests have fifty each. Series A has 
norms for fifth and sixth grades, seventh and 
eighth grades, and adult; Series B has adult 
norms only, 

An observation of the norms, which are 
given in deciles, causes one to question seri- 
ously whether these tests have enough dis- 
criminating test items, especially in the center 
of the distribution. As an example, in the 
rhythm test, Series B, the adult norms indicate 
that the single test score of 22 covers both the 
fifth and sixth decile ranks, and, judging by 
the table of reliability coefficients given in the 
monograph by the authors, it can be understood 
that a test score could vary at least one or two 
points above or below a true score. A variation 
of one point above and below would cover 
about one-third of the decile ranks in the dis- 
tribution, and a variation of two points would 
cover about half of them, Such possibilities of 
variance in a table of norms do not give great 
assurance for accurate prognostic work with 
individuals, particularly if the theory of spe- 
cifics recommended by the authors is followed. 

The aforementioned monograph states that 
the reliabilities of the revised measures are 
approximately the same as those of the origi- 
nal measures, The section on validation stresses 
the theory of ifics and does not give satis- 
fying information about practical adaptation in 
vocational or avocational guidance in music. 
Nor has such information since come to this 
reviewer's attention, 


William S. Larson, Chairman of the Music 
Education Department, Eastman School oj 
Music, The University of Rochester, Rocher- 
ter, New York, [Review of the original 1919 
edition.] These tests, the first of their kind, 
were presented to the public in recorded form 
nearly thirty years ago. The battery consists of 
the Sense of Pitch, the Sense of Intensity, the 
Sense of Time, the Sense of Consonance, Tonal 
Memory, and the Sense of Rhythm. The ac- 
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and rhythm at the lower grade levels), these 
tests, in the judgment of the writer, remain, 
especially for use by skilled experimenters, the 
best, everything considered, that have been pro- 
duced. The appearance of the revised Seashore 
tests, as well as the unavailability of the old 
battery at most times in recent years because 
of the general curtailment of such items by the 
manufacturers, has in general influenced the 
discontinuance of the original measures in 
music talent testing. 


James L. Mursell, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. The most important changes 
in the 1939 revision of this well-known test 
are as follows: (a) It has been organized into 
two series, each half the original length, Series 
A for “dragnet” purposes, Series B for finer 
measurement. (b) A Timbre Test has been 
substituted for the former Consonance Test. 

Grade norms on substantial populations are 
available. Retest coefficients range as follows: 
Series A, Pitch .79-.88, Loudness .79—88, 
Time .76-.77, Timbre .69-.74, Rhythm .62-.73, 
Tonal Memory .80-.84: Series B, same tests; 
48, 27, 20, 72, 72, 89. The instrument is 
not intended to yield a global score, and it is 
insisted that profiles always be derived and 
used. 

The following evaluative comments seem 
pertinent: (a) Physically the instrument is as 
good as it can be outside the laboratory. (b) 
Organization into two half-length series is ad- 
vantageous. (c) The dropping of the unsatis- 
factory and unreliable Consonance Test is an 
improvement. (d) Reliabilities are not ade- 
quate for discriminating abilities within the 
individual. This would make most derived pro- 


files statistically spurious. (d) On the evidence 
presented on the revision, validity has not been 
established, Since it does not yield a global 
measure of talent it cannot be validated. against 
the ordinary criteria of success in the field. 
The basic assumption is that a person who 
scores high on pitch will control the pitch fac- 
tor well in musical situations. Similar assump- 
tions are made for the other five tests. It is 
assumed, that is, that these six abilities func- 
tion in music as they do in the test. It is said 
that the instrument is not based on a "theo- 
retical" analysis of the musical mind; but the 
basic assumption certainly is a theory about 
the musical mind, and not a scrap of evidence 
is adduced to support it. The claim is that 
pitch discrimination, loudness discrimination, 
etc., are identifiable and measurable abilities. 
No one can deny this. But the crucial question 
is whether they operate in music as they do in 
the highly isolated test situation. No answer at 
all based on facts is forthcoming. Apparently 
one is invited to take it on faith. Yet it scems 
decidedly improbable. It has been shown again 
and again in the history of mental testing that 
isolated sensory and perceptual abilities do not 
indicate performance in complex situations in- 
volving context and meaning. The instrument 
is a reversion to a type of psychophysical and 
sensory testing that belongs to the infancy of 
mental measurement, and has repeatedly been 
proved worthless as an index to higher mental 
abilities. That it measures pitch discrimination, 
loudness discrimination, immediate memory 
for tonal nonsense-items, etc., is undeniable. 
That it measures anything that by any stretch 
of the imagination can be called musical talent 
is highly questionable and entirely unproved. 


N 
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Reviews nv Frederick B. Agard, H. E. Brogden, Nelson Brooks, Harold B. Dunkel, Elton 
Hocking, Joseph F. Jackson, Walter V. Kaulfers, John H. Meyer, C. W. Odell, Harry J. Russell, 
Herbert Schueler, Geraldine Spaulding, Hasel M. Toliver, Clarence E. Turner, Roland Vinette. 
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Iowa Placement Examinations: Foreign Lan- 
guage Aptitude. Grades 12-13; 1925-44; IBM; 2 edi- 
tions; G. D. Stoddard and G. E. Vander Beke; Bureau 
of Educational Research and Service, State University 
of Iowa. 
4) SERIES FAI, REVISED. 1925; Forms A, B ; $4 per 100; 
50£ per specimen set including the other 10 tests in the 
series, postpaid; 45(50) minutes; prepared under the 
un of C. E. Seashore and G. M. Ruch. 
b) NEW SERIES FA-2, REVISED. 1925-44; also called 
Quick-Scoring Edition; IBM ; Fox M separate an- 
swer sheets need not be used; $1.15 per 25; 30¢ per 
specimen set including the other 4 tests in the series, 
postpaid; $1.50 per 100 machine-scorable answer 
sheets; ISé per stencil for scoring answer sheets; 
45(50) minutes; revised by Grace Cochran, J. R. Niel- 
son, and D. B. Stuit. 
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H. E. Brogden, Research Psychologist, Per- 
sonnel Research Section, The Adjutant Gen- 
eral's Office, Washington, D. C. The Foreign 
Language Aptitude test of the Iowa Placement 
Examinations appears to be a well-planned and 
competently constructed test. Earlier forms, 
Series FAL, include six subtests concerned with 
facility in translation from singular to plural 
forms, in changing the tense of verbs, and 
forming nouns from verbs—all in the English 
language—and three subtests concerned with 
facility in guessing the meaning of Esperanto 
words, applying Esperanto grammar rules, and 
matching a series of sentences in Esperanto with 
corresponding sentences in English having dif- 
ferent ordering of the component words or 
phrases. In Form M, a recent, machine-scored 


edition, the student is tested by his ability to 
deduce the meaning of italicized words in sen- 
tences written in Esperanto, to translate sen- 
tences in a new language, and to answer cor- 
rectly or solve correctly questions and problems 
in English grammar. 

Validity of the earlier forms in predicting 
first-semester foreign-language grades is re- 
parted as .50 and reliability as .97. Form M is 
reported'to have validities of .67,%65, and .69 
(N’s not given) for predicting, respectively, 
first-semester grades in French, German, and 


_ Spanish. The reliability is given as .979. Per- 


centile norms, based on something over 1,200 
cases are given. 

While it would be desirable to base the norms 
on more than 1,200 cases, Form M, at least, ap- 
pears quite adequate for its intended use. More 
information than is usual is given on validity 
and reliability. The validities reported are satis- 
factorily, even surprisingly, high. 


[ 179] 
Language Aptitude Test: George Washington 


University Series. Grades 9-16; 19290; Form 1; 
$7.50 per 100; 10€ per specimen set, Pu 36 (45) 
minutes; T. Hunt, F. C. Wallace, S. Doran, K. C. 
Buynitzky, and R. E. Schwarz; Center for, Psycho- 
logical Service, George Washington University. 


H. E. Brogden, Research Psychologist, Per- 
sonnel Research Section, The Adjutant Gen- 
eral’s Office, Washington, D. C. The Language 
Aptitude Test consists of ten subtests. Four of 
these might be described as artificial language 
tests. These include Meaning of Prepositions, 
Memory for Foreign Language, Use of Pre- 
fixes and Suffixes, and Use of Language Rules. 
Two are verbal intelligence tests, namely, Com- 
prehension and Vocabulary. Two tests which 
appear to be grammar achievement tests—Eng- 
lish Grammar and Following Directions for 
Word Order—together with two tests entitled 
Recognition of Similarities in Sound and 
Knowledge of Accent complete the listing. There 
are 200 items in all. 

Apparently no serious attempt has been made 
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to achieve ease in scoring. Scoring keys are not © 


provided, and the format of the test appears to 
be such that keys cannot be simply constructed 
for rapid hand scoring. Actually there seems no 
reason why a test of this sort—if it is to be used 
extensively—should not be adapted for machine 
scoring. 

: A mimeographed sheet accompanying the test 
gives no information concerning the validity or 
reliability of the test. Although the test was pub- 
lished in 1929, only medians and quartile points 
are given as provisional norms. The number of 
cases employed in determining these is not 
given. 

Judged from the content, the validity of the 
test should be high. Artificial language and 
verbal ability are known to be efficient predic- 
tors of achievement in foreign languages. The 
effect of use of several varieties of unorthodox 
structure in the artificial language subtests is 
difficult to judge, although the subjective reac- 
tion of the reviewer to these tests is favorable. 
The two grammar tests and the test entitled 
Recognition of Similarities in Sound also im- 
press the reviewer favorably. A seemingly desir- 
able characteristic of the memory and compre- 
hension tests is the time interval between the 
study of the memory and comprehension mate- 
rial and the administration of the actual test. A 
sheet containing an. English passage—for the 
reading comprehension test—and the vocabulary 
and the prefixes and suffixes for two of the arti- 
ficial language tests is studied prior to the ad- 
ministration of the test proper. 

Since the test contains 200 items, the reliabil- 
ity of the total score is very probably quite 
adequate. It is doubtful if many of the subtests 
are sufficiently reliable so that the subscores will 
be of use in profiles or for differential diag- 
nosis. 

It would be interesting to have available the 
standard deviations of the various subtest scores 
and the distribution of the difficulties of their 
component items. The reviewer would guess that 
many of the tests would be found to be very 
much easier than others. Because of this sus- 
pected variation in difficulty, suspected varia- 
tion in reliability of the subtest content, and the 
evident variation in length of subtests, it is prob- 
able that the actual weighting of the subtests 
scores is far from equal and may be at variance 
with their intended and their optimal weighting. 

While the test impresses the reviewer favor- 
ably and is judged by him to be valid for pre- 


dicting achievement in foreign languages, it is 
still difficult to recommend it without adequate 
information as to its validity and reliability. One 
of the most unfortunate aspects of the testing 
movement is the tendency to write tests in large 
numbers—expending considerable ingenuity in 
the process very often—without adequate deter- 
mination of validity, reliability, and proper 
norms. Since the latter steps are the more labori- 
ous and possibly the more important aspects of 
test construction, it would seem—especially in 
the case of a commercial test—that adequate in- 
formation in these respects would be a primary 
requirement. Otherwise it can only be said that 
another test has been added to the tremendous 
number already available. 


FRENCH 
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*[Cooperative French Comprehension Test.] 1-2 
years high school or 1 year college, more than 2 years 
high school or more than 1 year college; 1942-47; this 
test is, as the booklet title indicates, Part I of the 
Cooperative French Test: Lower and Higher Levels; 
IBM; Forms S, X; 2 levels; separate answer sheets 
need not be used ; $1.75 per 25; 25¢ per specimen set of 
either level, postpaid; 40£ per 25 machine-scorable an- 
swer sheets ; 15¢ per stencil for scoring answer shects ; 
40(45) minutes; Geraldine Spaulding, Laura Towne, 
and Sarah Wolfson Lorge; Cooperative Test Service. 
@) LOWER LEVEL. 1-2 years high school or 1 year college. 
b) HIGHER LEVEL, More than 2 years high school or 
more than 1 year college. 


Joseph F. Jackson, Professor of French, 
University of Illinois, Urbana, Illinois. [Review 
of Lower Level Form S.] This examination 
(40 minutes) is planned to test comprehension 
of isolated words, single sentences, and rather 
brief connected passages. Tt is divided into two 
sections: Section I (15 minutes), 50 words and 
IO sentences, and Section II (25 minutes), 8 
connected passages with a total of 45 state- 
ments to be completed correctly. All items are 
of the multiple-choice type, with five options 
for each. The possible equivalents for the single 
words are given in English. The rest is entirely 
in French, thus constituting a double compre- 
hension test, both of the original and of the 
paraphrases or completions. | 

The test is comprehensive and well designed 
to explore a wide range and variety of vocabu- 
lary. The isolated words include some false cog- 
nates and near homonyms which should trap the 
unwary but which should give no real trouble to 
the careful and well-prepared student. The sen- 
tences of Section I involve so-called “idiomatic” 
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expressions, The connected passages of Section 
II embrace a diversity of situations and of per- 
tinent vocabulary. 

On the whole, the test is carefully designed 
and executed. The proposed equivalents are 
good, with the possible exception of Items 5r 
and 94. There are a few misprints which should 
be corrected in Items 53 and 77 and in the sec- 
ond passage, page 6, line 6. 

Regarding the large number of items and 
choices for the relatively short time indicated 
for the test, it should be pointed out that the 
examiners, seeking a wide spread in results for 
differentiation, do not expect the examination 
to be completed in the given period. Thus facility 
is measured and rewarded along with accuracy. 

Aside from the few minor defects noted 
above, this examination is to be recommended 
for general purposes. 


Joseph F. Jackson, Professor of French, Uni- 
versity of Illinois, Urbana, Illinois. [Review of 
Higher Level Form S.] This test is constructed 
on the same pattern as Form S, Lower Level; 
it consists of isolated words for recognition of 
equivalents in English, idiomatic expressions in 
complete sentences with possible equivalents in 
French, and short passages, each having three 
to six statements to be completed or questions 
to be answered, all in French. For each of the 
105 items involved there are five possible alter- 
nate choices. 

There is a good spread and variety of isolated 
words tested in Section I. A possible criticism 
is that over 80 per cent are nouns and verbs, 
with the former far exceeding the latter. Granted 
that nouns and verbs constitute the principal 
working stock in language, it would seem advis- 
able to give more attention to other parts of 
speech than we find here and to have such ele- 
ments as conjunctions and adverbial phrases at 
least represented. The locutions included in the 
complete sentences are well chosen. For a matter 
of detail, normal tense sequence is violated in 
Item 59, choices 3 and 4. Although this does not 
affect the point specifically tested, such question- 
able models should be avoided. 

The second section is consistently good, with 
two minor exceptions. Item 80 depends largely 
on the meaning of a single word, which is rather 
uncommon in itself and which is not used in the 
text. In general, it is better not to have evidence 
of comprehension of a passage hinge on the 
understanding of one word ; this is particularly 


frue in this case, as the item is designed to test 
comprehension of the general subject of the 
entire passage given. The second exception con- 
cerns the eighth passage (p. 8), which offers 
a casse-tête in the sequence of events rather than 
a clear test of comprehension of French. The 
reviewer cheerfully admits the validity of test- 
ing intelligence along with verbal knowledge, but 
undue complications or confusions are likely to 
obscure both types of ability. In any case, the 
unraveling of involved situations demands an 
amount of time disproportionate to the particu- 
lar end in view. 

Taken all in all, this is a good test in its plan- 
ning, execution, and validity. 


Clarence E. Turner, Associate Professor of 
the Romance Languages, Rutgers University, 
New Brunswick, New Jersey. [Review of 
Lower and Higher Levels, Form S.] Specimen 
sets of these tests do not include norm tables 
and related data. The scoring key gives parallel 
columns of raw scores and “tentative” scaled 
scores. The accompanying booklet does not dis- 
cuss the construction and validation of particu- 
lar tests. 

The tests follow the same pattern at both 
levels. Section I calls for the choice of the best 
of five English translations for each of fifty 
French words. Roughly half of the items are 
nouns, and one-third are infinitives, Ten items 
follow which call for the best from among five 
French paraphrases of French sentences con- 
taining crucial locutions. Section II, of forty- 
five items, calls for the best of five French 
sentence-completions based on a series of short 
prose paragraphs in French. 

The tests have many merits. The reading pas- 
sages in Section IT are interesting and varied in 
style and subject. Real ingenuity has been shown 
throughout in the choice of distractors, and in 
their combination so as to throw specific points 
into relief. In Section I, the five possible re- 
sponses typically include one false cognate, one 
translation of a superficially similar French 
word, and vague echoes of other foreign lan- 
guages. Not to any objectionable degree does 
exact comprehension of the French text fail to 
furnish all that is required for selection of the 
right answer. Wild guessing is penalized by the 
subtraction of a proportion of wrong answers 
from the total of correct ones. Subtle manage- 
ment of the difficulties promises discriminatory 
power for the tests at all levels of ability. 
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Two criticisms should be made. First, com- 
prehension of the force of moods and tenses of 
the verb is not sufficiently tested. Here is, for 
many students, the principal obstacle to the un- 
derstanding of written French. Second, fifty 
items, or just short of half the test, combine two 
assailable features: they involve English, and 
they test words out of context. This is men- 
tioned because of the frequent and weighty ob- 
jections which teachers have made to such prac- 
tices. To the writer, they seem defensible. Even 
the best student, in building his French vocabu- 
lary, has got to do more than simply read and 
understand : he must use some form of memory 
drill, to the end that one day he may, like the 
cultivated native, understand any word in any 
context. Such drill can scarcely take any prac- 
tical form other than the memorizing of words 
out of context in terms of their English equiva- 
lents. A practice which, with all its pitfalls, can 
scarcely be avoided by the learner would seem 
therefore to be admissible in a test, though the 
proportion of items involving it ought perhaps 
to be reduced. 

A final reservation, applicable to all reading 
comprehension tests in foreign languages, must 
be made. Given present trends in the curricu- 
lum, any such test will need to be combined with 
other tests having oral-aural features. 


For reviews by John H. Meyer and Roland 
Vinette of the Cooperative French Test: 
Lower and Higher Levels, see 182. 


[ 181] 
Cooperative French Test: Elementary and Ad- 
vanced Forms. 1-3 semesters high school or 1 semes- 
ter college, more than 2 years high school or at least 
1 year college; 1939-41; IBM; Forms P, Q, R; 2 lev- 
els; separate answer sheets need not be used; $1.75 per 
25 copies of Elementary Form; $2 per 25 copies of 
Advanced Form; 25¢ per specimen set of either level, 
postpaid; 40¢ per 25 machine-scorable answer sheets; 
156 per stencil for scoring answer sheets; 40(45) min- 
utes; Cooperative Test Service, 
a) ELEMENTARY FORM. I-3 semesters high school or 
I semester college; Jacob Greenberg and Geraldine 
Spaulding. 
b) ADVANCED FORM. At least 2 years high school or 
1 year college; Geraldine Spaulding and Paul Vaillant, 


REFERENCES 
1-4. See 40:1349. 
,,5. SMITH, FnANCIS Prescott. “The Use of Standardized Ob- 
jective Tests for Sectioning French Courses According to Stu- 
dent Ability." Mod Lang J 26:123-32 F '42. * 
,, 6, SMITH, FRANCIS PRESCOTT, AND CAMPBELL, HELEN S. “Ob- 
jective Achievement Testing in French: Recognition Versus 
Recall Tests." Mod Lang J 26:192-8 Mr '42. * 

7. Traxter, ArTHUR E. “The Relation of Vocabulary and 
Grammar to Reading Achievement in Latin, French, and Span- 
ish,” pp. 61-5. In 1947 Achievement Testing Program in Inde- 
pendent Schools and Supplementary Studies. Educational Rec- 
ords Bulletin, No. 48. New York: Educational Records Bureau, 
June 1947. Pp. xii, 66. Paper, lithotyped. $2.00, * (PA 22:449) 


For reviews by Nelson Brooks, C. E. Ficken, 
Harry. Heller, Warren S. Holmes, Joseph F. 
Jackson, Walter V. Kaulfers, and James B. 
Tharp, see 38:985, 38:986, 40:1349, and 
40:1350. 

[ 182 ] ` 
XCooperative French Test: Lower and Higher 
Levels. 1-2 years high school or 1 year college, more 
than 2 years high school or more than 1 year college; 
1942-47 ; IBM ; Forms S, X ; 2 levels; separate answer 
sheets need not be used; $2 per 25; 25¢ per specimen 
set of either level, postpaid; 60¢ per 25 machine-scor- 
able answer sheets; 30f per stencil for scoring answer 
sheets; 80(85) minutes; Geraldine Spaulding, Laura 
Towne, and Sarah Wolfson Lorge; Cooperative Test 
Service. e : 
a) LOWER LEVEL. 1-2 years high school or 1 year college. 
b) HIGHER LeveL. More than 2 years high school or 
more than 1 year college. 


REFERENCES 
1. Traxler, Artur E. "Some Data on the Results of the 


aerae, es a Li Ad Ere Tibia 
Program in Independent Schools and Supplementary Studies, 
Cer ieu au AE Paper. ithotypet: 
$1.50. * 

John H. Meyer, Former Assistant Examiner, 
Board of Examinations, The University of 
Chicago, Chicago, Illinois. [Review of Form S.] 
The older forms of this test have been so long 
and widely used that the characteristic virtues 
and limitations of the series are by now a famil- 
iar story. The S forms (80 minutes) do, how- 
ever, include a new feature: a 15-minute Part 
III labeled, somewhat ambitiously, Civilization. 
Grammar (Part II) takes 25 minutes; Part I, 
rather vaguely entitled Comprehension, is split 
into a 15-minute vocabulary section and a 25- 
minute reading comprehension section. Parts II 
and III are thus designed exactly to balance the 
two sections of Part I. j 

Vocabulary (Items 1 to 50) is tested not in 
context (i. e., specific words in complete sen- 
tences) but by the traditional and somewhat 
artificial device of a succession of isolated 
words. In the Lower Level test, the parts of 
speech are represented as follows: adverbs 5, 
adjectives 7, nouns 23, verbs 15; in the Upper 
Level: prepositions 1, adverbs 4, adjectives 3. 
nouns 27, and verbs 15. These proportions seem 
slightly faulty in view of the trouble which 
most students encounter in grappling with (a) 
verbs and (b) the “little words” (conjunctions, 
prepositions, etc.). 

Items 51 to 60 sample (in context) the stu- 
dent’s knowledge of common French idioms. 
But their position, at the end of the vocabulary 
section, means that many students either will 


i 
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not reach them at all or will be able to give 
them only very hurried attention. 

The content of the passages in the reading 
section, on which Items 61 to 105 are built, 
seems a trifle thin, elementary, and even trivial. 
"This is particularly true of the selections in the 
Lower Level form, which are, moreover, writ- 
ten in a rather wilted and limping French. The 
items suffer accordingly and betray, moreover, 
from time to time, a certain clumsiness of form 
(for an acute case of tense trouble see Item 67, 
Higher Level). One feels, therefore, that, at 
both levels: (a) the reading burden might have 
been lightened by the use of fewer passages 
(five or six, for example, instead of eight or 
nine) ; (b) choice of more significant and inter- 
esting material, and slightly longer passages, 
would have permitted the writing of better (and 
at least equally reliable) items; and (c) the 
prose style of certain of the passages might well 
have received first aid within the prescribed 
vocabulary limits. 

The 50 grammar items, while necessarily 
easy, in view of the constantly diminishing em- 
phasis on grammar in present-day teaching, 
offer a fair sampling of the indispensable mini- 
mum of grammatical knowledge. Items 1 
(Lower Level) and 42 (Higher Level) might 
perhaps have been more happily placed in the 
idiom section; Items 19 and 18 in the Lower 
Level duplicate to some extent the work of 
Items 4 and 5 in the same test. About one-third 
of the points tested at the lower level are re- 
tested at the higher level, but this proportion 
does not seem too great in view of the fact that 
so large a portion of second-year grammatical 
teaching is necessarily devoted to a review of 
fundamentals. It is true that a larger sampling 
of verb forms (and especially those of the most 
common irregular verbs) might have been in- 
cluded, although, strictly speaking, the verbs 
might better be tested under the heading of 
vocabulary. Irregular verbs, at any rate, seem 
inadequately represented in the test. 

The last 50 items (on Civilization), appar- 
ently chosen in accordance with no well-defined 
or logical plan, are not, generally speaking, im- 
pressively significant. The effect is one of scrap- 
piness—of stray facts arbitrarily coordinated by 
an "Information, please!" technique. At best 
they offer a few vague tags and set formulas. 
The cultural content of elementary French 
courses in the United States varies widely and 
is largely dependent upon such factors as (a) 


the personal culture and taste of the individual 
teacher, and (b) the scope and approach of the 
individual textbook. In the absence of objec- 
tive criteria governing the selection and arrange- 
ment of cultural material, the validity of "civi- 
lization” items remains dubious. Responses to 
such items are, therefore, determined largely 
and almost inevitably by chance. 

To enlarge upon the advantages and merits of 
the Cooperative French Tests would be almost 
an impertinence. In the case of so popular and 
long-established a series they may be taken for 
granted, The foregoing criticisms, it was felt, 
might prove useful. It is with that intention that 
they are offered. 


Roland Vinette, Professor of Measurement 
and Testing, Institute of Psychology, University 
of Montreal, Montreal, Canada, [Review of 
Form S.] Each of these tests consists of three 
parts: I, Comprehension, which has two sec- 
tions; II, Grammar; and III, Civilization. Part 
I, Comprehension, Section 1, contains first 50 
French words, each of which is followed by 
five English words or phrases. The subject must 
decide which one of the English words or 
phrases is the best translation for the French 
word. We have compared the 50 French words 
in each test with an unpublished list of the 6,000 
words most used by French-Canadian pupils 
and found the following distribution: for the 
Lower Level Test, so per cent of the Frencl. 
words fall in the first thousand, 30 per cent in 
the second thousand, and the rest in the third 
and the fourth thousands; for the Higher Level 
Test, about 30 per cent in the first thousand, 
about 15 per cent in each of the second, third, 
fourth, and fifth thousands. That is a fairly 
good proof of validity. 

Section 1 also includes ro French sentences, 
each of which is followed by five other French 
sentences, The subject must decide which of 
these five choices best explains the meaning of 
the first sentence. To make his choice, the sub- 
ject must have an exact understanding not only 
of the French vocabulary but also of the French 
sentence. 

Section 2 consists of French passages fol- 
lowed by several questions. To answer these 
questions correctly, the subject must show an 
understanding of the paragraph as a whole. 

Part II, Grammar, contains 50 questions. 
Each question consists of an English sentence 
followed by a translation in which there is a 
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blank to be filled.by one of the five words or 
phrases suggested. To fill in the blank correctly, 
the subject must know the appropriate rule in 
French grammar. Rules concerning pronouns 
and verbs are the most employed in the test; 
those concerning nouns and adjectives are rather 
neglected. However, those used are among the 
most important in the French language and 
among the most difficult for the English speak- 
ing person to apply correctly. 

Part III, Civilization, consists of 50 questions 
about great French men (authors, politicians, 
etc.); and French monuments, geographical 

- facts, history, and usages. 
> ‘Norms are provided for each part separately, 
Š ` for Parts I and II together, and for the total 
score. Data about reliability and validity are 
omitted. 

As a whole, each of these tests represents an 
. interesting device for measuring the student's 

level in French. 


» For reviews by Joseph F. Jackson and Clar- 
ence E. Turner of Part I of this test, see 180. 


[ 183 ] 
XExamination in French Grammar. 1-2 years high 
ool or 1 year college, 274 years high school or 2 
ears college; 1944-45; IBM; 1 form (usually called 
orm B); 2 levels; separate answer sheets must be 
used ; $1.75 per 25 of either level; 40¢ per 25 machine- 
scorable answer sheets for either level; 15 per scoring 
key for either level; 25¢ per specimen set of either 
level, postpaid; 40(45) minutes; prepared by the Ex- 
aminations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
cation; distributed by Cooperative Test Service. (Also 
distributed by Science Research Associates: $2 per 25 
of either level; 65¢ per 25 machine-scorable answer 
pein for either level; 50¢ per key; 5o¢ per specimen 
y set. 
~= -> 8) LOWER LEVEL. 1944; 1-2 years high school or 1 year 
. college; Form LFG-1-B-4. 
* `b) UPPER LEVEL. 1945 ; 234 years high school or 2 years 
Syst college; Form UFG-1-B-4. 


-Nelson Brooks, Instructor in French, West- 
over School, Middlebury, Connecticut. For some 
reason, the fixed-response type of test turns out 
to be less dependable in the field of grammar 
than in either vocabulary or comprehension. Un- 
fortunately the authors of the present tests seem 
not to have understood even the basic problems 
involved; much less have they contributed any- 
thing of value to the tests of this kind already 
available. There are some satisfactory items, 
such as Items 6 and 8, Lower Level, and Items 
34 and 56, Upper Level; the series is at its 
worst in Item 39, Lower Level or Item 48, 
Upper Level, where the incorrect French of the 


distractors is lamentably bad. Of the 120 gram- 
mar items offered, at least one-third must be 
rejected as unsatisfactory for one reason or 
another. Sometimes there are not five choices in 
the answers (this is true in I2 cases; in Items 18 
and 41, Lower Level, thereare only two) ; some- 
times the French in the question is dubious 
e.g., Item 58, Lower Level; often there is the 
sad business of putting incorrect French in the 
answers (there are 16 examples of this, in addi- 
tion to the sample items). Again, the question is 
not one of grammar but vocabulary (e.g., Item 
27, Lower Level), or the distractors are not 
likely enough (e.g, Item 60, Lower Level, 
where the most obvious wrong answer, @ 
demeuré, is overlooked), or there are two pos- 


sible right answers (e.g., Item 47, Lower Level), . 


or the problem is simply too far-fetched, as in 
Item 13, Lower Level, in which the manipula- 
tion of “ ‘My husband does not understand me,’ 
she sobbed. ‘He always keeps me from doing 
as I please." " in order to test for the use of the 
preposition de seems like serving a very small 
fruit in a very large bowl. 

Of these deficiencies, the two major ones are 
the lack of likelihood in the distractors, or 
their absence, and the use of incorrect French. 
in the answers. The first of these is very seri- 
ous for the worth of the test. Unless there is 
an obvious degree of plausibility in the wrong 
answer, it might as well be an ampersand ora 
Rorschach blot, and chance becomes a domi- 
nant factor. Of course, the tests will give im- 
pressive statistics, but let us not forget that it 
is possible to distinguish between students by 
dealing them cards from a deck. The point 
here is, have they been distinguished in accord- 
ance with their knowledge of French grammar? 
The authors overlook the fact that the student 
has a method in his wrongness, and seem con- 
tent to suggest for his logically right but gram- 
matically wrong answer sheer nonsense or noth- 
ing at all. 

Equally serious is the use of incorrect French 
in the wrong answers. It is painful indeed to 
see such monstrosities in print and it is little 
wonder that many people, teachers and stu- 
dents alike, receive a bad impression of this 
kind of test when they read them. To offer 
the student a puzzle with a part missing and 
ask him to choose which of several pieces fits 
the missing part seems natural and reasonable 
enough. To offer him some of these pieces 
forced violently and unnaturally together is to 
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offend his intelligence and to violate the-first 
principles of teaching. It is one thing: for the 
student to make errors in his attempts to mas- 
ter syntax; it is quite another for the teacher 
or the testmaker to do violence to language in 
an attempt to imitate the struggles of the stu- 
dent. It is the equivalent of baby talk and is 
of profit to no one. Furthermore, it is quite 
unnecessary. If a given number of acceptable 
items of this kind can be devised, twice that 
number can be devised, provided the testmak- 
ers understand what they are about and will 


` « firmly and patiently reject all but suitable mate- 


rial, Lef'those who use this form in making 
tests be aware of its possibilities and of its 
absurdities, and not give the method a bad 
name for not having themselves measured up 
to its exacting standards. 


[ 184 ] 

*Examination in French Reading Comprehension. 
1-2 years high school or 1 year college, 214 years high 
school or 2 years college; 1944-45; IBM ; 1 form (usu- 
ally called Form B) ; 2 levels; separate answer shcets 
must be used; $1.75 per 25 of either level; 4o¢ per 25 
machine-scorable answer sheets for either level; 15¢ 
per scoring key for either level ; 25¢ per specimen set of 
cither level, postpaid; 50(55) minutes; prepared by the 
Examinations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
seation ; distributed by Cooperative Test Service. (Also 
distributed bv Science Research Associates: $2 per 25 
of either levels; 65¢ per 25 machine-scorable answer 
p for either level; 5o¢ per key; 50¢ per specimen 
set. 

a) LOWER LEVEL. 1044; 1-2 years high school or 1 year 
college; Form LFR-1-B-4. 

b) UPPER LEVEL. 1945; 214 years high school or 2 years 
college; Form UFR-1-B-4. 


Joseph F. Jackson, Professor of French, 
University of Illinois, Urbana, Illinois. [Re- 
“view of Lower Level.] In attempting to eval- 
uate this test, it is well to have in mind the 
general conditions and purposes determining 
its conception. According to the Manual, the 
.original tests were intended "for use in re- 
porting the educational achievements and status 
of service personnel to the educational insti- 
tutions from which these men and women may 
wish to secure credit." Objectives were de- 
scribed by examiners, specialists, and teachers; 
materials were collected accordingly and they 
were pretested in schools or colleges, some- 
times with adult groups; then revisions were 
made on the basis of results recorded. The 
ensuing revised examination forms were then 
submitted to one or more critics designated by 
professional organizations. According to these 
criticisms (for comprehensiveness, accuracy, 


and validity), the examination forms were sub- 
ject to still further revision and to additional 
pretesting, if possible. In this process there 
was sufficient material for two forms, one for 
the Armed Forces Institute, the.other for civil- 
ian use in schools and colleges. this sec- 
ond or alternate form that is reviewed here. 
Specifically, the examination consists of seven 
French passages, 8 to 20 lines each, with a 
total of 47 completion-type statements in French 
to test comprehension. The completions are 
multiple choice. Choices are limited to three, 
reducing the number and diversity of alterna- 
tives found in other similar tests. ut 
The examination is less carefully executed 
than would be suggested by the procedures in- 
dicated for it in the Manual. At least six of 
the seven passages, depending on irony, sub- 


PS 


tlety, or knowledge of politics, demand a ma- - 


turity of judgment which cannot be assumed 
in younger students. Thus, the actual compre- 
hension of French may not be adequately 
tested, and the real validity of the examination 
may well be questioned. For example, the first 
passage, a verse selection, depends for its effect 
on the attitude of the poet. This is nowhere 
stated, it is only implied, particularly for those 
who happen to know the poet. A student might 


comprehend the French perfectly (although ` 


part of the sense derives. from a Spanish word, 
the meaning of which is not given) without 
grasping the unexpressed emotional tone. Simi- 
larly, Section II depends for full comprehen- 
sion on knowledge of court gallantry and in- 
trigues, religious attitudes, irony and personal 
relations incompletely explained in the selected 


context. A fuller acquaintance with the author — 
and with his entire work would supplement - 


understanding of the French text itself. Sec- 
tion IV depends on irony for its effect. Sec- 
tions V and VI presuppose some knowledge of 
“useful relations” and of political science and 
political attitudes. Section VII, a Merovingian 
scene, is remote and obscure for the uniniti- 
ated; its effect is achieved by indirection rather 
than by the precise meaning of words. 

The authors state that the educational objec- 
tives “are defined as clearly as possible in 
terms of behavior; that is, a student should be 
able to do definite things which are indicative 
of his attainment of each objective.” We should 
suggest that the student’s behavior is condi- 
tioned to a large extent by his experience. In 
comprehension passages material should be 
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considered not only for-its linguistic difficulty 


but for its content in relation to the student's. 


probable experience—social, intellectual, and 
emotional. The fact that the original USAFI 
tests were intended for scattered age groups 
may have something to do with this problem. 
It is surprising, however, that the difficulties 
indicated were not revealed in pretesting, par- 
ticularly with high school students. 

Beside this general question of appropriate- 
ness or validity of the passages chosen for 
comprehension, there are matters of detail 
which could be improved. Items 4, 8, and 15 do 
not represent a clear choice. In Item 28C, an 
intensification has been added to the original; 
this might be misleading. Without attempting 
to proofread the examination, which is a sec- 
ond printing, the reviewer has picked up some 
unfortunate misprints: la père (p. 2) ; le com- 
tesse (p. 3); soit for soir, Théatre (p. 4); 
ce for ces (p. 7); médicins and (la reine) 
étant allé (p. 8). Finally, the examiners offer 
(p. 1) an example for the students in which 
Vinsouciance is gratuitously translated as “‘care- 
lessness." Surely in the context it means rather 
“unconcern” or “lack of concern." 

Considering the procedures and purposes of 
this test, the performance fails to live up to its 
promise. 


Joseph F. Jackson, Professor of French, 
University of Illinois, Urbana, Illinois. [Re- 
view of Upper Level.] For the general make-up 
of this examination and for considerations re- 
garding its purpose and procedure, see the 
above review of the Lower Level. Form UFR 
contains six passages, with 44 completion-type 
statements in French, with three choices for 

~ each. 

While the Upper Level Form would ordi- 
narily be addressed to older students, a differ- 
ence of one or two years in age would scarcely 
compensate for the added difficulty in content 
of the Upper Level examination. 

Section I, in verse, is a particularly complex 
dramatic scene, the partial resolving of a real 
imbroglio. The number of characters and the in- 
volved relations and implications make the scene 
very confusing. Without referring to the origi- 
nal context, the reviewer had difficulty with 
two of the five items for comprehension. Sec- 
tion II involves Roman military and political 
history. Section IV is a rather remote social 
situation, strained in its humor and effect. Sec- 
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tion V is heavily historical and political. It is 
extremely perplexing in the. number of per- 
sonages involved, in their relationships and 
interests, in the factions and intrigues repre- 
sented, Section VI deals with the press and 
with the formation of public opinion. It is de- 
liberately paradoxical and in part quite obscure, 
(I have reread this passage several times with 
diminishing returns in trying to find apparent 
sense in it.) 

It is important to test intelligence as well as 
verbal ability but, unless the material itself is 
readily comprehensible to the student, the par- 
ticular purpose of testing the accuracy and 
scope of his understanding of the foreign lan- 
guage is defeated. At the same time, the range 
of vocabulary tested is limited by insistence on 
particular types of subjects. In this case, his- 
tory, politics, public opinion, and the press con- 
stitute half the content of the examination. 
Surely there could and should be more diversity 
of subject matter, related more closely to the 
student’s field of experience. A conceivable 
desire to include in examinations for the armed 
forces material of a social, political, and his- 
torical nature has no valid relation to civilian 
tests. The mere reference here to armed forces 
and to civilians suggests the source of the diffi- 
culty in attempting to evaluate such tests for 
ordinary purposes. : 

Apart from the question of suitability and 
comprehensiveness of material for general use, 
the examination is quite well constructed. In 
addition to the two items mentioned earlier for 
Section I, only Item 41 is not clear-cut. All 
the others are well confined to one reasonable 
choice. The test seems to have been carefully 
prepared. Two questions of detail arise: page 3, 
quelques for “some”; page 4, “chilly” for 
frileux. This last appears to be a contresens in 
the context. Might it not rather be “sensitive to - 
the cold" ? 

With the reservations noted, which should 
not affect older students unduly, the examina- 
tion is good of its kind. 


Clarence E. Turner, Associate Professor of 
the Romance Languages, Rutgers University, 
New Brunswick, New Jersey. The USAFI 
booklet accompanying these and the other aca- 
demic-subject tests states, without attempting 
proof, that the present forms are “equivalent 
to” those prepared for the testing of service 
personnel. Their use is recommended in order 


* 


273 | 


to evaluate justly the educational experiences 
of the veteran. The development of local norms 
by particular educational institutions is quite 
rightly urged. The separate norm tables ac- 
companying these tests give parallel columns 
of raw scores and percentile ratings together 
with the number of cases (for Form A only, 
ranging from 103 to 810) on which the norms 
for each class level are based. The booklet dis- 
cusses the construction of the tests in the most 
general terms, and one wishes one might know 
for these particular tests what the defined ob- 
jectives were. * 

At both levels, the tests consist of forty-odd 
triple-choice completions in French based on 
six French passages which include some poetry. 
The distractors vary from the ingenious to the 
quite obvious and do not as a rule contain a 
core of elements in common with the best an- 
swer so as to throw a point into relief. A good 
deal is tested besides comprehension of French, 
notably intelligence, logic, and the ability to 
give close attention to something not very in- 
teresting. We are told that it is hard to decide 
whether cats, when tested for intelligence, are 
stupid or just bored, and there are moments 
when these tests threaten to create the same 
problem. At bottom, what is being tested here 
is the ability to recognize and match two dif- 
ferent ways of saying the same thing, and the 
authors have devised one or two clever traps 
for those who match on an uncomprehending 
mechanical basis. If the directions called for 
the best answer rather than for "the correct" 
one, the student's perplexity would be dimin- 
ished at some points. 

While it is necessary to grant without ques- 
tion the usefulness of the tests for the purpose 
we have cited from the booklet, doubts arise 
as to their wider value. Their claims to supe- 
riority in the field would be that they avoid 
the use of English (save for rare bracketed 
explanations) and that they test no item out 
of context. But even the most perfect reading 
comprehension test would no longer reflect to 
a useful degree the prevailing curriculum, now 
that the teaching profession seems to be aban- 
doning the notion (entertained for the quarter 
century between two global wars!) that Ameri- 
cans need to know no more of foreign languages 
than how to read them. 

[185 ] 


XExamination in French Vocabulary. 1-2 years 
high school or 1 year college, 234 years high school or 
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2 years college; 1944-45; IBM; 1 form (usually called 
Form B); 2 levels; separate answer sheets must 
used; $1.75 per 25 of either level ; 4o¢ per 25 machine- 
scorable answer sheets for either level ; 15¢ per scoring 
key for either level; 25¢ per specimen set of either 
level, postpaid; 40(45) minutes; prepared by the Ex- 
aminations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
cation ; distributed by Cooperative Test Service. (Also 
distributed by Science Research Associates: $2 per 25 
of either level; 65¢ per machine-scorable answer sheets 
for either level; 50¢ per key; 50f per specimen set.) 

a) LOWER LEVEL. 1944; 1-2 years high school or 1 year 
college; Form LFV-1-B-4. 

b) UPPER LEVEL. 1945 ; 24 years high school or 2 years 
college; Form UFV-1-B-4. 


Nelson Brooks, Instructor in French, West- 
over School, Middlebury, Connecticut. There 
is much of value in these two tests, yet one 
cannot examine them carefully without con- 
cluding that they have been based on a doubt- 
ful assumption and constructed in the wrong 
way. The authors seem to feel that all vocabu- 
lary should be tested for in context, although 
the test itself proves this to be wrong. They 
also seem to assume that a suitable context is 
any complete sentence that contains the ex- 
pression to be tested, which is debatable in- 
deed. The series is at its best in such items as 
41 and 61, Upper Level, in which the vocabu- 
lary expression is well integrated with the 
entire sentence. However, in such an item as 46, 
Lower Level (Réfléchissez bien avant de 
choisir votre métier), the sentence adds ex- 
actly nothing and is merely a waste of time. In 
Item 7, Lower Level, dealing with the word 
malgré, the sentence, whose English equivalent 
is “She is old in spite of her blond horses,” 
becomes a definite liability. The misprint here 
is bad enough, although the student can prob- 
ably correct it if the testmakers could not (even 
in a second printing), but it is less serious 
than the use of a sentence so obviously car- 
pentered for a test. To have anything like its 
proper validity, the context must be a sentence 
taken from something that a writer has writ- 
ten for some purpose other than to make a 
test. Not only is the student entitled to deal 
with sentences that have the tang of plausi- 
bility rather than the woodenness of makeshift, 
but the test will prove far more if he does.’ 

A context has been supplied in all 160 items 
of the two tests, yet the results seem to sub- 
stantiate the case for testing many of them by 
themselves. There should be, of course, no 
doubt about the value of testing isolated words. 
Of two lists of, say, 50 words selected within 
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the same limits in the frequency list, the stu- 
dent will always make approximately the same 
score on the second as on the first, as any 
teacher can easily prove for himself. This 
would hardly be true if it were not the meas- 
urement of something very definite indeed and 
therefore worth measuring. 

"Testing for words in context, though by no 
means the only or the most efficient method 
of measuring vocabulary, can be very fruitful 
if the words and the context are rightly chosen. 
In not more than half of the items in these 
tests can it be said that the context limits or 
complements the test word in terms of the 
equivalents offered. In the other half, the word 
might as well, or might better, be set down 
opposite its five possible meanings, in which 
form it can be handled twice as fast. In the 
forty minutes allowed for one test, the student 
could then handle 120 items instead of 80 and 
would be measured that much more accurately. 
If the test for vocabulary in context is to be 
fully exploited, the equivalents should all be 
right translations of the single word, only one 
of which is right for the particular context. 
There is no example of this throughout the 
two tests, 

It is, of course, much more difficult to devise 
distractors for single words than for words in 
context, and it is significant that in these tests 
the majority of wrong answers are suggested 
not by the word in question but by the rest of 
the sentence. Yet this can be done, and anyone 
in search of likely distractors has only to ask 
a group of students to write what they think 
the foreign word means in English, and he 
will get a crop that will be highly useful and 
often surprising. He will discover that in every 
case there is a logical, even though erroneous, 
connection in the student’s mind, for logic can 
be very misleading when. it comes to handling 
verbal symbols. It is in this region of false 
analogies that the author must look for his dis- 
tractors if he is to duplicate the processes of 
the student’s mind. 

To be specific, take the sample item on the 
first page of the tests. As equivalents for 
travail in the sentence “Ce travail est difficile,” 
the authors suggest game, work, chair, book, 
bed. Three of these are acceptable, work being 
right and game and book, though in no way 
resembling fravail, at least being among things 
that are sometimes difficult. Chair and bed are 
worthless, for they are in no way connected 


with travail, and, whatever else they are, they 
simply aren't difficult. A far more likely list 
of wrong answers would be trial, journey, trail, 
crossing, all of which may be difficult and all 
of which are enough like travail directly or 
through synonyms to distract the student who 
does not immediately respond to the combina- 
tion travail—work and thus to separate him 
from those who do. If half the items in these 
tests were to be remade along these lines, the 
series might become a useful, even accurate, 
instrument. 
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*French Grammar Test: Dominion Tests. Grades 
9-10; 1940-41; Forms A, B ; 65¢ per 25; 306 per speci- 
men set; specimen set must be purchased to obtain the 
manual; 27(35) minutes; prepared by the Department 
of Educational Rescarch, Ontario College of Educa- 
fion, University of Toronto; Vocational Guidance 
entre, 


John H. Meyer, Former Assistant Examiner, 
Board of Examinations, The University of Chi- 
cago, Chicago, Illinois. On the basis of a detailed 
analysis of the major points covered by the 
two forms of this test, this reviewer questions 
the accuracy of the statement in the test man- 
ual that the difference between Forms A and 
B is “probably small.” The differences between 
the two parts of the test in both A and B are 
purely formal. Both within and between forms, 
there is considerable overlapping. For flagrant 
instances of redupiication, see especially: Item 
49 in both forms (identical item) ; Items 77 and 
80, 79 and 82, in Form A ; 7o and 74 in Form B. 
For examples of items which test more than 
one point, see especially: Form A, Items 69, 
70, 76; 77, 78, 8o, 81; Form B, Items 72, 75, 
77, 78, 82. 

The material presentation of the test (espe- 


cially Part I) is clumsy, confusing for the 


student, and arduous for the scorer. The man- 
ual admits that "the pupils occasionally misun- 
derstand the directions," which in themselves 
are explicit enough. The items in Part I are 
presented in groups of two to four with a 
common set of five possible completions, of 
which the student is to choose one. Actually, 
in many cases, he does nothing of the sort. In 
answering Item 1, Form A (a typical case), 
he (a) distinguishes between the conjunctive 
personal pronoun and the definite article, elimi- 
nating three possible responses, and (5) makes 
a true-false choice between the two remaining 
responses. 
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Any attempt at tightening up, correcting, 
and improving this test would mean practically 
rewriting it. 


[ 187 ] 
*French Reading: Dominion Tests. Grade 10; 
1940-41; Forms A, B; 65¢ per 25; 30¢ per specimen 
set; specimen set must be purchased to obtain the man- 
ual; 3o(35) minutes; prepared by the Department of 
Educational Research, Ontario College of Education, 
University of Toronto; Vocational Guidance Centrc. 


Geraldine Spaulding, Editor, National 
Teacher Examinations, New York, New York. 
This is a reading test designed for use in Cana- 
dian high schools, and recommended for grade 
10 only. Each form contains eight very simple 
anecdotes, with three to five questions on each. 
The passages are well selected and furnish 
appropriate material for reading at a very ele- 
mentary level. There is very little use of past 
tenses, and there are no subjunctives. 

The questions are in French, and are to be 
answered in English. This arrangement has 
the advantage that French questions tend to 
"give away" the meaning of the passage much 
less than do English questions. Some of the 
questions leave room for a good deal of varia- 
tion in the responses (thus complicating the 
scoring), but it is difficult to phrase questions 
that will avoid such variation and still test 
more than the comprehension of details. On 
the whole, the questions achieve a very satisfac- 
tory compromise between the need for mini- 
mizing variation in correct responses, and the 
need for testing comprehension of a sustained 
thought. 

A booklet furnished with the tests contains 
very explicit instructions for administering and 
scoring, keys, percentile norms (Canadian), 
reliability data, and (addressed to the expert 
in statistics) measures of internal consistency. 


[ 188 ] 

*French Vocabulary Test: Dominion Tests. 
Grades 9-10; 1940-41 ; Forms A, B ; 65¢ per 25; 30€ per 
specimen set; specimen set must be purchased to ob- 
tain the manual; 30(35) minutes; prepared by the De- 
partment of Educational Research, Ontario College of 
Education, University of Toronto; Vocational Guid- 
ance Centre. 


Roland Vinette, Professor of Measurement 
and Testing, Institute of Psychology, Univer- 
sity of. Montreal, Montreal, Canada. Each of 
these vocabulary tests contains 100 questions. 
Each question consists of a French word fol- 
lowed by five English words or phrases. The 
subject must decide which one of the English 


words or phrases is the best translation for the 
French word. 

A careful comparison of these 200 French 
words (100 for each form) with an unpub- 
lished list of the 6,000 words most used by 
French-Canadian pupils reveals that about 50 
per cent of the 200 words fall among the first 
thousand. The remaining 50 per cent are dis- 
tributed from the second to the fifth thou- 
sand, This proves the validity of these tests as 
a measure of knowledge in the more common 
French vocabulary. 

These tests are also reliable and easy to give. 
However, a more practical key, directly ad- 
justable to the test, would be desirable; at 
present the examiner has to make the key from 
answers listed in the manual. This minor dis- 
advantage should not reduce the value oí the 
test. 
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Iowa Placement Examinations: French Training: 
Series FT1, Revised. Grades 12-13; 1924-26; Forms 
A, B; $4 per 100; 50¢ per specimen set including the 
other 10 tests in the series, postpaid; rM 50) minutes; 
prepared under the direction of C. Seashore and 
G. M Ruch; C. E. Young, G. E. Vander Beke, and 
G. D. Stoddard; Bureau of Educational Research 
and Service, State University of Iowa. 


REFERENCES 
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the Iowa Placement Examinations, University of Iowa Studies 
in Education, Vol. s, No. 6. Iowa City, Iowa: the University. 
1930. Pp. 116. Paper. $1.00. * (PA 5:799) 

Geraldine Spaulding, Editor, National 
Teacher Examinations, New York, New York. 
This test is quite old, Form A being dated 1925 
and Form B, 1926, The norms which are fur- 
nished presumably date back to the same period 
and are of questionable value after twenty years. 
Since only one percentile table is supplied for 
each form (apparently based on students with 
one year of high school study), no basis is 
furnished for the interpretation of results for 
students with other amounts of study. 

With respect to content, Form B is some- 
what superior to Form A. In the vocabulary 
part of Form A, at least a fourth of the words 
are almost identical in English, while Form B 
contains no particularly obvious cognates. The 
frequency range of the words in this part is 
quite wide, covering the entire range of the 
French Word Book. Part 2, on grammar and 
idiom, offers stereotyped problems, primarily 
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on very elementary grammar points. Part 3, 
on verb forms, is rather heavily weighted with 
uses of the subjunctive. The passages pre- 
sented for reading in Part 4 appear pretty diffi- 
cult for students with one year of high school 
study. 

The techniques used include both multiple 
choice and completion. However, the advan- 
tages of each technique are not well utilized. 
Teachers often disapprove the presentation of 
unidiomatic constructions and incorrect or non- 
existent forms, which is practically unavoid- 
able in multiple-choice items on inflected forms 
and syntax. In this test, the only multiple- 
choice part is the one on verb forms, where 
this disadvantage is most obvious. And this 
same defect is introduced unnecessarily in Part 
2, where incorrect sentences are to be corrected, 
in free-response items. Multiple-choice items 
might better have been used for the vocabulary 
and reading (where acceptable answers in free- 
response form are likely to include more vari- 
ants), and completion items for the verb forms. 

Since three of the four parts consist of com- 
pletion items, the scoring of the test is more 
time-consuming and less objective than the 
scoring of a straight multiple-choice test, and 
some knowledge of French on the part of the 
scorer is advisable. - 


GERMAN 
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Cooperative German Test: Advanced Form. At 
least 4 semesters high school or 1 year college; 1938- 
40; Forms O, P, Q; separate answer sheets need not be 
used; $2 per 25; 25¢ per specimen set, postpaid; 40 
per 25 machinc-scorable answer sheets; 15¢ per stencil 
for scoring answer sheets; 40(45) minutes; Miriam 
Van Dyck Hespclt, E. Herman Hespelt, and Geraldine 
Spaulding; Cooperative Test Service. 


REFERENCES 

1-4. See 40:1360. 

Harold B. Dunkel, Examiner, Board of Ex- 
aminations, The University of Chicago, Chi- 
cago, Illinois. [Review of Advanced Forms O, 
P, and Q.] The existence of several compar- 
able forms, the availability of norms, and the 
general care with which the materials for ad- 
ministration and for scoring have been pre- 
pared, all give these tests (like other products 
of the Cooperative Test Service) a consider- 
able handicap over other competitors. Fortu- 
nately the merits of the tests themselves justify 
this additional apparatus. 

The reliabilities of the parts and of the total 


tests are not reported, but the standard errors 
of measurement are plotted beside the scale of 
scores. It is unfortunate, however, that the 
directions do not give a simple explanation of 
what these lines mean, one that would be in- 
telligible and serviceable to the average lan- 
guage teacher, who is unacquainted with statis- 
tics. What “error” means in the context of 
deciding whether to give Willie Johnson a D 
or an F is something that every teacher should 
know; and a better understanding of this con- 
cept would do much to correct the misappre- 
hensions of those who believe either that test 
scores possess the infallibility of the Holy 
Writ or that they are the invention of the Devil. 
The present effort to report standard errors is 
undoubtedly a step in the right direction, but 
I doubt whether it goes far enough to accom- 
plish its purpose. 

The tests cover reading, vocabulary (with the 
words in isolation), and grammar. Though this 
last part demands only the recognition of the 
correct structure, it demands a more active 
manipulation of the language than does read- 
ing. As a result, the teacher will probably get 
most information not from the total score but 
from the scores on those parts most relevant to 
the objectives he stresses. This flexibility of 
the tests for varied teaching procedures is one 
of their strong points. 

A feature of the tests which many teachers 
like is that the multiple-choice answers are all 
in German. Thus in the reading section, the 
student exhibits his comprehension by correctly 
completing the sentence which forms the item 
or, in the few longer items, by completing state- 
ments about a short passage. Similarly in the 
vocabulary section he selects that one, of the 
five German words offered, which is most 
closely connected with the German word tested. 
As usual, however, this procedure is attended 
by some difficulties. The answer must some- 
times be apparently more difficult than the ques- 
tion (e.g., Item 3, Part II, Form Q, erblicken 
the answer for sehen) and cognates must be 
used freely (Energie—Kraft, 21, II, O). 
Sometimes the distractors are so farfetched in 
difficulty (Lerche as a possible answer for 
Ding, 6, II, P) or sense (Response 2, 35, I, O) 
that they can scarcely perform their function. 

But these and others like them are relatively 
minor matters and involve a negligible fraction 
of the almost 400 items in the three forms re- 
viewed. Of the standardized tests for the abili- 
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ties covered, therefore, these tests remain the 
most useful and usable. 


For reviews by C. H. Handschin and Curtis 
C. D. Vail, see 38:1000 and 40:1360. 
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X*Examination in German Grammar: Lower Lev- 
el. 1-2 years high school or 1 year college; 1945; IBM; 
Form LGG-1-B-4 (usually called Form B) ; separate 
answer sheets must be used; $1.75 per 25; 40€ per 25 
machine-scorable answer sheets; 15€ per scoring key; 
25€ per specimen set, postpaid; 60(65) minutes; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education; distributed by Cooperative Test 
Service. (Also distributed by Science Research Asso- 
ciates: $2 per 25; 65¢ per 25 machine-scorable answer 
sheets; Sof per key; 50¢ per specimen set.) 


Herbert Schueler, Assistant Professor of 
Education, Queens College, Flushing, New 
York. This test of German grammar uses the 
familiar device of translation from English 
into German as the basis for the test items. 
Each English sentence is followed by a stem 
giving a partial translation (except in the case 
of very short items) followed by five alter- 
native completions from which the student is 
to choose the correct one. It is a test of eighty- 
five items with a time allotment of sixty min- 
utes, which makes it much longer than the 
grammar sections of both the American Coun- 
cil Alpha German Test and the Cooperative 
German Tests. 

Since no lists of syntax frequency are avail- 
able for comparison, any estimate of the ade- 
quacy of the sampling of grammar items tested 
will be largely subjective. About half of the 
items tested involve case inflections, about a 
third involve forms of verbs. Only five items 
involve a problem of word order. No subjunc- 
tive forms are tested directly. In the main, the 
sentences offered are quite short, their form 
colloquial, and their subject matter practical, 
with a minimum of the bookishness and old- 
fashioned style that so often afflict grammar 
textbooks and tests. 

The norm tables are even less adequate than 
those supplied with the USAFI vocabulary and 
reading tests. Less than three hundred cases 
and only a dozen schools are involved, and only 
students with a background of one year of 
high school and two years of high school were 
tested. For this reason, and for the reason that 
standards for the amount of grammar to be 
taught at the various levels vary so greatly 
from school to school and school system to 


school system, it is imperative that every school 
expecting to use this test establish its own 
norms. It will be necessary, in addition, for 
every school expecting to use this test for grade 
placement of students seeking credit for work 
taken in the Armed Forces to compare its 
course of study in German with the range 
of items in structure tested here, to help de- 
termine its validity for use in each particular 
situation, A school using the reading method 
will obviously find less use for a test of this 
kind, especially in the early terms of the sub- 
ject, than a school relying on the grammar- 
translation method. No one-year college norms 
are available for this test. Since the test is 
quite difficult for the one-year and two-year 
high school groups tested (one-year group, 
76th percentile, 44 out of a possible 84; two- 
year group, 71st percentile, 56 out of a pos- 
sible 84), some norms for more advanced 
levels, both high school and college, would have 
been helpful. The results obtained for this test 
should be used cautiously, pending the publi- 
cation of more extensive norms. In any case, 
because of the variability in the amount and 
extent of the teaching of structure in various 
schools and school systems, its use as a basis 
for individual grade placement and the assign- 
ing of individual letter grades for work taken 
in the armed forces is less to be recommended 
than its use as a means of individual diagnosis 
and group comparisons. 
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XExamination in German Reading Comprehen- 
sion: Lower Level. 1-2 years high school or 1 year 
college; 1945; IBM; Form LGR-1-B-4 (usually called 
Form B) ; separate answer sheets must be used; $1.75 
per 25; 40€ ues 25 machine-scorable answer sheets; 15 
per scoring key ; 25¢ per specimen set, postpaid; 50(55 
minutes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Sci- 
ence Research Associates: $2 per 25; 65¢ per 25 ma- 
chine-scorable answer sheets; sog per key; 50¢ per 
specimen set.) 


Herbert Schueler, Assistant Professor of 
Education, Queens College, Flushing, New 
Vork. This test of reading comprehension con- 
sists of six reading selections (from about 200 
to 300 words in length), each followed by a 
set of four-alternative multiple-choice comple- 
tion exercises in German. Of the six selections, 
one is an excerpt from a poem by Conrad Fer- 
dinand Meyer, two are narrative, two are ex- 
pository, and one is a short dialogue. All of 
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them are obviously literary in flavor and repre- 
sent a rather one-sided choice of reading mate- 
. rial designed to test reading comprehension. 
One is reminded too much of the foreign- 
language classroom in which the newspaper, 
the popular magazine, the science journal, and 
the cookbook never play a part in the develop- 
ment of reading skill. This restriction to the 
"literary" is a definite limitation on the valid- 
ity of this test. 

"This reviewer does not feel that the best 
way to test reading comprehension in a test of 
this kind is to use the foreign language in the 
multiple-choice completions. There are at least 
three serious disadvantages to the use of the 
foreign language in the test items: (a) It is 
difficult to construct alternatives in which the 
correct answer cannot be taken directly from 
the test. For instance, if in the text you have 
the sentence, “Ich hatte den Eindruck . . . als 
fügte sie Worte hinzu" and then have as one 
of the choices in the test—'"Der Held hatte den 
Eindruck, dass Mademoiselle im Lesen Worte 
hinzufügte," a student who has no idea of 
what the key word "hinzufügen" means can 
get the correct answer by a simple process of 
matching. (This example was taken from the 
test.) (b) If, however, the test constructor is 
careful to avoid such direct matching possi- 
bilities, he is in serious danger of yielding to 
the temptation of constructing items that in- 
volve something much more than the ability to 
comprehend exactly what a paragraph says. 
For instance, in a short dialogue between a 
man and a woman about life, love, and the 
relations between a woman who wishes to give 
herself freely to a man and a man who rejects 
her open devotion because he is concerned 
about his financial ability to found a family, 
the student is asked to consider whether this 
story is laid during a time when young people 
generally could look forward with confidence 
into the future, or when they felt like starting 
a revolution, or when they had little hope for 
the future, or when they didn't worry too much 
about any of those things. The reading selec- 
tion may give clues, to be sure, as to what 
might conceivably have been the most probable 
era in which the story was played, but there is 
little direct evidence in the material given to 
justify confidence in any speculation as to the 
social atmosphere of the time when it occurred. 
For this selection in the test, there are two 
such questions out of the total of seven, each 


of which requires either a knowledge of what 
went on before or pure speculation based on 
clues given in the selection, which make these 
items more a test of intelligence and imagina- 
tion and less of reading comprehension. In an- 
other selection, the student is asked to judge 
whether the selection is written for the pur- 
pose of teaching, amusing, or angering the 
reader, which involves a skill of judging liter- 
ary tone not necessarily associated with read- 
ing ability. (c) A third disadvantage is, that 
with the test item in the foreign language, you 
are posing the problem of comprehending the 
question as well as the paragraph on which it 
is based. If key words in the test item intro- 
duce additional difficulties for the student, for 
instance, then a student comprehending the 
paragraph may nevertheless get the wrong an- 
swer because he did not comprehend the ques- 
tion. In one item on this test the student sees 
the following alternative: “Diese Geschichte 
spielt in einer Zeit, in der junge Menschen mit 
Zuversicht in die Zukunft sehen." The com- 
prehension of the sentence hinges on the word 
Zuversicht; if the student doesn't know the 
meaning, his judgment is based not on his com- 
prehension of the original reading selection but 
on his knowledge of the meaning of one word 
in the test item. There are many such examples 
throughout the test, which seriously limit the 
test's usability, especially if discriminating meas- 
ures of the lower levels of reading ability are 
sought. It would have been much better had 
the test constructors used English instead of 
German in the test items. In that way many of 
the pitfalls mentioned above could have been 
avoided rather easily, and the problem of read- 
ing comprehension could have been narrowed 
down much more effectively to the actual read- 
ing selection. 

This test, and the tests of German vocabulary 
and grammar reviewed below, were designed 
to offer a basis for evaluating the work done 
in the subject by returned servicemen in the 
armed forces for the purpose of securing credit 
and grade placement in civilian schools. A table 
of percentile norms for one year of high school, 
two years of high school, and one year of col- 
lege are furnished with the test. Unfortunately, 
these norm tables, especially for the high school 
levels, are based on too few cases to be used 
with any degree of confidence. The one-year 
high school norms were secured from testing 
only 128 students in seven schools, a sample 
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muchstoo small to form the basis for any re- 
liable comparisons. It is imperative, therefore, 
as the manual accompanying the test takes 
great pains to point out, that each school se- 
cure its own norms based on the administration 
of these tests to its own students, before any 
attempt is made to use the test as a basis for 
assigning credit to returned servicemen. How- 
ever, because of the limitations of this test as 
a test of reading ability, as mentioned above, 
this reviewer would recommend use of such 
tried tests as the Cooperative German Tests, 
if not as a replacement for, then at least as a 
supplement to, this USAFI test. 
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XExamination in German Vocabulary: Lower 
Level. 1-2 years high school or 1 year college; 1945; 
IBM; Form LGV-1-B-4 (usually called Form B); 
separate answer sheets must be used; $1.75 per 25; 40€ 
per 25 machine-scorable answer sheets; 15¢ per scoring 
key; 25¢ per specimen set, postpaid; 45(50) minutes; 
prepared by the Examinations Staff of the United 
States Armed Forces Institute ; published by the Amer- 
ican Council on Education ; distributed by Cooperative 
Test Service. (Also distributed by Science Research 
Associates : $2 per 25; 65¢ per 25 machine-scorable an- 
swer sheets; 50¢ per key; 50¢ per specimen set.) 


Herbert Schueler, Assistant Professor of 
Education, Queens College, Flushing, New 
York. This 45-minute vocabulary test contains 
75 five-alternative multiple-choice items. Each 
item consists of a short German sentence con- 
taining the word or expression to be tested in 
boldface type. The five alternative responses 
are in English and include the correct transla- 
tion of the expression to be tested. Incorporat- 
ing the expression to be tested in a sentence is 
an improvement over the practice followed both 
in the American Council Alpha German Test 
and in the Cooperative German Tests of giving 
in the stem just the words to be tested. Even 
though other variables are introduced by this 
expansion of the stem, especially if the rest of 
the sentence contains other words of equal or 
greater difficulty to the candidate, these addi- 
tional variables are more than compensated for 
by the advantages gained by placing the ex- 
pression in natural semantic surroundings. The 
question of whether to use English or German 
in the alternatives is one of those oft-debated 
questions that probably will never be settled to 
the satisfaction of foreign-language teachers. 
By placing the alternatives in English, a wider 
sampling of vocabulary may be secured, espe- 
cially among those expressions of greatest fre- 
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quency and therefore of greatest importance. 
Use of alternatives in the foreign language, 
however, involves testing by means of syno-_ 
nyms, which in the case of those words of 
greatest frequency, are almost invariably of 
lower frequency and therefore less adapted to 
test the candidates’ knowledge of basic vocabu- 
lary. For this reason the form of item followed 
in this test seems to this reviewer to be the 
best of those so far used. 

The choice of expressions to be tested is, in 
the main, quite good and is well distributed 
over the frequency ranges, as revealed by a 
check with the Eaton Semantic Frequency List 
(which in German is based on Kaeding). About 
one-third of the expressions tested are verbs 
or verb phrases, about one-fifth nouns or noun 
phrases, and another fifth adjectival and ad- 
verbial modifiers. One weakness of the test's 
coverage of verbs is that the seven strong (ir- 
regular) verbs tested all appear in the imper- 
fect tense. The German sentences of the stems 
only occasionally have an armed forces flavor 
and are, with some exceptions, simple and col- 
loquial in style. 

Because of its length and the frequency 
range of the vocabulary tested, this test should 
offer a useful measure of vocabulary range of 
civilian students. If used as a measure for 
placing students seeking credit for study with 
the armed forces into grade levels in foreign- 
language classes, it must be used cautiously. 
The norm tables provided with the test rep- 
resent, especially in the case of the norms for 
the one-year and the two-year high school 
levels, a sample that is much too small. Less 
than three hundred students and fourteen 
schools are involved, hardly a basis for accu- 
rate comparisons and placement. As the very 
helpful manual accompanying this test points 
out, it will be necessary for schools expecting 
to use these tests for placement purposes to 
determine their own local norms first, and if 
they are found to deviate significantly from the 
published norms, to use the former as a basis 
for comparison. This determining of local 
norms would be necessary even with adequate 
national norms, since the content of foreign- 
language courses is so highly variable from one 
institution to the next. In any case, the use of 
this test for diagnostic and guidance purposes, 
as suggested by the manual, may represent a 
clue to its best usefulness. 
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[ 194 ] 
Lundeberg-Tharp Audition Test in German. High 
school and college; 1929; Forms A, B; $2 per 100; 10€ 
-per specimen set; Olay K. Lundeberg and James B. 
Tharp assisted by C. A. Williams; James B. Tharp, 
pause of Education, Ohio State University, Colum- 
us, Ohio. 


Harold B. Dunkel, Examiner, Board of Ex- 
aminations, The University of Chicago, Chi- 
cago, Illinois. These are the pioneering efforts 
to measure the aural comprehension of German 
and are probably the most widely used tests in 
this general area at present. The two forms are 
parallel and generally comparable. In the 50 
items of Part I, Phonetic Accuracy, the stu- 
dent selects, from four German words printed 
on the answer sheet, the one read by the ex- 
aminer, In Part II, Completion Series, the 
student writes, in English or German, the 
word which best completes each of 25 incom- 
plete sentences which are read to him one by 
one. In Part III he writes the object or idea 
defined by a short German sentence. Instruc- 
tions for reading and administering are given, 
as well as norms for high school semesters, 
though the size and nature of the groups used 
in standardizing are not reported. No data on 
reliability or on the intercorrelations between 
the parts are given, but the reviewer believes 
Írom personal experience with these and simi- 
lar materials that the reliabilities are probably 
sufficiently high. 

Reviewers of tests, like the reviewers of an- 
thologies, are often critical merely because the 
authors did not select the same material the 
reviewers would have chosen. And certainly it 
is unfair to ask a test to measure everything 
in forty minutes. With this disclaimer, I men- 
tion two points. In exercises like Part I, “pho- 
netic accuracy” is, for some languages, only 
part of the skill demanded; the student also 
must be able to translate the sounds he hears 
into the conventional spelling of that language. 
In these cases, the test is as much a measure 
of “orthographic” as of “phonetic” accuracy. 
The former is an important skill and, though 
it should not be confused with aural proficiency, 
is probably worth measuring in some aural 
tests. In a language having such relatively 
phonetic spelling as does German, however, 
the task is quite simple (as the norms for these 
tests show); and once the student is past the 
very elementary stages, the test may reflect 
primarily the artificial situation produced by 


the words being isolated from context or the 
failure of the reader to make his phonetic dis- 
tinctions perfectly clear. Some of us, conse- 
quently, would prefer to see Part I for German 
replaced by a measure of ability to comprehend 
longer continuous passages. This ability is not 
highly correlated with the ability measured by 
the completions and definitions; the beginner 
can often cope with single sentences but gets 
swamped by the longer units he is certain to 
meet in conversations, movies, and radio broad- 
casts. Teachers using the present form can 
easily supplement it in this regard. 

The major problem of standardizing aural 
tests is, of course, that of standardizing their 
oral administration. In the directions the au- 
thors have done what they can to obtain com- 
parable situations when the tests must be read 
by many different readers under different cir- 
cumstances. But that variable remains and 
should be remembered by teachers in interpret- 
ing the norms. 

Some minor infelicities (e.g., the misprint 
in Item 29, Part I, Form A and Item 15, Part 
I, Form B or the keying of Item 18, Part III, 
Form B) can be easily cleared up in revision 
and need not affect the present utility of the 
tests. With increased interest in the aural-oral 
skills, teachers will (or should) want to know 
how their classes stack up. This test, within 
its necessary limitations, can be recommended 
to them for this purpose. 


HEBREW 


[ 195 ] 
*Group Test in Siddur Reading. 1043-47 ; 1 form; 
5¢ per test; 15¢ per sample test; Noah Nardi; Jewish 
Education Committee of New York, Inc. 


[ 196 ] 
*Hebrew Aptitude Test. Ages 8-17 and adults; 
1946; 1 form; 5¢ per test; 15¢ per sample test; 48(60) 
minutes; Noah Nardi; Jewish Education Committee 
of New York, Inc. 
REFERENCES 


1. Narni, Noan. “A Test to Measure Aptitude in the Hebrew 
Language." J Ed Psychol 38:167-76 Mr '47. * (PA 22:537) 


[ 197 ] 
xHebrew Intermediate Test. 1046; Forms A, B; 
5€ per test; 15¢ per sample test; Noah Nardi; Jewish 
Education Committee of New York, Inc. 


[ 198 ] 
X*Hebrew Primary Test. First 2 years of a Talmud 
Torah curriculum ; 1943; Forms A, B ; sé per test; 15¢ 
per sample test; (60) minutes; Noah Nardi; Jewish 
Education Committee of New York, Inc. 
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[ 199 ] 

*Cooperative Italian Test. At least 2 semesters; 
1947 ; revision of Experimental Forms Q and S; IBM; 
Form X (experimental Forms Q and S are available 
on order for large quantities) ; separate answer sheets 
need not be used; $1.75 per 25; 25¢ per specimen set, 
postpaid; 4o¢ per 25 machine-scorable answer sheets ; 
15¢ per stencil for scoring answer sheets; Form X: 
40(45) minutes; Forms Q and S: 70(75) minutes; 
Peter Riccio and Anthony Cuffari; Cooperative Test 
Service. 

Elton Hocking, Professor of Modern Lan- 
guages and Head of the Department, Purdue 
University, Lafayette, Indiana. These tests are 
in the familiar pattern of the Cooperative tests 
in foreign languages, scorable by key or machine. 
However, no information is supplied concerning 
construction, validation, norms, etc. 

Of the seventy items in Part I of Form Q, 
27 per cent are based on four anecdotes aver- 
aging ten lines each, The correct responses are 
frequently mere synonyms or’ paraphrases of 
expressions in the anecdote, and they are often 
located for the student by the key statement 
which quotes the anecdote almost verbatim. 
This is valid testing, but it is usually classified 
as vocabulary. The only essential difference 
between such items and those of Part II (Vo- 
cabulary) is that the latter lists the vocabulary 
equivalents in English. Eleven items in Part I 
are concerned with matters of geography, his- 
tory, music, etc. Such questions would perhaps 
be justified in a test of “Italian culture,” but 
they have no place in a test of reading compre- 
hension. This tendency to stray from the pur- 
poses of the test is found also in connection 
with the anecdotes, where some of the “cor- 
rect” responses (Items 57, 58, 61) are based 
not on the facts of the narration but on infer- 
ences which the authors have drawn from 
those facts. If the student draws another in- 
ference, his response is “wrong.” This tend- 
ency reaches the point of absurdity in Item 56, 
which requires the student to pass judgment on 
the anecdote in question. If he judges it to be 
“silly,” his response is “wrong” ; the only “cor- 
rect” response is “amusing.” Of the remaining 
40 items in Part I, the majority should be clas- 
sified as vocabulary rather than reading, for 
the statements (averaging less than one line in 
length) are so brief that the student can only 
compare the key word with the five responses. 
These words, in turn, are too frequently con- 
cerned with food and drink and “daily life” 


vocabulary in general; a student with a fair 
reading knowledge in the Italian classics might 
conceivably be embarrassed by these bread-and- 
butter details. His knowledge of constructions 
and idioms, his ability to deduce meanings from 
the context will not help him here. 

Parts II and III of Form Q show a better 
balance of materials and a good spread in de- 
gree of difficulty. However, they seem not to 
have been checked against Part I for duplica- 
tions, and the whole test shows lapses in tech- 
nique, such as wrong responses which are obvi- 
ous for external reasons and the use, in Part 
III, of only the blank-filling technique. Unfor- 
tunately, some constructions, such as the use 
or omission of the article, do not lend them- 
selves to this device. Again, mere blank-filling 
is too cumbersome to handle the delicate ques- 
tions of word order or correlative constructions. 
Such difficulties could have been overcome by 
a series of items using complete responses and, 
with the fill-ins, the “No additional word 
needed” response. 

In general, the above comments apply also 
to Form S, except that in Part I the statements 
are even shorter, consisting frequently of only 
subject and verb; “cultural ” items have risen 
to 28 per cent; the anecdotes and exercises on 
them are somewhat better. Part III gives verb 
forms complete, and the number of such items 
has risen to 40 per cent. Although the total 
number of items in Form S is 223, and in 
Form Q 245, the same working time (70 min- 
utes) is specified for each. 

These tests lack the expert workmanship 
which generally characterizes the Cooperative 
tests in foreign languages. This is unfortunate 
because these are the only tests of this type yet 
available for Italian. However, the 1947 cata- 
log announces Form X, "made up of items se- 
lected and revised from the two experimental 
forms." No doubt this will be a more useful 


test. 


[ 200 ] 

XExamination in Italian Grammar: Lower Level. 
1-2 years high school or 1 year college; 1945; IBM; 
Form LIG-1-B-4 (usually called Form B) ; no norms 
available; separate answer sheets must be used; $1.75 
per 25; 40f per 25 machine-scorable answer sheets ; 15¢ 
per scoring key ; 25€ per specimen set, postpaid; 50(55) 
minutes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
Kork, answer sheets; 50¢ per key; 50¢ per specimen 
set. 
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XExamination in Italian Reading Comprehen- 
sion: Lower Level. 1-2 years high school or 1 year 
college; 1945; IBM; Form LIR-1- -4 (usually called 
Form B) ; no norms available ; separate answer sheets 
must be used; $1.75 per 25; 4o¢ per 25 machine- 
Scorable answer Sheets; 15¢ per scoring key; 25¢ per 
Specimen set, postpaid; 50(55) minutes; prepared by 
the Examinations Staff of the United States Armed 
Forces Institute; published by the American Council 
on Education; distributed by Cooperative Test Serv- 
ice. (Also distributed by Science Research Associates: 
$2 per 25; 65¢ per 25 machine-scorable answer sheets; 
50¢ per key; 50¢ per specimen set.) 


[ 202 ] 

XExamination in Italian Vocabulary: Lower Lev- 
el. E high school or 1 year college; 1945; IBM; 
Form IV-1-B-4 (usually called Form B); no norms 
available; separate answer sheets must be used; $1.75 
per 25; 40¢ fe 25 machine-scorable answer sheets ; 15¢ 
per scoring key ; 25¢ per specimen set, postpaid; 50(55) 
minutes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed by 
Cooperative Test Service, (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
rae answer sheets; 50¢ per key ; 50¢ per specimen 
set, 
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*[ Cooperative Latin Comprehension Test.] 1-2 
jus high school or 1 year college, more than 2 years 

igh school or more than 1 year college; 1942; this test 
is, as the booklet title indicates, Part 1 of the Coopera- 
live Latin Test: Lower and Higher Levels; IBM; 
Form S; 2 levels; separate answer sheets need not be 
used ; $1.75 per 25; 25¢ per specimen sct of either level, 
Postpaid; 40¢ per 25 machine-scorable answer sheets ; 
15¢ per stencil for scoring answer sheets ; 40(45) min- 
utes; Harold V. King and Geraldine Spaulding. Coop- 
erative Test Service. 
4) LOWER LEVEL, I-2 years high school or 1 year college. 
b) micer LEVEL. More than 2 years high school or 
more than 1 year college. 


Hazel M. Toliver, Instructor in Latin, Indi- 
ana University, Bloomington, Indiana, With 
these tests is furnished a pamphlet containing 
an abundance of material on administering and 
scoring the tests and on tabulating and inter- 
preting the results. Tables of norms and percen- 
tile scores based on the scores of a large number 
of students are supplied. 

Both tests are unusually accurate and well 
edited. The only error seems to be in the key for 
Item 14 in Section 2 of the Higher Level test. 

Each test contains three divisions of multiple- 
choice questions: a vocabulary section of sixty 
words ; ten Latin sentences to be completed ; and 
Latin passages (five in the Lower Level test and 
seven in the Higher Level) upon which are 
based statements to be completed. 

The vocabulary section is useful in checking 


the student’s knowledge of Latin words, al- 
though isolated words do not measure real un- 
derstanding of words in context. 

The Latin sentences to be completed offer a 
fair representation of typical Latin idioms and 
constructions and should, to some extent, meas- 
ure general knowledge of grammar and syn- 
tax. Actually, however, the lists of words and 
phrases from which the student must complete 
the sentences are such that the choice depends 
much more upon the actual meanings involved 
than upon any grammatical or syntactical differ- 
ences among the answers. Hence, this division 
tests vocabulary and comprehension more effec- 
tively than it does grammar and syntax. Perhaps 
the test would be better if this division were 
lengthened and the vocabulary section shortened 
or eliminated. 

The Latin passages in the third division be- 
come progressively more difficult and are well 
calculated to test the comprehension ability of 
even the best students. The statements based 
upon the passages involve both factual and 
thought answers. In some cases, because the 
statements fail to cover the material sufficiently, 
a student whose understanding of the whole 
passage was rather hazy could answer most of 
the questions. Since such passages must be taken 
out of context and usually offer some difficulty 
because of this fact, this section might be im- 
proved by reducing the number of passages, 
lengthening those retained, and supplying more 
questions—preferably such as would require 
real insight on the student’s part. 

On the whole, these tests could be expected 
to measure with reasonable accuracy what they 
claim to measure—ability to comprehend the 
thought contained in the Latin, 
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Cooperative Latin Test: Elementary and Ad- 
vanced Forms. 1-3 semesters high school or 1 semes- 
ter college, at least 2 years high school or 1 year 
college; 1939-41 ; IBM ; Forms P, Q, R; 2 levels; sepa- 
rate answer sheets need not be used; $1.75 per 25; 25¢ 
per specimen set of either level, postpaid; 4o¢ per 
25 machine-scorable answer sheets; 15¢ per stencil for 
scoring answer sheets; 40(45) minutes; Form P: 
John C. Kirtland; Forms Q and R: George A. Land; 
Cooperative Test Service. 
4) ELEMENTARY FORM. I-3 semesters high school or 
I semester college. 
b) ADVANCED FORM. At least 2 years high school or 
1 year college. 

REFERENCES 


L3 Mie AxrHux E, “The Relation of Vocabulary and 
Grammar to Reading Achievement in Latin, French, and Span- 
ish," pp. 61-5. In 1947 Achievement Inde- 
pendent Schools and Suppl, n Educational Rec- 
ords Bulletin, No. 48. New York: Educational Records Bureau, 
June 1947. Pp. xii, 66. Paper, lithotyped. $2.00. * (PA 22:449) 


Tenino Program in 
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For reviews by S. D. Adkins, Harold B. 
Dunkel, John Flagg Gummere, and Norman T. 
Pratt, Jr., see 38:1064, 38:1065, and 40:1365. 
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*Cooperative Latin Test: Lower and Higher 
Levels. 1-2 years high school or 1 year college, more 
than 2 years high school or more than 1 year college; 
1942; IBM; Form S; 2 levels; separate answer sheets 
need not be used; $2 per 25; 25¢ per specimen set of 
either level, postpaid ; 60¢ per 25 machine-scorable an- 
swer sheets; 30¢ E stencil for scoring answer sheets ; 
80(85) minutes; Harold V. King and Geraldine Spaul- 
ding; Cooperative Test Service. 

a) LOWER LEVEL. I-2 years high school or 1 year college. 
b) HIGHER LEVEL. More than 2 years high school or 
more than 1 year college. 


REFERENCES 


Cpapereye Tests s „French, in, 
athematics, Form . $59. hievi d 
Program in "Independent. «pM A slemonsery dies 
Educational Records Bulletin, No, 36, New York; 

Records Bureau, June 1942. Pp. E 

1.80. 


C. W. Odell, Professor of Education, Univer- 
sity of Illinois, Urbana, Illinois. Although these 
tests probably deserve rank as the best tests deal- 
ing with the phases of Latin which they cover, 
they are still subject to a number of adverse 
criticisms, Each consists of three parts, one of 
which is subdivided. Part I, Comprehension, 
calls for selecting the proper one of five English 
words for each of a number of Latin words, of 
five Latin words or phrases to complete each of 
a number of incomplete Latin statements, and 
of English words, phrases, or clauses to com- 
plete statements about Latin passages which 
have been read. Part II, Grammar, calls for a 
selection of proper endings and forms to com- 
plete Latin sentences for which the English 
translations are given. Part III, Civilization, is 
likewise in multiple-answer form and deals with 
varied aspects of Roman history, society, litera- 
ture, private life, and so forth. The general form 
and setup of the tests is good. The fact that the 
answers are to be placed in parentheses instead 
of blanks makes scoring somewhat slower. The 
directions for giving and scoring, the norms, 
and other supplementary materials likewise ap- 
pear to be satisfactory. 

The content appears to be based almost en- 
tirely on the traditional sequence of Caesar, 
Cicero, and Virgil, so that the tests are not well 
adapted for use in schools following the modern 
trend in the selection of content. In Part I, 
Section 1, it would be desirable to have all of 
each group of five suggested answers the same 
part of speech. As they are, alert pupils may 
derive some clues as to which ones to eliminate 
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from consideration. In the Lower Level, Part I, 
Section 2, Item 15, ten days are referred to as 
“a little more than a week" ; in Item 29, “to be” 
should be “to have been" ; in Item 36, Choice 4 
seems a correct answer as well as 5. In Part II, 
Item 28, "his" is in italics for no apparent rea- 
son. In Part ILI, Item 20, the correct answer is 
that clocks were unfamiliar to the Romans, a 
statement which is not true unless limited to 
modern types of clocks. In the Higher Level, 
Part II, Item 20, “these” is translated “illae” 
rather than “hae”; in Items 22 and 27 similar 
constructions are differently translated, that 
used in Item 27 being poor usage if not abso- 
lutely incorrect. 

The third parts of the two levels contain five 
elements that are exactly the same and two 
others almost so. Moreover, there are several 
items in this part of both levels that call for 
knowledge that probably would have been ac- 
quired only by reading specific sections of Latin 
authors that might easily not be included in a 
high school course. 

The form “Antonius” rather than “Anthony” 
is used, but inconsistently “Pompey” rather 
than “Pompeius” and “Octavian” rather than 
“Octavianus.” 

[ 206 ] 
Kansas First Year Latin Test. First, second semes- 
ters high school; 1996; a foci eed) CAU. 
Pin Mir Alice Seller, ich "Bellinger, and H. E. 
Schrammel; Bureau of Educational Measurements, 
Kansas State Teachers College of Emporia. 
d) TEST I, FORMS A AND B, First semester. 
b) TEST V, FORMS C AND D. Second semester. 

Hazel M. Toliver, Instructor in Latin, Indi- 
ana University, Bloomington, Indiana, The four 
forms of this test are similar in structure and 
also in the amazing number of errors which they 
contain. Misprints include prevénit and thon- 
sand in Form A, dabés and lead in Form B, 
adivisse and urnos in Form C, and appelavit and 
miitibus in Form D. Mistakes and omissions in 
the markings of long vowels are inexcusably 
frequent. Other errors are the incorrect syllable 
division of comparübüs in Form A and King 
Philippi in Form C, and the illogical tense of 
respondet in the reading passage of Form A . 
and of exclamat in Form B. 

Part I of each form contains a Latin passage 
with true-false statements based upon it. The 
value of such questions depends upon careful 
selection of material and carefully worded ques- 
tions requiring real insight on the student's part. 
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Yet Form A offers a story commonly read in 
grade school. Most students could probably 
answer at least 6 of the 15 questions without 
having looked at the Latin. In Form C the selec- 
tion is none too clear ; the first true-false state- 
ment is not completely covered in the passage; 
the fifth is couched in extremely odd English ; 
the ninth is puzzling and cannot be answered 
from the Latin ; and the truth or falsity of the 
eleventh and sixteenth statements is not clearly 
ascertainable from the Latin passage. This part 
of the test is better handled on Forms B and D, 
although there might be some question about 
the material used on Form D, since many 
first-year Latin textbooks contain accounts of 
Aeneas's adventures. 

Part II on Forms C and D, designed to test 
knowledge of forms and syntax, is poor because 
only one true-false explanation for the construc- 
tion of each of several words and phrases from 
the reading passage is offered. Part II on Forms 
A and B, with multiple-choice questions with 
the same objective, is more effective. Item 22 
on Form C explains sub dicióne as "ablative 
with preposition, sub, to indicate place where 
something is carried on." Since no actual place 
is involved, such a true-false statement could 
be confusing for the student. For Item 27 the 
key is wrong. A practice pedagogically objec- 
tionable is followed in Part II of Forms A and 
Band in Part III of Form C, where nonexistent 
forms of Latin words are presented as possible 
answers. 

Part III on Forms C and D tests knowledge 
of certain elementary principles of grammar and 
syntax, On Form C some five of the rules listed 
are hardly appropriate because many first-year 
students never hear of them. On Form D this 
part is badly arranged, and the scoring is made 
awkward by the fact that the number arrange- 
ment on the key does not correspond to that on 
the test. In Item 33 the rule itself supplies the 
correct answer. 

Part V on Forms C and D and Part III on 
Form B, word-study questions, merely list the 
English derivatives. Part III on Form A is 
better because the English words appear in 
sentences. There are two blunders in definition: 
impecunious is defined as “lack of money" and 
amiable as “affectionate.” 

All four forms of this test have too many 
yes-or-no questions, which invite guessing and 
reduce the effectiveness of the test. No penalty 
for guessing is provided in the scoring, and the 
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questions are not weighted properly since the 
yes-or-no type has the same value as the multi- 
ple-choice. Furthermore, the proportion of ques- 
tions testing language elements compared with 
those on background and derivatives is much 
too small. 

Although there is some good testing material 
in these tests, this reviewer could not honestly 
recommend them, in their present form, to any 
teacher who desired effective and accurate meas- 
uring instruments. 


For review by John Flagg Gummere, see 
40:1368. 
[ 207 ] 
Orleans-Solomon Latin Prognosis Test. Grades 9- 
16; 1926; Form A ; $1.70 per 25; 35 per specimen set, 
postpaid; 50(55) minutes; Jacob S. Orleans and 
Michael Solomon ; World Book Co. 


C. W. Odell, Professor of Education, Univer- 
sity of Illinois, Urbana, Illinois. Although it has 
been more than 20 years since this test appeared, 
it still merits use for the prediction of probable 
success in Latin courses. The test consists 
mainly of a series of short lessons to be studied 
followed by tests over their content, It is there- 
fore valid only for those who have never studied 
Latin previously. Its use indicates that a co- 
efficient of correlation of .70 or higher can 
usually be secured between the scores on this 
test and success in beginning Latin courses. 

The scoring provisions are one of the points 
of weakness, The form of answer called for in 
some cases is such as to allow some element of 
subjectivity in scoring, but it is doubtful whether 
this can be avoided without weakening the func- 
tioning of the test. In some of the tests only 
certain aspects of the answers are scored, and 
others, even if incorrect, do not count. It is at 
least doubtful whether this is better than scoring 
on all aspects of the answers. The scoring key 
for Tests 8 and 9 is not sufficiently specific. For 
example, for “puella” it is not clear whether to 
be considered correct, the answer must be "the 
girl,” as given in the key, or whether "girl" or 
even "girls" should be accepted. The maximum 
score on the test is only 222, but the form for 
tabulation goes up to 242. Moreover, all the in- 
tervals provided therein are not the same, thus 
violating good statistical practice. 

Tn several instances there appears to have been 
carelessness in wording. Thus, in Lesson 2, the 
practice as to whether or not an article is used 
before a Latin noun is not consistent. In Test 2, 
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(b) and (e) contain “if,” which it would have 
been better to omit. Lessons 3 and 4 are so 
nearly repetitions that they might well have 
been combined. Lesson 4 and some others there- 
after close with the statement, “Think what this 
lesson teaches you.” It would seem better to 
have followed this with more specific statements 
or reminders. 

Some of the suggested answers to Test 1 are 
probably not commonly known to ninth grade 
pupils. Examples are “equitant,” “equitation,” 
“militate,” “matriarch.” Latin words for which 
English derivatives would be more commonly 
known might well have been chosen. Test 4 con- 
tains the sentence, “The eagle destroys the 
land,” which is utter nonsense insofar as its 
meaning is concerned. 

Other points about which questions might 
well be raised are the desirability of having 
pupils tear off the list of study words rather 
than of merely placing it at the first and not 
allowing them to turn back; the reason why 
Lessons 3 and 4 and the following tests, and 
also Tests 8 and 9, are not united into one since 
they are very similar ; and the practice of trans- 
lating first person singular, present active indica- 
tive verbs by infinitives, for example, “narro— 
to tell.” 
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XExamination in Spanish Grammar: Lower Lev- 
el. 1-2 years high school or 1 year college; 1944; IBM; 
Form LSG-1-B-4 (usually called Form B); separate 
answer sheets must be used; $1.75 per 25; 40¢ per 25 
machine-scorable answer sheets; 15¢ per scoring key; 
25¢ per specimen set, postpaid; 40(45) minutes; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education ; distributed by Cooperative Test 
Service. (Also distributed by Science Research Associ- 
ates: $2 per 25; 65¢ per 25 machine-scorable answer 
sheets; 50¢ per key; 50£ per specimen set.) 


Frederick B. Agard, Associate Professor of 
Modern Languages, Cornell University, Ithaca, 
New York. This forty-minute test of ele- 
mentary Spanish grammar, now available in 
the civilian language-teaching field, employs 
the established multiple-choice technique for 
objective tests of the achievement type. The 
composition of its sixty items is the same as 
that used in the grammar sections of the Co- 
operative Test Service and the College Entrance 
Examination Board tests. In each item, a short 
English sentence is followed by its Spanish 
equivalent in which one word or phrase, em- 


bodying the grammatical point to be tested, is 
omitted. Below the pair of sentences appear two, 
three, four, or five (in most items five) words 
or phrases, one of which is the grammatically 
correct missing element of the Spanish version. 
The student indicates his answer on a separate 
sheet (which may be machine scored). The dis- 
tractors are all composed of real Spanish forms; 
in most cases they are merely syntactically incor- 
rect within the given sentence, and in a few 
items they offer meaningless sequences of forms 
(e.g., soy yendo a abrir). 

Of the sixty points of syntax tested, fifty-two 
have a range of 35 or higher in Keniston’s Span- 
ish Syntax List and are thus to be regarded as 
having the same relative importance as the first 
500 words in Buchanan's Graded Spanish Word 
Book or Keniston's Standard List of Spanish 
Words and Idioms. Nearly half of the fifty-two 
have a range of 55 or above, and eighteen of 
these are in the top range of 60 or in the assured 
high frequencies which were not counted, 
Within the high ranges a considerable weight is 
given to the handling of personal pronouns and 
possessive forms; some teachers may well feel 
that this occurs at the expense of verb-form 
treatment (tense, person, number). In the 
ranges below 35, we find the following eight 
items: contigo (Item 22), ser in impersonal ex- 
pression of time (Item 23), relative a quienes 
object of verb (Item 23), possessive su propio 
(Item 33), subjunctive after quienquiera que 
(Item 12), imperfect subjunctive after si in 
conditional sentence (Item 22), interrogative 
cuál before a de-phrase (Item 6), and acaba de 
plus infinitive (Item 21). Since the range goes 
as low as 14 before passing the 2,000-mark in 
vocabulary, most teachers will find no objection 
to these eight items which, moreover, are dealt 
with in most elementary grammars. For exam- 
ple, the use of cudl before a de-phrase (range 
of 6) merges, at the pedagogical level, with the 
generalized use of that interrogative form. Ac- 
tually, only the one item quienquiera que plus 
subjunctive seems to fall outside the reasonable 
limits of grammatical testing at the level set by 
this test. 

There is one disadvantage connected with the 
scoring. Since the number of answer choices 
varies from two to five, there can be no practi- 
cable formula for subtracting a fraction of 
wrong answers from the rights and thus com- 
pensating for guesswork. Except for this draw- 
back, which not all teachers will consider im- 
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portant, this test seems to be as good as, but no 
better than, competing tests of Spanish grammar 
at the basic level. 


[ 209 ] 


XExamination in Spanish Reading Comprehen- 
sion: Lower Level. 1-2 years high school or 1 year 
college; 1944; IBM; Form LSR-1-B-4 (usually called 
Form B) ; separate answer sheets must be used; $1.75 
per 25; 40f per 25 machine-scorable answer sheets; 
15¢ per scoring key; 25¢ per specimen set, postpaid; 
40(45) minutes; prepared by the Examinations Staff 
of the United States Armed Forces Institute; pub- 
lished by the American Council on Education ; distrib- 
uted by Cooperative Test Service. (Also distributed by 
Science Research Associates: $2 per 25; 65¢ per 25 
machine-scorable answer sheets; soé per key; so¢ per 
specimen set.) 


Harry J. Russell, Associate Professor of Ro- 
manic Languages, Miami University, Oxford, 
Ohio. The test technique employed in the con- 
struction of the examination is to present pas- 
sages of varying length, both prose and verse, 
upon which from four to seven incomplete 
Spanish statements are based, with three Span- 
ish completions furnished for each statement. 
The procedures used in constructing the test are 
not too clearly defined except in the rather gen- 
eral statements furnished in the manual. 

The test items are, generally speaking, fairly 
normal reading situations. The examination, 
however, seems to have the following defects: 
(a) The level measured is not too clearly de- 
fined. (b) The vocabulary difficulty is not care- 
fully controlled. (c) The cognates that were 
probably assumed by the test technicians to be 
"gift words" are frequently not known to the 
students for whom the test was designed. (d) 
The reading materials are not arranged in the 
order of progressive difficulty. (e) The reading 
passages are made up of prose as well as verse 
and are of a decidedly unequal length, and com- 
prehension of them is tested by many or few 
test items, depending upon whims of the test 
technicians that seem to have no relationship to 
either their length or their difficulty. (f) The 
time allowed for the completion of the test is 
inadequate. 

For some time it has been rather generally 
accepted that the maximum foreign-language 
vocabulary of a one-year high school student is 
1,000 words; that of a two-year high school, or 
a one-year college student, is 2,500 words. If 
this assumption is correct, there are 53 words, 
or 6.6 per cent, in this test that are beyond the 
two-year high school level, and 70 words, or 
7-4 per cent, that are beyond the one-year high 


school level. The burden of unknown words 
as compared with known ones is entirely too 
high in both cases for the levels specified. It 
would seem, therefore, that the term "Lower 
Level" is erroneous as applied to this test. 

The vocabulary difficulty of the reading pas- 
sages varies considerably. Passage VI, for in- 
stance, contains only two words that would not 
be known to a two-year high school student, 
while II, containing only ten more running 
words than VI, has seven such words. The most 
difficult selection in terms of known to unknown 
words is III rather than the last passage in the 
test. It ought to be possible to assume that a 
reading test is arranged in progressive order of 
difficulty, that is, graduated from easiest to 
hardest in terms of test items. 

Words that look like English, and that have 
the same meaning as their English counterpart, 
are only considered to be cognates when they 
are known to the person being tested. A series 
of tests designed by E. I. Dale of Ohio State 
University to test the English vocabulary knowl- 
edge of juniors and seniors in high school and 
freshmen in college proves that such words as 
“eminent,” “opulence,” “preponderance,” “pre- 
sumptious,” etc., all used in this test, are not 
likely to be known by this age student. They 
must, therefore, be considered “burden words” 
rather than “recognizable cognates.” 

It is questionable whether the inclusion of 
poetry in a “lower level” test is either legitimate 
or, wise, especially when that verse is as highly 
philosophical as Passage V is and when so 
much depends upon the student being able to 
comprehend it (17.5 per cent of the test items 
in this examination are based on this passage). 

Passage I contains 205 running words; VII, 
133, IIT, 188, and V, 127. The average test 
technician would go from simple to complex 
rather than reverse the process. The passage that 
contains 205 words is tested by 6 test items, 
while the one that contains 127 words has 7 test 
items, There seems to be no correlation between 
the length and difficulty of the passage to the 
number of test items based on it. 

There are 1,057 running words in the seven 
passages of this examination. One is considered 
to be a slow reader, and therefore inefficient, 
who reads at a rate of less than 250 words per 
minute. At that rate the students should be able 
to read all of the test passages in less than five 
minutes. Tt seems strange to allow 45 minutes 
to decipher 40 test items based on passages that 
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should take no longer than five minutes to read. 

Only three completion items are furnished 
with each incomplete statement in this test. The 
penalty for guessing (a penalty that should be 
imposed for all multiple-choice type examina- 
tions) when only 3 completion items are fur- 
nished is very heavy. It would seem that the 
normal of five completion items should be fur- 
nished with each incomplete statement in order 
to diminish the penalty imposed. 

The correct answer to a majority of the test 
items is almost a repetition of the exact language 
used in the passage being tested. A clever stu- 
dent need only match up these words to get the 
correct answer. There is no way of ascertain- 
ing whether he actually comprehended what he 
read or was merely astute in matching. This is a 
departure from the accepted technique which, it 
would seem, is inadvisable. 


[ 210 ] 
*Examination in Spanish Vocabulary: Lower 
Level. 1-2 years high school or 1 year college; 1944; 
IPM; Form LSV-1-B-4 (usually called Form B) ; sep- 
arate answer sheets must be used; $1.75 per 25; 40€ 
per 25 machine-scorable answer sheets ; 15¢ per scoring 
key; 25¢ per specimen set, postpaid; 40(45) minutes; 
prepared by the Examinations Staff of the United 
States Armed Forces Institute ; published by the Amer- 
ican Council on Education; distributed by Cooperative 
Test Service. (Also distributed by Science Research 
Associates : $2 per 25; 65¢ per 25 machine-scorable an- 
swer sheets ; so per key ; 50é per specimen set.) 

[211 ] 
Lundeberg-Tharp Audition Test in Spanish. High 
school and college; 1929; Forms A, B ; $2 per 100; 10€ 
per specimen set; (30) minutes; Olav K. Lundeberg 
ard James B. Tharp; James B. Tharp. Collége of Edu- 
cation, Ohio State University, Columbus, Ohio. 


Frederick B. Agard, Associate Professor of 
Modern Languages, Cornell University, Ithaca, 
New York. This is a time-tested instrument 
for measuring Spanish auditory apprehension, 
constructed by qualified experts in the field 
of testing. Accompanying norm tables, com- 
piled for each form separately, are furnished 
for two or four high school semesters; for 
two or four college semesters ; or for one, two, 
or three college quarters. The test has three 
parts: I, Phonetic Accuracy; II, Completion 
Series ; III, Definition Series. 

The spoken material in Part I consists of 
fifty utterances, ranging progressively from two 
through seven syllables each. Corresponding to 
each spoken item (uttered twice in succession), 
there are printed on the test paper four partially 
homophonous words or expressions, one of 
which is identical to the spoken stimulus. The 


student is to determine which one of the four 
was pronounced. The choices are all real Spanish 
but are mere segments of hypothetical speech 
with no importance attached to their meaning ; 
the student does not need to know their mean- 
ing, but merely to apprehend accurately their 
phonetic pattern; in addition, he needs to use 
an exact knowledge of the written representa- 
tion of Spanish sounds, in order to distinguish 
in writing the heard utterance from its partial 
homophones. 

In Part II the examiner reads 25 incomplete 
sentences in Spanish. After listening to a sen- 
tence once only, the student is to write down 
in English (or in Spanish if he prefers) a word 
or phrase which will reasonably complete the 
unfinished spoken sentence. A key of acceptable 
answers is supplied with the test. 

The vocabulary employed in the incomplete 
sentences is carefully controlled. Six words in 
Form A and seven in Form B are not listed in 
the Keniston Standard List of Spanish Words 
and Idioms. These few, most of which have 
English cognates (eg, invención, alfabeto, 
aritmético), are all understandable within their 
context or are unessential to comprehension of 
the sentence as a whole; in one item the key 
word is pizarra, a class-room term which can be 
assumed to be known by most students. 

In Part III, the examiner reads 25 statements 
in Spanish, each defining some object, person, 
or action. After listening to a definition once 
only, the student identifies what has been de- 
fined by writing down the English (or Span- 
ish) word for it. A key of acceptable answers is 
supplied. 

The vocabulary of the Definition Series, 
slightly more difficult than that of the comple- 
tion items, is no less rigidly controlled. Sixteen 
words in Form A and 21 in Form B are not in 
the Standard List; most of these are cognates 
of English words, and all are either deducible 
from context or are nonessential elements of the 
sentence, with the sole exception of muñeca, a 
key word in one item. The sentence structure 
and style, while uncomplicated, has the inevi- 
table dictionary flavor of definitions and thus 
confronts the student with turns of phrase not 
commonly met with in the spoken language un- 
der normal conditions of its use. 

As a whole the Lundeberg-Tharp Test repre- 
sents a valuable pioneer contribution to the field 
of aural testing. It succeeds in eliminating the 
variables of unknown vocabulary and of knowl- 
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edge any more specialized than the common 
fund possessed by all people as social beings. 
It is true that Part I does not measure ability 
to apprehend meaningful speech; and it is 
doubtful that Parts II and III measure differing 
degrees of aural skill, since the utterances are 
of equal length in both parts. The test does not, 
for example, measure the ability to apprehend 
sustained discourse such as conversations or 
anecdotes of fifty or a hundred words. I have 
come to believe that for students completing one 
or two years of Spanish the ideal aural test, 
which to my limited knowledge has yet to be 
built, will measure the ability to apprehend 
utterances of varying length, graduated from 
Buenos días and ¿4 dónde va Vd.? to a five- 
minute conversation or talk, Rather than in- 
complete sentences and definitions (which are 
not normal forms of speech) the student ought 
to be tested with realistic “slices of speech” 
similar to utterances he would hear in real con- 
versation and would be expected to react to 
by speech or action. In the test situation we 
would then provide for a “substitute reaction” 
in the form of choosing among different pos- 
sibilities a reply, or an act, or a restatement of 
content. 


Walter V. Kaulfers, Associate Professor of 
Education, Stanford University, Stanford, 
California. The reliability coefficients of .92 
and .92 for Forms A and B, respectively, of 
this relatively short and easy-to-administer 
aural-comprehension test indicate that it de- 
serves publication in a more attractive form 
than the present “tentative edition” affords. In 
case the publication of a more finished edition 
is contemplated, the utility of the test could be 
improved by substituting a strictly objective 
multiple-choice form of response in Parts II 
(completion of sentences dictated in Spanish) 
and ITI (identifying descriptive definitions) for 
the handwritten (Spanish or English) response 
that is now required of students. As Parts II 
and III now stand, only a corrector with a 
knowledge of both Spanish and English (and 
his own notions of fairness in accepting or re- 
jecting answers) can undertake to score the test 
—and at a considerable expenditure of time in 
the case of large groups. It is unfortunate that 
the multiple-choice form of response used in 
Part I (identifying the printed form of a word, 
in a group of four words, from hearing the key 


word pronounced twice) could not have been 
adapted for use in Parts II and'III. 

In revising the tentative edition for final pub- 
lication, the authors may also wish to consider 
the possibility of making the test far more ob- 
jective and valid by providing for its (optional) 
administration entirely via records on tone-con- 
trol talking machines, capitalizing on different 
native speakers. As the test now stands, it is 
likely to guarantee only a rough measure of how 
well the students can understand their own 
teacher. The norms do not indicate whose 
voice(s) the experimental groups heard during 
the tryouts. For this reason, intergroup compari- 
sons based on the test would be difficult to 
interpret. 

Needless to add, any test that requires ability 
to handle printed words (as in Part I) measures 
more than just aural comprehension. Any of 
the thousands of Spanish-speaking natives who 
understand Spanish without serious difficulty, 
but who have never learned to read or write, 
would fail every part of the present edition. If 
it is to have other than intramural scholastic 
interest, the final revision might well capitalize 
on some of the practical devices used in 
machine-scorable group performance scales. As 
the first venture in aural-comprehension testing 
to appear on the foreign language horizon, the 
Lundeberg-Tharp Audition Test deserves credit 
as a promising pioneering start toward a more 
finished product. It is doubtful, however, if an 
aural-comprehension test useful for purposes 
of vocational placement can be developed with- 
out the aid of a substantial grant from some 
educational or government foundation to finance 
the cost of deriving and validating norms de- 
fined in operational terms. 


Mod Lang J 28:699 D '44. Winthrop H. 
Rice. This audition test in Spanish is made up 
of three parts; part I, “Phonetic Accuracy,” 50 
multiple choice items wherein the student indi- 
cates which of four words has been pronounced 
by the examiner; part II, “Completion Series" 
(25 items), wherein the examiner reads a state- 
ment lacking the final word which the student 
is to supply in either Spanish or English (as 
determined by the examiner) ; part III, “Defi- 
nition Series" (25 items), wherein the examiner 
reads a definition and the student sets down. 
in Spanish or English, the name of the thing 
defined. * In general, the construction of the 
test is excellent. The selection of words for 
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the multiple choice part has been done with 
great care and, one might add, ingenuity. The 
elements are sufficiently different so that a 
moderately well-trained ear can grasp the dif- 
ferences yet they are sufficiently similar to per- 
mit of mistakes through careless listening or 
poor training. The other two parts are con- 
structed with similar care. There are, however, 
one or two questions which might be raised. 
First, one wonders why the single words of 
part I are read twice while the long, and some 
times involved, sentences of parts II and III 
are to be read only once. This seems to put the 
student at some disadvantage in the latter parts 
where not only sound but comprehension of 
vocabulary and sentence structure are involved. 
For example, it seems a bit strong to read only 
once, and at normal speed, such a definition as 
“Instrumento de hierro con anzuelo doble que 
está afirmado al extremo de un cable y que se 
arroja al mar para asegurar y detener las em- 
barcaciones," while such words as “zozobras” 
and "jayán" (to pick two of the most difficult) 
are read twice. The same remark might apply 
to such a completion question as "Si se saca 
al pez de su ambiente natural, se muere dentro 
de pocos minutos porque no puede respirar 
fuera del. . . .” It would be the suggestion of 
this reviewer that the number of readings for 
the three parts be changed to one for the first 
and two for each of the others. Such a change 
would probably bring the level of the norms 
for the different parts more in line with one 
another. The norms for fourth semester high 
school are given as 45.7 (out of 50) for part 
I; 11.2 (out of 25) for part II; and 6.2 (out 
of 25) for part III. It will be noted also that 
the norms are considerably lower on the parts 
where more than pure audition is involved. 
Three remarks concerning form B: 1) in part 
I, number 5, the key word is pazo; the four 
possibilities are paso, vaso, pazo and pazco. In 
classes using the seseo, this would allow two 
correct answers, and a change seems indicated ; 
2) in number 47, the key nos esperad is gram- 
matically incorrect; 3) in part II, statement 11 
(La leche es blanca pero el vino es general- 
mente . . .) which expects the answer red or 
rojo is open to question on the basis of fact. 
In spite of these minor difficulties, the test re- 
mains one of the best of its kind available to 
teachers. Statistical data show good reliability 
and validity which were reflected in sample 
testings made at this reviewer's institution, 


where several classes of different levels of ex- 
perience and of ability definitely showed these 
differences in their scores on the test. 


[212 ] 
Iowa Placement Examinations: Spanish Train- 
ing: Series ST1, Revised. Grades 12-13; 1024-26 ; 
Forms A, B; $4 per 100; 50¢ per specimen set, includ- 
ing the other 10 tests in the series, postpaid; 43(50) 
minutes; prepared under the direction of C. E. Sea- 
shore and G. M. Ruch; G. E. Vander Beke and G. D. 
Stoddard; Bureau of Educational Research and Serv- 
ice, State University of Iowa. 
REFERENCES 
1. Sropparp, Georce Dinsmore. Iowa Placement Examina- 


tions. University of Iowa, Studies in Education, Vol. 3, No. 2, 


Towa City, Iowa: the University, 1925. Pp. 103. Paper. $1.00. * 

2. MILLER, LAWRENCE WILLIAM. An Experimental Study of 
the Iowa Placement Examinations. University of Iowa Studies 
in Education, Vol. s, No. 6. Iowa City, Iowa: the University, 
1930. Pp. 116. Paper. $1.00. * (PA 5:799) 


Harry J. Russell, Associate Professor of Ro- 
manic Languages, Miami University, Oxford, 
Ohio. The examinations consist of Part 1, a 
vocabulary test of 50 items; Part 2, a grammar 
test of 40 items; Part 3, a verb test of 40 items ; 
and Part 4, a reading test containing three 
paragraphs on which are based 20 questions. 
The technique employed in testing the vocabu- 
lary is to give the Spanish word and then list 
five possible English translations, one of which 
is correct. The grammar technique is to give a 
Spanish sentence containing one grammatical 
error. The student is to underline this error and 
then write the correct word on a dotted line to 
the right. The verbs are tested by sentences in 
which the verbs occur in the infinitive form 
inclosed in parentheses, The student is to write 
each verb in the correct tense and person re- 
quired by the remainder of the sentence. The 
reading paragraphs are tested by English ques- 
tions to be answered briefly in English. 

There is no way of determining what the 
basis of selection of the vocabulary items com- 
prising the vocabulary test might have been be- 
cause a statement clarifying this point is not 
contained in the manual, nor was it possible for 
the reviewer to find it in any of the literature 
on the subject which was consulted. The follow- 
ing analysis of this test, however, may be of 
some value to the reader. If we are justified in 
assuming that the first-year high school stu- 
dent, or the first-semester college student, has 
a vocabulary of approximately 1,000 most com- 
mon Spanish words, this section of the examina- 
tion is entirely too difficult to test this level of 
instruction. The vocabulary in Form B con- 
tains 27 words, or 54 per cent, that would not 
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be known to first-year high school students. 
There are 10 words, or 20 per cent, that would 
not be known to second-year high school, or 
first-year college students, provided that it is 
safe to assume that at this level of instruction 
they would have a vocabulary consisting of the 
2,500 most common Spanish words. Form A is 
much better on this score since only 14, or 28 
per cent, of the words tested would be beyond 
the 1,000-word level; and 4, or 8 per cent, be- 
yond the 2,500-word level. The technique em- 
ployed in testing these words is a well-established 
one and, therefore, entirely acceptable. 

The grammar test was constructed without 
the benefit of The Spanish Syntax List and is, 
therefore, a random choice of the grammatical 
phenomena to be tested. Considerable stress is 
placed on phenomena that are now known to 
be obscure or of decidedly low frequency. The 
technique of writing grammatically incorrect 
sentences to test the student’s acumen in recog- 
nizing the errors has been seriously questioned 
by psychologists. This test is, of course, not 
the least bit objective, since all of the informa- 
tion required is active and to be written in by 
the student. The same criticism is registered 
concerning Part 3 of the test since the student 
is supposed to write the correct form of the 
infinitive in the space provided for that pur- 
pose and also for Part 4 since he is to answer 
English questions based on the reading of pas- 
sages briefly in English. 

An analysis of the reading passages used in 
Part 4 reveals that there are 34 words that would 
not be known to a first-year high school student. 
The first paragraph, the most difficult of the 
three contained in the test, averages one new 
word to 6 known ones. The minimum of one un- 
known word to from 50 to 70 known ones is a 
commonly accepted technique. Any reading pas- 
sage that is more heavily weighted with un- 
known words than that is considered unreadable. 
The research of numerous investigators estab- 
lishes the veracity of this statement; conse- 
quently, it is doubtful whether the reading sec- 
tion of this examination is a valid test for the 
instruction level stated above. 

These “placement tests” are a curious mix- 
ture of objective, passive and subjective, active 
techniques, The amount of time that one would 
have to spend in grading the subjective, active 
parts of the test could hardly be justified when 
one considers that there are several reputable 
standardized objective tests available that would 


be far more satisfactory and that would require 
only a fraction of the time for grading. 
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XFurness Test of bs CH ian asia in Span- 
ish. Grades 9-16; 1945; Forms A, B; $2.00 per 25; 10¢ 
per sample copy; (45) minutes; Edna Lue Furness; 
Banks Upshaw & Co. 

Frederick B. Agard, Associate Professor 
of Modern Languages, Cornell University, 
Ithaca, New York. This recently published 
test of aural skill in Spanish is, as far as one 
can judge, intended for first- or second-year 
students of the language. Available in two forms 
(A and B) seemingly intended to be equivalent 
in level of difficulty, the test consists of three 
parts: I, Vocabulary (10 minutes); II, Com- 
pletion (15 minutes); III, Identification (15 
minutes). The technique is that of multiple- 
choice response throughout the three parts, 
which may profitably be discussed separately. 

PART I, VOCABULARY. The spoken material 
consists of 50 isolated words. The examiner is 
directed to utter the stimulus twice in succes- 
sion. The student is to select his answer from 
five printed Spanish words (occasionally 
phrases), one of which is closely associated in 
meaning with the word pronounced. 

The purpose of this section is to measure skill 
in aurally apprehending single lexical units pre- 
sented in their conventional dictionary form 
(verbs, for instance, are uniformly given as 
infinitives). This objective seems open to ques- 
tion on the ground that such units do not con- 
stitute speech; in conversational situations lan- 
guage is not transmitted in this fragmentary 
form, as single words devoid of context. How- 
ever, admitting for the moment the desirability 
of aural vocabulary testing, one may legitimately 
inquire about the basis for selecting the words. 
All 50 are "basic" in that they fall within the 
first four groups (words 1 to 2,000) of the 
Keniston Standard List. Distribution according 
to frequency varies in the respective forms, how- 
ever. Form A contains 14 words from 1 to 500, 
17 from 5or to 1,000, 10 from 1,001 to 1,500, 
and 9 from 1,501 to 2,000. Form B, considerably 
easier from the point of view of frequency, has 
34 from 1 to 500, 15 from 501 to 1,000, I be- 
tween I,00I and 1,500, and none above 1,500. 
Within the battery of 250 answer choices (five 
per item), a good 8o per cent of the words are 
distributed evenly enough through the first four 
groups, with the addition of some Io per cent 
of derivatives (words 2,001 to 3,060) and an- 
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other 10 per cent (roughly) not found in the 
Standard List at all. Of these last, a majority 
are obvious cognates of English words; but it 
is difficult to justify from the student's point of 
view such ponderous items as auxiliar, inter- 
cesor, antepasado, or astro luminoso as correct 
answers (corresponding to the spoken items 
ayudar, abogado, abuelo, sol) or such compara- 
tive strangers as encarnar, jarabe, consuelo, 
guisar, agarrar even as distractor-items. The 
form ayanzar (Form A, Item 1r, Choice 5) is 
seemingly a misprint for avanzar, as is carcel 
(Form B, Item 1o, Choice 1) for cárcel. 

$ Apart from the few questionable word-selec- 
tions mentioned, this part has a serious flaw in 
construction. The make-up of a large number of 
the items is such that the correct answer can be 
spotted at a glance, simply because some or all 
of the distractors are partially homophonous 
with the spoken stimulus while the correct choice 
is entirely different in form and in some cases 
appears as an obvious paraphrase: e.g., spoken 
item ojo, answer-choices olor, odio, oreja, ola, 
visión; spoken item prometer, answer-choices 
dar mutua (!) palabra, pronunciar, proponer, 
pronto, estar. Tt seems vain to pretend that even 
an average student would not discover, after 
working through the first few items, that he 
could deduce the answers without even listening 
to the spoken stimuli. It may be that the author 
became aware of the structural weakness in this 
part of the test before building Form B, where 
in many items there is no giveaway phonetic 
similarity between distractors and stimulus and 
where correct answering depends, as it should, 
wholly on apprehension of the utterance. 

PART II, COMPLETION. In this section the 
examiner reads 25 incomplete sentences in Span- 
ish. After listening to a sentence twice, the 
student is to select one of five possible comple- 
tions printed in the folder. 

Correct answering of these completion items 
depends on two factors in addition to appre- 
hension of the utterance: (a) possession of the 
vocabulary used in both utterance and answer 
choices; (b) in many instances, a rudimentary 
knowledge of geography and cultural history, 
especially of the Western Hemisphere. As for 
the lexical factor, the utterances themselves 
range rather freely outside the first four Stand- 
ard List groups, particularly in the use of cog- 
nates such as eclesiástico, unidad monetaria, 
alfabeto, independencia, and numerous geo- 
graphical proper names. Reading a cognate is 


rarely a problem, but the student who knows 
well enough what a monetary unit is, may find 
his ear quite unresponsive to the sounds that 
make up unidad monetaria when he hears them 
for the first time. Nevertheless, it is fair to say 
that in substantially all these items apprehen- 
sion of the cognates is facilitated by their con- 
text; they are not "key words," as are cueca and 
jarabe in one item whose value depends wholly 
on the knowledge of at least one of these two 
unusual words. Also the derived word azuca- 
rera, essential to one item, seems of doubtful 
fairness; and among the answer choices, it is 
disconcerting to find such words as vocales, 
cas.'ela, férreo, licenciado, prelado, dramaturgo, 
valses. Furthermore, the. factual knowledge re- 
quired to answer some of the items, e.g., that 
Velázquez was a Spanish artist or that Brazil- 
ians alone in America speak Portuguese, invali- 
dates the section as an exact measure of aural 
apprehension. For if a student fails to answer, 
is it because his ear did not register the speech 
utterance (which is what we would want to 
know) or simply because he didn’t happen to 
know who Velázquez was? 

PART III, IDENTIFICATION. In this part the 
examiner reads 25 statements in Spanish, each 
defining some object, person, or action. After 
listening to a definition twice, the student is to 
identify what has been defined by selecting one 
of five possibilities. Example: spoken definition 
Producto útil que se encuentra en las minas; 
answer choices el máiz, la hierba, el guante, el 
carbón, la torre. This section is presumably in- 
tended to be harder than Part II, but the utter- 
ances are not perceptibly longer; and therefore 
one cannot see that it measures any different 
skill, or degree of skill, than the preceding sec- 
tion. Furthermore, the composition being essen- 
tially the same as that of the completion sec- 
tion, the same requirements are present in 
addition to mere apprehension of the statements 
uttered: vocabulary knowledge and, in some 
items, factual knowledge. The style of these 
utterances is ponderous: it is a dictionary style, 
quite dissimilar to the normal arrangement of 
spoken idiom. Words outside the Standard List 
1 to 2.000 set are more frequent here than in 
Part II; apart from numerous cognates we find 
encuadernades (misprint for encuadernados), 
platicar, predicador, limosna, puntero, frondoso 
(all however deducible from context or not 
essential to the item) and, as key words, de- 
porte, joya, uva, grosella, arbolado. Among the 
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answer choices are salsa, mestizo, codo, olfato, 
casimir, lienzo, volcar, mensajero, vara, rienda, 
reja. Certain of the items also depend, as in 
Part II, on knowledge of special facts. 
Adequate “reception” or “comprehension,” 
which constitutes one-half of the act of speech 
communication, is composed of at least three 
elements: (a) auditory apprehension of the 
spoken utterance and transmission of the acous- 
tic image to the brain for the purpose of physi- 
cal or linguistic response; (b) knowledge of the 
meaning to be attached to the acoustic image; 
(c) awareness of certain facts or assumptions 
bearing on the subject of discourse but not 
uttered in the actual speech situation, Success 
on Parts II and III of the Furness Test depends 
in varying degree on all three of these factors. 
"Teachers who expect their students' recognition 
vocabulary, both auditory and visual, to extend 
somewhat beyond the 2,000 words of the Stand- 
ard List and who likewise expect them to con- 
trol certain semispecialized facts as part of their 
Spanish-language training will find Form B of 
the Furness test usable. Many teachers, on the 
other hand, in order to evaluate precisely their 
students’ skill in using Spanish and to ascertain 
their individual weaknesses, will feel it desirable 
to isolate the factor of auditory apprehension. 
A test devised for this purpose must eliminate 
other variables (a) by using only words which 
the student can with fair certainty be assumed 
to know and (b) by setting no further obstacles 
to correct response, such as answer choices 
which draw on reading skill or vocabulary con- 
trol in the language or which require knowledge 
of particular facts. Carefully planned tests of 
this type already exist, such as the University 
of Chicago Language Investigation Tests of 
Aural Comprehension (available on two levels 
in French, German, and Russian as well as 
Spanish); the Spanish Aural Comprehension 
Test prepared by the Spanish staff of Stephens 
College, Columbia, Missouri ; or the Lundeberg- 
Tharp Audition Tests. An advantage of the 
Chicago and Stephens tests is that the spoken 
material is recorded on phonograph records, 
thus assuring provision of native speech at 
standardized speed. 


Walter V. Kaulfers, Associate Professor of 
Education, Stanford University, Stanford, 
California, This attractively printed, easily 
scorable, 40-minute, multiple-choice dictation 
test should appeal to teachers who want a 


measure of aural comprehension that is easy 
to administer and readily scorable from a key. 
Since only right answers are counted, the dan- 
ger of clerical error is reduced without injury to 
the value of the test in providing a percentile 
ranking of students. Users of the test, however, 
should realize that its scores are destined to be 
more than a measure of aural comprehension. 
Since the examinees are required to choose right 
Spanish answers (words or phrases) from 
among a group of five printed responses, the 
examination tests a considerable amount of 
what might more properly be termed audio- 
visual-association and phrase-reading ability in 
Spanish. Part I, Vocabulary, for example, re- 
quires the examinee to find among the words 
apoyer, atacar, auxiliar, ayer, ayanzar (?) the 
one that “most nearly corresponds in meaning 
to” the key ‘word ayudar (to be pronounced 
twice by the examiner). In Part II, Completion, 
the examinee is asked to complete statements 
like “Cervantes es famoso por ser . . .” by choos- 
ing the right answer from among (1) un cura 
italiano (2) un abogado francés (3) un autor 
espafiol (4) un maestro chileno (5) un doctor 
argentino. Exactly the same technique is used in 
Part III, Identification. 

Since this test, like the Lundeberg-Tharp 
Audition Test in Spanish, is a pioneer in the 
foreign-language field, the suggestions for im- 
provements in subsequent forms of that test 
apply with equal relevance to the present edition. 
Inasmuch as the English directions in the test 
show that it is primarily designed for English- 
speaking students of Spanish, it would seem 
that all the printed multiple-choice responses 
could well be English words of high frequency, 
rather than a mixture of Spanish words of both 
common and, too often, exceedingly rare fre- 
quency. 

If a teacher does a great deal of vocabulary 
work with synonyms in Spanish in elementary 
and intermediate as well as advanced classes, 
his first-year students should probably have no 
difficulty in associating the dictated key word 
ayudar with its nearest given equivalent auxiliar 
(Item 11, Part I, Form A). If, however, such 
work is reserved for second or third year courses, 
this item would merely disqualify students with- 
out giving them any opportunity to demon- 
strate their ability to understand words of much 
higher frequency than auxiliar that have formed 
the basis of most of their work in elementary 
and intermediate courses. In its present form, 
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Part II would seem to be much fairer to college 
students taught by the synonym method than to 
freshmen or sophomores in most of our high 
schools. 

Whether any aural comprehension test should 
include cultural information, no matter how 
simple, or how well "anybody of average intel- 
ligence ought to know it"—to quote a common 
remark—is still questionable; for, while it is 
unlikely that a student will attend a Spanish 
class for any length of time without hearing 
of Cervantes, it is not at all certain that he 
will necessarily remember the author of Don 
Quixote clearly at the time of the examina- 
tion, no matter how well he might be able 
to comprehend Spanish. If the use of English, 
instead of Spanish, for the printed multiple 
choices does not seem desirable, the use of 
techniques similar to those suggested for the 
Lundeberg-Tharp Audition Test would, per- 
haps, be even more satisfactory. In its present 
form, few high school teachers of Spanish will 
want to administer the test without first mak- 
ing a careful study of the individual items in 
a specimen set to see whether the students 
have been exposed to the vocabulary included 
among the acceptable choices. 

No information is supplied in the manual 
about the reliability of the test and the extent 
to which an effort was made to scale the indi- 


vidual items on the basis of their actual diffi- 
culty as shown by the number of examinees 
missing them during experimental tryouts. A 
tentative table of norms is given for students 
who have completed two or more semesters of 
Spanish in high school or college. Since no 
mention is made of the number of cases on 
which the norms are based, however, teachers 
will hardly be justified in drawing serious con- 
clusions from a comparison of the perform- 
ance of their own students with the averages 
reported in the Examiner's Manual. If the test, 
in its present form, is administered by a teacher 
to his own students, it can hardly afford more 
than a rough measure of how well they can 
understand him when he dictates items to be 
identified among a printed list consisting of 
both oft-used and rather-seldom-used words. 

How well an examinee can understand the 
oral Spanish most likely to be encountered in 
routine travel situations abroad, in telephone 
conversations dealing with the routine business 
of daily life, in radio news commentaries, or 
in formal lectures, would not be evident from 
the scores of either the present test, or of the 
Lundberg-Tharp Audition Test in Spanish, 
except by very indirect inferential guessing. In 
their present forms, both are primarily intra- 
mural scholastic examinations—for “marking 
purposes only.” 


INTELLIGENCE 


Reviews sy Anne Anastasi, James M. Anderson, Brent Baxter, Nancy Bayley, Marion A. 
Bills, Harold H. Bixler, E. J. G. Bradford, H. E. Brogden, Emily T. Burr, Cyril Burt, Psyche 
Cattell, Charles N. Cofer, John Cohen, W. D. Commins, Raleigh M. Drake, W. G. Emmett, 
George A. Ferguson, John C. Flanagan, Charles D. Flory, Henry E. Garrett, Florence L. 


Goodenough, J. P. Guilford, James R. Hobson, Car 
Kogan, Frederic Kuder, Charles R. Langmuir, Irving Lorge, Stanley S. Marzolf, L. S. 


Carl I. Hovland, Grace H. Kent, Kate Dn 
en- 


, Walter C. Shipley, Patrick Slater, Florence M. Teagarden, Robert L. Thorndike, Wil- 
fiam W. Turnbull, jo I Tyler, Ruth W. Washburn, Robert I. Watson, David Wechsler, 
Beth L. Wellman, F. L. Wells, Carroll A. Whitmer, D. A. Worcester. 
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X*Academic Aptitude Test: Non-Verbal Intelli- 
gence. Grades 7-16 and adults; 1944; 1 form; $2.50 
per 25; 10¢ per manual; so¢ per specimen set; 26(35) 
minutes; Andrew Kobal, J. Wayne Wrightstone, and 
Karl R. Kunze; Acorn Publishing Co. 


[215 ] 
XAcademic Aptitude Test: Verbal Intelligence. 
Grades 7-16 and adults; 1943-45; 1 form; $2.50 per 


25; I0£ per manual ; 50¢ per specimen set; 40(45) min- 
utes; Andrew Kobal, J. Wayne Wrightstone, and Karl 
R. Kunze; Acorn Publishing Co. 


Marion A. Bills, Assistant Secretary, Aetna 
Life Insurance Company, Hartford, Connecti- 
cut. This test is made up of three time-limit 
subtests which are set up to measure separately 
(a) information, (b) mental alertness, (c) 
comprehension of relations. Its purpose is to 
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differentiate between those persons capable of 
continuing with academic work and those not 
capable. 

Norms for the various school grades have 
been carefully set up by adequate testing. 
Grades correspond closely with individual tests 
of mental alertness. Tests on a small group of 
employed persons show about the expected cor* 
respondence with position and employee rat- 
ings. Suggestions for interpretation are clearly 
given. 

The advantages of the test are that the sub- 
tests may be scored separately, and the test is 
difficult enough to differentiate at the higher 
levels of ability. A disadvantage of the test is 
that the individual timing of subsections makes 
it necessary for a person to be in attendance 
during the entire time of the test. 


[ 216 ] 
*Adaptability Test. Grades 9-12 and adults; 1942- 
43; preliminary manual; Forms A, B; $1.85 per 25; 
50 per specimen set, postpaid ; 15(20) minutes ; Joseph 
Tiffin and C. H. Lawshe; Science Research Associates. 
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Anne Anastasi, Associate Professor of Psy- 
chology, Fordham University, New York, New 
York. Designed with special reference to its 
use as an employment aid, this test attempts 
to furnish a quick estimate of "general mental 
ability" by means of a wide variety of items. 
It is self-administering and consists of 35 items 
in spiral omnibus arrangement. In form, the 
items are either of the multiple-choice variety, 
or they are questions which can be answered 
with a letter or number. In content, they are 
predominantly verbal and cover an exception- 
ally heterogeneous sampling of familiar types. 
The test is hand-scored from a key. Two 
equivalent forms, A and B, are available, Per- 
centile and standard score norms are given for 
four samplings (612 industrial female appli- 
cants, 200 Navy electrical trainees, 86 clerical 
employees, and 43 Purdue seniors) as well as 
for all four combined. The manual is in pre- 
liminary mimeographed form and contains a 
number of printing errors, including the omis- 


sion of words, numbers, and one entire graph! 
Lj 


The 70 items comprising Forms A and B 
were chosen from an original compilation of 
120, 60 of which were administered to Navy 
trainees and 60 to engineering students. Item 
analyses were made against total scores, only 
those items being kept which yielded a differ- 
ence of I c or more between the performance 
of the best quarter and the poorest quarter of 
the group. Reliability coefficients, computed on 
the same four groups employed for the norms 
(cf. above), range from .60 to .go by the split- 
half method and from .79 to .93 by the retest 
method. Validity was checked against super- 
visors' ratings for the clerical employees and 
against mean grades for the Navy trainees. In 
both groups, reliable differences in mean test 
scores were found between the contrasted 
criterion subgroups. 

The Adaptability Test illustrates the value 
of item analysis in yielding a short test, the re- 
liability and validity of which may be as high 
as, or even higher than, those of a longer test. 
Since, however, the strength of such a test de- 
rives so largely from the definitiveness of the 
original item analysis, it is regrettable that 
such a restricted sampling was employed for 
this purpose (i.e., Navy electrical trainees and 
engineering students, N not stated). The dis- 
criminative value of specific items may be quite 
different even for the remaining types of sub- 
jects included in the norms, not to mention 
other types of workers to whom the test may 
be applied. It would seem, furthermore, that a 
slightly greater number of items would have 
been desirable, especially in view of the ex- 
cessive heterogeneity of content. The test con- 
tains, for example, only one or two items of 
certain types. In discussing the interpretation 
of scores, the authors rightly stress the need 
for finding anew, within each industrial con- 
cern, the "effective ranges" of scores for spe- 
cific job types. They appear, however, to lay 
undue emphasis upon a hierarchy of jobs in 
terms of scores on this test and do not call 
sufficient attention to the importance of special 
aptitudes in the higher level jobs of a pre- 
dominantly nonverbal nature. For example, 
office clerks and skilled tradesmen are listed 
within the same score range without refer- 
ence to the fact that a low score on this par- 
ticular test is a more serious occupational 
handicap for the clerk than for the skilled 
tradesman. The chief advantages of this test 
are its brevity and good "face validity" in a 
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plant situation, many items being expressed in 
terms relevant to an industrial environment. 


Marion A. Bills, Assistant Secretary, Aetna 
Life Insurance Company, Hartford, Connecti- 
cut. This 15-minute test consists of 35 items 
on vocabulary, arithmetic, and logic arranged 
in order of increasing difficulty. Experimenta- 
tion by the authors seems to have proved that 
the test will differentiate between clerical work- 
ers who should be employed for complicated 
work and those who should be employed for 
simple routine work only. The advantages of 
the test are that it is simple, short, and easily 
administered. The disadvantages are: (a) Sep- 
arate scores on vocabulary, arithmetic, and 
logic cannot be used. This is, of course, an 
inherent defect in all time-limit spiral tests. 
(b) Some of the questions seem to be of the 
puzzle type rather than straight arithmetic or 
reasoning and might therefore be resented. 


[ 217] 
American Council on Education Psychological 
Examination for College Freshmen. 1924-47; new 
form issued annually; IBM; separate answer shcets 
must be used; $2 per 25 tests; 50¢ per 25 machine- 
scorable answer sheets; 50¢ per specimen set; 38(65) 
minutes; L. L. Thurstone and Thelma Gwinn Thur- 
stone; Educational Testing Service. 
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ica, Washington, D. C. This is perhaps the 
test that one is likely to recommend to anyone 
who is looking for a “good” intelligence test 
to give to a group of college freshmen. It may 
be administered in an hour’s time and may be 
scored conveniently through the use of sten- 
cils or by machine. Another feature that is likely 
to appeal to the general user of tests is to be 
found in its wide use throughout the country. 
Although the data on the norms and the rela- 
tive ranking of different types of colleges are 
not available for each yearly edition until well 
into the school year, the authors always try 
beforehand to make the scores experimentally 
equivalent. The large list of colleges submit- 
ting test records to the authors for the pur- 
pose of setting up norms is made up, for the 
great part, of smaller institutions; and while 
the range of ability in these is considerable, it 
would be interesting to have many of the larger 
institutions also represented. Each test is pre- 
ceded by a practice exercise of the same kind. 
This is not only desirable on general grounds 
but is definitely reassuring to the giver of the 
test if he has doubts about the homogeneity of 
his group or about their being “test-wise.” The 
division of the score into a Q (quantity or 
number) partial score and an L (language or 
verbal) score, with appropriate norms, is a 
refinement that may be useful on occasion but 
needs considerable study before its full signi- 
ficance is apparent. 

The later yearly editions have improved in 
general over the early ones in a number of 
mechanical features as well as in the dropping 
of the artificial language test. It would seem, 
however, that the study of the individual items 
might be carried further. This might be in the 
direction of the “mental functions” that are 
supposedly tested by the items constituting 
each subtest. The psychologist would like the 
test material to be homogeneous in this respect 
and is not always satisfied with the exclusive 
use of a “factor analysis” approach and the 
disregarding of some kind of “qualitative” 
analysis. Thus, some items in the completion 
test of the present edition seem to plumb one’s 
familiarity with relatively uncommon words, 
as “gill” and “gobbler,” while other items 
seem aimed more at an understanding of 
the object whose name is sought. This is not 
the kind of thing that is seriously reflected in the 
objectively determined validity of a mental 
test, particularly if used for educational pur- 


poses; but because of the wide general ac- 
ceptability of the present test, it might become 
the basis for still further “psychological” im- 
provement. 


J. P. Guilford, Professor of Psychology, 
The University of Southern California, Los 
Angeles, California. This is a member of a 
series of examinations of like content that have 
been designed for measuring aptitude at the 
college level. It is evidently constructed with 
the usual high quality of workmanship ex- 
hibited by all forms of the series. 

Its predecessors have commonly shown 
somewhat lower validities for the prediction 
of over-all academic achievement than its chief 
rival, the Ohio State University Psychological 
Test. In this fact there is comparative strength 
as well as weakness. By incorporating a wider 
variety of functional content, the ACE test 
covers a somewhat more diverse list of abilities. 
This avoids the overweighting of one funda- 
mental resource (verbal ability) to the neglect 
of others that may be relatively more impor- 
tant in specialized curricula and courses, It is 
time that the specific aptitudes, in terms of 
distinct abilities, become much more generally 
recognized in various academic areas and re- 
ceive appropriate weights. Academic-aptitude 
tests should not be restricted to verbal ability, 
though this may be central, but should be ex- 
tended to cover all significant qualities. This 
goal requires optimal assessment of achieve- 
ment in each type of course, aimed at genuine 
objectives in those courses. Too often, final 
grades and achievement examinations reflect 
a preponderance of verbal variance when the 
objectives of training really do not. A vicious 
circle of circumstances has brought to the fore 
measurements of verbal talent in both aptitude 
and achievement tests. ` 

This line of thought leads directly to a de- 
mand for a more comprehensive battery of 
“primary-ability” tests. The authors of the ex- 
amination under discussion are known to be 
most sympathetic to this point of view. We are 
therefore led to the conclusion that the present 
form of the examination is some kind of a 
compromise with popular demand. To the re- 
viewer it is an unsatisfactory compromise, for 
the reasons that follow. 

The tests in the “linguistic” part of the ex- 
amination seem to be most heavily saturated 
with verbal-factor variance. The “quantitative” 
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part is a conglomerate factorially. Besides meas- 
uring numerical facility, this part also prob- 
ably measures three kinds of reasoning as well 
as other factors to a small degree. It probably 
would select students with good reasoning 
abilities, but validity studies might well fail to 
show this fact because of the manner in which 
“achievement is assessed. 

It should be unnecessary, in view of the last 
paragraph, to use six parts in order to derive 
two part scores. A vocabulary test would suf- 
fice for the linguistic or verbal score. The num- 
ber-series test could be dispensed with in the 
“quantitative” part, since it probably adds noth- 
ing unique in the way of factors. 

Each of the six parts is separately timed. It 
is the reviewer’s belief, without any definitive 
proof for it, that the factors measured, except 
for the numerical, should be assessed by means 
of power tests. 

The norms given for these examinations are 
very extensive, being obtained on a nation-wide 
scale and covering different types of colleges. 
The smaller institution is probably overweighted 
in this. 

The authors wisely warn against attempts to 
make mental-age and IQ equivalents out of 
the total scores. 


For reviews by Anne Anastasi, Jack W. 
Dunlap, David Segel, and Robert L. Thorn- 
dike of earlier forms, see 38:1037 and 40:1377. 
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American Council on Education Psychological 
Examination for High School Students. Grades 
9-12; 1933-47 ; new form issued annually ; IBM; sepa- 
rate answer sheets must be used; $2 per 25 tests; 50¢ 
per 25 machine-scorable answer sheets; 50¢ per speci- 
men set; 35(65) minutes; L. L. Thurstone and Thelma 
Gwinn Thurstone; Educational Testing Service. 
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Carl I. Hovland, Professor of Psychology 
and Chairman of the Department, Yale Uni- 
versity, New Haven, Connecticut. The Thur- 
stones have continued their important contribu- 
tion to scholastic aptitude testing at the high 
school level with the 1946 edition of the Ameri- 
can Council on Education Psychological Exami- 
nation for High School Students. 

In the current edition only one form has 
been prepared, the answer sheet for which can 
be either hand- or machine-scored. The two 
forms, one for machine-scoring and one for 
hand-scoring, used in previous years caused 
difficulties with respect to the comparability of 
norms for the two forms. For hand-scoring, a 
time-saving scoring board is described, manu- 
factured, and distributed by C. H. Stoelting. 

Four types of material are used: (a) same- 
opposite, (b) completion, (c) arithmetic rea- 
soning, and (d) number series. The scores for 
the first two can be combined to obtain an 
L-score, indicative of linguistic ability, and the 
latter two, to give a Q-score for quantitative 
ability. It is suggested that this breakdown into 
two factors may be useful in counseling. 

The yearly revision of the tests makes it dif- 
ficult to prepare a maximally useful accompany- 
ing manual. Reliabilities and norms cannot be 
provided at the time the manual is prepared. 
This may cause considerable inconvenience to 
small high schools using the tests. Preliminary 
norms are prepared as soon as cooperating 
schools send in their data, but heavy bias in 
selection operates as to which schools submit 
their data. In recent editions this bias was re- 
duced by getting a somewhat more complete 
sample of a smaller universe, specifically the 
state of Illinois. But even here the authors 
state: "There is also evidence that the schools 
reporting are not a random sampling of high 
schools so that the norms based on their re- 
ports would not be reliable." (Report of Norms 
on 1944 Psychological Examination for High 
School Students.) 

No data are available concerning the relia- 
bility of the current test. On this point we must 
rely on data from analogous tests prepared by 
the same authors and upon their general repu- 
tation as careful workers. One would estimate 
the reliability to be reasonably high, but it 


299 ] 


TESTS & REVIEWS: INTELLIGENCE—GROUP 


220 


would be most helpful if data on reliability 
were routinely supplied in some form by all 
constructors of tests, 

Validity is based on the a priori relevance 
of the material to scholastic aptitude and the 
similarity of the tests to others which have been 
validated in school situations. The tests have 
been used in various factor analyses aimed at 
determining primary mental abilities, and the 
subdivision of the content into linguistic and 
quantitative subtests is supported by such 
analyses. The authors are modest in stating the 
value of their tests: “While the scores do show 
roughly the mental alertness of the student, 
they should not be thought of as measuring 
mentality with high accuracy. The scores are 
roughly indicative of the level of mental alert- 
ness of the student, but they should not be 
taken so seriously as to exclude other evidences 
of intelligence and talent in individual cases.” 
(Manual of Instructions, 1946 Edition.) 


For reviews by V. A. C. Henmon and A. H. 
Turney of earlier forms, see 38:1038 and 
40: 1378. 
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*Army General Classification Test: First Civil- 
ian Edition. Grades 9-16 and adults; 1947 ; commonly 
called AGCT; IBM; Forms AH (hand scored), AM 
(machine scored); separate answer pads or answer 
sheets must be used; 45¢ per hand-scored test booklet; 
$1.65 per 25 hand-scored answer pads; 75¢ per speci- 
men set (Form AH); 35¢ per machine-scored test 
booklet ; $2.35 per 100 machine-scorable answer sheets ; 
$1 per set of machine-scoring keys; 40(55) minutes; 
Science Rescarch Associates. 
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formats are supplied: IBM machine scoring,- 
and a self-scoring form in a bound booklet of 
18 step-down sheets employing a replaceable 
pin-punch answer sheet similar to the familiar 
Kuder. Planned as a test of adult learning abil- 
ity, it uses three types of items, arranged in 
spiral-omnibus form. Verbal comprehension is 
measured by vocabulary, quantitative reasoning 
by arithmetic problems, and spatial thinking by 
block counting. No part scores are obtained. 
The reliability of the total score is about .95. 
Information is given concerning its correlations 
(.27 to .40) with success in several army train- 
ing courses. Interpretation is effected by con- 
verting raw scores to AGCT standard scores 
and percentiles. A chart gives the mean and 
range of scores for each of 125 civilian occu- 
pations. The occupational data may be misin- 
terpreted dangerously by poorly trained users, 
as no indication is given of the much higher 
predictive value of the test for some occupa- 
tions than for others. The manual’s statement 
that scores “will indicate aptitude for learning 
the majority of civilian occupations” seems un- 
justified in the light of known low validities. 
Let us hope that this test will escape many of 
the abuses that the Army Alpha suffered after 
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Army Group Examination Alpha and revisions. 
d) ARMY GROUP EXAMINATION ALPHA. Grades 5-16 and 
adults; released by the War Department for general 
use in 1910; Forms 5, 6, 7, 8, 9; $1.20 per 25, postpaid ; 
as¢ per specimen set, postpaid; 224 Gs) minutes ; 
manual revised in 1939 by H. E. Schrammel and E, R. 
Wood; prepared by the Psychology Committee of the 
National Research Council; Bureau of Educational 
Measurements, Kansas State Teachers College of Em- 
poria. 
b) BUREAU TEST VI, MENTAL ALERTNESS. Grades 9-16 
and adults; 1920-29; 15(20) minutes; out of print. 
C) REVISION OF ARMY ALPHA EXAMINATION. Grades 9- 
16 and adults ; 1925-35; Forms A, B ; $2.50 per 25, post- 
paid ; 35& per specimen set, postpaid ; 2124(35-40) min- 
utes; revision by Elsie O. Bregman; Psychological 
Corporation. 
d) SCOVILL CLASSIFICATION TEST. Adults; 1027; 2 parts 
(Part I is nonverbal, Part II is verbal) ; $3.75 per 25 
copies of Part I, postpaid; $2.70 per 25 copies of Part 
II. postpaid; 35¢ per specimen set of both parts, post- 
paid; (30) minutes for Part I; (20) minutes for Part 
II; prepared by the Scovill Manufacturing Co.; Psy- 
chological Corporation. 
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€) REVISED ALPHA EXAMINATION, FORMS 5 AND 7. 
Grades 9-12 and adults; 1932-33; Forms 5,75 $2.50 per 
25, postpaid ; 35¢ per specimen set, postpaid ; 212 (30) 
minutes; revision by F. L. Wells; Psychological Cor- 
poration, 

Í) REVISED ALPHA EXAMINATION FORM 6, SHORT FORM. 
Grades 9-16 and high-grade adults ; 1933; a short form 
of the experimental edition of Revised Alpha Exami- 
nation, Form 6, consisting of subtests 2, 4, 7, and 8; 
$1.50 per 25, postpaid; 35¢ per specimen set, postpaid; 
13%4(20) minutes; revision prepared under the super- 
vision of C. R. Atwell and F. L. Wells; Psychological 
Corporation. 

9) ARMY ALPHA EXAMINATION: FIRST NEBRASKA EDI- 
TION. Grades 9-16 and adults; 1937; revision of Army 
Group Examination Alpha ; $2.25 per 25; 40 per speci- 
men set; 2234 Gp minutes; revision prepared by J. P. 
Guilford; Sheridan Supply Co. 

h) ARMY GROUP EXAMINATION ALPHA: SCHRAMMEL- 
BRANNAN REVISION. Grades 7-16 and adults; 1936-38; 
Forms A, B, C; $1.30 per 25, postpaid; 25¢ per speci- 
men set, postpaid; 40(45) minutes; revision by H. E. 
Schrammel and Christine V. Brannan; Bureau of Edu- 
cational Measurements, Kansas State Teachers Col- 
lege of Emporia. 

i) MODIFIED ALPHA EXAMINATION FORM 9. Grades 7-12 
and adults ; 1941 ; separate answer sheets must be used ; 
$4.00 per 25, postpaid ; $1.25 per 50 answer sheets, post- 
paid; 35¢ per specimen set, postpaid ; 24(35) minutes; 
revision by F. L. Wells; Psychological Corporation. 
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*[Benge Employment Tests.] Adults; 1942; 1 
form; 2 levels; $3 per introductory set including 10 
tests, postpaid; sé per extra test, postpaid; introduc- 
tory sets may be ordered on approval; Eugene J. 
Benge; Management Service Co. 

a) EMPLOYMENT TEST, FORM B. Adults with an cighth- 
grade education or less; 12(20) minutes. 

b) BASIC EMPLOYMENT TEST. Ádults with more than 
an eighth-grade education; 22(30) minutes. 


Brent Baxter, Director of Personnel Re- 
search, Chesapeake and Ohio Railway Com- 
pany, Cleveland, Ohio. This pair of tests was 
designed for industrial and business use in 
measuring intelligence; they are the outgrowth 
of service to industrial clients. 

The test items resemble those of most group 
verbal intelligence tests. The author reports 


that “The Basic Test correlates with Otis 
Advanced between .80 to .85.” The items are 
classified as memory, numbers, vocabulary, 
spatial perception, and reasoning, with a pro- 
file chart being made for these aspects. No 
data are available to the reviewer on the inter- 
relationship of these parts or on their relia- 
bility or validity, For Form B there are only 
ten possible items on each factor ; this suggests 
that the part scores are not likely to be too 
reliable. The different kinds of items are ar- 
ranged in spiral order, which makes the scoring 
more complicated and subject to error. 

The test manual provides no data for the 
evaluation of the tests. A personal letter from 
the test author suggests that a great deal of 
data has been collected for many years but that 
it has never been assembled. But, “we have 
made no reliability studies of the Basic test 
whatsoever.” It would appear that the tests 
can be sold in industry without offering any 
scientific statistical data, a reflection upon in- 
dustry and test salesmen rather than upon the 
tests. * 

It is understood that these tests are usually 
sold as part of a management service, and it is 
therefore assumed that normally more help on 
interpretation and use is given to the test user 
than is offered in the manual. The manual 
merely suggests that the tests be tried out lo- 
cally to see if they discriminate between better 
and poorer workers. Then local “job stand- 
ards" for rejecting potential failures should be 
set up. No help is given for establishing these 
job standards. 

Decile norms are given for "usual appli- 
cants." Such a generality can certainly lead to 
wide errors, especially since the test author 
privately writes that for the Basic Employment 
Test the norms are for a sampling of "clerical, 
supervisory, and executive positions in indus- 
try." No provision is made for the wide vari- 
ety of skilled and semiskilled positions. The 
manual also suggests a change in time limits 
without changing the norms. 

Without data an adequate over-all evalua- 
tion of the tests cannot be made. On the sur- 
face they appear similar to other intelligence 
tests in industrial use. Their only unique fea- 
ture is the attempted measurement of five sepa- 
rate factors. While these factors are probably 
properly evaluated, is worth considering in in- 
not reliably measured, this kind of approach, 


dustry. 
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~ Marion A. Bills, Assistant Secretary, Aetna 
Life Insurance Company, Hartford, Conneci- 
cut, BASIC EMPLOYMENT TEST. This test con- 
Sists of 100 items of five types—information, 
arithmetic, vocabulary, spatial perception, and 
reasoning—arranged in cyclic order. The di- 
rections are clear except that no examples are 
given of the spatial perception types of item. 
The author assumes that the different types of 
items may be scored separately, thereby giving 
a profile of the individual's ability. This prac- 
tice is questionable since the testee may have 
devoted much more time to some types of 
items than to others. Some of the items are of 
the puzzle type which might well increase this 
hazard. No validating data are given, and the 
author sounds warning that each company 
should give the test to workers of proved abil- 
ity and set up its own standards, . 

"The advantages of the test are that it is a 
short, self-administering test which avoids be- 
ing resented by an applicant as an intelligence 
test because of its name. For jobs where spa- 
tial.relations are essential, the test would have 
an advantage over tests not including this type 
of item. The test has the disadvantage of all 
time-limit spiral tests in that individual types 
of ability cannot be accurately scored sepa- 
rately. 

EMPLOYMENT TEST, FORM B. This test con- 
sists of 50 items of the same five types but 
much less difficult. No questions of the puzzle 
type are included. The test is a simple, self- 
administering, short, very easy mental alertness 
test which is not immediately recognized as an 
intelligence test. Disadvantages of the test are 
that it should not be scored for the various 
types of items, since the time devoted to any 
one type is not known and that it is too easy 
to discriminate between people of fair to good 
mental alertness, regardless of edueation, 
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*California Capacity Questionnaire. Grades 7-16 
and adults; 1941-42; IBM; an abbreviated adaptation 
of California Test of Mental Maturity; Forms A, B; 
Elizabeth T. Sullivan, Willis W. Clark, and Ernest W. 
Tiegs; California Test Bureau, 
a) HAND SCORING EDITION. 1941; $1.20 per 25; 35¢ per 
specimen set ; 30(40) minutes. 

MACHINE SCORING EDITION, 1942; 6¢ per test; 2¢ per 
machine-scorable answer sheet ; 35(45) minutes. 


Anne Anastasi, Associate Professor of Psy- 
chology, Fordham University, New York, New 
York. This 30-minute, self-administering test 
yields verbal, nonverbal, and total IQ's for 


ages 11 up. Its'two equivalent forms are avail- 
able in hand-scoring as well as IBM machine- 
scoring editions. The 132 items comprising 
each form are grouped into seven subtests ad- 
ministered with a single over-all time limit of 
30 minutes. Some effort has been made to 
equalize the distribution of content throughout 
the test, the different types of items (with the 
exception of right-left orientation) recurring 
at least twice in successive subtests with in- 
creasing difficulty level. It is doubtful, however, 
whether this expedient has allowed adequately 
for the use of a single over-all time limit. A 
subject who is poor in right-left orientation, 
for example, may spend so much time on the 
more difficult items of Test 1 as to be seri- 
ously handicapped on the other types of items 
—or one who experiences difficulty with the 
manipulation of areas in Test 3 may never 
reach some of the later vocabulary items in 
which he might have excelled. The effect of 
the present arrangement of items is therefore 
to minimize any existing difference between 
the subject's ability in the verbal and nonverbal 
parts, which are scored separately. In refer- 
ence to mechanical layout, it might be noted 
that some of the pictures used in the spatial 
items are very small and indistinct, a few be- 
ing difficult to recognize. 

MA equivalents for verbal, nonverbal, and 
total raw scores are furnished, but no infor- 
mation is given about the size or composition 
of the groups from which these mental age 
norms were derived. The use of the MA and 
the IQ in a test which is not an age scale is 
open to criticism, There are certain well- 
known statistical conditions which an intelli- 
gence test must meet in order to yield a con- 
stant IO, conditions which most group scales 
fail to meet. The authors give no indication 
that the present test meets these conditions. 
Moreover, since this test is oriented in large 
part toward adult testing, the IQ is especially 
unsuitable. The table of norms gives MA's 
ranging up to 32 years! It is also rather star- 
tling to find percentile equivalents for the IQ's. 
One wonders why such percentiles are not 
computed directly from the raw scores. 

Spearman-Brown reliability coefficients of 
-925 for the total test, .900 for verbal sub- 
scores, and .703 for nonverbal subscores were 
found on “320 persons” whose ages and other 
characteristics are not stated. The correlation 
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between verbal arid nonverbal subscores (pre- 
sumably on the same group?) is .45. The items 
of the present test were chosen from the much 
longer and more comprehensive California Test 
of Mental Maturity, Intermediate and Advanced 
Series. In discussing validity, the authors re- 
port a correlation of .835 between the present 
test and the California Short-Form Test of 
Mental Maturity (N — 121). The present test 
correlates .821 (N = 129) with the Kuhlmann- 
Anderson Intelligence Test. These correlations 
are difficult to evaluate since the characteristics 
of the subjects are unknown. Among children, 
for example, the large general factor usually 
found at the lower age levels may result in 
high correlations between tests which among 
older groups yield low, negligible correlations. 
For the proper interpretation of the given cor- 
relations, the degree of homogeneity of the 
group should also be known. 

No further data on validity are discussed in 
the manual. General references are made to the 
"exacting tryout" of items in all parts of the 
country as well as to the sampling of aptitudes 
which were identified through factor analysis. 
None of these data, however, are presented 
either in the present manual or in those of the 
earlier forms. Nor are specific references to 
the literature cited. The general vagueness and 
ambiguity of sources is also illustrated by a 
table (p. 6) showing the expected percentile 
ranks in such basic skills as reading compre- 
hension and arithmetic ftndamentals in rela- 
tion to median IQ for the class or school. In 
giving the derivation of this table, the authors 
state only that it is "based upon over 100 
school surveys and 48,000 individual measure- 
ments." In the absence of any statement to the 
contrary, the reader may well be left with the 
impression that these data were obtained with 
the present test. The identical table, however, 
is printed in the manual of the earlier Califor- 
nia Tests of Mental Maturity with the identical 
explanation. Finally, the authors' effort to sam- 
ple the various mental "factors" uniformly may 
have resulted in an instrument of lower valid- 
ity than the usual predominantly verbal intel- 
ligence test. There is a growing realization 
that verbal ability plays a major part in many 
of the practical achievements which intelli- 
gence tests try to predict. For maximum valid- 
ity, therefore, the tests themselves may have 
to be overloaded with verbal ability. 


Emily T. Burr, Director, Guidance Bureau, 
Inc., New York, New York. This test, devised 
in response to the need for a short, easily admin- 
istered, and dependable measure of mental abil- 
ity, is a welcome addition to the testing field. It 
takes but 30 minutes and is self-administering, 
thus satisfying the first two requirements, An- 
other of its distinct advantages is the fact that it 
is possible to derive separate mental ages in the 
area of language and nonlanguage ability. This 
is of significant value when a counselor is seek- 
ing to ascertain an individual’s assets so that 
suitable educational and vocational advice may 
be offered. 

The questionnaire is composed of tests to 
determine perceptual ability, memory, spatial 
orientation, logical reasoning and inference, 
mathematical reasoning, and practical ideas. It 
thus reveals a wide sampling of abilities that 
are basic to the understanding of relationships 
and situations. 

Besides statements regarding the validation 
of the tests, the manual includes interesting and 
helpful lists of jobs that may be performed by 
persons of very superior, superior, high aver- 
age, low average, and inferior ability. These 
lists give the counselor pertinent and most 
valuable information that always proves en- 
lightening in a practical discussion with the 
applicant who is trying to decide for what 
occupation he should aim. This factual mate- 
rial, however, must be used wisely and with 
discrimination. Test results alone cannot solve 
a guidance problem. Physique, health, age, 
training, experience, appearance, and interest 
are among the essential characteristics to be 
checked. 

As the demand for scientific aid in the selec- 
tion of employees in industry increases, such 
tests as these will become more and more valu- 
able. For this highly specialized purpose, it is 
desirable that critical scores be determined by 
means of careful, thorough research. We shall 
profit by knowing the minimum score which 
will enable a man to make good on a particular 
aspect of a certain occupation. As data regard- 
ing success or failure on a job is tabulated and 
checked with the results of tests administered 
to that individual previously, we shall have at 
our disposal a large amount of fundamental 
knowledge about tests that will greatly facili- 
tate our guidance service. 

This questionnaire, standardized after years 
of careful and extensive research, is a fitting 
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tool to be used for such a survey. Large num- 
bers of persons from all social and economic 
levels, living in various sections of the United 
States have taken part in its formation, so that 
it appears to be based on the responses of 
people throughout the country as a whole. 

Credit for this excellent piece of practical 
research must be given to Dr. Ernest W. Tiegs, 
formerly Dean of University College and Pro- 
fessor.of Education at the University of South- 
ern California; to Dr. Elizabeth T. Sullivan, 
Senior Educational Psychologist of the Los 
Angeles Schools; and to Willis W. Clark, Re- 
search and Guidance Consultant, Los Angeles. 
These three are among the most experienced 
and best qualifed psychologists who could 
haye undertaken the task. One cannot but have 
faith in the effectiveness and validity of their 
results. 
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California Test of Mental Maturity. Grades kgn-1, 
1-3, 4-8, 7-10 and adults, 9-adults; 1936-46; 1 form; 
35¢ per specimen set of any one edition at any one 
level ; 3¢ per answer sheet for the highest three levels; 
Elizabeth T. Sullivan, Willis W. Clark, and Ernest 
W. Tiegs; California Test Bureau. 

a) CALIFORNIA TEST OF MENTAL MATURITY: 1946 REVI- 
SION. 1936-46; $2 per 25; (90) minutes. 

b) CALIFORNIA SHORT-FORM TEST OF MENTAL MATURITY. 
1938-42 ; $1.20 per 25; 6¢ per copy of machine-scoring 
edition for the highest three levels; (45) minutes. 

C) CALIFORNIA INTELLIGENCE TEST: LANGUAGE SECTION. 
1942-43; same as language section in California Test 
of Mental Maturity ; 1943; $1.20 per 25; 6¢ per copy of 
machine-scoring edition for the highest three levels; 
(45-50) minutes. 

d) CALIFORNIA INTELLIGENCE TEST: NON-LANGUAGE 
SECTION. 1942-43 ; same as nonlanguage section of Cal- 
ifornia Test of Mental Maturity; $1.50 per 25; 6¢ per 
copy of machine-scoring edition for the highest three 
levels; (45-50) minutes. 
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A; the Stanford-Binet Intelligence Scale, Form-L; and the Co 
fornia Test of Mental Maturity, Elementary S-Form. Unpu 
lished master’s thesis, University of Southern California, 19 

14. Botton, Frovp B. "Value of Several Intelligence Tests 
for Predicting Scholastic Achievement.” J Ed Res 4151338 O 
Eo Ed bees “Scientific Selection of Sales Engi- 
neers.” Personnel 24:24-34 J1 47. * 

Henry E. Garrett, Professor of Psychology, 
Columbia University, New York, New York. 
These tests consist of five batteries designed to 
cover the range of mental ability from kinder- 
garten to college. Each battery contains sixteen 
subtests described as measuring memory, spa- 
cial relations, reasoning, and vocabulary, as 
well as tests of visual and auditory acuity and 
of motor coordination. Age norms are sup- 
plied from which MA’s and IQ's may be de- 
termined for the language and nonlanguage 
tests separately and for the test battery as a 
whole. A diagnostic profile permits one to ap- 
praise the relative strengths and weaknesses of 
the examinee in the various subtest groups. 

Several comments pro and con may be of- 
fered with regard to this test battery: (a) In- 
formation concerning the test is quite com- 
plete; directions for administering and scoring, 
data on reliability, age norms, relations to tests 
of school progress, and the like are adequate. 
(b) The pictorial tests are clear and well de- 
signed. (c) The tests of visual and auditory 
acuity and of motor coordination are undoubt- 
edly useful additions to the examination. 

Criticisms of the test may be summarized as 
follows: (a) The authors offer no evidence to 
show that their test batteries are so constructed 
as to fulfill the conditions of IQ constancy. 
The IQ, thought of as a developmental ratio, 
has little or no meaning when used with adults 
and is always suspect when calculated from 
group tests, It would have been far better if 
some other type of score had been employed. 
(b) No evidence is offered to show that the 
particular “factors” ostensibly measured by 
the subtests are independent or quasi-independ- 
ent variables. If they are, the total score (IQ), 
of course, has no meaning. Use of the term 
"mental factor" to describe a test group is in a 
sense a misnomer as the term "factor" has 
come to be identified with an ability-cluster lo- 
cated through factor analysis. (c) The test 
user must remember that the labeling of groups 
of tests as measures of logical reasoning, spa- 
cial relations, and the like is very often simply 
a common sense divination as to probable func- 
tion. Such classifications may be practically 
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useful but other groupings might be proposed 
with as much justification. The authors should 
provide more evidence as to the validity of 
their categories than a few references to factor 
studies. (d) The authors state that it is their 
desire to break down the IQ into its compo- 
nent parts. This is a laudable ambition. But it 
is doubtful whether two tests of memory are 
sufficient to give a valid measure of memory 
and whether there are enough subtests in the 
other groupings to provide an adequate basis 
for differential judgment. The score derived 
from the test battery as a whole is probably 
the most valid measure of mental level which 
the tests yield. 


Am J Psychiatry 97:993 Ja '41. W. Line. 
[Review of California Short-Form Test of 
Mental Maturity.] * The standardization ap- 
pears to be indicative rather than final, and the 
claims for the tests are based partly upon fac- 
tor analysis. The subdivision into Language 
and Non-Language categories would seem to 
be most pertinent at the lower level, and should 
everywhere be used with caution, The manuals 
give clear directions, and reflect the authors' 
point of view on the nature of intellectual de- 
velopment. This point of view, called a mul- 
tiple-factor theory of intelligence, is not suffi- 
ciently elaborated to be harmful. The tests 
themselves will probably prove to be quite use- 
ful, particularly at the intermediate level, and 
under circumstances that normally give reliable 
results. 

J Consult Psychol 11:156 My-Je '47. The 
chief features of this test are its analysis into 
language and nonlanguage abilities, and into 
five factors (memory, spatial relationships, 
logical reasoning, numerical reasoning, and 
vocabulary), on all of which useful scores can 
be obtained. * A pretest detects children whose 
visual and auditory acuity, and motor coordi- 
nation, might be so handicapped as to influence 
the results. * Validity is chiefly inferred, but 
a correlation of .88 with the Stanford-Binet 
Test is stated, for an unspecified level and 
range. * The large amount of nonverbal mate- 
tial, even at the higher levels, is a distinctive 
and commendable feature of this test. 


For reviews by Raymond B. Cattell, W. D. 
Commins, F. Kuhlmann, Rudolf Pintner, and 
Arthur E. Traxler, and excerpts from reviews 
by David Kopel and one other, see 38:1042 and 


40:1384. 
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Carnegie Mental Ability Tests: Clapp-Young 
Self-Marking Tests, Grades 9-16; 1932; Form A; 
separate answer booklets must be used; $2.75 per 25 
sets of test and answer booklets; 9o per 25 answer 
booklets; 12¢ per specimen set, postpaid ; 65(90) min- 
utes; Glen U, Cleeton; Houghton Mifflin Co. 
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W. D. Commins, Associate Professor of 
Psychology, The Catholic University of Amer- 
ica, Washington, D. C. Although the manual 
for this test does not give all the information 
desirable for its evaluation, such as the char- 
acter of and number in the groups employed in 
its standardization, the reported coefficients of 
validity (based on scholastic records in college) 
and of reliability are high, the former ranging 
from .50 to .70 and the latter being .95. It 
would appear, however, from the nature of the 
test that it is heavily loaded with science and 
mathematics items and would perhaps find its 
chief use in predicting scholarship in engineer- 
ing schools. 

The time required for administration prob- 
ably runs close to two hours, which would 
probably make it less available for general col- 
lege use. It enjoys, however, the Clapp-Young 
self-marking feature, making for ease of 
scoring. 

Although each item was checked against 
scholarship records of students before being 
chosen for the test, it does not seem that the 
items have been sufficiently well scaled. Certain 
items in the latter part of a subtest are not ob- 
viously of greater difficulty than those appear- 
ing in the earlier part. The information items 
seem at times to be largely “verbal,” such as 
“the toad is a mammal,” or “the larynx func- 
tions in speech," and would apparently need 
reworking. A fresh revision of the test would 
improve it considerably. 

Separate scores are obtainable on the Word 
Relations Test and on the Number Relations 
Tests, a feature of definite utility at times. The 
inclusion of two tests of High School Infor- 
mation probably increases the accuracy of pre- 
dicting college grades when the academic back- 
ground of the students is similar, but would 
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be of more doubtful value when a large variety 
of high schools is represented. 


Robert L. Thorndike, Associate Professor 
of Education, Teachers C. ollege, Columbia Uni- 
versity, New York, New York. This test was 
developed for the specific purpose of predicting 
academic achievement. The purpose should 
perhaps be defined even more narrowly than 
that as prediction of academic success in the 
freshman year of a pre-engineering curriculum. 
It is reported that each test item was retained 
only if it individually yielded some discrimina- 
tion between individuals at different levels of 
academic achievement as defined above. The 
test has, therefore, primarily a pragmatic rather 
than a theoretical basis. This pragmatic inter- 
est has led to the combining of subtests of the 
more general “aptitude” type with tests cover- 
ing information specifically taught in school 
courses, It is the contention of the author of 
the test that a combination of the two types of 
material permits a better prediction of aca- 
demic achievement than either taken alone, 
This may be true,* but one may still question 
the desirability of combining into a single test 
both types of items. However, the author does 
provide separate norms for the verbal, num- 
ber, and information sections, so that these 
parts may be treated separately, The eight sub- 
tests may be characterized as follows: (1) dis- 
arranged sentences, (2) verbal analogies, (3) 
mixed synonyms and antonyms, (4) number 
series, (5) numerical computations, (6) arith- 
metic reasoning, (7) information, academic in 
character (multiple choice), (8) information, 
academic in character (true-false). 

The test appears to place quite a premium 
upon speed. This may result in a more valid 
test of academic promise within a given time 
limit, though no evidence is presented to sup- 
port this. From the theoretical point of view, it 
means that this test, like many others, is in un- 
known mixture a measure of speed and level 
of mental functioning. The speed element has 
the one specific effect of making the reported 
odd-even reliability coefficients (for an unspeci- 
fied type of group) spuriously high. 

The "Manual" for this test consists of a 


single sheet printed on both sides. This sheet 
contains instructions for administering and 
timing the nine subtests, all material on norms, 
all material on statistical analyses of reliability 
and validity, together with descriptive and pro- 
motional material on the construction of the 
tests. The critic finds this rather meager, par- 
ticularly with regard to the rationale, develop- 
ment, and statistical constants of the test. 

The form of the test appears satisfactory, 
and use of the Clapp-Young type of answer 
sheet permits of efficient scoring. 
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*Chicago Tests of Primary Mental Abilities. 
Ages II-17; 1941-47; revision of Tests for Primary 
Mental Abilities; L. L. Thurstone and Thelma Gwinn 
Thurstone. 
@) SIX BOOKLETS EDITION. 1941; IBM; $6 per set of 
6 subtests; $1 per set of memory cards; $2 per set of 
scoring stencils; $1 per specimen set; $1.25 per 25 of 
any one of the following subtests: Number, Verbal 
Meaning, Space, Word Fluency, and Reasoning; 65¢ 
per 25 of the subtest Memory; (240) minutes; Science 
Research Associates. 
b) SINGLE BOOKLET EDITION. 1943; $3.75 per 25; $1 per 
set of memory cards; 75¢ per set of scoring stencils ; 
75¢ per 14 profile cards; $2.75 per specimen set; (120) 
minutes ; Science Research Associates. 
C) SRA PRIMARY MENTAL ABILITIES. 1947-48; a short- 
ened version of the Single Booklet Edition; IBM; 
Forms AH (hand scored), AM (machine scored) ; 
separate answer pads or answer sheets must be used ; 
43€ per hand-scored test booklet with answer pad; 
$1.65 per 25 hand-scored answer pads; 35¢ per machine- 
scored test booklet; $2.35 per 100 machine-scorable 
answer sheets ; $2 per set of machine-scoring keys; 45¢ 
per 25 profile sheets ; 75¢ per specimen set; 25(45) min- 
utes; Science Research Associates. 
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Cyril Burt, Professor of Psychology, Uni- 
versity College, London, England. [Review of 
the 1941 and 1943 editions.] Judged from the 
practical standpoint, the construction and ar- 
rangement of these tests seem admirable. The 
test material has been most carefully selected 
and prepared; and the explanations given to 
examiners regarding administration, scoring, 
and interpretation are simple, lucid, and easy 
to apply. 

The theoretical basis for the tests, however, 
will naturally evoke criticism from those who 
do not entirely accept Professor Thurstone's 
view on mental factors. Attempts to devise 
tests for intellectual abilities have been based 
on three alternative hypotheses: (a) that the 
measurement of "general" intelligence is of 
supreme importance (Spearman, Binet, most 
British group tests); (b) that the use of “a 
single index" for "general" intelligence does 
not "differentiate kinds of intelligence" and 
should therefore be replaced by tests measur- 
ing a number of primary abilities (Thur- 
stone) ; (c) that both the "general factor" of 
intelligence and specialized “group factors,” 


225 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 308 


like verbal, numerical, manual abilities, etc., 
should be measured (Burt). 

The object of the Chicago tests is to assess 
“independent factors of the mind.” Six factors 
are provisionally chosen as most readily iden- 
tifiable and measurable: (a) number, (b) ver- 
bal meaning, (c) word fluency, (d) space, 
(e) inductive reasoning, (f) rote memory. An 
endeavour has been made to devise tests "in 
which there is a heavy saturation of a primary 
factor and in which other factors are mini- 
mized." The marks obtained by each examinee 
are to be plotted in the form of an individual 
"profile," which, it is suggested, will be of 
value for educational and vocational guidance. 
The ultimate aim thus resembles that set out 
in this reviewer's 1917 Report to the London 
Education Authority on “The Distribution and 
Relations of Mental Abilities,’ where such 
special abilities as the arithmetical, manual or 
mechanical, and verbal (of two types) were 
reported and the pupil’s varying levels in a 
battery of standardized tests was exhibited by 
means of individual “psychographs.” 

CRITICISMS. (a) Omission of General Fac- 
tor. 'To most British psychologists the omission 
of a separate measurement for "intelligence" 

(i.e. innate, general, cognitive ability) will seem 
a serious defect, since in the past our assess- 
ments of pupils for backward classes, central 
schools, secondary schools, etc., have been 
based chiefly on written tests for "intelligence" 
(supplemented by estimates of special abilities 
and disabilities and by individual testing where 
necessary). In the Chicago Tests the presence 
of a general factor seems indicated by the posi- 
tive intercorrelations between the primary 
abilities (.15 to .55); and its similarity to 
"intelligence" is attested by the relative mag- 
nitudes of the factor-loadings for the several 
tests (highest for reasoning, decidedly high for 
the verbal abilities—possibly because there is 
so large a verbal element in most of the tests 
—low for memory and the space factor). It 
probably includes a marked element of speed 
and may underrate those whose capabilities are 
exhibited more freely in practical or manual 
problems. The authors recognize the presence 
of this "second order general factor" and its 
"psychological interest"; and presumably a 
measure of it, if required, could be obtained 
by taking a simple (or better a weighted) total 
of the six scores. 


In this country we find that, with pupils of 


II to 13, the general factor of "intelligence" 
will account for far more of the total variance 
than the specialized factors and that (save for 
exceptional cases) estimates of the latter have 
a low reliability at these earlier years and do 
not yield very safe indications of the way such 
special aptitudes or interests may develop at 
or after puberty. This seems to be in agree- 
ment with the experience of the authors, who 
note that at the ages concerned (11 to 17) the 
correlations "are higher than the correlations 
between primary factors for adults" and con- 
sider it "now an interesting question to deter- 
mine whether such correlations with still 
younger children will reveal, perhaps even 
more strongly, a second-order general factor." 
British psychologists would reply that the evi- 
dence available has already confirmed this an- 
ticipation, 

(b) Assessment of Special Abilities. The 
attempt to measure special abilities, however, 
is of great interest and should be particularly 
instructive to British users in view of the 
present plans (enjoined by the recent Educa- 
tion Act) to allocate pupils, at the age of 11 
plus, to different types of schools according to 
their special aptitudes. There can be little doubt 
that the six abilities selected by the Chicago 
Tests are of great importance both for school 
success and for vocational guidance; and, to 
judge by our own limited experience, this set 
of tests appears to assess them (at any rate with 
older pupils) far more effectively than any other 
existing set of group tests. 

Nevertheless, I believe that the assessment 
of special abilities or aptitudes is far more dif- 
ficult than the assessment of general intelli- 
gence. The following seem to be some of the 
more obvious difficulties and drawbacks. (7) 
Even with the fuller version, the “composite 
score” for each primary ability has fairly high 
saturations (often as much as .57 or .58) for 
other abilities, as well as for the particular 
ability which it is intended to measure. Hence, 
the tests are not sufficiently “pure” to dispense 
with the need for regression equations, if we 
are to obtain an adequate appraisal of the sepa- 
rate factors. (ii) These saturations or “validi- 
ties” supply only a theoretical validation, based 
on a hypothetical interpretation of the factors 
involved: they do not give correlations with 
independent criteria, However, it is stated that 
“teachers, etc., have expressed their satisfac- 
tion with the profile plotted for each child”; 
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and confirmatory evidence is found in the fact 
that "students in linguistic subjects excel in V 
and W; science students typically excel in the 
space factor, and have high or average N ; and 
it is expected that accountants will have high 
scores in N,” (iit) Few school psychologists 
would, I fancy, be satisfied with an assessment 
of general intelligence reached after a com- 
posite test lasting for much less than an hour; 
can we therefore be satisfied with an assess- 
ment of these more elusive special aptitudes, 
which has been based on a couple of tests each 
lasting about five minutes only? (iv) The only 
relevant score for pupils to be selected for 
technical or trade schools would be that given 
by the space factor; no attempt is made to as- 
sess manual or mechanical aptitudes, so far as 
they involve motor or kinaesthetic components. 

However, no tests can pretend to be either 
perfect or final. The authors emphasize that 
their tests are necessarily provisional and will 
be enlarged and improved in the light of fur- 
ther experience and research. There can, I 
think, be little question that, in the hands of 
suitably trained teachers or psychologists, they 
will yield helpful information that could not 
be obtained so rapidly by any other means; and 
those who consider them inadequate by them- 
selves will still find them suggestive as a pre- 
liminary to a more intensive study of selected 
individuals discovered by their means. Above 
all, they should stimulate the critic to produce 
something better if he can, and will provide 
him with a model of conscientious and sys- 
tematic research and of lucid and frank pres- 
entation, 


James R. Hobson, Director of Child Place- 
ment, Public Schools, Brookline, Massachusetts. 
[Review of the 1943 edition.] This series of 
tests does in a measure what every teacher 
charged with the guidance of adolescent youth 
has long felt needed to be done, namely, pro- 
vides measures of differentiated abilities which 
have educational importance and vocational 
implications. 

The six primary factors measured by this 
battery may be identified as: N, Numerical 
Facility; V, Verbal Comprehension; S, Spacial 
Orientation; W, Word Fluency; R, Inductive 
Reasoning ; and M, Visual Rote Memory. 

The manual of instructions accompanying 
the shortened single booklet edition oi these 
tests is quite complete but does not present any 
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new information about, or any additional aids 
in, the interpretation of the tests. The tables of 
age norms in the 1941 manual have been re- 
placed by a profile of age scores and a profile 
of percentile ranks on the back of the test 
booklet in the 1943 single booklet edition. 

There is one error on page 9 of the manual 
in Part I, Section 7, in which reference is 
made to the five tests dropped from the 1941 
edition to leave twelve remaining tests in the 
1943 edition. Actually six tests were dropped 
and eleven remain. Another minor criticism of 
the manual of the 1943 edition is the fact that 
all of the coefficients of reliability and validity 
quoted refer to the longer 1941 edition. This 
fact is pointed out in an inconspicuous foot- 
note. In any event, it is impossible to determine 
how the elimination of more than one-third of 
the tests in the 1941 edition has affected these 
important considerations in test selection. 

However, these criticisms are minor com- 
pared to the criticisms of the tests themselves 
which the writer has recorded over the past 
five years as a result of administering one edi- 
tion or the other to about 3,500 eighth- and 
ninth-grade pupils. For purposes of clarity and 
emphasis these criticisms will be enumerated 
as follows: 

(a) To begin with, a premium appears to 
have been placed upon speed rather than power 
in five of the six abilities measured. This is 
defensible perhaps in the cases of N and W 
because facility is frankly the thing which is 
being measured and speed is certainly a func- 
tion of facility. The tests of Verbal Compre- 
hension are not so highly speeded as the tests 
of Spacial Orientation and Inductive Reason- 
ing. The practical and vocational usefulness of 
extreme speed in the latter two primary abili- 
ties is open to question. 

(b) A second criticism is that there is no 
evidence that the test items in V, S, and R are 
arranged in order of difficulty as they should 
be if any attempt is to be made to measure 
power. As a matter of fact, a purely subjective 
survey of the test items involved raises a ques- 
tion as to whether the items in these six tests 
could be scaled satisfactorily for the entire age 
range of the battery. : 

(c) The norms developed for this test are 
in terms of age scores and percentile ranks 
based upon Chicago children. This reviewer 
cannot understand how sex differences in the 
primary abilities could have been ignored when 
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sex was one of the variables taken into con- 
sideration in the factorial analyses in “The 
Eighth-Grade Experiment.” As a result of ad- 
ministering this series of tests to about 2,000 
eighth-grade and 1,500 ninth-grade pupils, the 
writer has found markedly significant sex dif- 
ferences at these ages in all abilities except 
Numerical Facility and has since the first year 
of administration reported results in terms of 
separate sex percentile ranks in which the same 
raw score shows a percentile difference of as 
much as 15 points between the sexes. 

(d) The results obtained by this reviewer 
from the original data referred to above show 
a definite and consistent increase in mean per- 
formance from grade eight to grade nine in all 
abilities except Memory in which no increase 
was found. This would seem to indicate that 
this ability may not increase after age thirteen 
and one-half years and that increased age 
norms above that age are at variance with the 

- facts. 

(e) Again, the writer questions the choice 
of media for the measurement of R and M. 
The use of letter series and letter groupings as 
the vehicle for measuring inductive reasoning 
is, in the entirely subjective opinion of this re- 
viewer, responsible for the significantly higher 
mean rating of girls than boys in this ability. 
This result is contrary to published results in 
comparative problem-solving ability and the 
observed superiority of boys in academic sub- 
jects in which inductive reasoning is an impor- 
tant tool. In measuring memory, the ability to 
remember the first name when the last name 
is given would appear to be a rather sterile one 
so far as academic or vocational values are con- 
cerned. The ability to recall more meaningful 
and related material would appear to be more 
significant educationally and vocationally. 

(f) Finally, the Science Research Associates 
who publish and distribute the single booklet 
edition have rendered a real disservice to the 
cause of evaluation and guidance in making 
machine scoring of the 1943 edition impossible, 
(The 1941 edition is machine scorable.) Test 
publishers will have to learn that, with clerical 
labor as scarce, expensive, and often unreliable 
as it is, schools are going to turn in larger 
numbers to the use of the IBM scoring ma- 
chine. If the use of such a machine is decided 
upon, substitution is sure to be made in the 
cases of those instruments not adaptable for 
machine scoring, in which case exclusively 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 310 


hand-scored batteries of tests will not prove to 
be as profitable as anticipated. If test publish- 
ers do not wish to publish a machine-scored 
edition, they should so arrange the physical 
make-up of their tests as to permit the use of 
standard IBM answer sheets and stencils. 

The fact that a great deal of criticism, mostly 
constructive, has been leveled at this series of 
tests by outstandingly competent reviewers as 
well as by practical consumers, such as the 
writer, is more or less beside the point. These 
tests are trail blazers which, while quite useful 
and usable in their present stage of develop- 
ment, can be counted on to be constantly im- 
proved because of the eminence of their au- 
thorship and the society under whose auspices 
they were developed. 


F. L. Wells, Psychologist, Department of 
Hygiene, Harvard University, Cambridge, Mas- 
sachusetts. [Review of 1941 Edition.] This 
test series is a recasting of the 1938 Experi- 
mental Edition (see 40:1427), subtests of which 
were somewhat different and generally more 
difficult. Among those eliminated from the 
older series are those termed Arithmetic, Me- 
chanical Movements, Number Series, Number 
Patterns, and Marks, Among the accessions 
the more distinctive are those termed Three- 
Higher, Letter Series, and Pedigrees. The an- 
nounced range is 11-17 years, which should 
make it serviceable up to at least the upper 
ten per cent of adult population, possibly 
higher, with a reduction of time limits. Aside 
from factorial considerations, it impresses this 
writer as easily the best designed series adapted 
to mass testing within this range. 

Tests having the quality of these and their 
immediate predecessors raise a question whether 
clinicians are not missing a useful clinical ap- 
proach. Such tests do not, of course, replace 
the more qualitative and projective methods, 
from Bellevue to Rorschach; but they are able 
to supplement the latter more effectively than 
can be realized by those who confine their ob- 
servations essentially to free-answer proce- 
dures. The above-mentioned 1938 series has 
lent itself very well to this use, and the present 
gives similar promise. The approach must of 
course be a wholly eclectic one among subtests, 
and some procedural changes may be needed 
to adapt those chosen for individual use. It 
may thus be necessary to determine normative 
data for the specific uses. Neither is machine 


311] 


TESTS & REVIEWS: INTELLIGENCE—GROUP 


228 


scoring suited to clinical functions, if for no 
other reason than that it commonly omits ac- 
count of errors and "skips"; but separate an- 
swer sheets are easily adaptable. 

There is said to be a wide-spread illusion 
that the authors have put forward these tests 
as for the primary mental abilities, No such 
pretense is made; but it is put forward that 
from among an indefinite number of “primary 
abilities" certain ones have been isolated, for 
which the names Number, Verbal Meaning, 
Space, etc., are here appropriate. This can be 
conceded; though it is an unfortunate circum- 
stance that names like these denote so much 
broader functions than are subsumed in the 
tests concerned. It is very reasonable to expect 
that research will isolate other and further 
"primary" abilities in these various areas, and 
the authors themselves so indicate. 

One limitation, already hinted at, will be dif- 
ficult to overcome. There is practically com- 
plete dependence on multiple-choice technique, 
and whoever uses this in parallel with free- 
answer procedure, soon learns that they do not 
record the same abilities. Some concession to 
this fact is embodied in a curiously hybrid role 
assigned to the present “Word-Fluency” pro- 
cedures, In effect, these are very simplified 
projection tests; a large part of whose mean- 
ing is still qualitative, may even be affectively de- 
termined. The tests in question well exemplify 
a consideration above mentioned (“broader 
functions”), in that fluency (a) consists not 
merely, or so much, in the setting down of 
discrete language symbols, as in organizing 
them into a meaningful Gestalt; (b) is very 
much a function of social situations, and of 
speech or writing. In any case, certain modifi- 
cations of the tasks in respect to initial letters 
and number of letters might prove desirable 
with general use. 


For reviews by Henry E. Garrett, Truman L. 
Kelley, C. Spearman, Godfrey H. Thomson, 
and Robert C. Tryon, and excerpts from re- 
views by A. B. Crawford and John M. Stal- 
naker, see 40:1427. 
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Cole-Vincent Group Intelligence Tests for School 
Entrants. Grades kgh-1; 1924-28; 1 form; $1.20 per 
25; 25¢ per set demonstration cards; 40€ per specimen 
sct; (20-30) minutes; Lawrence W. Cole and Leona 
E. Vincent; Bureau of Educational Measurements, 
Kansas State Teachers College of Emporia. 


Ruth W. Washburn, Consultant in Child 
Development, Shady Hill School, Cambridge, 
Massachusetts. When it was first published in 
1924, this should have been considered one of 
the good tests for the determination of a 
child’s readiness for the work of the kinder- 
garten and first grade. By its use the teacher 
could learn about a child’s understanding of 
language and his ability to follow verbal direc- 
tions, the development of his perception of 
form with special reference to likenesses and 
differences, and, most important, his ability to 
translate through his hand what he saw with 
his eye. Even though its standardization was 
based on the scores of 1,171 pupils whose Stan- 
ford—Binet mental ages were known, further de- 
tailed scientific work in the field of develop- 
mental readiness has made the norms of this 
test of questionable value. Parts of this test 
are very difficult even for well-developed five 
and six year old children; for example, Test 8 
in which 12 geometric figures of increasing 
difficulty are copied. Established norms for the 
Porteus maze test suggest that Test 9, a blind 
alley test, is also very difficult for candidates 
for the first grade. 

The promotion policy based on the findings 
of the test (“a plan which is very satisfactory 
arranges for the pupils of highest ability to 
complete three grades in two years by means 
of mid-year promotions”) has for some time 
been abandoned in many school systems. Ex- 
perience has proved that it is a rare child with 
a mental age of five years and two months who 
can carry on the work and the life of the first 


grade. 
[227 ] 


*Cotswold Mental Ability Test. Ages 10-11; gi 
2 forms (Series 1, 2) ; 6s. per 20; Is. per manual ; d. 
per single copy ; 40(50) minutes ; C. M. Fleming (man- 
ual) and J. W. Jenkins; Robert Gibson & Sons (Glas- 
gow), Ltd. 
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XCulture-Free Test. Mental ages 12 and over; 1944- 
46; 1 form; $4.00 per 25, postpaid; 35¢ per specimen 
set, postpaid; (80-90) minutes; Raymond B. Cattell; 
Psychological Corporation. 
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L. S. Penrose, Galton Professor of Eugenics, 
University College, London, England. The 
new test is a revision, which follows very 
closely the draft prepared under the title of 
A Culture-Free Test of Intelligence in 1940. 
There are now six parts: I, classifications; 
II, reflections; III, series completions; IV, 
matrices (2 X 2); V, matrices (3 X 3) ; and 
VI, incomplete matrices (3 X 3). A set of 
mazes, originally included, was found to be 
valueless and was deleted. Each part consists 
of a set of items arranged in order of difficulty. 
The test is essentially untimed, in that ample 
time, in all 80 to 90 minutes, is allowed for com- 
pletion, Although the material is pictorial and 
nonverbal, the instructions for the test are 
given verbally and are planned so that groups 
of subjects can be tested simultaneously. Some 
norms for high school students are provided 
with the manual of directions. 

The test is scored by adding up the total 
number of items correctly answered ; 72 is the 
maximal score. In its original form, perform- 
ance on the Culture-Free Test was found to be 
correlated highly with the Binet IQ (r = .66 

. to .87). The new test has been compared with 
the Army Alpha and the Minnesota Paper 
Formboard, and correlations varying from .32 
to .71 have been obtained. Intercorrelations be- 
tween the parts of the test itself proved to be 
rather surprisingly low, varying from .12 be- 
tween Parts I and II to .63 between Parts III 
and IV. As with all other intelligence tests, 
scores are found to improve after training, but 
the author considers that the errors in meas- 
urement caused in this way are less in the 
Culture-Free Test than in tests involving scho- 
lastic training. 

The test material consists of problems of the 
type known as “perceptual,” which are believed 
to measure Spearman’s general factor of in- 
telligence more accurately than any other type. 
Moreover, ability to do perceptual problems is 
said to be free from the influence of cultural 
and educational training. However, it is im- 
possible to get away entirely from differences 
in performance due to familiarity or unfamil- 
jarity of subjects with test situations in gen- 
eral. Some children are more predisposed than 
others by training to accept a test a$ worthy 
of serious and prolonged attention. Any pencil- 
and-paper test is only relatively culture-free. 
Furthermore, to deal with linguistically handi- 
capped or deaf subjects, the instructions would 
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have to be modified—though this should not 
present serious difficulty. 

The construction of perceptual tests needs to 
be carried out with great accuracy in order to 
avoid secondary solutions which are as reason- 
able as the primarily intended solutions. In the 
case of Item 8 of Part I of the present test, 
the author has actually had to admit two rea- 
sonable possibilities. Tests using classifications 
and series completions are commonly inaccu- 
rately constructed, and the subject's task is 
partly to discover what has been in the mind 
of the examiner. Part II is free from such am- 
biguities. So also are Parts IV and V, which 
are similar to the series of Progressive Matrices, 
which has been very widely used in England 
during the war in connection with personnel 
selection. Part VI ingeniously combines the 
ideas of matrix completion and series comple- 
tion but may not be absolutely free from am- 
biguity; Item 4 is not very well drawn. 

Tests relatively free from cultural bias are 
needed for many purposes, for genetical stud- 
ies and for comparing groups with different 
cultural backgrounds in anthropological stud- 
ies. They are also of practical value in the esti- 
mation of intelligence of immigrants. The 
present test has the advantage of covering a 
wide range of ability and, when adequate 
norms are available and minor errors corrected, 
should prove useful, provided that plenty of 
time is available for the examination. 


Walter C. Shipley, Professor of Psychol- 
ogy, Wheaton College, Norton, Massachusetts. 
This is a nonverbal, group intelligence test 
suitable for testing over the mental-age range 
from about twelve years to superior adult- 
hood. Like Raven’s Progressive Matrices, it is 
designed to measure Spearman’s “G” factor in 
highly saturated form and to be relatively un- 
influenced by verbal or cultural factors. It is 
composed of 72 abstract perceptual items of 
the multiple-choice type, divided into six parts: 
classifications, pool reflections, series, and three 
series of matrices. The first part requires the 
identification of two figures out of a group of 
six which do not embody the same principle 
as the other four; the second requires the iden- 
tification of figures as they would appear if 
inverted; the third, the identification of the 
figure which continues a series started by three 
others; the last three parts each require the 
selection of a figure which will complete a 
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sequential pattern; and in the very last part, 
portions of the pattern are omitted and must 
virtually be reconstructed mentally before com- 
pletion is possible. Within each part the items 
are arranged in order of difficulty. 

The test requires from 80 to 9o minutes to 
administer and can be scored in a minute or 
two. While there are time limits for each part, 
these are so liberal as to render the test one 
of power rather than speed. Norms, expressed 
in percentile equivalents, are available for high 
school freshmen (N = 1,009) and high school 
seniors (N = 918). The self-correlation of 
the test, Spearman-Brown adjusted, is re- 
ported for 121 high school freshmen to be .88. 

The test has much to commend it. Research 
with it in an earlier form has shown it to be 
relatively uninfluenced by cultural factors and 
to be highly saturated with a general intellec- 
tual factor. The reviewer finds it difficult, 
however, in view of a striking basic similarity 
between this and Raven’s Progressive Matrices 
(first published in 1938), to make any general 
recommendation of one over the other. For 
many purposes they would appear to be vir- 
tually interchangeable. Both appear to fill an 
urgent need for a highly discriminating, non- 
verbal, culture-free test of general intelligence. 


David Wechsler, Chief Psychologist, Psy- 
chiatric Division, Bellevue Hospital, New York, 
New York. The title of this test is misleading 
for, apart from the arbitrariness of the author’s 
definition of culture, there is little evidence to 
show that the test is free from over-all cultural 
influences, however defined. Actually, Dr. Cat- 
tell’s test is a nonverbal (paper and pencil) 
scale, consisting largely of visuo-perceptual 
items. In its latest form, it consists of 6 sub- 
tests: classifications, pool reflections, series, 
and three separate matrices tests modeled after 
Raven’s Progressive Matrices. Originally it 
also contained a Maze Test which was deleted 
because it discriminated poorly between senior 
and junior high school students who were used 
as criteria in differentiating test items. The 
test was standardized on approximately 1,000 
high school freshmen and 1,000 high school 
seniors, and the norms available are in terms 
of percentile ranks for these two groups, irre- 
spective of age. 

Apart from the constricted population used 
in its standardization, the most serious limita- 
tion of the Cultwre-Free Test is its restricted 
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subject (content) range. It consists, as already 
mentioned, exclusively of visual perceptual 
items, and while the abstract reasoning in- 
volved differs from test to test, the role of the 
capacity for the visuo-perceptual organization 
is much overworked. It is for this reason that 
the Culture-Free Test may be expected to be 
a less effective test of intelligence than the 
Kellogg-Morton Revised Beta Examination or 
the Chicago Non-Verbal Examination which 
have a broader range. For this reason too, the 
time required for its administration is much 
too long, to wit, 80 to 9o minutes. This is more 
than double the time required for the Kellogg- 
Morton scale. Nevertheless, the correlations of 
the Culture-Free Test with other intelligence 
tests are of about the same order as those ob- 
tained between the latter and other nonverbal 
scales (eg. Army Beta), namely, r's from 
.50 to .60. No important sex differences were 
obtained except in the case of one correlation 
with the Revised Minnesota Paper Formboard 
which is .32 for the boys and .64 for the girls 
(high school freshmen). 


J Consult Psychol 11:282 S-O '47. 
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*Duplex Series of Ability Tests. Ages 10-11, 11-12, 
12-13, 13-14; 1947; 1 form, 4 levels; 10s, 6d. per 25 
copies of Test No. 1 (ages 10-11) or Test No. 2 (ages 
11-12) ; 12s. 6d, per 25 copies of Test No. 3 (ages 12- 
13); I5s. per 25 copies of Test No. 4 (ages 13-14) ; 
25. 6d. per manual; 5s. od. per specimen set; Frank M. 
Earle; George G. Harrap & Co. Ltd. 
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Essential Intelligence Test. Ages 7-11 and reading 
ages of 74 or more; 1940-46; manual and publisher's 
catalog refer to the test as Essential Junior Intelligence 
Test; 1 form; 5d. per test; 25-49, 4d. each; Is. per 
manual; 45(55) minutes; Fred J. Schonell and R. H. 
Adams; Oliver & Boyd Ltd. 
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Group Test of Learning Capacity: The Dominion 
Tests. Grades kgn-1, 4-6, 7-9, 9-12; 1934-46 ; formerly 
called Group Test of Intelligence; Forms A, B ; pre- 
pared by Department of Educational Research, Uni- 
versity of Toronto; Vocational Guidance Centre, On- 
tario College of Education, University of Toronto. 

a) primary. Grades kgn-1; 1944-46; $1.20 per 30; 3o¢ 
per specimen set ; 33(65) minutes. 

b) junior. Grades 4-6; 1940-41; 65¢ per 25; 30¢ per 
specimen set; 25(45) minutes. 

C) INTERMEDIATE. Grades 7-9; 1934; 95¢ per 25; 30¢ 
per specimen set; 37(50) minutes. 

d) ADVANCED. Grades 9-12; 1939-40; 65¢ per 25; 3o¢ 
per specimen set; 30(45) minutes. 
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F. T. Tyler, Professor of Psychology and 
Education, The University of British Columbia, 
Vancouver, Canada. The Advanced forms are 
of the omnibus, self-administering type; the 
forms at the three earlier levels contain sub- 
tests which are separately timed. Total testing 
requires from 45 to 60 minutes, varying with 
the level. The manual includes complete direc- 
tions for administering and scoring the tests. 
As the form in which the answers are presented 
does not lend itself to speed or accuracy of 
scoring, users will need to prepare their own 
Scoring keys. 

Mental age equivalents are presented for all 
tests. The authors prepared their own norms 
for the Primary form and used the National 
Intelligence Test and the Terman Group Test 
of Mental Ability to determine mental age 
equivalents for scores on their other tests. 

Reliability coefficients, obtained by the equiv- 
alent-forms method, are usually reported by 
both grades and levels. The reliability coeffi- 
cient for the Intermediate form, .95, is based 
on 1,000 students in one (unspecified) grade. 
For the other levels, coefficients range from „81 
to .92. The numbers of subjects used in de- 
termining reliability ranged from 41 to 186, 
except for the Primary form, for which 933 
first grade pupils were used. 

The probable errors of scores, when reported, 
were very consistent, ranging from 2.5 to 3.0, 
except for grade 1 where the P.E. of an in- 
dividual score on each form was 4.5 IQ points. 
The standard deviations of scores ranged from 
9.36 to 13.38; variabilities of IQ’s were not 
reported except for grade 1, where the stand- 
ard deviation was 17.5. The mean IQ was re- 
ported only for grade 1, where it was 108.3. 
In the absence of more complete data it is dif- 
ficult to determine the comparability of 1Q’s 
at different grade levels. IQ's are of the well- 
known ratio type. 

The manuals, though not all equally com- 
plete, are comparable to those for most group 
tests. They do, however, fall short of the stand- 
ards set up by R. W. B. Jackson and G. A. 
Ferguson in their Studies on the Reliability of 
Tests (see 952). The relationship between sub- 
tests is not indicated at all. Incomplete data 
are given on item validities, although certain 
manuals indicate that the items in the final 
form were selected from a much larger number 
after a study of item validity, 


[232 ] 
XInductive Reasoning Test. Grades 9-12 and adults ; 
1946; IBM; 1 form; 50¢ per 25 machine-scorable test- 
answer sheets ; 25€ per key; 35¢ per specimen set, post- 
paid; 20(25) minutes; G. Bernard Baldwin; Educa- 
tional Test Bureau. 


Charles R. Langmuir, Director, New York 
Office, Educational Testing Service, New York, 
New York. This test consists of 40 number- 
series items printed on one side of an IBM-type 
answer sheet. One-third of the page is used for 
directions and practice exercises. While the 
directions are read, the test items are upside 
down. Upon the starting signal, the page is 
turned end for end. The arrangement is con- 
venient and practical. 

The details of the item form, variety, and 
range of difficulty are best shown by repro- 
ducing Items 1, 11, and 40: 


/518::9/10 4147 1,7 
7.49 6 36 5 25..... 
[40] 17 19 23 29 31 3..... 


n] 
[11] 


Two answers are required to each item. 
“Score is the number of items in which both 
choices are correct." No other scoring infor- 
mation is given. The catalog description indi- 
cates hand or machine scoring. The space pro- 
vided for recording the score was not designed 
for use in machine scoring. This defect can be 
overlooked since the requirement of two cor- 
rect marks for each item makes machine scor- 
ing impossible. 

All the information available about the test, 
including directions for administering, the 
method of item selection, the method and re- 
sults of determining reliability and validity of 
the test, and the report of norms, is presented 
in a one-page manual (582 words of text). 

Norms are limited to a table of scores equiva- 
lent to the Ist, 2nd, 16th, soth, 84th, 97th, and 
99th percentiles in the four high school grades. 
Score distributions are not reported. There is 
no information about the students forming the 
norms group other than grade placement in un- 
identified schools. 

The test will interest experimenters and 
teachers who want a brief, easily administered, 
and easily scored number-series test as a meas- 
ure of power in number factor. This reviewer 
recommends that users disregard the so-called 
norms, investigate the need for increasing the 
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time limit, verify the scoring key, and then ac- 
cumulate interpretive data from local experi- 
ence. 

In fourteen items the alternatives are so 
chosen that the series can be marked correctly 
for three or more spaces, as in Item 1, Should 
the student be marked wrong on "inductive 
reasoning" when he correctly extends the se- 
ries beyond two entries? The choice of a rule 
for scoring these items may have a consider- 
able effect on the validity of the scores. 

The scoring key is an International Test Scor- 
ing Machine key form punched by hand, Users 
would be wise to verify the key. In the key 
reviewed, the answers to Item 40 were given 
as 47 and 51. This reviewer, in consultation 
with 13 literate advisors, was forced to ascribe 
the obscurity of the item to defects in both 
the problem and the key. 


J Consult Psychol 11:156 My-Je '47 


[ 233 ] 
*Junior Scholastic Aptitude Test. Grades 7-9; 
1935-45; Forms DR, ER, FR; charges include test 
materials, scoring, and reporting of results; all test 
materials, used and unused, must be returned; $1 per 
individual; 86(105) minutes; prepared by the Bureau 
of Research (A. L. Lincoln, chairman), Secondary 
Education Board, Milton, Mass.; distributed and 
scored by the Educational Records Bureau. 
REFERENCES 

1. TRAXLER, ARTHUR E. “A Study of the Junior Scholastic 
Aptitude Test," E rap m 1939 Achievement Testing Pro- 
gon in Independent Schools and Supplementary Studies. By 

ducational Records Bureau. Educational Records Bulletin No. 
27. New York: the Bureau, June 1939. Pp. xiii, 76, 11. $1.50. 
Paper, lithotyped. (PA 1, 14274) 

2. traxtek AxruUR E. UA Study of the Junior Scholastic 
Aptitude Test." J Ed Res aeu $^ 1. * (PA 15:5392 

3. TowNsEND, AGATHA. “The Use of Results From the dcr 


Scholastic Aptitude Test,” pp. 34-9. I Fall Testing Pro- 
gram in Independent Schoolt ad Supplementary Studies, Edu- 
Fd Dee ES] uc 39. d esi UR Rec- 
20s Hureau Ja ry 1944. Pp. xii, 48. Paper, lithotyped. $1.00. 
[ 234 ] 
Kentucky General Ability Test. Grades 12-13; 1946; 
a revision of the Kentucky General Scholastic Ability 
Test; Form 46; 8¢ per test including scoring service; 
5€ per test without scoring service; 30(35) minutes; 
E. J. Asher; revised by P. L. Mellenbruch ; Kentucky 
Cooperative Testing Service, University of Kentucky. 
REFERENCES 
1-3. See 40:1402. 


For a review by Richard Ledgerwood of an 
earlier form, see 38:1048. 

; [ 235 ] 
Kingsway Intelligence Tests. 1939; a 20-page book- 
let of 15 practice tests ; 7d. per pupils’ edition; 10d. per 
teachers’ edition; Evans Brothers Ltd. 

J Ed (London) 72:394 Ag "40. Controversy 
still rages around the use and interpretation of 
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the results of group tests of intelligence. Where 
two different, but equivalent, forms of a group 
intelligence test are given successively to the 
same group of children, the gain of the second 
test over the first may be as high as twenty per 
cent. Gains of this size have been found in the 
American Army Tests, and in others. Any 
teacher can buy Intelligence Tests, coach his 
pupils in them, and they will obtain a consequent 
increase in the scores on subsequent tests of the 
same type. Questions of all the common types 
used in Intelligence Tests of the Entrance Ex- 
amination are obtainable in the Kingsway Intel- 
ligence Tests. Nothing could be more foolish 
or cruel to the borderline scholarship child than 
the use of such books. The child thus coached 
may manage to “pass” the Intelligence Test at 
the expense of a more intelligent child who has 
not had such coaching. But the result will be 
that he will spend a miserable four years trying 
to live up to his intelligence score, attempting 
work that is far too difficult for him, and be- 
coming a drag on the school. Admittedly, this 
kind of coaching is now done in many schools, 
but those teachers who do it show a complete 
lack of understanding of examinations. They 
too frequently regard the examiner as a foe to 
be outwitted rather than as a friend with whom 
co-operation is necessary. Some even consider 
training in intelligence tests to be training in 
intelligence. This is not so. The candidate who 
is test-sophisticated has learnt to recognize the 
instructions, and has acquired more efficient 
methods of working than the child who comes 
new to the test, but this is not to say that he 
is more intelligent. Those who have prepared 
and used intelligence tests know that they are 
sensitive instruments easily rendered invalid by 
mishandling. Their successful use depends on 
the standardization of the conditions under 
which they are used. Where some testees are 
familiar with tests and others are not, the test 
results become entirely invalid and, under such 
conditions, it would be well to abandon them 
altogether. It would be a pity to have to do this 
now, at a time when a number of county authori- 
ties are coming to depend on the Intelligence 
Test in the County Entrance Examination. 


For an additional review, see 40:1403. 


[ 236 ] 
Kuhlmann-Anderson Intelligence Tests, Fifth 
Edition. Grades 1B, 1A, 2, 3, 4, 5, 6, 7-8, 9-12; 1927-42; 
1 form; 9 levels ; $1.50 per 25 of any one level; 50¢ per 
manual; $1 per specimen set, postpaid; 15¢ per pad of 
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25 of any one subtest for individual testing; (40-60) 
minutes; F. Kuhlmann and Rose G. Anderson ; Educa- 
tional Test Bureau. 


REFERENCES 

1-15. See 40:1404. 

16. KeLLY, Price OnmviLLE. Comparative Validity and Re- 
liability of Four Intelligence Tests in the Ninth Grade. Unpub- 
lished master's thesis, University of Kentucky, 1933. Pp. 63. 
(Theses $n Education . . ., 1937, pp. 18-9.) 

17. NEMZEK, CLAUDE L, “Intelligence Testing at the College 
Level.” J Ed Res 26:617-8 Ap '33. * (PA 7:4196) T 

18. Apxins, Dozornv C. “The Effects of Practice on Intelli- 
apes Test Scores.” J Ed Psychol 28:222-31 Mr '37. * (PA 11: 
3629, 

19. Apxins, DoRorHY C. “The Efficiency of Certain Intelli- 
gence ‘lests in Predicting Scholarship Scores." J Ed Psychol 
28:129-34 F '37. * (PA 11:3391) 

20, Berson, P. M. Optimum Use of Reading Tests and of 
Non-Visual-Verbal and Visual-Verbal Subtests of the Kuhl- 
mann-Anderson Intelligence Test in Indicating the Rate of 
Progress through the Primary Grades. Unpublished master's 
thesis, College of the City of New York, 1937. Pp. 43. 

21. CAVELL, ANGELO CIAVARELLA. A Study of the Kuhlmann- 
Anderson Tests Administered to Inmates of the Western State 
Penitentiary. Unpublished master's thesis, University of Pitts- 
burgh, 1937. (Abstracts of Theses . . . 1937, PP. 454-5.) 

22, ALLEN, MiLpRED M. Prediction of Academic Success of 
Elementary School Pupils by Means of Kuhimann-Anderson 
Tur S npnbHished doctor's thesis, New York University, 1940. 

p. 128, 
_ 33. Bruce, MvnrLE. Factors Affecting Intelligence Test Per- 
Tormance of Whites and Negroes in the Rural South. Archives 
of Psychology, No. 252. Washington, D. C.: American Psycho- 
logical Spee lnc., July 1940. Pp. 99. Paper. $1.50. * (PA 
1534591 

24. KosLow, SypNnEy. “Reading Readiness and Reading 
ee in First Grade.” J Exp Ed 9:154-9 D '4o. * (PA 
15:2379. 

25. HARTSON, L. D., ANp Sprow, A. J. “The Value of Intelli- 
AA Quotients Obtained in Secondary School for Predicting 

llege Scholarship." Ed & Psychol Meas 1:387-98 O "41. * 
WA 16:1174) 

26. SPaCHE, GxonGE. “Deriving Lan; lage and Non-Language 
Measures of Intelligence From the Kuhlmann-Anderson Intelli- 
gence ts.” J Ed Psychol 32:673-80 D '41. * (PA 16:2907) 

27. LRAXLER, ARTHUR E. "Comparison Between L.).'s on the 
New Edition ot the Kuhlmann-Anderson Intelligence Wests and 
Binet 1.0.'5," pp. 27-32. In 1940 Fall Testing Program in Inde- 

pendent Schools and Supplementary Studies. Educational Kec- 
9rds Bulletin, No. 31. New York: Educational Records Bureau, 
TUR 1941. Pp. xi, 41. Paper, lithotyped. $1.00. * (PA 15: 


mergency Battery With the Revised Stanford-Binet and thi 
Kuhlmann-Anderson Tests." Ai t. D p^ 
k (1 77, fle SES bstract. Psychol B 39:591 O '42. 

33. ALLEN, Mitprep M, "Relationship Bet: x y 
Anderson Intelligence Tests and Pes niis MGE TEN 
Grade IV.” J Ed Psychol 35:229-39 Ap '44. * (PA 18:3902) 
Ad + ALLEN, Mitprep M. “Relationship Between Kuhlmann- 

nderson Intelligence Tests in Grade 1 and Academic Achieve- 
aum re 3 and 4.” Ed & Psychol Meas 4:161-8 su '44. * 

£53. 

35; Hotann, Pattie E. The Intelligence of Rural Chi 
a easured by the Kuhlmann-Anderson Intelligence fand 
the Sienford.Pinet Scale. Unpublished master's thesis, Univer- 


TowNsENÉ, AcATHA, "Some Aspects of Testing i; 
Primary Grades," pp. Sire. In 1944 Achievement Testing Pre 
gram in Independent Schools an Supplementary Studies. Edu- 
cational Records Bulletin, No. 40. New York: Educational Rec- 


ords Bureau, June Pp. xi i 
eens 1944. Pp. xii, 58. Paper, lithotyped. $1.50. * 


J Ed Psychol 36:252-6 Ap "4S. * (PA 19:2387) 
B. CHAPANIS, ALPHONSE, AND WILLIAMS, W. C. “Results of 
adest Survey’ With zm Median Anderson Intelligence 
ity. s i 4 
PS REED. VA 'ennessee. ‘enetic Psychol 67: 


e BERLY, STAN E. “A Systematic Error in Kuhlmann- 
Anfersom Mental Ages.” J Bd Psychol 37:193-218 Ap 46." 


bee Rose G. “‘Wimberly’s Criticisms of Kuhlmann- 


Anderson Tests." J Ed Psychol 38:45-50 Ja '47. * (PA 21:3283) 

W. G. Emmett, Lecturer in Experimental 
Education, University of Edinburgh, Edin- 
burgh, Scotland. These tests were first published 
in 1927. In the present edition, the level of 
difficulty of some of the items has been raised 
to allow more headroom for the brighter candi- 
dates. There are 39 tests in all, 10 for each grade 
from grade 1 to maturity, 6 or 7 tests being 
common to successive grades. There is little 
to criticise about the subject matter of the tests ; 
it might be said that the method of answering 11 
of the earlier tests by making a dot on a picture 
or diagram is somewhat hazardous, but doubt- 
less the authors have satisfied themselves that 
accurate marking is not imperiled. The instruc- 
tions for marking and scoring are clear, and 
aids for computing a measure of brightness are 
provided. 

The norms are based on the scores of some 
30,000 children. They are given for each test 
in terms of mental age corresponding to each 
point of score obtained. The median of the 
mental ages from the several tests within a 
grade is taken as the best estimate of the mental 
age of the candidate. From this point two 
courses of expressing the brightness of a candi- 
date are available. First, the mental age may be 
converted into an intelligence quotient, the max- 
imum chronological age to be used as divisor 
being 186 months. The authors, although allow- 
ing this method, are strongly critical of its use. 
"They prefer a measure based on Heinis's growth 
curve, the mental age being expressed as a per- 
centage of average development. If the IQ unit 
is adopted, the resulting standard deviation of 
the quotients given by the test should be quoted, 
in order to determine the size of the unit. It is 
well known that the standard deviation depends 
on the test material and may vary from 13 up 
to perhaps 25. The authors unfortunately do 
not give the information. 

The Heinis’s growth scale seems a more 
rational method of measuring brightness, if the 
scale is constructed from well-substantiated 
data. Best of all, however, in the reviewer's 
opinion is a scale of standard scores, centered 
round roo with a constant standard deviation 
Such as r5, determined by agreement among 
psychologists and educationalists. Individuals 
would then be rated in standard units above or 
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below the average performance of others of the 
same age. 

The authors have quaint and unacceptable 
notions about the validity and reliability of men- 
tal tests. They say, “Test scores themselves are 
a better criterion of intelligence than anything 
else available." This sounds very like, *Heads I 
win, tails you lose.’ There seems little point in 
giving an intelligence test if it is not to pre- 
dict a criterion. However, if there be virtue in 
measuring intelligence for its own sake, the 
measure should be as accurate as possible, that 
is, the reliability coefficient of the test should be 
high. But surprisingly the authors say, “close 
agreement between the first and second set of 
scores and (sic) as expressed by the coefficient 
of correlation may be evidence that the tests are 
poor instead of good.” We are left wondering. 

On the whole, the authors appear to have 
produced a reasonably good set of tests, pre- 
sumably by somewhat empirical methods. 


Stanley S. Marzolf, Professor of Psychology, 
Illinois State Normal University, Normal, 
Illinois. The preparation of the first edition 
of these tests was begun by the late Dr. Kuhl- 
mann in 1916 with a view to conserving for 
group use as many of the desirable features 
of an individual test as possible. Dr. Anderson, 
who joined in this task in 1920, has been re- 
sponsible for organizing the instructions for 
the tests and for the testing involved in pre- 
liminary trials for the first and subsequent 
revisions. 

Thirty-nine tests, each containing from 6 to 
72 items, are to be used in batteries suitable for 
the age or grade of the person tested. When used 
as an individual test, it is recommended that 
14 tests constitute a battery; for group use, Io 
are sufficient. Booklets containing the tests suit- 
able to a given grade are available. For use as 
an individual test, pads of the separate tests are 
available. The total time required for adminis- 
tration is approximately 45 minutes. Mental age 
credit depending upon the number of items cor- 
rectly answered is given separately for each sub- 
test. The median mental age of all subtests 
given, neglecting zero and maximum scores, in- 
dicates the general level of mental development. 
An alternative method of scoring is strongly 
recommended by the authors, namely, the use 
of Mental Units and PA (per cent of average 
development). Mental Units have been com- 
puted using the Heinis growth curve data and 


are assignable in accordance with the number 
of items of the subtest passed. The ratio of the 
median number of mental units attained to the 
average number for a given CA is the PA. 

The fifth revision was undertaken chiefly to 
increase the range of possible mental ages. The 
maximum "mental age" obtainable is now 40, 
as compared with 30-5 for the fourth edition, 
The maximum CA to be used in computing IQ's 
is now 15-6 rather than 16. The higher mental 
ages obtainable are due principally to added 
trials in 16 tests. Tests considered dependent 
upon specific school training do not appear in 
booklets below grade 3. The 25th printing, which 
appeared in 1942, contains norms which call for 
some downward revision of the maximum men- 
tal ages, since in nearly all cases the median MA 
of unselected ages from 6 to 16 was one or two 
months higher than expected. 

The directions for administering the tests are 
clear and specific, and scoring is entirely objec- 
tive. Some minor alterations in phrasing must 
be made when the test is used individually. Per- 
haps the biggest source of possible error in the 
use of these tests by the classroom teacher is the 
timing. Time intervals of 10, 15, and 20 seconds 
are quite common at the lower levels. At the 
upper levels, time limits up to 3 minutes occur. 
Time limits at the lower levels are quite ample, 
since speed is not intended to be a significant 
factor. 

The standardization groups have been selected 
from supposedly representative communities of 
the North Central and Middle Atlantic states. 
Repeated revision and the use of over 5,000 sub- 
jects for the final revision have produced mental 
age riorms which are doubtless quite dependable. 

Mental Units and PA represent a commend- 
able and straightforward attack upon the recog- 
nized inequality of mental age units, though 
the data Kuhlmann has accepted for the growth 
function could readily be improved upon. Doubt- 
less the principal reason PA has not challenged 
the IQ more successfully is the weight of tradi- 
tion and custom which had accumulated by the 
time PA was proposed. Kuhlmann's lead in this 
direction deserves more attention than it has 
received. 

The manual does not include validity coeffi- 
cients. In the opinion of the authors such co- 
efficients are misleading since a suitable criterion 
is not available. School marks cannot be used 
because they are contaminated by other factors 
than intelligence, such as health, diligence, 
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teacher proficiency, and the like. While it is true 
that school marks do not constitute a perfectly 
desirable criterion, we have a right to expect an 
intelligence test to show some relation to them. 
Certainly, correlations of the order of .60 have 
been found with sufficient frequency to cause us 
to view with suspicion any test with a validity 
coefficient much below this level. In fact, cor- 
relations of this magnitude have been found 
for this test (28, 34), and the publishers quote 
from two unpublished University of Washing- 
ton dissertations to the effect that these tests 
ranked high in validity when compared with 
other tests. No competent person is likely to 
find these studies misleading, the authors’ 
manual notwithstanding. 

In the final analysis, the validity of these 
tests rests upon the informed judgment of the 
authors. It is their claim that “chronological age 
is used as the criterion of what the tests pro- 
pose to measure.” As it stands this statement is 
absurd, since measures of height, weight, or 
degree of ossification would each meet this cri- 
terion, The authors’ wide experience and their 
knowledge of the experience of others with 
kinds of test items has undoubtedly enabled 
them to develop a good test. Manipulations in- 
volving chronological age provide a derived scale 
for quantifying responses; they do nothing to 
enhance the validity. The’ authors are explicit 
in admitting use of common-sense judgment as 
the basis for ruling out tests which depend upon 
training, but they are not explicit in admitting 
the use of such judgment in accepting tests 
which do not depend upon school training. 

Reliability coefficients are respected even less 
than validity coefficients. For the authors, the 
reliability which scores from these tests possess 
arises from careful scaling of difficulty, the care 
with which directions have been formulated, and 
the use of the median mental age method of 
scoring. Maybe so, but we still wonder what 
degree of reliability has resulted from these 
efforts. 

These tests are probably much better than 
the authors' reasoning about validity and relia- 
bility would lead one to expect. They will con- 
tinue to be widely used, largely because of their 
brevity. Perhaps, the best reason for their con- 
tinued use is that they are relatively less de- 
pendent upon reading skill than are most other 
group tests. 


For reviews by Psyche Cattell, S. A. Ci ourtis, 


Henry E. Garrett, and Austin H. Turney, of 
an earlier edition, see 38:1049 and 40:1404. 
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Laycock Mental Ability Test. eads 4-10; 19397 
; r test; 25-499 copies, 4¢ each ; 5¢ per manual ; 
2 ae hey 3214 (40) minutes; S. R. Laycock; Univer- 
sity of Saskatchewan Bookstore. 


George A. Ferguson, Assistant Professor of 
Psychology, McGill University, Montreal, Can- 
ada. This test, published in 1933, provides a 
measure of the mental ages and resulting in- 
telligence quotients of school children from 
grades 4 to 8. It consists of seven subtests of 
the more commonly used types of test material: 
classification, verbal analogies, figure analogies, 
opposites, inference, number series, and sen- 
tence completion. 

The items included in the present final form 
of the test were selected from three previous 
experimental drafts, the number of items being 
progressively reduced from an original 550 to 
the present 168. Items were initially selected 
from the original draft on the basis of their 
ability to discriminate between different mental 
age groups, mental ages determined by the 
National Intelligence Test serving as the cri- 
terion. The author reports a correlation of .83 
with the National Intelligence Test (N = 40, 
grade 4 pupils) and a correlation with teachers’ 
judgments of ability of .63 (N = 49, grade 8 
pupils). Other items of validation data are pre- 
sented. 

The test norms are based on 7,500 public 
school children in grades 3 to 8, selected from 
both rural and urban schools in seven provinces 
of Canada and 1,000 high school children 
selected from schools in the province of Sas- 
katchewan. 

Although this test contains nothing unique 
in format, test material, or methodology of con- 
struction, very considerable care and labour 
have been exercised in its development and in 


. the procuring of various forms of validation 


data. Its norms are based on a broad and repre- 
sentative sample of the Canadian school popu- 
lation, and it serves as a satisfactory instru- 
ment for appraising the mental level of school 
children within the age range specified. 


F. T. Tyler, Professor of Psychology and 
Education, The University of British Colum- 
bia, Vancouver, Canada, The Laycock Mental 
Ability Test sets out to measure Spearman's 
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general factor, "g," by means of seven sub- 
tests involving the eduction of relations and 
correlates. The author points out that "Tests of 
Information" have not been used. Nevertheless, 
such an item as "Subservient people are apt to 
be—aristocratic, conceited, proud, deferential" 
requires the testee to manipulate information. 
Still, the author seems to assume that all indi- 
viduals taking the test have had equal opportu- 
nity to acquire the information called for by the 
item. This is the case with most of the present 
mental ability tests. The author gives no data 
on the extent to which the test is measuring “g” 
or on the relationships existing among the seven 
subtests. 

The 550 items with which the author started 
were reduced to 168 by eliminating, first, all the 
items which failed to show an increase in per- 
centage passing with increase in mental ages 
obtained from the National Intelligence Test. 
The tests went through three revisions, but no 
other factor used in eliminating items is indi- 
cated. The tests were validated by correlating 
with teachers' judgments, other intelligence test 
scores, and average school marks. The median 
correlations were .67, .81, and .59, respectively, 
in small groups of 30 to 40 students. These cor- 
relations are typical of those found when other 
tests are used, but the groups used are rather 
small for validation. 

Reliability is indicated by means of coeffi- 
cients of correlation. The test-retest correlation 
for 40 pupils after a six month's interval was 
.8o; the split-half reliability for 195 students 
chosen at random from the 8,500 pupils in pub- 
lic and high schools used in making the norms 
was .95. These results are typical of those re- 
ported by other test makers. 

Test scores from 20 to 155 are changed to 
mental ages in order to compute the usual ratio 
type of IQ. The maximum divisor is 15, and 
CA’s of children beyond 13 years of age are 
adjusted in a manner similar to that used to 
compute Revised Stanford-Binet IQ's. The 
maximum IQ obtainable at age 12 is 153; this 
is reduced to 122 at age 16. 

The method of indicating responses, namely, 
underlining the choice, is not too well chosen. 

Mental age norms are based on 8,500 students 
in all grades from 3 through 11. Samples for the 
elementary school subjects were selected from 
seven Canadian provinces; all high school stu- 
dents were in Saskatchewan schools. The aver- 
age numbers of students per grade level are 


somewhat small as compared with numbers used 
by many other test makers. 

Judged by the recommendations of R. W. B. 
Jackson and G. A. Ferguson in Studies on the 
Reliability of Tests (see 952) more information 
is to be desired. Insufficient data on reliability 
and item validity are presented. The numbers 
of cases used in some of the studies were inade- 
quate. On the other hand, data presented in the 
manual compare favourably with those for 
many other tests of mental ability. 


[ 238 J 
Mental Alertness Test: George Washington Uni- 
versity Series, Revised Form. Grades 12-13; 1926- 
30; $10 per 100; 15¢ per specimen set, postpaid ; 50 (60) 
minutes; L. Omwake, S. Doran, F, Crowley, and H. 
Peterson, Center for Psychological Service, George 
Washington University. 


J. P. Guilford, Professor of Psychology, The 
University of Southern California, Los Angeles, 
California. This test has so many faults that 
one is relieved to find that the copyright date is 
1930 rather than 1947. 

To be more specific, some now well-recognized 
rules of good test construction have been vio- 
lated, (a) In the first three parts, the “Rights — 
Wrongs” scoring formula is used, without any 
specific instructions regarding guessing and 
penalties for wrong responses at the beginning 
of each part, and apparently without recognition 
of the fact that all examinees will have had 
ample time to complete each part. (b) The sev- 
eral parts have one over-all time limit. This is 
sound practice if the time is sufficiently liberal 
to allow every examinee to attempt all items. If 
there are slow examinees who do not even at- 
tempt items in part 5, however, and if part 5 is 
functionally different from other parts (it prob- 
ably is not in this examination), the total scores 
for slow examinees would be somewhat differ- 
ent in meaning from those of rapid examinees. 
(c) Scores for parts 4 and 5 are weighted 5 and 
2 times, respectively. One would like to see rea- 
sons for this given in the test manual. (d) Part 4 
contains only five items and hence must be quite 
unreliable. By virtue of its unusual weight it 
would tend to lower the reliability of the total 
score. 

The total score is apparently heavily weighted 
with verbal-ability variance. All the parts, ex- 
cept part 4, would also have their heaviest vari- 
ance in that same factor. In part 3 we find an 
attempt to measure memory for content of prose 
material. This is a rather novel feature in 


4 


- form in turn is divided into two parts. The first 
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scholastic aptitude tests and one that deserves 
to be validated. The aim of part 3 probably 
miscarries to a large extent, however, because 
the prose material is sufficiently familiar that 
many examinees could make a fair score with- 
out having read the specific material. 

In spite of these shortcomings, the test can 
still have a substantial degree of scholastic- 
aptitude validity, because of the probably heavy 
weighting with the verbal factor and the corre- 
spondingly great emphasis on this factor in 
measures of academic achievement. No data on 
reliability or validity are given in the test man- 
ual, or on intercorrelations of parts. 

The norms given with the test are highly 
inadequate. It is merely stated that they pertain 
to college freshmen, but information as to how 
many, what sex, and what college is not given. 
Centile equivalents are given only over the range 
from the roth to the goth centile. This restric- 
tion of range is considerable, since the scale dis- 
tance under the normal curve is as great from 
the goth to the goth centile as it is from the soth 
to the goth, 
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*Mill Hill Vocabulary Scale. Ages 434 and over; 
1943; Forms I, IT; 125. 6d. per 50 of either form; also 
for use with the complementary test, Progressive 
MUT (see 258) ; J. C. Raven; H. K. Lewis & Co., 


REFERENCES 


m AND Warsmaw, J. B. “Vocabula: 


in Rum: C. Tests.” 
Brit J Med Psychol 20:18. * (PA 19: 
RUPES a d pap 


2. HiwwzLWEIT, H. T. Intelli i 
z + Mace Ne Temperament.” J Personality Morir ete 


3. Eysenck, H. J, Dimensions of Personality, pp. A 


123-5. 

Hinmelwelt aod W. Tinted Seer ate with ET 
W. D. Furneaux, H. Halstead, O. Marum, M. McKinlay, A. 
Petrie, and) PLM. Yap. Foreword by Aubrey Lewis. London: 
bun [ERR » Trubner & Co. Ltd., 1947. Pp. xi, 308. 

David Wechsler, Chief Psychologist, Psychi- 
atric Division, Bellevue Hospital, New York, 
New York. The Mill Hill Vocabulary Scale is 
a wide-range vocabulary test standardized by 
the authors on an English population, ages 414 
through adulthood. It has two forms which can 


be used individually or as a group test. Each 


part of each requires the subject to define the 
words either verbally or by writing them out; 
the second part, administered only to the sub- 
ject who can read, is a multiple-choice vocabu- 
lary test in which the subject selects the nearest 
synonym (r of 6) to the stimulus word. 
Results for both forms are given in percen- 


tiles for different ages, Scoring makes use of 


both qualitative and quantitative criteria, but 
only the latter are used to obtain the subject's 
test score. Test-retest correlations between 
Forms 1 and 2 as a whole are in the “neighbor- 
hood of .97”; between the oral definitions parts, 
.95 and for the synonym selection parts “in the 
neighborhood of .9." A method is furnished for 
converting the quantitative scores into MA 
equivalents without, however, any validation of 
the technique employed. 

The 88 words which comprise the two vocab- 
ulary lists were taken from the Concise Oxford 
Dictionary supplemented by synonyms from 
Roget's International Thesaurus, In spite of the 
excellent criteria and statistical procedures used, 
the net result, as in the case of other vocabu- 
lary lists, is to furnish median scores which dis- 
criminate only modestly between successive age 
levels. Thus the differences between successive 
median age scores (4J4 to 14 inclusive) is on 
the average only about two words. On the other 
hand, the author's norms show a very marked 
jump from the median 14-year score to that of 
the average adult. The difference between these 
two age levels is much greater than those found 
by American authors (Terman, Wechsler). On 
the Mill Hill list (88 words) the jump is from 
38 to 55; on the Terman list (45 words) from 
16 to 20; on the Wechsler list (42 words) 
from 20 to 25. 

The authors originally presented their vocab- 
ulary test as a measure of verbal ability rather 
than of general intelligence and were opposed to 
combining it with other tests to give a single 
measure of intelligence. In their latest manual, 
however, they state that it was designed as a 
"complement" to Raven's Progressive Matrices 
(see 258), “both Scales (being) constructed to 
cover as nearly as possible the whole range of 
intellectual development from infancy to matu- 
rity." This claim is unsupported and would be 
difficult to prove. The Mill Hill Vocabulary 
Scale is nevertheless an excellent test measuring 
in a rather good way the same factors which 
may be said to be measured by other vocabulary 
tests. Comparative norms on an American popu- 
lation would be very desirable, 
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*Moray House Picture Intelligence Test 1. Ages 
78-103 months ; 1944; 125. 6d. per 25; 8d. per manual; 
Is. 4d. per specimen set, postpaid; 25(75) minutes; 
Margaret A. Mellone under the supervision of Godfrey 
H. Thomson and an Advisory Committee oí Edin- 
burgh Headmasters and Infant School Mistresses: 
University of London Press Ltd. : 


- 
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, t. Metrong, Margaret A. “An I igati - 

tionship Between Reading Ability and EQ, as Measured by a 

yere D TEE nce Test." Brit J Ed Psychol 12:128-35 
: 16:39) 

„2. MzLLoNE, Marcarer A. “A Factorial i 

Tests for Young Children." Brit J Peychol 5:916 8 544, * (BA 


19:371) 

[24 ] 
[Moray House Tests of Intelligence.] Ages 10-12.0 
Or I0-12.5; 1934-37 ; 5 forms; withdrawn from publica- 
tion in 1047; 45(60) minutes; Moray House Tests 10, 
11a, 13, and 20 by Godfrey H. Thomson and colleagues ; 
University of London Press Ltd. 
G) MORAY HOUSE TEST IO. Ages I0-I2.5; 1934. 
b) MORAY HOUSE TEST IIA. Ages 10-12.0; 1934. 
C) MORAY HOUSE TEST I3. Ages 10-12.0; 1935. 
d) MORAY HOUSE TEST BJ. Ages I0-12.5; 1937. 
£) MORAY HOUSE INTELLIGENCE TEST 23. Ages 10-12.0; 
no date; Alexander S. Mowat and William G. Emmett 
in consultation with Godfrey H. Thomson. 


REFERENCES 


1. Ropcer, Arran G. “The Application of Six Group Intelli- 


rence Tests to the Same Children, and the Effects of Practice.” 


Brit J Ed Psychol 6:291-305 N '36. * (PA 11:1521 
2, THOMSON, Goprrey H. What Are Moray core Tests? 
onm ie niveri of London Press Ltd., [no date]. Pp. 8. 


Patrick Slater, Chief Research Officer at the 
Social Survey and Honorary Psychometric Psy- 
chologist at St. Thomas's Hospital, London, 
England. All children educated by local educa- 
tion authorities under the English Ministry of 
Education undergo an examination between the 
ages of eleven and twelve; their performance is 
used in deciding what kind of secondary educa- 
tion would be best for them. English and arith- 
metic are the obligatory subjects, but nearly all 
authorities use an intelligence test as well. Most 
authorities employ a psychologist, and it is fre- 
quently one of his (or her) duties to prepare 
the test. But others make use of the Moray 
House tests, which are specially prepared for 
this purpose by a team of psychologists work- 
ing under the direction of Professor Godfrey 
Thomson. According to Thomson, the number 
of children to whom these tests are given is 
of the order of 400,000 per annum. 

“In carrying out this work,” Thomson writes, 
“the makers of Moray House tests have them- 
selves got much satisfaction and pleasure, and 
perhaps some reputation and, in the case of the 
younger, material for degree theses and the like. 
But they have not made any money out of it: 
for from the beginning all fees and royalties 
have been paid into a fund (nowadays adminis- 
tered by trustees) which is devoted to the fur- 
therance of research in education and particu- 
larly to research into the improvement of tests.” 

Meticulous care thus goes into the prepara- 
tion of the tests, and they are pre-eminently suit- 
able for children of normal mental ability aged 
from ten to twelve. After some 20,000 children 
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have been tested, permanent conversion tables 
are set up, giving the IQ corresponding to a 
given score on each test, and the test is there- 
after supplied to Directors of Education only, 
for use in their areas, and under promise that 
adequate care will be taken to preserve secrecy. 
A few of the tests which have served a large 
number of authorities in this way have been 
released to the general public, among them the 
tests enumerated here. 

They consist of problems of types used since 
the beginning of this century in verbal tests 
of intelligence—directions, analogies, classifica- 
tion, sentence completion, series completion, 
jumbled sentences, deciphering codes, and mis- 
cellaneous problems including some arithmetic. 
There are no sections; problems of all kinds are 
presented one after the other in their order of 
difficulty, Any teacher can administer them after 
studying the manual of directions. Perhaps in 
view of their extensive circulation more atten- 
tion to simplifying scoring methods, eg., by 
mechanical means, would be justifiable. 

For the age groups and the purpose intended, 
no better tests are published in the British Isles. 
But they do not measure intelligence independ- 
ently of other aptitudes, for success in them 
depends also on special abilities in the use of 
words and numbers. Particular precautions are 
necessary if they are to be introduced into ex- 
periments involving factor analysis, and gener- 
ally speaking they are unsuitable for this pur- 


pose. 
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Multi-Mental Scale. Grades 3-16; 1925-904 ipn D 
2; $1 100; 25¢ per specimen set; 20(30) min- 
a3 : ‘Vitam A. McCall; Bureau of Publications, 
Teachers College, Columbia University. 


REFERENCES 

1. Metar, WinLIAM A. anp His Stovents, "The Multi- 
Mental Scale." Teach Col Rec Led O 'as. * 

2. McCatt, WILLIAM A.; BINSON, CLARA; STRIBLING, 
Frances; LERCHE, MARIE; Hurcninos, MABEL; Lono, Ertza- 
petit; Garrison, N. L.; LONG, JOHN A.; BARTHELMESS, Harriet 
M.; Morrison, Nevure C.; OESCHLER, HAZEL; Saxnrontvs, I. C.; 
Faux, Rrra; Lazar, Mat; Owen, Rena; Donor, Amy B.; Mor 
PAT, Grace W.; Scuun, BELL; Kao, Kyuin SAN; AND OPPER- 
MANN, qued. EON of Mone Scale." Sum- 
mary. 1 eaci ol ec 27: 41 ja 26. 

7 CowarNs, W. D. V [o rison of Intelligence Tests.” 
sch & Soc 27:298-300 Mr 10 '28, * (PA 2:2340) 4 

4. Witty, PAUL A., AND TAYLox, J. F. “Some Results of the 
Muiti-Mental Test" J Ed Psychol 20:299-302 Ap '29. * (PA 


* ae J. C., anp Hause, Ext». “On the Reliabili! 
rived Scores Yield 
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Tests.” J Ed Psychol 24:371-9 My 
8. PuiLLIPS, Arexanver J. "Further Evidence Regardin; 
Weighted Versus Unweighted Scoring of Examinations." Ed 


Psychol Meas 3:151-5 su '43. * (PA 18:1925) 
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D. A. Worcester, Chairman, Department of 
Educational Psychology and Measurements, 
The University of Nebraska, Lincoln, Ne- 
braska. This scale, published in 1925, consists of 
100 groups, each of five words one of which 
is to be crossed out as not belonging with the 
others. While the type of response is the same 
for each item, at least 16 different mental sets 
are required. The subject is not told but must 
"divine" the necessary point of view in each 
case. Each word in the group is calibrated for its 
degree of correctness. Great care was given to 
the choice of the items and the statistical treat- 
ment was much more thorough than was usually 
given to tests at that time. The scale was vali- 
dated against several of the best intelligence and 
achievement tests then in use. 

Tt is hard to convince users, however, that a 
20-minute test can “be used with equal success 
from grade three through the university" and 
that it will yield grade scores from 1.4 to 30 
with age equivalents from 6.9 to 38. It is doubt- 
ful, too, that many psychologists at present will 
agree that, for the calculation of IQ’s, 20 years 
(CA) should be used as the maturation point 
for intelligence. One wonders if the authors still 
hold to this age, but no revision of the manual 
has been put forward. The scoring key was 
based on the responses of 250 pupils, and the 
adult user is warned that disagreements as to 
the weighting will be due to his sophistication. 
Since the scale is to be used with adults as well 
as with children, it would seem that some adults 
could have been well used in the standardization, 
The idea of the scale is good but revision is 
needed. 
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*Non-Language Multi-Mental Test. Grades 2 and 
over ; 1942; Forms A, B; $3.85 per 100; 25¢ per speci- 
men set; 30(40) minutes; E. L. Terman, William A. 
McCall, and Irving Longe; Bureau of Publications, 
Teachers College, Columbia University. 


5 Rr REFERENCES 

1, TERMAN, E. L. The i 

tional Education Survey Techniques With Speal Emphanis on 
M. Intelligi ic 


^ fe su . 
lished doctor's dissertation, New York University, 1930. 


Carroll A. Whitmer, Director of S, pecial Edu- 
cation, Public Schools, Pittsburgh, Pennsyl- 
vania. According to the manual, this test is 
designed to give an estimate of the intelligence 
of children and adults who do not speak English 
or who are deaf or illiterate. It is administered 
by simple verbal directions or pantomime. Scor- 
ing is done by a marked copy, and the score is 
the number of correct responses. 
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The authors made an original tryout of this 
type of material on 149 WPA employees who 
were also given Otis Self-Administering Tests 
of Mental Ability and found that the two tests 
measured a common function. The standard 
scores obtained in this testing were averaged and 
used as a criterion score for the selection of the 
120 items used in the two forms of the final test. 
Each form is composed of three cycles of twenty 
items arranged in order of difficulty. 

The norms for the test were obtained from 
statistical treatment of the scores of 2,531 New 
York City school children in grades 3 through 
8, selected for socio-economic background and 
intelligence. The table of mental age equivalents 
in ‘the manual ranges from 33 months to 236 
months, Scores at the extremes (above 47, MA 
173 months, and below 20, MA 72 months) have 
been extrapolated. Grade equivalents are also 
given for grade 1.5 to 8.5. Grade equivalents 
below grade 3.4 are extrapolated. 

The reliability of the test for grades 3 through 
8, estimated by the Kuder-Richardson reliabil- 
ity formula, is .86 and .9o for the separate forms 
and .94 for both forms. The authors recommend 
using both forms. 

“In essence, the test requires the same type 
of intelligence performance among its pictorial 
symbols as is required among more conventional 
verbal symbols. The kinds of relationship re- 
quired are of the form of relations of opposites, 
of similar pairs, of part to whole, of whole to 
parts, and of sequences.” This generalization 
quoted from the manual may be interpreted to 
represent a quite different assumption from 
that usually used in the justification of non- 
language tests. Overlapping abilities are usually 
admitted, but it is generally assumed that the 
ability to work with nonverbal symbols repre- 
sents another type of intelligent performance. 

A 30-minute gioup test composed of 60 items 
which purports to differentiate mental age from 
33 months to 236 months assumes almost mirac- 
ulous discriminative power. One point of score 
determines a mental age difference of three to 
six months. 

The motivation of the young or dull subject 
in a test which has all the practice exercises at 
the beginning of the test period is questionable, 
particularly in a test which uses pantomime 
directions and has items arranged in difficulty 
cycles. 

This test may be a serviceable instrument for 
rough survey, but any of the tests standardized 
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on a narrower age range should be a more accu- 
rate measure. 
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Ohio State University Psychological Test. Grade« 
9-16 and adults; 1919-47 ; separate answer sheets must 
be used; prepared by Herbert A. Toops for the Ohio 
College Association. 

a) FORM 2I. 1920-41; 45¢ per test booklet with answer 
pad; $1.80 per 25 answer pads; 75¢ per specimen set; 
worl limit (90) minutes; Science Research Associ- 
ates. 

b) FORM 23. 1920-47 ; IBM ; test booklets may be either 
rented or purchased ; 4¢ rental charge per test booklet; 
5¢ per used test booklet; 8¢ per new test booklet; 334¢ 
per answer pad for stylus marking (nonmachine-scor- 
able); %4¢ per pin stylus; 2¢ per stylus; 3¢ per ma- 
chine-scorable answer sheet; 6¢ per electrographic 
pencil ; 5¢ per hand-scoring stencil; scoring service for 
machine-scorable answer sheets is available at 3¢ per 
test plus postage; earlier forms available include Form 
I9 (1935), Form 20 (1937), Form 21 (1941), and 
Form 22 (1943); work limit (120) minutes; Ohio 
College Association. 
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J. P. Guilford, Professor of Psychology, The 
University of Southern California, Los Angeles, 
California. [Review of Form 21.] This well- 
known examination has gone through numerous 
revisions, and in these revisions some of the 
most exacting test-construction procedures have 
been employed. Its three parts have survived re- 
peated item-analysis and validity studies. With 
a reported validity coefficient of .68 (against the 
criterion of “Point Hour Ratio"), the total score 
would seem to offer the best predictions now 
available for an over-all academic-aptitude in- 
strument at the college level. It is wisely set up 
as a power test, all students being permitted to 
attempt all 150 items. The reliability estimate 
(N = 300) is .93. 3 

Part 3, a reading comprehension test, may be 
interpreted on the basis of separate norms pro- 
vided. The author should have presented data 
on intercorrelations of the three parts, however, 
so that the user can judge whether the separate 
scoring of part 3 is justified. Owing to the high 
degree of verbal-factor variance that seems 
evident in all parts, one would expect very high 
intercorrelations. No estimate of reliability of 
part 3 scores is provided. 

In spite of the unusually high validity coeffi- 
cient, which probably rests on the fact that 
verbal ability is by far the greatest contributor 
to average scholastic achievement as measured, 
it is time to question whether other qualities 
that are really important, particularly in the 
more specialized curricula and courses, should 
not receive more consideration. The Ohio exam- 
ination, while achieving its unequalled validity, 
may be the result of a vicious circle which has 
overstressed verbal ability in education. In order 
to gain more light on this point, we need meas- 
ures of achievement that are more in line with 
genuine objectives. It will then probably be 
found by the same procedures by which the 
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Ohio test has been brought to its present degree 
of refinement that a much more varied content 
is desirable, both for the selection of students 
and for their guidance. It is time for us to think 
more in terms of academic aptitudes than of 
academic ability. It is time for us to validate 
tests less often against grade-point averages and 
more often against specific course achievement 
properly measured. 

The norms provided with the test are based 
upon samples of adequate size, though not rep- 
resenting the national scope that is true of the 
American Council on Education Psychological 
Examination. Norms are conveniently given for 
each year of high school as well as for college 
freshmen. 

The printing format of the test booklet is 
largely determined by the convenient pinhole 
scoring system, but something might still be 
done to avoid the extremely long lines of print, 
a feature that psychological studiés have con- 
demned as a reading handicap. There is no evi- 
dence that this affects the value of the test re- 
sults, but it is a matter of reading comfort. 


For reviews by Louis D. Hartson, Theos A. 
Langlie, and Rudolf Pintner of Form 20, see 
38: 1051. 
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XOral Directions Test: An Intelligence Test on 
Phonograph Records. Adults; 1945-46; 2 styles of 
vinylite recordings; each record set includes manual, 
plastic-covered key, and roo answer sheets; $16 per 
E record (3374 rpm) set; $17 per standard 
record 78 rpm) set; $4 per 100 answer sheets ; 28(35) 
mime Charles R. Langmuir; Psychological Cor- 
poration. 


Charles D. Flory, Rohrer, Hibler and Rep- 
logle, New York, New York. This test purports 
to be a measure of the general intelligence of 
adults, and especially of adults in industrial 
situations. The major features of this test are 
simplicity of administration, range of coverage, 
and ease of scoring. 

The Oral Directions Test is unique in that 
both directions and questions are given by 
phonograph reproductions. The recordings are 
available in two styles: (a) the Transcription 
Record which is a double-faced, 16-inch disk 
to be reproduced at 3314 rpm, and (b) the 
Standard Records, consisting of four double- 
faced, 12-inch records for use on a standard 
phonograph. In the standard model the test is 
broken by natural pauses so that the records 
- can be changed without disturbing the continu- 


ity of thought for the subjects. Both styles of 
recording are made on vinylite discs, which are 
practically permanent. Directions are provided 
so that the test can be given from printed script, 
but this procedure defeats one of the major 
advantages of the test. 

The items are intended to fit an unselected 
adult population but in some instances favor the 
city dweller. The author contends that “illiter- 
ates can be tested in the same group with high 
school graduates without embarrassment to the 
examiner or the applicant.” The range of diffi- 
culty among the items suggests the possibility 
of using this instrument in a screening pro- 
cedure preparatory to the application of more 
rigorous classification techniques. 

The test is composed of only 30 items, but 
through weighted scoring a maximum of 58 
points can be earned. The scorer is instructed 
to memorize the key, which can be done very 
quickly. The weights per item vary from 1 to 10, 
with 23 items receiving only one point. 

This test, because of its uniqueness, should 
be given further experimental use. There are 
several points, however, at which critical users 
will undoubtedly raise questions. Few psychol- 
ogists will agree with the author when he says, 
“Any available office worker is qualified to ad- 
minister the test.” The attitude of subjects is 
important even when a test is self-administering. 
Desirable attitudes for reliable test results are 
not produced by “any office worker.” 

The norms have been established by testing 
768 Army enlisted men who were selected as 
representative of all enlisted men. A table of 
general population equivalents was devised by 
comparing the Oral Directions Test scores with 
the Revised Alpha Examination, Form 5 scores 
for these 768 men. Tt is evident from the equiva- 
lent scale that the Oral Directions Test dis- 
criminates less well at the upper intelligence 
levels than it does at the lower levels. The 
spread in points for the upper 50 per cent is 
only r1 while the spread for the lower: 50 per 
cent is 24 points. Logic also leads to questions 
about norms in which half of the people tested 
in an industrial group were in the lowest 30 per 
cent of the general population and the average 
for those who graduated from high school was 
only slightly above the 3oth percentile on the 
equivalent scale. These results suggest need for 
more typical standardizing data. 

This test seems to the reviewer to be heavily 
weighted with the perceptual aspects of intelli- 
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gence and relatively light on judgment items. It 
is therefore possible that the test has greater 
industrial application to special jobs than as a 
general screening test. Only research can answer 
this question. The tempo is so slow in places 
that brighter subjects consider the tasks tedious. 
The complexity is likely, however, to cause some 
unselected adults to lose interest or to answer 
at random in an unsupervised situation. 

It seems likely that an examiner who has 
thirty minutes of testing time will prefer, in 
most situations, a test with more items and items 
which will probe more facets of intellectual 
competence. 


Irving Lorge, Professor of Education, and 
Executive Officer, Institute of Psychological 
Research, Teachers College, Columbia Univer- 
sity, New York, New York. The subtitle for 
this test is “An Intelligence Test on Phono- 
graph Records.” Basically, the test is an orally 
administered omnibus-form intelligence test. 
The item types include simple directions, figure 
recognition, digit span, memory for detail, 
number completion, and comprehension. Among 
the items are several requiring spatial orienta- 
tion and direction. Such items seem to cause 
a special difficulty to corrected left-handers. 

The manual suggests that the test may be 
given without phonograph records by a trained 
examiner, but implies that the records will be 
superior to such administration. There can be 
no doubt that the record controls the stimuli 
and the timing, yet there may be some difficul- 
ties with the recording. First, the record is 
spoken by a male whose lower register may not 
be understood by older women. Second, occa- 
sional misunderstandings may affect the relia- 
bility and validity of the test, e.g. the different 
meanings for “blocks” in Item 14 (where 
“streets” or "squares" would be the preferred 
localism), the enunciation of the letter “N” in 
Item 20, and the confusion of “X” and “J” in 
Item 18, Such possible auditory disadvantages, 
however, will be relatively small for a young 
adult group provided that the amplification 
system is adequate. 

The idea of an orally administered intelli- 
gence test is not new. As a matter of fact, the 
first test of the Army Group Examination 
Alpha is an orally administered directions test. 
The oral administration has the advantage of 
controlling time on each question so that can- 
didates are not penalized by spending too much 


time on any single question. The test can be 
given to a wide range of ability without undue 
ennui on the part of able candidates or undue 
frustration in duller people. 

The reliability of the test is reported on the 
basis of the correlation of first half by second . 
half as .&r. The manual suggests that the first 
half of the test may be given as a short form. 
As such, the test would be basically a rough 
screening test with a rather large standard 
error. The total test would have a reliability 
of about .9. 

The validity is reported on the basis of the 
correlation of the Oral Directions Test with 
Wells’ Modified Alpha Examination Form 9. 
The correlation for various samples hovers 
about .84. 

The norms are reported on the basis of the 
method of equal centiles. The author had avail- 
able data for a representative sample of en- 
listed men on the Army General Classification 
Test and Wells’ Revised Alpha Examination, 
Form 5. Since the relationships between Wells’ 
Forms 5 and 9 were available, the author pro- 
duced norms for a specified sample of the popu- 
lation. Since the norms were produced as a first 
approximation under the stress of a war situa- 
tion, it is hoped that the author and publisher 
will produce a better set of normative data soon. 
The test undoubtedly will show sex and age dif- 
ferences. Since the enlisted population is trun- 
cated at the low end of the ability continuum, and 
truncated on age, the currently available norms 
do not represent the general population. Auxil- 
iary norms for a specialized group of clients in 
a community advisory service center are also 
given. 

The relationship of the test score to age and 
sex is given in terms of mean differences be- 
tween semidecades for men and women. As 
may be expected with an auditory speed test, 
significant differences exist between age groups. 
There is a significant age decrement. The man- 
ual, however, does not suggest any correction 
for age nor does it suggest any clinical evalu- 
ation of lower scores in older groups. Sex dif- 
ferences are reported. These, however, cannot 
be evaluated adequately, since there is no evi- 
dence as to the representativeness of the sam- 
pling of successive age groups. Some of the 
poorer performances of the older women may 
be attributable to the fact that older women do 
not understand male speech patterns as well as 
men or younger women. 
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The test is simple to give and fairly easy to 
score. Except for possible minor misunder- 
standings of the speech pattern, the test should 
be fairly adequate for large scale screening 
projects. 


William W. Turnbull, Secretary of the Board 
and Head of the Test Construction Department, 
College Entrance Examination Board, Prince- 
ton, New Jersey. pEscRIPTION. This test is sub- 
titled “An Intelligence Test on Phonograph 
Records,” and was designed as “a test of gen- 
eral mental ability which would be simple, valid, 
and practical for use in testing adult groups.” 
The complete test, both directions and ques- 
tions, has been recorded on unbreakable discs. 
It consists of thirty items, some of which are 
subdivided, affording a total of 58 scorable 
units, The subject’s task is to listen to each ques- 
tion, remember it, solve it, and record the 
answer, The questions sample a wide variety of 
abilities. Memory plays a large part in almost 
all questions, but ability to follow complex 
verbal directions, spatial reasoning, number 
facility, and ability to concentrate are also im- 
portant. 

SCORING. All questions are answered by writ- 
ing a symbol, number, or word in a specified 
position on the answer sheet; and thus the test 
cannot be scored by machine or with a stencil. 
The scoring key consists of an answer sheet, on 
which the correct answers and rules for scoring 
variations have been overprinted in red, which 
is sheathed in celluloid—an excellent protective 
device, 

NORMS. Percentile equivalents of raw scores 
are given to provide norms for the general 
population, as represented by a controlled sam- 
ple of 768 Army enlisted men. Further norms 
are given for 258 male applicants for vocational 
and educational guidance at a community ad- 
visory service center. More extensive norms are 
obviously needed, and will no doubt be gathered 
as the test finds wider application. 

VALIDITY. Correlations in the low .8o's have 
been established between scores on this test and 
scores (total, verbal, and numerical) on the 
Modified Alpha Examination, Form 9. The 
population included over 2,500 adult men and 
women, whose mean scores on the ODT fell 
between the 20th and 3oth percentiles of the 
norms for the general population. When given 
to a group of some 3,000 adult applicants for 
employment in a war plant, the test showed a 


total score difference of approximately one 
standard deviation (12 raw score points) be- 
tween applicants whose education had extended 
to grade 8 or below (mean score at 13th percen- 
tile), and those who had attended high school 
(mean score at 34th percentile) ; and a raw 
score difference of 7 points between the high 
school group and those who had attended col- 
lege (mean score at 59th percentile). It is unfor- 
tunate that both studies were made with groups 
whose mean ability was markedly below aver- 
age, since the question of the test's validity 
for the general population is thereby left in 
doubt. Since the test seems to be most discrim- 
inating at the lower levels, the reported validity 
may well exceed the actual validity for the gen- 
eral population. 

RELIABILITY. The reliability is estimated (by 
controlled split-half technique with 737 cases) 
as .93, which is excellent in view of the un- 
speeded nature of the test. The corresponding 
standard error of measurement is 3.3 raw score 
points, compared with a raw score standard 
deviation of 12 points. It should be noted that 
the mean score of the population on which the 
reliability is based fell below the 35th percentile. 
Thus for the general population the reliability, 
like the validity, is probably lower than the 
figure reported. 

MANUAL. The manual is well set up, and gives 
adequate information concerning the construc- 
tion and standardization of the test, and about 
the studies on which reliability and validity 
figures are based, as well as full directions for 
giving and scoring the examination. A copy of 
the script accompanies the manual. 

EVALUATION. This test seems likely to serve 
its purpose very well when applied to a relatively 
low-level group. It is doubtful if it is difficult 
enough for effective use with a group of above- 
average ability. The norms show a spread of 
24 raw score points between the sth and soth 
percentiles, but of only 11 points between the 
5oth and 9sth percentiles. The test material it- 
self shows every evidence of careful prepara- 
tion, The recording is distinct, the directions 
clear, the items unequivocal. The presentation 
on records has a considerable degree of face 
validity for use with adults at low ability levels, 
since in most work situations their instructions 
will be given orally. The oral feature is advan- 
tageous also in testing memory. One of the ob- 
jects in recording the test is to standardize the 
wording and timing of the directions, and so to 
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avoid errors of administration in these respects, 
Against this advantage will have to be counted 
the differences which will be introduced by vary- 
ing excellence of play-back equipment and by 
differences in room acoustics. 


J Consult Psychol 11:282 S-O °47. 
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O'Rourke General Classification Test, Senior 
Grade, 1942 Edition. Grades 12-13 and adults; 1927- 
42; Form L; $3.25 per 50, postpaid ; 25¢ per specimen 
set, postpaid; 40(55) minutes; L. J. O'Rourke; Psy- 
chological Institute. 
M REFERENCES 

1 WI ». LS i i T 
DA yea SIM H7! EMT] of Clerical Tests.” Per- 

2, DAVIDSON, CHARLES M. ^ Evaluaton of Clerical Tests," Per- 
sonnel J 16:57:04. Te 787. (PA 11:4713) 
E aa E bel E DARIN TEPE 
ces.” J Appl Psychol pner onbres 

Marion A, Bills, Assistant Secretary, Aetna 
Life Insurance Company, Hartford, Connecti- 
cut, This 40-minute mental alertness test con- 
sists of 105 items of eight types, two of which 
are vocabulary, two of logic, one of arithmetic 
of the problem type, one information, one spell- 
ing, and one grammar. Each type becomes more 
difficult as the test progresses, the different types 


-appearing in cycles. The directions are clearly 


given, the norms are based on approximately 
three thousand males. 

Its advantages are that it is easily adminis- 
tered and contains enough variety of items so 
that a person of high school level has a chance 
to demonstrate ability. 

Its disadvantages are that the various types 
of ability cannot be scored separately, and the 
mixture of different types of items might con- 
fuse the less mentally alert. 


[247 ] 
Otis Classification Test, Revised. Grades 4-8; 
1923-41 ; Forms R, S, T ; consists of an achievement 
test and an intelligence test (the Otis Quick-Scoring 
Mental Ability Tests, Beta Test) ; $1.70 per 25; 35¢ per 
Specimen set, postpaid; 60(70) minutes; Arthur S. 
tis; World Book Co. 
REFERENCES 

1. Banker, Howarp J. “Correlation Studies of the Student's 
zu Index." J Ed Res 20:31-7 Je 'a9. * (PA 3:4621) 

2. Sancent, Ruta F. “The Otis Classification Test—Form A, 
Part IL" Teachers Forum. 453053 N '31, * (PA £3") 
Confined in the Restevky State Raten J ASPI Areko] 
24:628-32 O '40. * (PA 15:987) P D. deis 

Ed Res B 21:186-8 S 16 "42. William J. 
Jones. The manual..contains more figures, 
tables, and charts for determining indexes, 
norms, and various types of ages than for any 
other test known to the present writer. The 


test, which is a combination mental-ability and 
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achievement test, yields not only raw and stand- 
ard scores, but also MA, EA, IQ, EQ, CI, PR, 
AR, IE, and MEA! For those who revel in 
statistical gymnastics, the possibilities of the in- 
terpretative charts included in the manual are 
well-nigh limitless. The 80 items comprising 
the mental ability test (Part I) are the same as 
those in the widely used Otis Quick-Scoring 
Mental Ability Tests. The material consists of 
opposites, analogies, proverbs, vocabulary, rea- 
soning, mixed sentences, similarities, and the 
like. The achievement test of 115 items (Part 
II) is largely made up of items dealing with 
language, arithmetic, geography, history and 
civics, physiology and hygiene, music, and gen- 
eral information which were included in the 
original Otis Classification Test, published for 
the first time twenty years ago. * It can almost 
be said that the test is self-administering and 
self-scoring; it is so practicable. It is necessary 
merely to pass out the booklets, allow the pupils 
time to study the first page with a minimum of 
directions, and then let them go ahead and take 
the test, allowing thirty minutes for each of the 
two parts. * One of the important assumptions 
upon which the test is based is that innate 
mental ability and achievement based on school 
training are different, and can be measured sep- 
arately. Part I is therefore designed to rule out 
or hold constant the influence of the school by 
including only “elements of their environment 
that are common to all children," while Part II 
is made up of items which depend on “knowl- 
edge gained in school." Due to the fact that the 
reported coefficients between Parts I and II 
average .79, it seems unnecessary, even unwise, 
to figure out the relationships between scores 
on the separate parts (as suggested in the man- 
ual) since the magnitude of the r shows that 
they measure the same ability to a large extent. 
Dependable interpretations cannot be drawn 
from the resultant spurious relationships. In 
this connection it is interesting to note that in 
compiling the test norms it was necessary to 
make up fictitious mental ages for certain scores 
in order to have a “more nearly uniform corre- 
spondence between mental ages and scores." Be- 
cause of this, the manual recommends determin- 
ing the intelligence quotient by addition or 
subtraction, instead of division. For the reasons 
just mentioned, it would be better not to take 
the recommended statistical manipulations too 
seriously. The total score which the test yields, 
however, is a worth-while index which can be 
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— used along with other criteria by those who 
sometimes find it desirable to classify pupils. 


[ 248 ] 
Otis General Intelligence Examination: Designed 
Especially for Business Institutions. Applicants 
for clerical and executive positions; 1920; I form; 
$1.20 per 25; 35¢ per specimen set, postpaid; nontimed 
(40) minutes; Arthur S. Otis; World Book Co. 


Frederic Kuder, Professor of Psychology, 
Duke University, Durham, North Carolina. 
As a collector’s item this test (copyright date, 
1920) is of interest. Adequate evidence of its 
worth, however, is conspicuously lacking. For 
example, there are no data on the reliability of 
the test given in the manual. Validity was de- 
termined by correlating scores of 100 clerks 
with the combined ratings of “intelligence.” 
Correcting for attenuation in the criterion, a 
coefficient of .73 resulted. The size of this sam- 
ple, derived from a single business concern, 
and the dubiety of the criterion measure make 
this less than sufficient demonstration of valid- 
ity by present-day standards. Norm tables are 
not titled to show the number of cases on 
which they are based. Finally, no bibliography 
is provided in the manual to permit the test 
user access to information concerning empirical 
studies of the test. 

The test bears considerable resemblance to 
the newer Otis Self-Administering Tests of 
Mental Ability. Many of the items are similar 
to those incorporated in the latter test. How- 
ever, there are also several items so ambiguous 
in construction as to lack reliability entirely. 
Significantly, these items do not occur in the 
later well-validated tests. 

This test thus stands in the same relation to 
the well-established tests of general ability as 
the crystal set to the frequency modulation 
radios—it might deliver, but the consistency 
and quality of performance is doubtful. It can- 
not compete with the Wonderlic Personnel Test 
nor its lineal descendants, the Otis Self-Ad- 
ministering Tests of Mental Ability and the 
Otis Quick-Scoring Mental Ability Tests. 


[ 249 ] 
Otis Quick-Scoring Mental Ability Tests. Grades 
L.54, 4-9, 9-16; 1936-39; IBM for grades 4 and over; 
ae 4 forms; 3 levels; Arthur S. Otis; World Book 
0. 


a) ALPHA TEST. 
$1.50 per 25; 
DEA 
,BETA TEST. Grades 4-9; 1937-39; IBM; revision of 
Otis Self-Administering Tests of Mental Ability, In- 
termediate Examination; Forms A, B, CM, DM; $1.10 


Grades 1.5-4; 1936-38; Forms A, B; 
35€ per specimen set, postpaid; 20(25) 


per 25 (Forms A, B) ; $1.30 per 25 machine-scorable 
tests (Forms CM, DM) ; 35¢ per specimen set, post- 
paid; 70¢ per 25 machine-scorable answer sheets; 20¢ 
per machine-scoring key; 30(35) minutes. A 
c) GAMMA TEST. Grades 9-16; 1937-39; IBM; revision 
of Otis Self-Administering Tests of Mental Ability, 
Higher Examination; Forms AM, BM, C, D; $1.10 
per 25 (Forms C, D) ; $1.35 per 25 machine-scorable 
tests (Forms AM, BM); 35¢ per specimen set, post- 
paid; 7of per 25 machine-scorable answer sheets ; 20¢ 
per machine-scoring key; 30(35) minutes. 
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pervisory Workers in the Aircraft Engine and Propeller Indus- 
tries" Appl Psychol 29:185:90 Je as. n (PA 19:8198) e 
Qdodeats With College Aptitude est uino J Ed Psychol 
a E S aea onL Apalit ofthe 
Silent Reading Abilities of Acoustically Handicapped Chil- 
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Predicting Scholastic Achievement.” J Ed Res 41:1338 O 47. " 

Frederic Kuder, Professor of Psychology, 
Duke University, Durham, North Carolina. 
ALPHA TEST. These picture tests appear attrac- 
tive enough to gain and hold the attention of 
a jaded seven-year old. By an ingenious plan, 
it is possible to use these tests for both non- 
verbal and verbal testing, using the same items 
and test blanks. The author of the tests recom- 
mends that both tests be given for increased 
reliability. 

From the standpoint of convenience of ad- 
ministration the Alpha tests combine the fea- 
tures of self-administration with rapid scoring 
by means of a hand stencil for the nonverbal 
test. The verbal tests require the recitation of 
test items by the teacher or examiner. The 
amount or quality of reading skill of the ex- 
aminees does not affect their test scores. Mul- 
tiple-choice items composed of series of pictures 
graduated in difficulty from familiar, concrete 
objects to more abstract “geometrics” make up 
the test content. 

Unlike the Beta and Gamma Tests, these are 
not revisions of the Otis Self-Administering 
Tests of Mental Ability but the method of de- 
termining item inclusion is the same, i.e., per 
cent of “passes” by groups of students making 
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rapid progress against those making slow 
school progress. Further evidence of validity is 
offered in reports of correlation of the Alpha 
Tests with the Primary Examination (sic) and 
Grade Placement, the biserial coefficients being 
.65 and .86, respectively, with total score. 

Reliability for total score is reported as .81, 
computed from total scores of Form A and 
Form B, for grade range of 2 to 3. This reli- 
ability is not impressively high ; however, con- 
sidered with regard to the high variability of 
performance in young children, the limited 
sampling of abilities and the narrow range of 
grades, the reliability appears acceptable for 
group prediction or appraisal. The reliability 
compares favorably with other nonlanguage 
tests for about the same grade range: Haggerty 
Intelligence Examination, Delta I, reliability 
reported as .79; Chicago Non-Verbal Exami- 
nation, coefficients ranging from .80 to .93; 
Pintner-Cunningham Primary Test as .89 and 
.93 (N = 22 and 23). It would be indeed 
hazardous to attempt prediction and control in 
the individual case from information derived 
from these test scores, but the reliability seems 
adequate for group comparisons. 

Scores may be reported in terms of “IQ’s” 
and “MA’s” with norms based on nearly 18,000 
cases of school children. 

Since test scores will be influenced by visual 
acuity, especially the discernment of differences 
among the “geometrics,” a welcome addition 
to the tests would be some simple determination 
of eye efficiency such as is provided on the 
California Test of Mental Maturity. 

BETA TEST. These revisions of the Otis Self- 
Administering Tests of Mental Ability are 
designed for either machine scoring or hand 
scoring by means of a stencil. 

The reliability, as computed by the correla- 
tion of comparable forms, yields a combined 
coefficient of .96, based on grades 4 to 9, in- 
clusive. However, it is observed that the relia- 
bility coefficient for the grade levels taken 
separately is an average coefficient of .79. 
Therefore, the accuracy of measurement for 
any individual examinee is considerably less 
than might be inferred from the coefficient of 


The only advantage these tests offer over the 
Otis Self-Administering Tests is that of rapid 
scoring. Where a small group of papers, 25 to 
50, is to be hand-scored the gain is small in- 
deed and must be balanced against the in- 


creased time required for oral instructions. 
Furthermore, the answer sheets of Forms CM 
and DM require rather complex manipulation 
and may result in lowering of scores for the 
less test-wise members of the group. 

The tests would be improved if practice re- 
sponses were made on the answer sheet proper 
rather than in an analogous, but not identical, 
manner. Moreover, a more specific check of 
the accuracy of sample responses would in- 
crease the assurance of both the examiner and 
the examinee that the method of response is 
understood. The writer has in mind the well- 
known American Council examinations wherein 
“doing” and demonstrating are effected on the 
answer sheet proper. A further flaw in the 
design of Forms CM and DM is the procedure 
of giving the oral instructions with the test 
material ‘exposed to the roving eye. The ex- 
aminer may control the actual marking of the 
answer sheet in advance of the “Go” signal, 
but it would not be possible to control the read- 
ing of items by the student who requires little 
or no instruction. Since these disadvantages do 
not occur in Forms A and B of the Beta series, 
it is believed that they offer forms preferable 
to Beta CM and DM. Forms A and B do not, 
however, permit machine scoring. 

Neither these tests nor the Self-Administer- 
ing group instruct the examinee concerning the 
advisability of guessing on doubtful test items. 
This question frequently arises in practice, par- 
ticularly among the test sophisticates aware that 
various methods of counting “wrongs” exist. 
The absence of any definite instruction in this 
matter may result in the test measuring other 
factors in addition to test content. The major- 
ity of well-constructed tests in the field are 
explicit on this point. Since there is no infor- 
mation provided in the manual on this matter, 
it is recommended that, in the interest of com- 
parability of scores, the examiner urge against 
guessing. 

For reviews by Psyche Cattell, F. Kuhl- 
mann, Rudolf Pintner, and C. Spearman, and 
excerpts from reviews by J. A. Long and one 
other, see 38:1053, 40:1413, and 40:1427. 


[ 250 ] 
Otis Self-Administering Tests of Mental Ability. 
Grades 4-9, 9-16; 1922-28; Forms A, B, C, D; $1 per 
25 of either level; 35¢ per specimen set, postpaid; 
30(35) minutes or 20(25) minutes; Arthur S. Otis; 
World Book Co. 
a) INTERMEDIATE EXAMINATION. Grades 4-9. 
b) HIGHER EXAMINATION. Grades 9-16. 
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Frederic Kuder, Professor of Psychology, 
Duke University, Durham, North Carolina. 
For purposes of prediction of school and col- 
lege success these tests compare favorably with 
other measures of general ability. Recent stud- 
ies have indicated a high relationship between 
the American Council on Education Psycho- 
logical Examinations and the Otis Self-Ad- 
ministering Tests, with the latter test showing 
as high, or higher, relationship to college 
grades. There is some evidence to indicate that 
when the IQ’s on the Otis and the familiar 
individual intelligence tests are compared, the 
IQ’s of the Otis will correspond more closely: 
with those of the Wechsler-Bellevue than 
those of the Stanford-Binet. (The IQ's for the 
three tests are, of course, computed on the 
basis of different postulates.) 

The Otis tests are among the easiest and 
most economical to administer and score of all 
the general ability tests. A consideration of 
these factors of cost and administrative ease 
brings to mind the Wonderlic Personnel Test. 
This “restandardized abridgement of the Otis 
S-A test" was standardized on employees and 
some student groups. It may be given as a 
speed test (12 minutes) or a power test. The 
reliability of this shorter test is somewhat 
lower than that of the Otis Self-Administering 
Tests. Correlations between the Wonderlic and 
the Otis are not as high as between various 
forms of the Otis tests. A good feature of the 


Wonderlic Personnel Test is the correction of 


score to allow for the reduced speed of older 
adults. Wonderlic furnishes a table for adding 
points, according to age, to the scores of ex- 
aminees more than thirty years old. The use 
of the Otis tests in adult counseling centers 
suggests the need for deriving similar norms 
for various age levels of the population, since 
current norms are based on young people pre- 
sumably working at optimum functioning level. 
The current norms do not permit the compari- 
son of adults in one age group with adults in 
other age groups which have experienced a 
greater or less amount of the known diminu- 
tion of mental alertness as measured by a speed 
test. 

The procedure of administering the Higher 
Examination on a 20-minute basis with scores 
transmuted into terms of 30-minute scores, ac- 
cording to a table in the manual, has stimulated 
some research into the accuracy of such con- 
versions. The results of these experiments are 
not entirely conclusive but generally imply that 
the conversions are not applicable to all age 
and ability groups. In addition to the inaccu- 
racies of measurement obtained on the 20-min- 
ute basis, there may be inadequate measure- 
ment where certain types of items are grouped 
near the end of the test and are unlikely to be 
attempted by examinees tested at the 20-minute 
rate. For example, in the Higher Examination, 
there are three or four questions based on a 
geometric figure, presumably designed to meas- 
ure some factor related to space perception; 
these items all appear near the end of the test, 
and only there. It is thus likely that this par- 
ticular function would not be tested because the 
examinee’s time ran out before he reached the 
items. 

The manual is more comprehensive than 
most in describing procedures of standardiza- 
tion, various methods of interpreting scores, 
devices for wide and varied application in 
schools, and even some elementary instruction 
in statistical techniques. It is, however, seri- 
ously deficient in bibliography. The increasing 
use of tests, with consequent stimulus to re- 
search, indicates the need for a list of refer- 
ences to be incorporated in the manual. 
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*Perception of Relations Scales. Adults; 1944-46; 
1 form; $1 per 25; nontimed (30-60) minutes; Educa- 
tional Test Bureau. 
G) PERCEPTION OF RELATIONS SCALES. 1044; 25¢ per 
specimen set; M. J. Van Wagenen. 
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b) ADVANCED PERCEPTION OF RELATIONS SCALES. 1946; 
IBM; 25¢ per specimen set; Lindsey R. Harmon and 
M. J. Van Wagenen. 


Charles D. Flory, Rohrer, Hibler and Rep- 
logle, New York, New York. “The Perception 
of Relations Scales are designed to provide a 
quick, simple, valid means of predicting the 
learning ability of adults, particularly in per- 
sonnel work where a complete testing program 
is impractical.” These tests are intended by the 
author to be a short intelligence test, a screen- 
ing instrument to be used with adults in indus- 
try or education. 

The tasks provided require an understanding 
of the relationship between two elements that 
are given and the detection of a similar rela- 
tionship between a third given element and one 
of five possible choices, There are sixty items 
scaled in difficulty. The relationships involve 
time, place, identity, integral parts, quality, and 
cause. The difficulty of the relationship pre- 
sented is intended to exceed the difficulty of 
the vocabulary or information required for 
solution of the item. 

The performance of 18-year-old men and 
women has been used as a basis for determin- 
ing norms. The raw scores of 1,000 subjects 
were scaled so that raw scores can be trans- 
mitted directly into per cent placement, stand- 
ard scores, or percentile rank. The scales cover 
approximately ten probable errors. The ad- 
vanced scales arranged by Van Wagenen and 
Harmon are similar in form and content to the 
original scales but are more difficult. In the 
original and simpler scales, the middle score 
for 18-year-old subjects is the 5oth percentile 
while on the advanced scales, the middle item 
was answered correctly by only 10 per cent of 
the 18-year-old group. The authors therefore 
feel that the advanced scales are well adapted 
to use with students above the secondary level 
and at the supervisory-executive levels in in- 
dustry. 

These scales are practically self-administer- 
ing. They can be given individually or in 
groups. The scoring can be done manually in 
less than one minute per test. The advanced 
scales are also arranged for machine scoring. 
The scales are given as a power test rather than 
a speed test. The time suggested is about thirty 

minutes but wide variation occurs with rapid 
workers finishing in half the suggested time 
or less and the slowest workers requiring more 
than twice the suggested time. This feature is 


" 


a definite handicap in large-scale industrial 
testing where the measurement of intelligence 
is only one part of a selection or placement test 
battery. 

Meager data are supplied concerning the 
standardization and validation of these scales. 
The statement is made that validation results 
from the use of outside criteria but no hints 
are given concerning these criteria. The items 
are purported to discriminate both from grade 
to grade and between pupils varying in ability 
within each grade. None of the statistical evi- 
dence supporting these contentions is to be 
found in the manual. The norms are reported 
as tentative. 

The test is intended for adults but stand- 
ardized on 18-year-old subjects. The assump- 
tion is made that gains in ability to sense the 
types of relationships involved have stabilized 
by age eighteen. Ability to handle analogies in 
contrast to vocabulary proficiency does appear 
to reach its peak in the late teens but there 
must be some decline in the ability to perceive 
relationships during adulthood. This point has 
not been treated in the interpretative data ac- 
companying these scales. 

Analogies have long been used in the meas- 
urement of intelligence, but these scales have 
the weakness of any single-type test such as 
vocabulary, memory, or number as an index 
of intellectual competence. Research has yet 
to demonstrate whether the short intelligence 
test should sample a single aspect of intellectual 
functioning or seek to sample the various abil- 
ities that compose intelligent action. 
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*Personnel Classification Test [Henderson]. 
Adults; 1946; IBM; 1 form; separate answer sheets 
must be used ; $20 per 100; $2 per 100 machine-scorable 
answer sheets; 35€ per scoring stencil; 25¢ per single 
copy; 25€ per manual; Robert W. Henderson; the 
Author, University Personnel Service, University of 
Kentucky, Lexington, Kentucky. 

Brent Baxter, Director of Personnel Re- 
search, Chesapeake and Ohio Railway Com- 
pany, Cleveland, Ohio. This test appears to 
have been been published prematurely as far 
as being a standardized test is concerned. The 
test has two parts, verbal ability and arithmeti- 
cal ability, but there is no data about their re- 
liability, relationship, or validity. The norms, 
based on 100 persons of an atypical group, are 
quite inadequate. No studies of the validity of 
the test are available. A single reliability co- 
efficient based on an unspecified group is given. 
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A system of adjusting total scores for the 
length of time used to complete the test is 
offered, but it is based on "rule of thumb" 
developed from the scores of 25 people. 

"Therefore, in its present form this test is 
not an effective contribution to industrial test- 
ing. As the author collects future evidence, 
however, he may then be able to offer a useful 
tool. At present, it represents only a typical- 
looking group of items on vocabulary, general 
information, and arithmetic ready for evalu- 
ation. 


John C. Flanagan, Professor of Psychology, 
The University of Pittsburgh, Pittsburgh, 
Pennsylvania, According to the manual, this 
test was designed to measure mental ability in 
an employment or work situation. There is no 
time limit and the test is designed to be used 
with a standard five-choice answer sheet. The 
first part contains approximately 100 items, 
with alternate items testing information and 
word knowledge. The second section of the 
test is composed of 25 problems "selected to 
measure practical arithmetical ability." 

Although no time limit is used in adminis- 
tering this test, a table is provided for allow- 
ing speed credit or demerit points in accordance 
with the number of minutes required to com- 
plete the test. This table is to be used only if 
it is desired “to determine the speed score.” 

The norms presented were determined from 
100 clients of a private employment agency. 
Ages ranged from 17 to 50. Eighty per cent 
were high school graduates, 17 per cent were 
college graduates. One half were men. The 
manual states: “They represent many occupa- 
tions and should not be considered as typical 
of any one employee group.” It is stated that 
the norms are illustrative and that “after a 
sampling has been made, local norms [should] 
be established that will be representative of the 
group being studied.” 

The manual also states the odd-even relia- 
bility of the test in one study was .9r. The 
number of cases included in this study and the 
results for other studies are not presented. 

The test items are short and appear likely to 
differentiate between individuals with varying 
amounts of vocabulary information and arith- 
metical skill. However, they do not seem to be 
quite so carefully edited as some in the better 
published tests of this type. For example, one 
of the items states: "Which substance will not 


conduct electricity ?", and the answer given is 
"rubber." Although there should be no diffi- 
culty in identifying this as the intended an- 
swer, it would be more accurate to say that it 
is a poor conductor of electricity rather than 
that it is not a conductor of electricity. Simi- 
larly, an item regarding mercury gives the an- 
swer that it is a liquid, and other choices are 
“gas” and "solid." It would be more accurate 
to state that under ordinary conditions of pres- 
sure and temperature mercury is a liquid. 

One misprint was noted ; one of the options 
for "thyroid" was printed as “figament.” Pre- 
sumably "ligament" was intended. 

The arithmetic items include not only ques- 
tions which require arithmetical computations 
but also questions measuring an understanding 
of numbers, such as, "Which is the smaller 
fraction?”, and, “25 divided by zero equals.” 

In conclusion, it may be stated that this set 
of items will tend to differentiate people on 
much the same basis as many currently pub- 
lished tests of general intelligence. The fact 
that it has been designed for use without time 
limits represents a possible advantage, but since 
usable norms have not been provided, the test 
user could as easily utilize other test materials 
and develop his own norms for the no-time- 
limit situation. The items tend to be somewhat 
more practical than academic; and the test 
should prove useful to persons interested in 
obtaining a combined measure of vocabulary, 
information, and arithmetical abilities for gen- 
eral classification purposes, 


[ 253 ] 
Personnel Classification Test [Wesman]. Grades 
8-16 and adults; 1947; Forms A, B ; $1.50 per 25, post- 
paid; 35¢ per specimen set, postpaid ; 28(35) minutes; 
Alexander G. Wesman; Psychological Corporation. 


J Consult Psychol 11:340 N-D ’47. Intended 
primarily for industrial and business use, this 
short general ability test yields three scores: 
verbal, numerical and total. Part I consists of 
40 multiple-choice verbal analogies items, in 
which the first and fourth terms of the analogy 
must be chosen, with the second and third 
terms given. Part II contains 20 arithmetic 
computation items. Reliabilities range from .77 
for 93 college freshmen, to .92 for 97 life in- 
surance salesmen. Norms are supplied for sev- 
eral educational and occupational groups. “Va- 
lidity” is indicated by the correspondence of 
test scores with educational level and with an- 
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other group intelligence test, and by data show- 
ing a relationship to supervisors' ratings of 
chain store clerks. The manual stresses the value 
of the separate verbal and numerical Scores for 
occupational placement. 


[254 ] 
*Pintner General Ability Tests: Non-Language 
Series. Grades 4-9; 1945; may be administered in 
pantomime; Forms K, L; $2.30 per 25; I5¢ per pan- 
tomime directions; 35¢ per specimen set, postpaid; 
50(60) minutes; Rudolf Pintner; World Book Co, 


Carroll A. Whitmer, Director of Special 
Education, Public Schools, Pittsburgh, Penn- 
sylvania. The nonlanguage series of the Pint- 
ner General Ability Tests was projected to 
parallel the author's verbal series as closely as 
Possible in “mechanics of standardization and 
general structure.” The advanced form has not 
been published. 

The intermediate test consists of six sections, 
namely, figure dividing, reverse drawings, pat- 
tern synthesis, movement sequence, manikin, 
and paper folding, Although the total working 
time on the test is 50 minutes, the directions 
are quite elaborate and include practice exer- 
cises for each section of the tests, The author 
recommends that the test be administered in 
two sittings. The test is hand scored with a 
cutout stencil. 

The manual of directions gives a detailed 
discussion of construction and standardization 
of the tests and the interpretation of scores. 

The test items were selected for their dis- 
criminative power. The scores for over 6,000 
children in the four yearly age groups from 
974 to 1314 were used in the standardization. 
The children were selected by taking all the 
members of three consecutive school grades in 
eleven communities from different sections of 
the United States. 

Reliability data for the two forms of the 
test are presented as split-half correlations of 
858 and .890 corrected by the Spearman- 
Brown formula and probable errors of meas- 
urement of 4.1 and 3.7. The standard devia- 
tions of median scores are 16,0 and 16.4. 

_ A full-page discussion of the validity of the 
test is given in the manual in terms of the uses 
~ for which the test is recommended. These are 
“as a supplement to the Pintner Verbal Test 
in order to get a better measurement of all- 
round mental ability”; as a measure of the type 
of mental skill which functions in School and 
life situations involving shop work, drafting, 


and vocations of a mechanical nature; and as 
a substitute for verbal tests in measuring the 
hard-of-hearing, deaf, and foreign-language- 
speaking. Some statistical evidence is pre- 
sented in defense of the validity of the test for 
the last-mentioned purpose, but research is 
recommended for the establishment of the 
validity of the test for the second purpose 
stated. 

Five different end scores may be obtained 
from the raw score on this test. The first of 
these, the subtest standard score, is not recom 
mended for separate use with present evidence. 
The other end scores are median standard score, 
mental age, deviation IO, and percentile rank. 
The use of these various scores is discussed in 
the manual, 

It is gratifying to the reviewer to find a test 
manual as detailed as this one and a nonlan- 
guage test which is as well constructed as this 
one. It appears to be the best test in its field, 
but like all group tests which cover a wide age 
range, it should prove most efficient in the 
middle of the age range for which it is in- 
tended. 
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Pintner General Ability Tests: Verbal Series. 
Grades kgn-2, 2-4, 4-9, 9+ ; 1923-46; 20¢ per manual; 
World Book Co. 
d) PINTNER-CUNNINGHAM PRIMARY TEST. Grades kgn- 
2; 1923-46; Forms A, B, C; $145 per 25; 35¢ per speci- 
men set; Rudolf Pintner, Bess V. Cunningham, and 
Walter N. Durost. 
b) PINTNER-DUROST ELEMENTARY TEST. Grades 2.5-4.5; 
Scales 1 (requires no reading) and 2 (requires read- 
ing) may be used separately or together; Forms A, B; 
$2.00 per 25 Scale 1; $1.60 per 25 Scale 2; 35€ per speci- 
men set; (45) minutes per scale; Rudolf Pintner and 
Walter N. Durost. z 
c) PINTNER INTERMEDIATE TEST. Grades 4.5-9.5; 1931- 
g: revision of Pintner Intelligence Test; Forms A, B; 

1.70 per 25; 35¢ per specimen set; $1.20 per 25 
machine-scorable answer sheets 3 45(55) minutes; 
Rudolf Pintner, 
d) PINTNER ADVANCED TEST. Grade 9 and above; 1938- 

; Forms A, B; $170 per 25; 35¢ per specimen set; 

duet per 25 machine-scorable answer sheets; Rudolf 

intner. 
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Diferent Intelligence Tests. Unpublished in need vat 


versity of Washington, 1942. (Abstracts of Theses . . . 


143 De 189.) 

Stanley S. Marzolf, Professor of Psychol- 
ogy, Illinois State Normal University, Normal, 
Illinois. Four batteries of subtests make up this 

"series. The primary battery (of which Form A 
is a slightly modified version of the original 
Pintner-Cunningham Primary Tvst) consists of 
seyen subtests, each involving responses to pic- 
torial materials. The elementary battery is made 
up of two parts or scales which are published in 
separate booklets, The first part, Picture Con- 
tent, has six subtests composed entirely of pic- 
türes. Scale 2, Reading Content, also has six 
parts, namely vocabulary, number sequence, 
analogies, opposites, logical selection, and arith- 
metic reasoning. Separate norms are available 
for these two scales. The Intermediate and Ad- 
vanced Tests have eight subtests each, namely 
those included in the elementary battery and, 
in addition, a classification test and a best- 
answer test. Both the Intermediate and Ad- 
vanced Tests are adapted for use of a hand- 
scoring stencil or for machine scoring. Except 
for the Primary Test, standard score norms are 
provided for all subtests, and the general level 
of ability is the median of the subtest scores. 
Mental-age equivalents of standard scores are 
provided. Ratio IQ's may be computed in the 
usual way, but the use of deviation 1Q’s is 
urged. In addition to the usual instruction man- 
ual provided with each battery, a Manual for 
Interpreting the Intermediate and Advanced 
Tests is obtainable. 

This last-named manual is very complete and 
satisfactory. It contains material which will be 
profitable to the users of the primary and ele- 
mentary batteries also. It explains the functions 
which intelligence tests may serve, describes 
the standardization procedures, reports sub- 
test intercorrelations, explains standard scores, 
gives norms, and describes the procedure for 
computing IQ’s. 

According to this manual, a valid intelli- 


1942, 


gence test for school use must sample'a broad 
range of the effects of the child’s total environ- 
ment so as to assist in predicting future school 
achievement. This practical approach defines 
intelligence as that which enables a child to do 
school work, yet is not dependent upon’ the 
school experience itself. Validity is to'be tested 
by logical analysis of the items, intercorrela- 
tions between the subtests, and correlation: 

with school achievement. t 

Whether a variety of abilities has been sam- 
pled is determined by intercorrelations of the 
subtests of the intermediate and advanced bat- 
teries. The median intercorrelation for the 
former is .58 and for the latter, .43. However, 
when the contribution of the vocabulary test . 
is partialled out these medians drop to .30 and 
.24 respectively. No subtest intercorrelation 
data are given for the Primary and Elementary 
Tests. The Picture Content and the Reading 
Content scales of the latter correlate .70 ap- 
proximately. 

Some data on correlation with achievement 
are given. For 168 fifth-grade children, a cor- 
relation with total scores on the Metropolitan 
Achievement Test of .84 was found. For 209 
seventh-grade children the correlation was 785. 
At the college level, test scores were found to 
correlate .59 with grades in all courses, 77 
with the Cooperative English Test, and «56 
with the Cooperative General Achievement 
Test (12). Correlations with other intelligence 
tests range from .71 (Binet) to .87 (Otis). The 
Primary Test was found to correlate 63 with 
reading test scores of first-grade pupils. No 
data are given for the most recently developed 
battery, the elementary. 

Lack of perfect correlation with school 
achievement is explained on the basis that the 
“tests are not supposed to be measuring iden- 
tical abilities." In view of the purported aim 
of the tests it would be more to the point to 
make specific mention of such factors as the 
fallibility of school marks, differences in pre- 
vious school training, and differences in moti- 
vation which doubtless account for part of 
achievement variance. 

Frequent reference is made to the compara- 
bility of scores at one level with those at an- | 
other. It is claimed that the scores are com- 
parable because the subtests have been selected 
to make this possible. Emphasis of this point 
had better be delayed until factor analysis stud- 
ies give empirical evidence for it. 
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The reliabilities obtained by the split-half 
and interíorm methods for the various bat- 
teries are, in the majority of cases, in excess 
of .90. Sources of reliability data are given in 
all instances, 

Standardization has been based on “approxi- 
mately 100,000 tests from widely separated 
parts of the country.” Further collection of 
scores for normative purposes is now in prog- 
ress, 

The computation of deviation IQ's is amply 
explained and illustrated. For the Intermediate 
and Advanced Tests a monograph which facili- 
tates computation of IQ’s and centile equiva- 
lents is provided, 

This series is one of the best available for 
school use. The tests are easy to give and score. 
Raw scores are easily converted to a norma- 
tive form. The same score system, standard 
Score, mental age, and deviation IQ, is used 
throughout the series, The attempt to make the 
tests comparable at all grade levels is com- 
mendable, even though empirical evidence that 
this has been accomplished is lacking. 


D. A. Worcester, Chairman, Department of 
Educational Psychology and M. easurements, 
The University of Nebraska, Lincol n, Nebraska. 
The above tests constitute a series running 
from the kindergarten age to maturity. Norms 
on the advanced test are given for MA's to 2r, 
but apparently the standardization was on 
public school pupils. 

The primary test, originally published in 1923 
was revised in 1938, though the changes re- 
quired for this "thorough revision" seem to 
have been very few. Seven subtests, all using 
pictures or drawings are included, involving 
common observation, aesthetic comparison, as- 
sociated objects, discrimination of size, picture 
parts, picture completion, and dot drawing. 
The test requires detailed instructions and ac- 
curate timing, in seconds, on many items. This 
may account for the fact that in some instances 
children secure higher scores on this than on 
some other tests; inexperienced testers are 
likely to allow a little more than the stated 
times. 

The elementary test is in two parts, one en- 
tirely of pictures, the other employing read- 
ing. The parts are meant to be as nearly as 
possible equivalent, but inspection suggests 
that they are by no means wholly so. Correla- 
tions between them are substantial but not 


high. The attempt is to measure on this test 
the same elements as are included in the inter- 
mediate and advanced tests. The administra- 
tion of the test is quite complicated ; there are 
many, detailed oral instructions each with its 
separate timing. Like the primary test, good 
results necessitate an examiner not only ex- 
perienced in testing but in handling groups of 
children. Some of the pictures are not as clear 
as is desirable. 

The intermediate and advanced tests each 
have eight subtests. All are timed but with lim- 
its so liberal, intentionally, that they are not to 
be considered as speed tests. The materials of 
the tests are on the whole of the kind that one 
finds in most of the conventional intelligence 
tests. 

Each test of the series has received careful 
statistical treatment and the statistical findings 
are given in the manuals. Norms for the tests 
are articulated with each other, making pos- 
sible comparable measures at the various age 
levels. Scores may be interpreted in almost any 
way which the user may wish: standard scores, 
ratio or deviation IQ's, percentile ranks, men- 
tal ages, or grade equivalents. Machine scoring 
is available for the intermediate and the ad- 
vanced tests. While the task of administering 
these tests is somewhat greater than that for 
some of the tests constructed more recently, 
there is evidence that they have been con- 
structed with care and may be employed with 
good results. 


For excerpts from reviews by Roger M. 
Bellows and J. Wayne Wrightstone, see 
40:1416. 
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Pintner Non-Language Primary Mental Test. 
Grades kgn-2 ; 1929-30; $2.20 per 25; 30¢ per specimen 
set; Rudolf Pintner; Bureau of Publications, Teachers 
College, Columbia University. 


REFERENCES 


1. PiNTNER, RupoLr. "Results Obtained with the Non-Lan- 
guage Group Test." J Ed Psychol 15:473:83 N '24. * 

2. Brown, Anvrew W. “The Correlations of Non-Language 
Tests With Each Other, With School Achievement, and With 
Teachers’ Judgments of the Intelligence of Children in a School 
for the Deaf." J Appl Psychol 14:371-5 Ag '30. * (PA 5:880) 

3. PiNTNER, R. ^A Group Intelligence Test Suitable For 
Younger Deaf Children." J Bd Psychol 2215603 My "31. * (PA 
5:35! 

4. PiNTNER, RupoLr. “The Influence of Language Back- 
Nou Es R Tests.” J Social Psychol 3:235-40 My '32. 

13793. 

5. SENOUR, A. C. “Necessity for Use of a Non-Language 
Mental Test in Group Intelligence Testing.” J Ed Res 27:435- 
41 F '34. * (PA 8:2816) 

. GanrH, Tuomas R., AND Smitu, Owen D. “The Perform- 
ances of Full-Blood Indians on Language and Non-Language 
Intelligence Tests.” J Abn & Social sychol 32:376-81 O-D '37. 
* (PA 12:3064) 


337 ] 


TESTS & REVIEWS: INTELLIGENCE—GROUP 


256 


Psyche Cattell, Clinical Psychologist, Lan- 
caster, Pennsylvania, This test was devised for 
use in testing the intelligence of kindergarten 
and first- and second-grade children in situa- 
tions where a test which requires no knowl- 
edge of written or spoken language is desired. 
All the directions are given in pantomime. The 
children’s responses consist of marking or com- 
pleting pictures and diagrams in the record 
booklet. 

_The data regarding the validity and relia- 
bility given in the mimeographed manual of 
directions are somewhat meager, making it dif- 
ficult to evaluate the tests. Mental-age norms 
are given for ages 4 9/12 to 9 7/12 years, but 
the number of cases on which they are based is 
not reported. The lowest norm given is for 
only two correct responses, and the highest, 
for a perfect score. Obviously a mental-age 
norm which is based on a near-zero score or a 
near-perfect one cannot be a reliable measure 
of ability. No distribution of scores is given 
to aid the examiner in determining within what 
range the scores will give a reasonably reliable 
mental-age measure. 

Correlating the odd and even items and cor- 
recting for the length of the test gave a relia- 
bility coefficient of .go for the first-grade and 
86 for second-grade children. A validity co- 
efficient obtained by correlating the mental ages 
of 80 kindergarten children on this test and on 
the Stanford-Binet is .61. Using this test and 
the Pintner-Cunningham Primary Test, the 
coefficient was only .51, even though the range 
here included three school grades. No infor- 
mation regarding the distribution of children 
within these grades is given, nor any informa- 
tion regarding the socio-economic level of the 
groups. 


Carroll A. Whitmer, Director of Special 
Education, Public Schools, Pittsburgh, Penn- 
sylvania. The purpose of this test is to provide 
a means of testing children with all types of 
language background, from good or poor 
homes, without permitting language facility to 
influence the test results. The directions for 
the test are given entirely in pantomime. The 
examiner must not use any verbal supplement 
to the pantomime. 

The manual warns that this test is much 
more difficult to administer than the ordinary 
test with verbal directions and recommends 
that the examiner rehearse the directions two 


or three times before attempting to give the 
test, It is the reviewer's experience that it takes 
many more than two or three practice rehear- 
sals to become proficient in administering test 
directions to five- and six-year-olds in deaf- 
mute style when they are not accustomed to 
that type of performance. 

The arrangement of test material for this 
test is quite similar to that used in the older 
Pintner-Cunningham Primary Test. This type 
of arrangement with large figures and a small 
number of items on a page has a great advan- 
tage over the crowded page so frequently seen 
in paper-and-pencil tests intended for young 
children. The appropriate responses to be made 
by the child are done with a large pencil and 
do not require finer co-ordinations than should 
be reasonably expected of children beginning 
school. 

The score on the test is the number of cor- 
rect responses. Raw scores are converted to 
mental ages by the use of a table in the man- 
ual. Mental ages on the test range from 4 years 
9 months (raw score 2) to 9 years 4 months 
(raw score 74). IQ's are computed in the 
usual manner. Ít is immediately apparent that 
the test has little range at the low end. Even 
though the test is suitable for children in the 
age range of 4.5 to 5.5, the test does not meas- 
ure the younger kindergarten child whose abil- 
ity is much below average. Nor does it meas- 
ure the six-year-old whose mental retardation 
is more than a year. 

The test manual presents very little statisti- 
cal data, Split-half reliabilities of .90 for III 
first-grade children and .86 for 129 second- 
graders are presented without mention of 
whether or not a correction formula was used. 
The evidence for validity offered in the man- 
ual consists of correlations of .61 with Stan- 
ford-Binet mental ages for 80 kindergarten 
children, .78 for 28 other cases not identified, 
and .31 with Pintner-Cunningham mental ages 
of 154 children in kindergarten to third grade. 

The scores from this test should be used 
with great caution because the test samples a 
very limited range of a very limited kind of 
ability. The many uncontrolled variables in the 
administration of a group test at the primary 
level become even more significant when a 
strange form of test administration is used. 
The test may serve a good purpose as a rough 
screen when a large number of children of the 
type for which it is intended must be exam- 
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ined quickly. Certainly, individual testing with 
more comprehensive tests would be desirable if 
any important decisions depended upon the 
test results. 
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*Profion Dealltwriaeth Cyfaddasiad Cymbraeg. 
Ages 7-10; 1946; a Welsh edition of the Ryburn Group 
Intelligence Tests for Junior Children, 1936; 6s. per 
20; Is. per manual; 4d. per single copy; 30(40) min- 
noU H. V. Clark; Robert Gibson & Sons (Glasgow) 
td. 
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Progressive Matrices: A Perceptual Test of In- 
telligence, 1938. Ages 6 and over; 1938-43; 1 form; 
£8 8s. per 25; Is. 6d. per key and norms; 3s. 5d. per 100 
record forms; 15s. per specimen set; (60) minutes; 
J. C. Raven; H. K. Lewis & Co., Ltd. 
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Walter C. Shipley, Professor of Psychology, 
Wheaton College, Norton, Massachusetts. This 
is a nonverbal intelligence test designed to 
measure Spearman's "G" factor (eduction of 
relations). It comprises sixty untimed multiple- 
choice problems, each consisting of a design or 
"matrix" from which a part has been removed. 
The testee examines the matrix and chooses 
the correct part for completing it. The easier 
problems draw principally on discrimination; 
the more difficult ones, on reasoning by anal- 
ogy. The problems are divided into five series, 

‘each developing a different theme, with twelve 
problems to a series arranged in order of diffi- 
culty. Choices are recorded on a separate an- 


swer sheet so that the test booklets can be used 
over and over again. Norms, expressed in per- 
centiles, are based on data from 1,407 children 
and 3,665 men. Statistical data on reliability 
are not given. 

The test has much to commend it. Since the 
content is limited to highly abstract material 
which is largely foreign either to the classroom 
or to everyday experience, it is highly prob- 
able that what is being measured relates more 
directly to “native” abstract intelligence and 
less to academic achievement, educational op- 
portunity, or cultural background than is the 
case with most tests of general intelligence. 
Striking attestation to its probable value is the 
fact that in World War II, a more difficult 
form of the test was administered by the Brit- 
ish as a first sieve for all naval candidates and 
as a part of their standard army battery applied 
at primary training centers, 

In many ways this test is resembled by Cat- 
tell’s Culture-Free Test. Because of the rather 
striking similarity between the two with re- 
spect to both rationale and content, the re- 
viewer finds it difficult to recommend one over 
the other. For many purposes they would ap- 
pear to be virtually interchangeable. The pres- 
ent test, however, seems to require less time to 
administer, and to be applicable over a some- 
what wider range. Both appear to fill an urgent 
need for a highly discriminating, nonverbal and 
culture-free test of general intelligence. 


David Wechsler, Chief Psychologist, Psy- 
chiatric Division, Bellevue Hospital, New York, 
New York. This test consists of a graded 
series of logically designed patterns (matrices) 
serving to measure what the author variously 
calls “innate eductive ability,” “eductive intel- 
ligence,” or simply “eduction.” Eventually, 
however, they are offered as a test of general 
intelligence and as such will be reviewed in this 
article. 

The Matrices are divided into 5 sets, each 
involving a somewhat different principle but 
all requiring the subject to supply a missing 
part to a design by choosing one of six possible 
inserts. The “themes” of the successive sets or 
subtests are: (a) continuous patterns, (b) fig- 
ure analogies, (c) progressive alteration of 
patterns, (d) permutations of figures, and (e) 
resolution of figures into constituent parts. In 
an earlier form of the test, the first two sub- 
tests were put out as formboards (intended 
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for use with young children), but as now avail- 
able all items are contained in a 6-page printed 
booklet. 

The Progressive Matrices may be adminis- 
tered as either a group or an individual test 
with percentile norms available for each of the 
subtests as well as for the test as a whole. 
There are also separate norms, as are seem- 
ingly necessary, for the individual as against 
the group form of the test. The age range is 
from 6 to 1374 years (individual form) and 
from 8 to 14 (group form), with a single 
adult group. The adult group is made up ex- 
clusively of males (British soldiers). 

In appraising the Progressive Matrices it 
seems well to emphasize three points. In the 
first place the Progressive Matrices are one of 
the few or relatively few new tests that have 
come from psychological laboratories in the 
last fifteen years. By new I mean contextually 
different. The second important point is that 
the Matrices do seem to be excellent tests of 
intelligence. I use the word seem, because the 
author, while claiming high correlations with 
other tests of intelligence including the Stan- 
ford-Binet, has only published one coefficient 
of correlation (.60), that obtained with the 
Mill Hill Vocabulary Scale. The last point is 
that the Progressive Matrices present what 
may be called the first comprehensive attempt 
to measure intelligence in terms of a single 
intellectual function, to wit, that of visual per- 
ception. 

The main limitation of the Progressive 
Matrices is that they attempt to measure gen- 
eral intelligence (or even eductive ability) 
through a single modality of performance. To 
this there. are serious theoretical objections; 
but leaving these aside there still remains the 
practical consideration that there is no test, 
no matter how excellent, but that certain indi- 
viduals may fail to do justice to themselves be- 
cause they simply are not good in doing the 
test or type of task used. The Progressive 
Matrices are no exception. A second limita- 
tion of the test is that it has a rather low ceil- 
ing. Considering that there are 60 items, the 
mean test score for each age group (age 12 
and above) increases negligibly or not at all; 
the average adult score is the same as that of 
subjects at age 14. Findings of the Progressive 
Matrices are thus in line with results obtained 
with most other sensory and perceptual motor 
tests which, at higher levels at least, turn out 


to be relatively undifferentiating. There is yet 
another problem which will confront the author, 
and that is the probable rapid decline of scores 
on the test with ages beginning with adult- 
hood. It is possible that the identical scores 
obtained for average adults and the 14-year- 
olds are due to the fact that the adult group 
included a considerable number of subjects 
whose scores already showed this age decline. 

The above strictures notwithstanding, the 
Progressive Matrices should be a welcome 
addition to psychometrics because they con- 
stitute a real contribution to the field. 


For a review by T. J. Keating, see 40:1417. 
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Revised Beta Examination. Grades 3 and over; 
1931-46; 1 form; a French edition is also available; 
$3.50 per 25, postpaid; 35¢ per specimen set, postpaid ; 
15(45-60) minutes; 1946 restandardization by Robert 
M. Lindner and Milton Gurvitz; C. E. Kellogg assisted 
by N. W. Morton; Psychological Corporation. 
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Raleigh M. Drake, Professor of Psychology 
and Head of the Department, Kent State Uni- 
verity, Kent, Ohio. There is a real need for a 
group non-verbal general intelligence test which 
can be given to illiterate adults, In spite of the 
amount of work which has been devoted to the 
Army Beta, including the present restandardiza- 


tion, this test does not fulfill this need ade- — 


quately, although it may do about as well as 
most other tests made for the same purpose. 

No changes of any kind have been made in 
the test section. The only significant modifica- 
tions are confined to assigning equal weights to. 
the six subtests and to constructing a new table 
of norms by employing Wechsler's method of 
computing IO's. The authors (Lindner and 
Gurvitz) of the 1946 restandardization report 
an r of .92 between Beta IQ’s and Wechsler 
IQ's for 192 male prisoners. If this were typical, 
and if the reliability could be taken as satis- 
factory, the test would be very valuable since ~ 
it can be given to groups at a great saving of 
time and money over individual tests. 


x 
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Many items are ambiguous, as pointed out by 
Porteus in an earlier review (40:1419). This is 
true of Test 3, Item 11 in which one is required 
to select the one wrong item out of four pre- 
sented—at least three of the items are equally 
wrong. Items 18 and 20 in the same section are 
poor because of unclear detail in the drawings. 
In Test 6, the last item in the pictorial simi- 
larities is drawn differently but is scored as 
being the same. Some of the pictorial items 
depend upon information about, or experience 
with, tools, which makes them inappropriate for 
women. This fact should be kept in mind in con- 
nection with the comments that follow because 
the Army Beta was specifically constructed for 
males and no attempt has been made to adapt it 
for use with the opposite sex. 

The reviewer gave the Revised Beta Exam- 
ination to a group of college women and found 
the following: (a) A reliability coefficient of .30 

(N = 116) by correlating the odd subtests with 
the even subtests. Kellogg and Morton report 
:90, or better, by odd-even items and by sub- 
tests 1, 2, and 3 with subtests 4, 5, and 6. Be- 
cause the subtests are very dissimilar in content, 
and the odd-even method accentuates this dis- 
similarity, it is to be expected that lower self- 
reliabilities would be found when they are cor- 
related with one another than when odd-even 
items are correlated ; but unless it can be shown 
that each subtest is a valid measure of general 
intelligence, we must require that the parts of 
the total be consistent with one another. Test 4 
(geometrical forms) is similar to the now well- 
known Revised Minnesota Paper Formboard 
Test and is no longer considered a good test of 
general intelligence. Test 6 (number and pic- 
ture checking) is essentially the same as the 
Minnesota Vocational Test for Clerical Work- 
ers, which is definitely not an intelligence test. 
What the Army Beta does seem to measure well 
is the ability for close observation. Success in 
Test 1 (maze tracing), Test 3 (detecting miss- 
ing parts of pictures), Test 5 (drawing in miss- 
ing parts of pictures), and Test 6 (similari- 
ties) depends upon quick visual observation, an 
important factor in general intelligence but 
hardly as important as the overloading given it 
in the present test (i.e., four out of six subtests 
are of this nature). The remaining two subtests, 
(Tests 2 and 4, number substitution and geomet- 
tical forms), have not been very successful 
measures of intelligence. The reliability is low 
also because the range for each subtest is from 


the third grade to the adult level, according to 
the publisher's catalog, so there can only be one 
or two items which are really critical at each 
age or ability level, which in effect makes each 
subtest very short. Actually, each subtest re- 
quires only from one and one-half to four min- 
utes working time. (b) These considerations 
may account for the low reliability found in the 
reviewer's group and for the low correlations 
found when the Beta was correlated with other 
tests of mental ability, an r of .58 being obtained 
with the Henmon-N elson Test of Mental Abil- 
ity (raw scores), an v of .30 with the Otis Self- 
Administering Test of Mental Ability (raw 
scores), and ati r of .23 between Beta IQ's and 
Otis IQ’s. (c) The average IQ for the group 
on the Army Beta was 111.06, and for the Otis 
IQ it was 111.45. The average difference be- 
tween IQ’s on the two tests was 9.4 points, and 
the standard deviations were 9.6 and 10.5 re- 
spectively. Considering the difficulty of obtain- 
ing IO's for ages over 16, the two tests agree 
rather well; and if the r of .92 between the Beta 
and the Wechsler-Bellevue reported is repre- 
sentative, we may conclude that the Beta yields 
a usable IO score, considering the purpose for 
which it is intended that it be used, i.e., illiterate 
male adults. (d) Using a long, reliable, and 
finely scaled final examination in general psy- 
chology, Otis, Henmon-Nelson, and Beta raw 
scores yielded r’s of .57, .34, and .19 respec- 
tively. This low r (.19) for the Beta, together 
with the low reliability found for our group of 
women, indicates that, for our group at least, the 
test is much inferior to other group tests avail- 
able. This is partly due to the restricted range 
and the high level of talent for our group, be- 
sides the fact that the test is not intended for 
use with women. 

The 1946 restandardization was based on 
1,225 male white adult prisoners at the U.S. 
Federal Penitentiary at Lewisburg, Pennsyl- 
vania with sampling controlled to represent the 
educational and socio-economic distribution 
found in the 1940 census. The norms are pre- 
sented in convenient tables so that for a given 
weighted score the computed IQ can be deter- 
mined for ages 16 to 59 inclusive. The authors 
wisely warn against assuming that the Beta IQ’s 
are equivalent to Stanford-Binet IQ’s or even 
to Wechsler IO's, but it is doubtful if such a 
warning will be heeded for there is no practical 
way of differentiating them. The test will find 
its greatest usefulness in cases of illiteracy where 
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other superior tests are inappropriate and when 
interpreted in a liberal manner as being only a 
rough approximation to the conventional IQ. 


Walter C. Shipley, Professor of Psychology, 
Wheaton College, Norton, Massachusetts. This 
is not a new test but rather a new and radically 
different standardization of a well-known and 
widely used instrument, the Revised Beta Ex- 
amination, prepared by Kellogg and Morton, 
intended—like the United States Army Group 
Examination Beta from which it was derived— 
to serve as a measure of general intellectual 
ability for persons who are relatively illiterate 
or who are non-English speaking. 

The restandardization, while introducing only 
minor refinements in administrative and raw- 
scoring procedures, involves a very significant 
change in the general procedure for computing 
the IQ. This procedure is patterned directly 
after that originated and employed by Wechsler 
for computing IQ's for the Wechsler—Bellevue 
Intelligence Scale. The procedure involves (a) 
converting the raw scores on the separate parts 
of the test to weighted scores so that each part 
contributes equally to the total score—a method 
which readily lends itself both to the plotting 
of meaningful profiles and to proration in case 
parts of the test must be omitted—and (b) 
allowing, in computing the IQ, for the fact of 
decreasing performance (raw score) with age. 

The standardization sample on which the 
norms are based comprised r,225 white male 
adult prisoners at the United States Federal 
Penitentiary at Lewisburg, Pennsylvania. While 
the criticism might well be made that a prison 
population is not a typical cross-sectional adult 
sample, this objection is in a large measure 
mitigated by the facts that (a) the population 
was carefully selected so that education and 
socio-economic status were in proportion to the 
distributions for white male adults in the 1940 
census and (b) the group was composed of 
individuals considered as improvable offenders, 
i.e., lawbreakers, rather than criminals. 

Although the reviewer has not had occasion 
to use the new procedure, he feels that the appli- 
cation of the Wechsler refinements to the scor- 
ing of the Beta Examination is a highly desirable 
step and thoroughly recommends it. 


For reviews by S. D. Porteus and David 
Wechsler, see 40:1419. 
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*Scale of Nonverbal Mental Ability. Ages 10-12; 
1947 ; 1 form; £3 10s. per 100; Qs. per 12; 1od. per sin- 
gle copy; 6d. per table of norms; rod. per booklet of 
answers; 30(35) minutes; J. W. Jenkins; National 
EIU for Educational Research in England and 
ales. 
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*SRA Non-Verbal Classification Form. Grades 
9-12 and adults; 1946-47 ; preliminary manual; IBM ; 
Form AH (hand scoring) ; Form AM (machine scor- 
ing) ; 10(15) minutes; Robert N. McMurry and Dale 
L. Johnson; Science Research Associates. 
2 FORM AH, $1.90 per 25; 50€ per specimen set. | 
b) FORM AM. $1.30 per 25; $2.35 per 100 machine- 
scorable answer sheets; 50¢ per scoring key. 


J Consult Psychol 11:340 N-D 47. This is 
a brief nonverbal intelligence test of 60 items. 
The items are all of one form (to select the 
“most different" one of five pictures), but in- 
volve two types of tasks: pictured logical rela- 
tionships, and- geometrical relationships. The 
format, scoring methods, and interpretative 
tables are uniform with the SRA Verbal Form 
...and a common manual serves the two tests. 
The reliability is stated as .85. The test will be. 
a useful supplement to verbal mental ability 


tests. 


[ 262 ] 

XSRA Verbal Classification Form, Grades 9-12 and 
adults; 1946-47; an abbreviated edition of the Thur- 
stone Test of Mental Alertness which is an abbreviated 
adaptation of the American Council on Education Psy- 
chological Examination for High Si chool Students; 
IBM; preliminary manual ; Form AH (hand scoring) ; 
Form AM (machine scoring) ; 15(20) minutes; Sci- 
ence Research Associates. J 

a) FORM AH. $1.90 per 25; 50¢ per specimen set. , 

b) FORM AM. $1.30 per 25; $2.35 per 100 machine- 
scorable answer sheets ; 5o¢ per set of scoring keys. 


[ 263 ] 
XTerman-McNemar Test of Mental Ability. 
Grades 7-12; 1942; revision of Terman Group Test of 
Mental Ability, 1920; IBM; Forms C, D; separate an- 
swer sheets need not be used; $1.50 per 253. 35¢ per 
specimen set, postpaid; 85¢ per 25 machine-scorable 
answer sheets; 20¢ per stencil for scoring answer 
sheets; 40(45) minutes; Lewis M. Terman and Quinn 
McNemar; World Book Co. 

REFERENCES 
1-25. See 40:142. 
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Carl I. Hovland, Professor of Psychology and 
Chairman of the Department, Yale University, 
New Haven, Connecticut. The authors are to be 
commended on the thoroughness of this revision 
and restandardization of the Terman Group 
Test of Mental Ability first published in 1920. 

The test consists of 162 items arranged in 
seven subtests: (a) Information, (b) Syno- 
nyms, (c) Logical Selection, (d) Classification, 
(e) Analogies, (f) Opposites, and (g) Best 
Answer. The number of items in each subtest 
is too small to provide separate scores so no 
norms for the separate subtests have been pre- 
pared by the authors, Four and five alternative 
multiple-choice items have replaced the two- 
choice items. Provision has been made for scor- 
ing by means either of a perforated key sheet or 
by machine methods, 

The new tests eliminate the arithmetical and 
numerical subtests used in the original forms. 

& 


This was done because factor analysis had indi- 
cated that these did not scale with the other 
subtests; this revision makes the scores of dif- 
ferent individuals “lie along the same con- 
tinuum." The result is that the test is primarily 
one of verbal intelligence and may be somewhat 
inadequate as a measure of general intelligence. 
The alternative would have been to strengthen 
the quantitative portion and thus obtain two 
scores, one verbal and one quantitative, as in the 
Thurstone American Council on Education Psy- 
chological Examination for High School Stu- 
dents. 

The arrangement of the items of each subtest 
in order of increasing difficulty and the “ample 
time allowance” should make this more a power 
than a speed test, but no data bearing on this 
point are presented. 

The construction appears to have been done 
very carefully and systematically. The item anal- 
ysis was based on 1,200 pupils in the 7th and 
11th grads. The comparability of the old and 
new tests was tested on a sample of 1,400 cases, 
grades 7 to 12, and that of the two forms of the 
new test on a sample of 1,500 pupils in grades 
7, 9, and 11, 

The norms appear to be extremely well stand- 
ardized, based on a 10 per cent sample of results 
from 148 communities in 33 states. Data are 
available for interpreting the results in terms of 
normalized standard scores, mental ages, per- 
centile ranks, and "deviation IQ's." The latter 
requires that the difference be found between 
the obtained standard score and the average 
standard score for other individuals of the same 
age. This difference is then interpreted from a 
table in terms of IQ. This IQ is thus not a 
quotient like the usual IQ but is a general meas- 
ure of brightness. Because the use of a separate 
answer sheet affects performance on the test, 
particularly at the lower ages, separate norms 
are given for interpretation when the separate 
answer sheet is used. This seems to complicate 
matters, but one wonders how often this point is 
neglected in other standardizations. 

The Manual of Directions is unusually com- 
plete and useful. It makes a fairly good compro- 
mise between the simplification required for 
nonstatistically minded test users and the sophis- 
tication expected by research users of tests. 

The reliability of .96 reported for age 14 (and 
said to vary only slightly for other ages) is very 
satisfactory. Reliability has been analyzed in 
terms of split-half correlation, inter-form cor- 
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relation, and the probable error of measurement. 

It is a criticism of the entire test-making busi- 
ness that we have no adequate theoretical formu- 
lation of intelligence against which we can meas- 
ure practical tests to determine validity. It is 
regrettable that such outstanding test construc- 
tors as Terman and McNemar have to say, “The 
best evidence of the validity of the Terman test 
is to be found in its successful use over the 
period of years since the test was first issued. 
Many instances may be cited where the Terman 
test has been used with great success in guidance 
and administration.” The validity of the new 
test is measured by the correlation of .91 be- 
tween it and the original Terman test. 


Robert L. Thorndike, Associate Professor of 
Education, Teachers College, Columbia Univer- 
sity, New York, New York. This test is a re- 
vision of the Terman Group Test of Mental 
Ability which has been one of the standard in- 
struments for testing at the junior and senior 
high school levels for 25 years. The revision 
undertakes to incorporate into the test the ad- 
vances which were made both in the theory and 
in the mechanics of mental measurement be- 
tween 1920 and 1940. 

Tn content, the chief change in the new form 
is its restriction entirely to verbal materials. All 
problems involving numerical or quantitative 
thinking have been excluded. This has had the 
effect both of narrowing and of purifying the 
function measured; that is, the test becomes a 
better measure of a more restricted segment of 
human ability. The choice was one which was 
deliberately made in constructing the test and 
one which can certainly be defended. However, 
the user should realize that it is verbal intelli- 
gence which he is assessing. For a number of 
purposes it may be desirable to supplement this 
test with some other measure specifically aimed 
at quantitative mental operations. 

The superficial form of the examination has 
been modernized so as to permit more efficient 
testing and scoring. The present form consists 
entirely of multiple-choice items. Jt permits 
either hand scoring, using a superimposed sten- 
cil, or machine scoring. 

In the norms of this test, an effort is made 
to bridge the gap between the IQ, with its long 
tradition and wide popular acceptance and mean- 
ing, and the statistically more acceptable stand- 
ard score. The “deviation intelligence quotient” 
is the index of brightness preferred by the au- 


thors. This is essentially a standard score with 
mean for the age group set at 100 and standard 
deviation of any age group set at 16. Thus, the 
numerical values have very nearly the same 
meaning, in terms of degree of excellence, as 
the IQ's on the Revised Stanford-Binet. How- 
ever, the ambiguities of quotient indices, which 
become meaningless as adulthood is reached, 
are avoided. This type of compromise index is 
probably the most satisfactory for use in school 
testing at the present time. 

The item analyses underlying the selection of 
items for the test and the extensive testing to 
establish norms bear witness to careful and thor- 
ough work in the test’s construction. The test- 
ing to establish norms was on a truly imposing 
scale. No indication is given, however, to sup- 
port the representativeness of the 200 commu- 
nities in which testing was carried out to estab- 
lish norms. It is to be hoped that they were as 
adequate qualitatively as they were quantita- 
tively. The homogeneity of the content and the 
care in construction appear between them to 
have yielded a test of quite satisfactory relia- 
bility. 

Allin all, this test appears to be a professional 
and workmanlike successor to the Terman 
Group Test. 


For reviews by Anne Anastasi and Howard 
Easley of the original edition, see 40:1424. 
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*Tests of Primary Mental Abilities for Ages 5 
and 6. 1946; also called SRA Primary Mental Abilities 
for Ages s and 6; 1 form; $2.35 per 25; Sof per speci- 
men set; (60) minutes; Thelma Gwinn Thurstone and 
L. L. Thurstone; Science Research Associates. 


REFERENCES 
1. Tuurstonz, TueLwa Gwinn. Play and Learn: The Red 
Book: Learning to Think Series. Chicago, Ill.: Science Research 


iates, . Pp. 8o. Paper. $0.55. 
AS RLURSTQUE, ELMA l Winx, Teacher's Manual for Play 
and Learn: The Red Book: Learning to Think Series. Chicago, 


Ill.: Science Research Associates, 1948. Pp. iii, 75. Paper. 


0.50. * 
See also references for 225. 


Florence L. Goodenough, Research Profes- 
sor, Institute of Child Welfare, University of 
Minnesota, Minneapolis, Minnesota. With the 
publication of these tests, the Thurstones have 
extended their work on the factor analysis of 
mental ability to the early childhood level. Their 
view of intelligence as a molecular rather than 
a molar trait is exemplified in the definition 
given on page ii of the Examiner’s Manual, 
where they state, “Intelligence may be defined 
as the composite of abilities for acquiring knowl- 
edge of various types.” This obviously calls for 
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a multiple rather than a unitary type of appraisal. 

After presenting this definition, they proceed 
to a brief overview of the primary abilities 
isolated in their previous work with adolescents 
and young adults. This discussion includes a few 
curious statements. For instance on page ii it is 
said that “word fluency involves the ability to 
think of isolated words with little concern for 
their meaning. It is important . . . in occupations 
requiring creative writing or reporting." Some 
"creative writing," it is true, seems based on this 
principle! And of the "space factor" it is noted 
(p. iii) that "the fact that young children have 
had no specific training in this factor suggests 
that it is fundamental to the child's intelligence." 
As stated, this argument is not easy to grasp 
though what is probably meant is that whatever 
ability along this line the child may show is the 
result of innate ability to profit by the casual 
experiences of everyday life rather than of 
formal instruction, 

In devising the new battery, seventy tests, 
"all of the known types of tests for children 
of this age were included—the current intelli- 
gence, reading readiness, arithmetic readiness, 
and other measures of learning ability," to- 
gether with a number of newly devised tests, 
were administered to 200 children in kinder- 
garten and first grade. On the basis of a factor 
analysis, five factors, identified as Verbal-Mean- 
ing (V), Perceptual-Speed (P), Quantitative 
(Q), Motor (M), and Space, (S) were iso- 
lated. Mental age norms for each of the five 
factors separately, and for V, P, Q, and S com- 
bined, were derived from approximately 1,200 
Chicago school children from five to eight years 
old. By extrapolation, standards for ages three 
to nine years were added. 

Approximately 35 items are provided for each 
of the five abilities. These consist entirely of 
pictures to be marked by the children. The pic- 
tures would have been improved in some in- 
stances by the omission of confusing small de- 
tails and greater emphasis upon the salient 
features. Some of the items may be criticized 
on the ground that they are likely to fall outside 
the experience of young children from rural 
areas or small towns. There are others for which 
the choice of a “right” answer seems question- 
able, e.g., one on page 5 where a padlock is to 
be chosen in response to the question, “If you 
do not have a key, which one will lock the 
door?” The picture of a doorknob below which 
is a keyhole that might well belong to a Yale 


lock seems to me an equally sensible choice. 

The instructions for administering and scor- 
ing the tests are clear and sufficiently detailed. 
One or two typographical errors were noted. 
The word “driving” in place of “diving” on 
page 8 might lead to confusion since it is a part 
of the examiner’s instructions to the children, 
though not one of the scored items, No data 
are presented with respect to the self-correla- 
tions of the tests, their intercorrelations, or their 
agreement with other criteria. 
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XThurstone Test of Mental Alertness. Grades 9-16 
and adults; 1943; an abbreviated adaptation of the 
American Council on Education Psychological Ex- 
amination for High School Students; preliminary 
manual; IBM; 1 form; 2 editions; 40(50) minutes; 
Thelma Gwinn Thurstone and L. L. Thurstone; Sci- 
ence Research Associates. 

G) HAND-SCORED EDITION, FORM AH. $1.85 per 25; 6o¢ 
per set of scoring folders; 50£ per specimen set, post- 
aid. 

) MACHINE-SCORED EDITION, FORM AM. IBM; $3 per 
25 machine-scorable test-answer booklets; $2 per set 
of scoring keys. 

REFERENCES 
1. Maner, Howanp, AND Fire, IsaseLLe E. “A Biological- 


Pharmaceutical Checker Selection Program.” J Appl Psychol 
31:469-76 O '47. * (PA 22:3665) 

Anne Anastasi, Associate Professor of Psy- 
chology, Fordham University, New York, New 
York. This is a short adaptation of the Ameri- 
can Council on Education Psychological Ex- 
amination, designed for use with high school 
and general adult populations. It is virtually 
self-administering, all the necessary instruc- 
tions appearing on the test booklet. The test 
proper, consisting of 70 five-response and 28 
four-response multiple-choice items, has a sin- 
gle, over-all time limit of 20 minutes. The total 
testing time, including the instructions and 11 
practice problems which precede the test proper, 
is estimated as 25 minutes, The items are pre- 
sented in spiral omnibus arrangement and in- 
clude: word definitions, same-opposite, arith- 
metic reasoning, and number-series completion. 
Besides a total score, the test yields an L-score 
(Linguistic Ability) and a Q-score (Quantita- 
tive Thinking), based upon the two former and 
the two latter types of items, respectively. The 
layout of the test is exceptionally compact, all 
the items of the test proper being printed on 
the two sides of a single sheet which is detach- 
able for scoring. 

The test is available in a hand-scoring as 
well as an IBM machine-scoring form. Con- 
venient transparent stencils are furnished for 
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hand scoring. For the benefit of those planning 
to use the test, it might be added parenthetically 
that the instructions on the scoring stencils 
ought to read, "Place the test in the (L- or Q-) 
stencil folder so that the page number of the 
test coincides with the page number of the 
stencil.” If one follows the present instructions 
to place the test paper "so-that the lower edge 
of the paper is at the fold of the stencil," the 
markings on test and stencil will not corre- 
spond—at least, they failed to do so in the 
specimen examined by the present reviewer! 
Percentile norms on 3,820 high school students 
attending nine high schools in scattered parts 
of the country are given for total as well as 
for L- and Q-scores. Separate percentiles are 
reported for each of the four years of high 
school. 

No correlations between scores on this short 
form and on the long forms of the American 
Council on Education Psychological Examina- 
tion are reported, nor are reliability or validity 
coefficients of the present form given. The 
present adaptation may be somewhat more 
difficult than the original because of the scram- 
bled arrangement of items which necessitates a 
constant shifting of mental set on the part of 
the examinee. To this should be added the fact 
that the specific instructions for each item are 
omitted in the test proper, having been given 
only in the practice problems. Thus the subject 
must first recognize the type of item and recall 
the corresponding instructions, before he can 
proceed with the item in the usual way. The 
unduly high difficulty level of the test is re- 
flected in the distribution of scores. For exam- 
ple, although a “chance” score on this test 
would correspond to approximately 21 correct 
items, the lowest percentile in the gth, roth, 
11th, and 12th grades extends down to 12, 14, 
18, and 22 items correct, respectively. These 
represent in effect zero scores. The differentia- 
tion among the four high school classes, more- 
over, was not very great, the differences in 
median scores between successive classes being 
614, 6, and 3% items, respectively, out of 98. 
These relatively small inter-class differences 
may suggest that the test is not sufficiently sensi- 
tive to individual differences at the high school 
level, since its unduly high difficulty level tends 
to dilute the scores with a large element of 
guessing or “chance” performance. It should 
be noted that if the test is actually too difficult 
for high school students, then its difficulty level 
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is all the more inappropriate for unselected 
adult populations, whose average education 
falls considerably short of high school gradua- 
tion. It is unfortunate that norms are furnished 
only for a high school population. The principal 
advantages of this test are its brevity, ease of 
administration, ease of scoring, and compact- 
ness, in each of which the test represents a high 
degree of efficiency. 


Emily T. Burr, Director, Guidance Bureau, 
Inc., New York, New York. This test is an 
abbreviated adaptation of the American Coun- 
cil on Education Psychological Examination for 
High School Students. The adaptation is ar- 
ranged more compactly, requires less time, and 
is easier to administer and score. Inasmuch as 
the results of these tests correlate highly with 
actual classroom standing, they are immensely 
useful in the analysis of the individual pupil's 
progress. 
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xTime Appreciation Test. Ages 10 and over; 1946; 
formerly called JNB Time Test; nontimed (10) min- 
utes; 1¢ per form sheet; 25¢ per manual (reprint of 
John N. Buck’s, "The Time Appreciation Test,” Jour- 
nal of Aenea Psychology 30:388-98 August 1946) ; 


John N. Buck; Steward's Office, Lynchburg State 
Colony, Colony, Virginia. 
REFERENCES 


1. Buck, Joun. “The Time Appreciation Test.” J Appl 
Psychol 30:388-98 Ag '46. * (PA 21:305) 

2. LANDISBERG, SELMA. “A Personality Study of Institution- 
alized Epileptics." Am J Mental Def 52:16-22 Jl '47. * (PA 22: 
2267) 

E. J. G. Bradford, Senior Lecturer in Edu- 
cation, The University of Sheffield, Sheffield, 
England. The name of this test is unfortunate 
since it is not a test of time sense or of time 


appreciation, but is based on a sampling of the 


-knowledge about time that might be acquired in 


school, through conversation, or by reading. A 
similar series of questions could be devised by 
sampling the subject matter related to weight 
or to distance. In spite of the restricted field 
of sampling, the resulting estimate-of mental 
age obtained is usually within one year for 
people of about normal intellectual level, some- 
what closer for subnormals, and less close for 
those above normal. About half the questions 
can be answered by morons and borderlines, 
ie. the lowest 10 per cent of the population, 
leaving the remaining 15 or 16 questions for 
differentiating the remaining go per cent of the 
population. Obviously, the best field for the ap- 
plication of the test is the subnormal. With a 
few modifications needed to make the test 
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suitable for use in England, the Time Apprecia- 
tion Test was found to give results roughly one 
year higher than other tests, including the Kent 
E-G-Y Test. 

Boredom or offence would be avoided if a 
start were made with Item 13, followed by 
Items 15 and 17, and then straight on to Item 
22. Should any errors be made in answering 
these four items, the examination might be re- 
started with Item 1. Otherwise the subject 
might be given full credit for the first 22 items. 

In the reviewer's opinion, the test would be 
improved if the Kent plan of dividing the 
series into sections were adopted. In such a form 
it could be used to advantage as a supplement 
or alternative to the Kent E-G-Y Test, and 
would be valuable in giving mote precision to 
the range of mental ages above 14 years. 


Charles N. Cofer, Associate Professor of Psy- 
chology, University of Maryland, College Park, 
Maryland. This test is designed as a "screen" 
or “emergency” device to allow rapid deter- 
mination of intellectual level. It is composed of 
30 items relating to knowledge of holidays, 
temporal terms, and temporal units and to pre- 
sent orientation. It is designed for either indi- 
vidual or group administration, has no time 
limit, and requires 10 minutes or less to give. 
The test is applicable to subjects whose chrono- 
logical age is 10 years or more, and it yields 
adult IQ’s from 30 to 123 and MA's from 4%4 
to 1874. It appears to be an interesting and po- 
tentially valuable test, and the author is to be 
commended for his brief but perceptive re- 
marks on its interpretative use. Questions con- 
cerning its standardization, however, prevent 
unqualified endorsement of it at the present 
time, 

Information given concerning the standardi- 
zation groups is very meager. The first norms 
were developed on 675 white Virginians by 
group administration. Although the norms de- 
veloped on this group were established by 
“chronological selection” (1, p. 389), no data 
are given as to the number and kind of subjects 
available at the ages 8 to 23 years represented 
in this group. These norms “were later amended 
in certain instances” in view of the data ob- 
tained from individual examination of 350 ad- 
ditional whites. The range in educational and 
chronological age of the latter group was wider 
than for the first group, and the intelligence of 
the second group had been determined by the 


use of standard intelligence tests. No data are 
given to describe the second group more ade- 
quately than the first, and the method of 
“amendment” of the initial norms is not de- 
scribed. 

Test-retest reliability was determined on 
small and unrepresentative groups (student 
nurses and psychiatric and mentally defective 
patients). Group administration yielded a re- 
liability coefficient of .80, and individual ad- 
ministration a coefficient of .96. Directions for 
scoring the test appear to be adequate. The 
items are not arranged in strict order of diffi- 
culty; all of them must be presented to each 
patient, an inconvenience not present in the 
Kent Scales which are designed also as screen 
tests. 

Correlations are reported between the Time 
Appreciation Test and the Revised Stanford- 
Binet Scale, the Wechsler-Bellevue Intelligence 
Scale, and the Otis Quick-Scoring Mental 
Ability Test. 'These data, obtained from seven 
relatively small groups which, for the most 
part, contained defectives and epileptics, show 
the test to have fairly high "validity" (range 
of r from .67 to .88). The range of intellectual 
level in these groups was fairly wide. 

On the whole, the reviewer is well impressed 
with the potentialities of this test as a screen 
device. The criticisms made arise largely from 
the inadequate presentation by the author of 
standardization data and methods and from the 
difficulty that any single investigator must face 
in making his standardization and validity 
studies on large and representative samples 
from more than a local population. 
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xV.G.C. Intelligence Indicator. Grades 3-8, 7-12; 
1943; an adaptation of the Henmon-Nelson Tests of 
Mental Ability (40:1398) ; Forms A, B ; 2 levels; sepa- 
rate answer sheets must be used ; $2.25 per 25 of either 
level ; 40 per 25 answer sheets for either level; 25¢ per 
specimen set of either level ; specimen set must be pur- 
chased to obtain the manual; 30(35) minutes; adapta- 
tion by M. D. Parmenter; Vocational Guidance Centre. 

[ 268 ] 
XWillis-Smith Advanced Mental Test. Grades 8- 
12; 1939; 1 form, 2 parts (Forms A, B) ; sé per set of 
both forms; 100(115) minutes; C. B. Willis and H. E. 


Smith; Institute of Applied Art, Ltd., Edmonton, Al- 
berta, Canada. 


Harold H. Bixler, Director of Research and 
Guidance, Public Schools, Atlanta, Georgia. 
Although this test is listed as including Forms 
A and B, this statement is somewhat mislead- 
ing inasmuch as both forms must be given in 
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order to compute an IQ: The purpose, say the 
authors, is that the forms may be given on 
successive days, using one period each day. 
'Fhis test is compiled by Canadian authors, 
and presumably the norms are based on Cana- 
dian students. 

This test is of the self-administering type 
and resembles the Otis Self-Administering 
Tests of Mental Ability very much. Like the 
Otis test, the items include vocabulary, num- 
ber series, analogies, and arithmetic problems. 

Neither in the test manual nor in the pub- 
lisher's catalog are there given any data about 
the validity or reliability of the test. One might 
guess that the test would be reliable because of 
the length of testing time, that is, 100 minutes. 
The test is probably adapted for the grades for 
which it is advertised, i.e., grades 7 to 12 in- 
clusive. The test yields an IQ. 

The test is probably not as good as other 
mental tests for the high school grades, such 
as the Otis Quick-Scoring Mental Ability Test, 
the Terman-McNemar Test of Mental Ability, 
and the Pintner General Ability Tests. 


F. T. Tyler, Professor of Psychology and 
Education, The University of British Columbia, 
Vancouver, Canada. 'This test consists of Forms 
A and B, both of which must be administered 
before the IQ can be computed. Each form 
contains 80 items and is of the omnibus, self- 
administering type. Both completion and multi- 
ple-choice types of items are used. 

The manual discusses three points—adminis- 
tration, scoring, and computation of IQ's. 
Scoring rules are described, and a convenient 
scoring key is provided. IQ's are determined 
from a graph which has ages from 1374 to 18 
on the horizontal axis and scores from 4 to 
136 on the vertical axis. IQ's for the ages and 
scores indicated are read directly from the 
graph. The authors state that "scores [sic] 
above 136 can be estimated from the amount 
by which they exceed 136.” Apparently this 
means that IQ's can be computed for scores 
above 136 by determining the amount by which 
the score exceeds 136. How this is done is not 
at all clear. 

It is to be noted that since the manual of 
directions gives no information beyond that 
just described, there are no data on such fac- 
tors as validity, item validity, reliability, sam- 
ple used in preparing the test, uses of the test 
results, etc. 
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Wonderlic Personnel Test. Adults; 1939-42; Forms 
A, B, D, E, F; Forms D, E, and F are adapted for busi- 
ness use from the Otis Self-Administering Tests of 
Mental Ability, Higher Form; $6 per 100; $4 per spe- 
cial introductory package containing 10 copies of each 
form; 12(20) minutes; E. F. Wonderlic; the Author, 
750 Grove St., Glencoe, Ill. 
REFERENCES 

1-2, See 40:1415. 

3. Carson, WALTER A, “Intelligence Testing of Flyi 
Cadet Applicants: A Report on Psychometric Measurement.' 
J Aviat Med 12:226-9 S '41. * (PA 16:305) 

4. Wnaicnr, James H., anv Laing, DONALD M. "The Time 
Factor in the Administration of the Wonderlic Personnel Test." 
J Appl Psychol 27:316-9 Ag '43. * (PA 18:274) EE 
RNS De dr ese Dias ADU a 

6. Wonveruic, É. F. Dine to Use the Personnel Tests.” 
Personnel Dig 1:19-23 Su '44. * 

7. McMurry, Rosert N., AND Jounson, Dare L, “Develop- 
ORE Ede AB eue err 
P na ENSEN, Mirton B., AND Ror, vtr B. ihe Value 
of Thirteen Psychological Tests in Oficer Candidate Screen- 
ing.” J Appl Psychol 31:312:22 Je '47, * (PA 21:4107) 

9. THompson, CLAUDE EDWARD, ‘Selecting Executives by 
Psychological Tests." Ed & Psychol Meas 7:7738 w "47: * 

H. E. Brogden, Research Psychologist, Per- 
sonnel Research Section, The Adjutant Gen- 
eral's Office, Washington, D. C. The Wonderlic 
Personnel Test is a short (50 items), rapidly 
administered (12-minute time limit) general 
intelligence test. Forms D, E, and F are adapta- 
tions of the Otis for industrial use. Forms A 
and B, developed subsequently, are similar but 
include a greater variety of item content. Per- 
centile norms based on industrial populations 
are presented in the manual. The directions, 
format, and scoring precedures offer little room 
for criticism in view of the intended use of the 
test. 

Reliabilities, determined by correlation be- 
tween forms, are reported as ranging from .82 
to .94. The figure .93 is said to result after cor- 
rection for homogeneity of the population. 
Slightly higher corrected odd-even coefficients 
are also quoted. It might be noted in passing 
that the odd-even coefficient is not applicable 
to time-limit tests such as the Wonderlic and 
that considerable positive bias can usually be 
expected to result from its use. In all, the 
above quoted figure of .93 appears possibly to 
be an overestimate. The reliability is in any 
event sufficiently high that an increase in length 
would not appear justifiable in expectation of 
any resulting increase in validity. 

Use of a general intelligence test might be 
criticized from the viewpoint of the factor 
analysts. If, however, the presence of a gen- 
eral factor among adults is admitted —and the 
reviewer does not believe that its existence has 
been disproved—this criticism could not be re- 
garded as serious. Since the test contains a con- 
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siderable variety of pertinent content, it is prob- 
ably a reasonably good measure of G. With 
the exception, possibly, of a heavy verbal bias, 
it is also probably not heavily loaded with 
group factors other than G. It is realized that, 
from the strictly pragmatic viewpoint of the 
average personnel director, the factor struc- 
ture of the general content of the test is not 
important in relation to the primary considera- 
tion of validity. Even though the test were a 
composite of a number of unrelated variables, 
high validity is still quite possible since criteria 
are probably themselves composites of a num- 
ber of unrelated activities. Knowledge of factor 
Structure may, however, be useful in predicting 
validity or lack of it in various possible appli- 
cations. 

While the manual is commendably cautious 
—although not too informative—in discussing 
the validity of the test, the write-up entitled 
“A Summary of Experiences With the Won- 
derlic Personnel Test" leaves the definite im- 
pression that the Wonderlic test can be used in 
predicting success in almost any type of indus- 
trial work. To the reviewer this impression is 
undesirable. There is sufficient evidence in the 
literature to indicate that a general intelligence 
test cannot be expected to show high validity 
in general industrial use. Limitations in the use 
of a general intelligence test should be dis- 
cussed. It might help if validity were consid- 
ered in relation to particular jobs or job areas 
so that waste motion on the part of those per- 
sonnel managers not acquainted with the test- 
ing field could be avoided. Probably, for ex- 
ample, emphasis on usefulness of a general 
intelligence test in predicting success in clerical 
work and ability to absorb training and on its 
general lack of usefulness for predicting suc- 
cess in routine assembly-line types of jobs would 
be appropriate even though no more restrictive 
statements were attempted. 

In general, if a measure of general intelli- 
gence is desired for industrial selection, the 
Wonderlic seems adequate and can probably be 
more cheaply and conveniently administered 
than most competing tests, 


Charles D. Flory, Rohrer, Hibler and Replo- 
gle, New York, New York. Many testmakers 
have, in the interest of reliability, extended the 
testing time beyond practical limits. Wonderlic 
was among the first to take an accepted intelli- 
gence testing instrument and scale it down to 


twelve minutes with ‘little loss in reliability. 
The Wonderlic Personnel Test was originally a 
restandardization and revision of the Otis Self- 
Administering Tests of Mental Ability. The 
format was retained in the revision but the 
content was reduced by eliminating those items 
with the least predictive value in an industrial 
situation. These tests originally were available 
in two forms, subsequently a third was added, 
and recently two more have been provided. The 
publicity concerning these tests sometimes 
mentions four and sometimes five forms, but 
the reviewer has seen five forms, A, B, D, E, 
and F. Statements by the author that “there are 
four equal and similar forms” and that “the 
new forms A and B are exactly equivalent to 
... forms D and F” seem slightly exaggerated. 
The correlations reported for one test taken 
immediately following another range from .82 
to .94. These results suggest “similar” but 
hardly “exactly equivalent" forms. 

The Personnel Test has been used widely in 
industry, and therefore has norms based upon 
more than 35,000 subjects. The author wisely 
recommends the use of percentiles in interpret- 
ing results for the adult population. He also 
cautions against the use of psychological data 
by those who are untrained in the field. 

'These tests are weakest in two areas, format 
and content, The print is so small and the items 
so variable in type and arrangement that unini- 
tiated subjects become confused in taking the 
test. The items are also strongly academic in 
flavor since they follow the Otis pattern. These 
conditions make the tests of limited value for 
individuals in the lowest two deciles of intelli- 
gence and for individuals with little schooling. 

The contention that the name of the tests 
conceals their purpose is probably incorrect 
and also of doubtful value. There is little value 
to be gained by subterfuge in industrial situa- 
tions. The brightest subjects know better and 
are critical of the ruse, Subjects of less insight 
care little about labels but react strongly to atti- 
tudes in the group. The competent psychologist 
has little trouble examining applicants except 
when a defensively incompetent person is en- 
countered. These instances are rare enough to 
be insignificant. 

These tests have many uses where a quick 
measure of intelligence is desired. A test of this 
length is more likely to meet the needs of indus- 
trial personnel directors than the typical intelli- 
gence tests which are from two to five times as 
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long. These tests are easy to administer and 
can be scored manually in less than one minute. 
Minimum scores have been established for 
more than twenty-five different vocations. 


Irving Lorge, Professor of Education, and 
Executive Officer, Institute of Psychological 
Research, Teachers College, Columbia Univer- 
sity, New York, New York. The Wonderlic 
Personnel Test is a 50-item omnibus-type in- 
telligence test prepared for 12-minute time limit 
administration. The items for Forms D, E, and 
F were selected from the Otis Self-Adminis- 
tering Tests of Mental Ability: Higher Ex- 
amination. The general approach was to nor- 
malize the distribution of item difficulties for 
adult groups with an average difficulty at fifty 
per cent subject to a triple check on item va- 
lidity. The validation was based on the differ- 
entiation of each item between “successful and 
unsuccessful industrial employees,” on the dif- 
ferentiation of each item between “good and 
poor school records” of college students, and 
on the biserial correlation coefficients of each 
item using the total score as a criterion (2). 
Forms A and B “were developed with the same 
formula which was used in developing Forms 
D, E, and F." 

The reliability of the test is reported in the 
manual, on the basis of comparable forms, as 
ranging from .82 to .94 and, on the basis of 
equivalent halves raised by Spearman-Brown 
prophecy formula, as ranging from .88 to .94. 
In a study, however, Wright and Laing (4) 
report the reliability on equivalent halves for 
Form D as .87 on 318 cases, for Form E as 
.70 for 88 cases and for Form F as .81 for 116 
cases with the comment "The reliability of this 
test varies with the different forms and is not 
as high as it might be." 

The validity is considered to be demon- 
strated by the reporting of the correlation with 
the original Otis test as ranging from .81 to 
.87, and the statement, without evidence, “This 
test has been shown to be a valid instrument 
in determining success on a number of differ- 
ent jobs.” Furthermore, a table in the manual 
on “Minimum Scores on Personnel Test for 
Various Occupations” implies a relationship be- 
tween test score and job success. While the 
author suggests “These minimum scores are 
not presented as final critical scores, but merely 
to guide the research worker,” the average user 
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may be confused since he is supposed to be 
the practical business man. 

The item types are like those of the Army 
Alpha: analogies, geometric figures, arithmetic 
reasoning, directions, disarranged sentences, 
logical inference, proverbs, opposites, similari- 
ties, etc. Forms D, E, and F are well arranged 
for writing the appropriate answer between 
parentheses; forms A and B, however, while 
continuing the same arrangement do not pro- 
vide enough room for reporting the complete 
answer. 

Forms D, E, and F are not completely inde- 
pendent since some of the items are repeated 
in several forms. Item 50, for example, occurs 
in each form. The overlap, moreover, is par- 
ticularly disconcerting since a number of mathe- 
maticians to whom the item was given, neither 
agree with each other nor with the key. Item 
50 apparently puts an artificial ceiling on the 
test. Forms A and B are not printed as well 
as the earlier forms. In my opinion, older 
people would be penalized by the over-black 
appearance of the test and the small type for 
the proverbs and clerical matching items. 

The keys are of simple strip variety. Scoring 
is fairly easy. There would be an advantage to 
include the right answer for Item 50 of Form 
B (the answer is 650). 

The author suggests that the test can be 
given either with a 12-minute time limit or as 
a power test in unlimited time. Unfortunately, 
the manual provides no data for interpreting 
the unlimited time scores. Wright and Laing 
report the correlation between a 12-minute and 
a 24-minute time limit score as .944 which is 
not surprising since on the average over ninety 
per cent of the 24-minute score is the 12-minute 
score. Their evidence, moreover, does not 
justify the failure to report power norms. 

The time limit norms are based on about 
37,000 cases. While the norms are categorized 
separately by highest grade reached, the sam- 
ples are not well defined as to their representa- 
tiveness. The practical business man does not 
know with what to compare employees' scores. 

The author suggests the use of an age cor- 
rection ranging from 3 points at age 30-39 to 
1I points at age 60-69. The latter allowance 
exceeds the average eighth grade norm. 

This is a simple self-administering test of in- 
telligence which for its short time limit may 
give fair group results, but which lacks suffi- 
cient reliability for individual appraisal. The 
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author overclaims the validity of the test and 
further makes an arcanum of test construction 
by stating “. . . it was scientifically constructed 
following a very complicated formula.” 

The norms could be improved by specifying 
with scientific accuracy the population of which 
the sample is supposed to be representative. 


INDIVIDUAL 
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Alexander Performance Scale: A Performance 
Scale for the Measurement of Practical Ability. 
Ages 8-18; 1935-46; individual; 1 form; £9 7s. per 
testing outfit; 2s. per 50 record sheets; (40-50) min- 
utes; W. P. Alexander; Thomas Nelson and Sons Ltd. 


REFERENCES 
1-3. See RAUS e F, 
4. Amin, D. L. “Differences Among the Deaf and Hearing 
Children." Indian J Psychol 21:91-2 pts 1-4 '46. * 
- John Cohen, Lecturer, University of Leeds, 
Leeds, England. This is a performance scale 
for measuring practical ability, based on W. P. 
Alexander's well-known study Intelligence, 
Concrete and Abstract (Cambridge Unversity 
Press, 1935). The scale comprises three tests— 
the Passalong Test (devised by Alexander him- 
self), Kohs' Block Design Test, and the Cube 
Construction Test—and is specially designed for 
children between 11 and 16 years of age. It 
can, however, be used, though not so effectively, 
with children 2 or 3 years younger than 11 and 
older than 16, Testing time is about 40 minutes. 
The scale was published to meet the need, 
created by the English Education Act of 1944, 
for allocating all children at 11+ to one of 
three forms of secondary education, viz., (a) 
literary or humanistic, (b) scientific, and (c) 
technical or practical. It is to test aptitudes for 
the third type, i.e., technical and practical edu- 
cation, that the scale is primarily intended, In 
addition to its use in allocating for secondary 
education, the scale can also be usefully em- 
ployed for ascertaining educationally retarded 
children, for classifying pupils within second- 
ary schools, for child and vocational guidance, 
for selecting apprentices, and with the deaf 
and partially deaf. The tests are attractively 
designed and convenient to handle in compact 
form. There can be no doubt that they will be 
extensively employed in the next few years in 
large numbers of English schools. 
The scores on the three component tests yield 
a Practical Ability Ratio (i.e., a ratio of prac- 
tical to chronological age) distributed like 
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Binet IQ's with a mean of 100 and a standard 
deviation of 15. The pupil’s PAR, as Alexan- 
der suggests, should be noted on the primary 
school record form together with mental and 
attainment ratios and ratings of personality. 
This scale is the most up-to-date of its kind in 
England and is a marked advance on the Col- 
lins and Drever performance scale, which is 
too complex for wide use in schools. 

The mean intercorrelation, corrected for at- 
tenuation, of the three component tests is .78. 
The scale as a whole has a correlation of about 
8 with “g” and about .7 with the practical 
factor (“F”). The reliability coefficient, com- 
puted from successive testings, is of the order .8. 

The record sheets accompanying the tests are 
unusually simple and convenient to use, and the 
Instruction Book is a model of conciseness and 
clarity, Altogether the scale is very strongly 
recommended, though further validation work 
is doubtless ‘desirable and would add to the dis- 
criminative and predictive worth of what prom- 
ises to be a first-rate testing device. 


Schoolmaster 151:395 My 8 ’47. G. W. * 
Attention is drawn especially to the handbook 
of instructions. The first half contains a clear 
description of the tests and how to administer 
them. The purposes for which the scale can be 
used are clearly outlined. The second half re- 
lates the use of the scale to general educational 
theory. Particularly illuminating are its refer- 
ences to school records and its analysis of the 
considerations, determining the allocation of 
children at 11 plus to different types of second- 
ary education. * A valid instrument for meas- 
uring practical ability is of great importance to 
educationists. During recent years considerable 
evidence for the validity of the Alexander Per- 
formance Scale has been produced. It is with 
confidence that the scale is recommended for a 
thorough trial. 

Times Ed Sup 1670:206 My 347. This is a 
battery of tests for measuring practical as dis- 
tinct from academic or verbal ability. The most 
important purpose of the scale is to select chil- 
dren for secondary education of a technical 
kind; and the most appropriate time to apply 
it is at the end of the primary school course. * 
The test material is supplied in a well-made 
wooden box about a foot long, 8 in. wide, and 
3 in. deep. * The price of the apparatus seems 
a little alarming, but it must be borne in mind 
that one box is enough for a school and that the 
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material will last for many years. The impor- 
tant question is: Does the test fulfil its pur- 
pose? There is good reason for thinking that it 
does. Dr. Alexander has carefully tried out the 
scale for the last 10 years, and has been able to 
provide norms of performance for the various 
ages, and to fix a level above which he contends 
that a technical training is indicated; Success at 
a technical school is in his opinion assured if 
the child's practical ability ratio is 120 or over 
in a scale where the average is 100 and the 
standard deviation 15. * Among the other pur- 
poses for which it is claimed that the Alexan- 
der scale can be used, besides the allocation of 
pupils at rr plus to appropriate secondary 
schools, the following may be mentioned: the 
testing of the deaf and the partially deaf, child 
guidance, vocational guidance, and the selection 
of apprentices, 


For a review by J. M. Blackburn, see 40: 
1376. For reviews by James Drever, T. J. Keat- 
ing, and Grace H. Kent of the Passalong Test, 
see 40:1414. See also 36:B3 and 38:B293. 
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[Arthur] Point Scale of Performance Tests. Ages 
4% or 5% through superior adults; 1925-47; individ- 
si 2 o ego Grace aur 

a) ForM t. Ages 544 through superior adults; 1925-43; 
C. H. Stoelting Co. 4 icai 
b) REVISED FORM II. Ages 434 through superior adults ; 
1947 ; all prices include postage; $2.50 per Knox Cube 
Test, Arthur Revision; $15 per Seguin Formboard, 
Arthur Revision; $3 per Stencil Design Test 1; $10.50 
per set of 14 pads of the Porteus Maze Test, Arthur 
revision; $20.50 per Healy Picture Completion Test 
II; $2.50 per 100 score sheets; $2.50 per 100 score 
cards; $225 per manual ; $49 per complete testing out- 
fit without carrying case ; $3.75 extra for carrying case; 
Psychological Corporation. 


REFERENCES 
1-16. See 40:1379. 


17. Deant, KArHARINE, The Arthur Performance Scale and 
the Stanford’ Binet: An Examination of One Hundred Sixty- 
nine Ten-Year-Old Children. Unpublished master's thesis, 
Stanford University, 1934. 

18. Huma, Katuryn A. Applicability of the Arthur Per- 
formance Scale to an Adolescent Group. Unpublished master’s 
thesis, University of Southern California, 1935. 

" 


19. DiTorrLA, EucENIA E. A Comparative ly Deom 
ance Tests. Unpublished master's thesis, Fordham University, 
1939. (Dissertations . . ., 1940, Dp. 97:98.) 


20. Bruce, MvnrLE. Factors Affecting Intelligence Test Per- 
formance of Whites and Negroes in the Rural South. Archives 
of Psychology, No. LES Washington, D. C.: American Psycho- 

"nce. July 1940. Pp. 99. Paper. $1.50. * 


logical Association, 
(PA 15:4591) us 

21. BagAsoN, Seymour Bernar. The Effects of Training on 
Four Intelligence Tests. Unpublished master's thesis, Clark 
University, 1940. (Abstracts of Dissertations . . . 1940, PP- 
113-4. a 

22. AmrHUR, Grace. “An Experience in Testing Indian 
School Children." Mental Hyg 25:188-95 Ap "gr. * (PA 15:3481) 

23. Bryou, Srpney W. DA "Experimental Analysis of Ar- 
ERIT frentes J Consult Psychol 6:247-52 S-O 
42. 16:5041 

24. Burcwarp, Epwarp M. L., AND MYKLEBUST, Hermer R. 
“A Comparison of Congenital and Aiventitious Deafness With 
Respect to Its Effect on Intelligence, Personality, and Social 
Maturity: Part I, Intelligence.” Am Ann Deaf 87:14^-54. Mr 
42. * (PA 17:75) 


24a. ArTHUR, Grace. A Point Scale of Performance Tests: 
Vol. I, Clinical Manual, Second Edition. New York: Common- 
wealth Fund, 1943. Pp. xi, E $1.50. * (PA 18:1539) A 
. 2$, ARTHUR, E ‘A Non-Verbal Test of Logical Think- 
ing." J Consult Psychol -4 Ja-F '44. * (PA 18.3485). 

26. Havicuurst, ROBERT Ves AND Hirkevitcu, Ruea R. “The 
Intelligence of Indian Children as Measured by a Performance 
Scale.” J Abn & Social Psychol 39:419-33 O '44. * (PA 193475) 

27. WaLLIN, J. E. WALLACE, AND HULTSCH, CATHARINE. 
“The Pathognomonic Significance of Psychometric Patterns.” 
Am J Mental Def 48:269-77 Ja 44. * (PA dix 

28, Ape, THEopoRA M. “The. Relationship Between Aca- 
demic Success and Personality Organization Among Subnor- 
mal Girls.” Am J Mental Daf 50:306 o (As. * (PA 20:2883) 

29: CAPwELL, Dora F, “Performance of Deaf Children on the 
d ur SEES Scale." J Consult Psychol 9:91-4 Mr-Ap '45. * 

19:239 

30. Rossars, A. Duprey. “Intelligence and Performance 
Test Patterns Among Older Mental Defectives.” Am J Def 
49:300-3 Ja '4s. * (PA 19/5529 fi 

31. SHOTWELL, AxNA M. “Arthur Performance Ratings of 
Mexican and American High-Grade Mental Defectives.” Am J 
Mental Def 49:445-9 AY "45. * (PA 20:170), 3 

32, PATTERSON, R. Mercuer, "The Significance of Practice 
Effect Upon Readministration of the Grace Arthur Performance 
Scale to High Grade Mentally Deficient Children." 4m J Men- 
tal Def 50:393-401 Ja '46. * (PA S 

33. SARASON, SEYMOUR B., AND SARASON, ESTHER Kroor, 
“The Discriminatory Value ‘of a Test Pattern in the High 
Grade Familial Defective.” J Clin Psychol 2:38-49 Ja "46. * (PA 


20:1940) 7 
34. Frazeur, HELEN A., AND HOAKLEY, Z. PavLINE. "Sig- 


nificance of Psychological Test Results of Exogenous and Em. 
dogenous Children (Arthur Point Scale)." Am J Mental Def 
51:384-88 Ja '47. * (PA 32:467) - 

25. Sarason, SEYMOUR B., AND SARASON, Estuer Kroor. 
“The Discriminatory Value of a Test Pattern With Cerebral 
Palsied, Defective Children." J Clin Psychol 3:141-7 Ap '47. 
(PA 21:3029) 


J Consult Psychol 1r:224 Jl-Ag ’47.* A 
notable feature of the selection of subtests is 
the elimination of the complex, speeded, form 
boards that were once the core of performance 
testing, which recent research has shown to re- 
ward rapid trial and error, rather than planned 
response. Form II has two primary uses, as. a 


retest for children of school age who have been — 


examined with Form I, and as a first test for 
use with pre-school children, in connection with 
the forthcoming Arthur adaptation of the Leiter 
International Performance Scale. The conven- 
ient spiral-bound manual gives clear descrip- 
tions of the tests, directions for administration 
and scoring, and tables for obtaining MA’s 
from each test and from the entire scale. * 


For reviews by Andrew W. Brown and Car- 
roll A. Whitmer and an excerpt from a review 
by Donald Snedden, see 40:1379. See also 36: 
Bro, 38:B304, and 40:B830. 
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Canadian Intelligence Examination. Ages 3-16; 
1940; a modification of the 1916 Stanford Revision of 
the Binet-Simon Intelligence Scale; individual; 1 form ; 
456 per set of test materials; $1 per manual ; 75¢ per 50 
record forms; Harry Amoss and Charles G. Stodgill ; 
Ryerson Press. 
REFERENCES 

1. Stotuers, C. E. “The Use of the Canadian Intelligence 

Test.” Sch 30:920-3 Je '42. * 


Sch 30:946-- Je '42. C(lara) B(lacklock). 
For some time there has been an urgent need 
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for an individual intelligence test published in 
Canada and standardized on norms obtained in 
Canada. American tests are both expensive and 
difficult to obtain. Experience with these Cana- 
dian tests proves them to be of interest and 
enjoyment to students. They have somewhat 
more originality than various tests that are 
definitely based on a revision of the Binet, Un- 
doubtedly the inclusion of a greater number 
of mathematical tests gives a more complete 
picture and a fairer chance to the not-too- 
articulate person. The clarity, general plan, and 
set-up of the manual makes it decidedly helpful 
to the administrator. The administering and 
scoring requires approximately the same time 
as the Binet test and the results prove to be 
practically the same. Keen interest is felt in a 
further report which Dr. Stothers, Inspector of 
Auxiliary Classes, is preparing, including com- 
parative results of a series of Binet Tests and 
Canadian Intelligence Examinations given to a 
wide range of groups throughout the province 
last year. A standardized Canadian test will cer- 
tainly be most welcome to all interested in this 
field of study. 

Understanding the Child 9:34 Ja ' 41. It will 
be of considerable interest to teachers in Can- 
ada to know that at last there has been pre- 
pared an individual test of general intelligence 
of the Binet-Simon type, designed particularly 
for use with Canadian children. The Canadian 
Intelligence Examination is based largely on 
the Terman, Kuhlmann, and Gesell tests, but 
original items have been devised and included. 
The chief points of difference from the Stan- 
ford revision are that the language of instruc- 
tion used in the presentation of the various 
problems has been simplified; the acceptable 
responses have been more rigorously defined; 
and an attempt has been made to reduce the 
weighting heretofore given to acquired literary 
skills by the inclusion of more mathematical 
and constructional problems, In order to avoid 
local and literary expressions, the word list in 
the vocabulary test has been selected from lan- 
guage common to both urban and rural com- 
munities in Canada rather than from the 
dictionary. The materials necessary for the 
administration of the test are simple and easily 
procured at little expense. The simplicity of 
the arrangement of the test items shortens the 
time usually required to administer an individ- 
ual intelligence examination. The test has been 
standardized chiefly in Ontario, but it is prob- 


able that it will prove more applicable to Cana- 
dian school children anywhere in the Dominion 
than the present revisions of the Binet test 
currently sold in the United States. 


[ 273 ] 
Carl Hollow Square Scale. Ages 10 and over ; 1939- 
45; individual ; 1 form ; $42 per testing outfit, postpaid ; 
(25-60) minutes; George P. Carl; Psychological Serv- 
ice, Institute of the Pennsylvania Hospital, 111 North 
49th St., Philadelphia 30, Pa. 


REFERENCES 


1-2. See 40:1385. T 

3 d own A. “The Reliability of the Lincoln Hol- 
low Square Form Board and a arison of Hollow Square 
Seoras. Ms PE pine Mental Ages.” J Appl Psycho) 
15:79-81 HE 14334. ate 
Sa Estes, Stantey G. A Study of Five Tests of ‘Spatial 
Ability.” J Psychol 13:265-71 Ap ‘42. * (PA 16:3771), 3 

5. PRE H., AND SLATER, PATRICK. “An Experiment in 
the Vocational Adjustment of Neurotic Patients.” J Mental Sci 
92:509-15 Jl *4s. * (PA 21:281) 


Grace H. Kent, Visiting Professor of Psy- 
chology, The University of Miami, Coral Ga- 
bles, Florida. This test might appropriately 
have been named the “Bevel-Edge Formboard 
Series.” It differs from the Lincoln Hollow 
Square in the use of beveled edges, and from 
the Ferguson Formboards in that the beveled 
edge is used systematically and consistently. It 
is an exquisite piece of apparatus, one which 
apparently must have required years of pre- 
liminary experimentation. It can be said of the 
standardization also that it represents a vast 
amount of-labor. Each of the 1,500 subjects 
was given a very thorough examination, and 
the norms are based upon correlation with 
other tests. 

The recess into which the blocks are to be 
fitted is a hole about 434 inches square. There 
are twenty tasks, all except two requiring four 
blocks apiece. For each task the edges of the 
blocks which touch the frame are perpendicu- 
lar, but the matchings within the square in- 
variably involve the fitting together of two 
beveled edges. There are 29 blocks, no two 
alike, whose basic forms are rectangular, tri- 
angular, trapezoidal and somewhat irregular. 
Each block is numbered, and the four blocks 
needed for a given task are selected by their 
labels. The entire outfit, including a well-built 
container, is so compact that it can be carried 
in a briefcase. 

A very strong feature of the test is its at- 
tractiveness to the subject. The problem is one 
of absorbing interest to the subject who is able 
to grasp the significance of the beveled edge; 
and his interest is well sustained throughout a 
series of increasing difficulty. s 


i 


SS 00 


353 ] 


„This reviewer regards the test as a master- 
piece, and has nothing but the highest praise 
for the basic idea and for the way in which it 
has been worked out in the apparatus, All criti- 
cisms here expressed relate wholly to the way 
in which the test has been used. 

The formula for presentation includes nearly 
500 words to be read aloud to the subject, and 
the reading occupies at least three minutes. 
This is not only a waste of time; it is also 
somewhat irritating to a subject who compre- 
hends the task and who is impatient to get the 
new toy into his hands. Even a highly trained 
subject finds it difficult to listen attentively to 
the instructions, For the average subject it 
should be sufficient to show the performance 
of a demonstration task, explaining that the 
straight edge of any block should always be 
placed at the edge of the square. One whose 
performance of the first task shows that he has 
failed to follow this demonstration may safely 
be considered an unsuitable subject for the test. 

Each task is evaluated by points, based upon 
an arbitrary combination of the time record 
and the move count. The score is modified fur- 
ther by additional points bonus for completing 
a task within a “sub-limit” in respect to time 
or moves, or by penalty for going beyond an 
upper limit in time or number of moves, The 
computation of results is somewhat compli- 
cated, and it takes the examiner about thirty 
minutes to evaluate the record. 

The final score is the sum of the points 
earned in all the tasks. The score for male 
subjects has been found to be 31 points less 
than the IQ by the Stanford-Binet scale, and 
for female subjects it was 39 points less than 
the Stanford-Binet IQ. Accordingly, the author 
derives an IQ for a male subject by adding 31 
points to his score, and for a female subject by 
adding 39 points. The “mental age,” if desired, 
is derived from the IQ by reckoning back- 
wards, 

To the reviewer this method of evaluation 
seems neither justifiable nor necessary. What 
we call "intelligence," however defined, is a 
factor in the performance of any test, as it is 
also a factor in the performance of such house- 
hold tasks as cooking and dishwashing. It is 
natural that any two tests should show a mod- 
erately high correlation, but it does not follow 
that they measure the same thing. To consider 
the Carl test a special aptitude test does not 
imply any disparagement. The aptitude test has 
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an important place in the clinical study of ado- 
lescent children, especially with reference to 
vocational guidance; and it is not essential that 
such a test yield an IQ. 

It is at the lower age levels and lower mental 
levels that formboard tests are most useful in 
the clinic. The very simple formboard, dealing 
with form-experience so elementary as to be 
universal among normal children of school age, 
may rank almost with the language test in its 
measurement of what we wish to measure by 
our so-called "intelligence tests"; but as we go 
higher in the difficulty scale, we deal increas- 
ingly with form-experience which is far from 
universal and which our subject may or may 
not have had. The tridimensional formboard is 
at the upper end of the formboard scale. The 
mechanical engineer may be expected to achieve 
a high score, and so also may the cabinetmaker 
who left school before reaching instruction in 
geometry; whereas the professional man who 
deals chiefly with words and abstractions may 
show amazing stupidity in his failure to recog- 
nize the most obvious similarities and differ- 
ences in form. 

Most formboard tests (including those de- 
veloped by the reviewer) involve so many 
factors that it is impossible to say what is 
measured by them. The strength of an aptitude 
test depends largely upon the success with 
which some one particular factor is isolated 
and measured; but this single aptitude, how- 
ever well measured, cannot fairly be used as an 
index to a subject's mental capacity in general. 

As a means of measuring the subject's apti- 
tude for dealing with beveled edges, the Car] 
Hollow Square is a superb instrument. Let it 
be standardized and used for just exactly what 
it is, and it should rank very high in service- 
ability. It cannot be recommended for routine 
clinical use. There are a few clinics in which 
time is allowed for an exhaustive study of a 
subject's special aptitudes, but they are excep- 
tional. For the clinic in which the psychometric 
examination is crowded into one hour, this test 
is too limited in scope to yield results com- 
mensurate with the time required for its 
presentation. 

Whatever defects may be found in the au- 
thor’s methods of presentation, recording, and 
standardization, the test itself can be uncondi- 
tionally recommended for use in the industries ; 
and it seems to be especially rich in potential 
value for the boys’ vocational school. On the 
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chance that some student will see its possibili- 
ties and undertake its standardization as an 
aptitude test, the following suggestions are 
offered for its use: 

a) The presentation should be adapted to the 
comprehension of the subject. It is far more 
important that it be natural and informal than 
that it be uniform for all subjects. One of the 
"extra" tasks may be used frankly for demon- 
stration, the subject to be given whatever help 
he requires for its solution. Any subject who 
then fails to solve the first task of the series 
within a reasonable time limit should be ex- 
cused from further work with this test. He 
may be given a little help in completing this one 
task, after which he should be given a task 
which is easily within his reach. The Lincoln 
Hollow Square, because of its superficial sim- 
ilarity to the Carl test, would be a very good 
formboard to have at hand for the purpose of 
easing over a failure. 

b) Time limits for the various tasks should 
not be too liberal, There are enough tasks to 
make the test abundantly discriminative at any 
level, even if some tasks are scored as failures. 
When the time limit of a task is reached, the 
subject may be shown the solution and then 
given the next task. He will not be wholly de- 
prived of its practice-effects on the remaining 
tasks, 

c) In the face of the author's confidence in 
his system of scoring the results, the reviewer 
recommends that the move count be omitted 
and that the score be based wholly upon the 
time record. This plan will permit the examiner 
to take notes on the subject's mode of pro- 
cedure, and the author concedes the value of 
supplementary qualitative observations, 

The fundamental objection to the move 
count, except in a test designed specifically 
with reference to an accurate move count, is 
that it is too subjective to be trustworthy. The 
movements which a subject may make are too 
varied in kind to be accurately defined, There 
is no opportunity for deliberation concerning a 
hair-splitting distinction, and no check on a 
move that has been recorded or left unrecorded. 
While the examiner is trying to decide what to 
do with some particular half-move, the subject 
continues to make more and more moves. It is 
true that the highly practised examiner may be 
able to establish his own criteria for evaluating 
the moves, but it does not follow that two ex- 
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perienced examiners would agree as to their 
criteria. : : 

The subject who tries to work rapidly is 
likely to make unnecessary moves, and the one 
who concentrates upon economy of moves is 
almost sure to take more time than is needed. 
There is an irreconcilable interference between 
the two variables. Scoring them both in terms 
of one or the other is an attempt to get some- 
thing for nothing, and is almost in the class 
with the perpetual-motion concept. Any form- 
board which permits an accurate move count 
should be scored wholly in terms of moves, and 
the subject should be informed that he is not 
being timed; whereas the test which does not 
permit an objective move count may better be 
evaluated wholly in terms of speed. 

The Carl test is almost in a class by itself in 
respect to the essential fairness with which it 
may be scored by speed. As there are only four 
placements required in any given task, there is 
little opportunity for serious confusion. The 
same principle runs through the series, and 
there are enough tasks to offset the effects of 
chance. Almost any subject is likely to have 
accidental difficulty with some task, possibly 
with one of the easier tasks in the early part of 
the series; but his speed score on the series as 
a whole will furnish a reasonably fair index to 
his aptitude in the matching of beveled edges. 
When the test is used for study of this aptitude 
and not as an intelligence test, the move count 
is as unnecessary as it is inaccurate. 

It is recommended that the time records in 
seconds be totaled for the several tasks, and 
that this figure be accepted as the final score, 
exactly as it stands. The stopwatch should be 
used openly as a means of letting the subject 
know that he is being timed. There is no need 
of instructing him to work rapidly. 

d) It is suggested that the norms be relative 
rather than absolute. Standardization for a 
selected group of subjects is a much simpler 
process than is the development of norms in- 
tended to be generally applicable. Only within 
the age limit for compulsory school attendance 
is it easy to find a representative sampling of 
the population ; and the standardization of tests 
for adults and adolescents presents inherent 
difficulties. It is much easier to standardize a 
test for a given industry or for a given school ; 
and the norms thus obtained may be used with 
reservations for individual study of a person 
belonging to the same selected group. The 
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norms for the group may be expressed in decile 
or quintile or quartile scores, according to the 
range of scores and the number of subjects. 

e) The norms, in whatever form offered, 
should be worked out independently for several 
different levels of achievement. The subject 
who can barely complete two or three tasks 
within the time limit should not be required to 
struggle through the entire series for the sake 
of obtaining a score. A given set of records 
can be used for the establishment of as many 
levels of norms as are needed, with preferably 
not over three tasks for the lowest level. The 
intervals between consecutive levels should be 
small enough so that no subject need be re- 
quired to continue after two failures. There 
should be definite rules for bringing the per- 
formance to a conclusion as soon as the sub- 
ject appears to have reached his limit; and a 
frank failure should invariably be followed by 
a formboard task easy enough to insure success, 
The examination should always leave the sub- 
ject with a sense of achievement rather than 
a sense of failure, 


For a review by T. J. Keating, see 40:1385. 
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Detroit Kindergarten Test. Kgn entrants ; 1921-25; 
individual; t form; $1.40 per 25; 35€ per specimen set; 
(7-12) minutes; Harry J. Baker and H. J. Kaufmann; 
World Book Co. * 


REFERENCES 
eel Pe Haste, V. a enfe HP Genetic: PINO 
58:407-17 Je '41. * (PA 16:99) 

Psyche Cattell, Clinical. Psychologist, Lan- 
caster, Pennsylvania. The Detroit Kindergarten 
Test is designed for a short individual intelli- 
gence test for children entering kindergarten. 
The test consists of a booklet of pictures which 
the examiner marks according to the child’s re- 
sponses to his instructions. 

Percentile curves based on 895 cases and age 
norms based on over 5,000 kindergarten pupils 
are given for ages 3 0/12 years to 7 6/12 years. 
The lowest norm is based on a score of two, 
and the highest, on a perfect score. Obviously 
mental age norms at the extremes of the possi- 
ble range of scores cannot be reliable. In gen- 
eral, as the authors point out, the younger 
children attending a kindergarten are more in- 
telligent than the average child, whereas the 
older ones are below average, and therefore the 
norms are “necessarily somewhat hypothetical 
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except for about the age of five years.” The 
authors do not, however, state what if any 
measures have been taken to correct for the 
atypical sampling of the younger and older age 
groups. No information on reliability and va- 
lidity is reported. 

The items for the test appear to have been 
carefully selected ; the directions for giving and 
scoring the tests are simply written in language 
that can be understood and followed by an in- 
telligent teacher; the test can be administered 
in less than half the time required for the 
Stanford-Binet; and it requires considerably 
less training on the part of the examiner than 
such a test as the Stanford-Binet. 

It is unfortunate that a test with the above 
advantages should not report data on reliability 
and validity and that unselected groups at the 
upper and lower age levels could not have been 
obtained for the standardization. 

From the inadequate information offered, it 
appears probable that this test should not be 
used for obtaining mental ages or intelligence 
quotients, but that it might well prove of value 
in classifying kindergarten children into per- 
centile ranks. 


Ruth W. Washburn, Consultant in Child De- 
velopment, Shady Hill School, Cambridge, 
Massachusetts. The publication of this test in 
1922 has been followed by so much excellent 
work in the field that the test has little left to 
recommend it save the short time necessary for 
its administration as an individual test (7 to 12 
minutes). Though the norms for ages 3 years 
o months to 7 years 6 months are based on 
total scores of 5,069 pupils and “partly upon 
Binet mental ages of 102 pupils,” the authors 
of the test felt that norms were so difficult to 
obtain as to make them “necessarily somewhat 
hypothetical except for the age of about five 
years where they are fairly reliable," Had the 
test been standardized more recently the norms 
could have been more reliably established by 
using the test in conjunction with some of the 
excellent tests for preschool children now avail- 
able. 

A major criticism of the test is that the 
young child is given no chance to use a pencil 
or other concrete material, so nothing is learned 
of the development of eye-hand coordinations, 
so important in kindergarten and preprimary 
work. 
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Detroit Tests of Learning Aptitude. Ages 4 and 
Over; 1935-39; individual; 1 form; $4.20 per testing 
outfit; 10¢ per record booklet; Harry J. Baker and 
Bernice Leland; Public School Publishing Co. 


REFERENCES 

1. Worcester, D, A., anp Corey, Sternen M. “In Criticism 
of the Detroit Tests of Learning Aptitude.” J Ed Psychol 
27:258-62 Ap '36. * (PA 10:6004) 

F. L. Wells, Psychologist, Department of 
Hygiene, Harvard University, Cambridge, 
Massachusetts, The earlier edition of the De- 
troit Tests of Learning Aptitude (DTLA) was 
reviewed in the 1938 edition of this Yearbook 
(38:1058). The present revision cites further 
experience with its use (12,000 cases) with 
some minor evaluative changes resulting there- 
from. Only slight procedural alterations have 
been made; no change was observed in the 
pictorial material. Thus, it is not clear that the 
presentation of the series now offered mate- 
tially alters the pertinence of the judgments 
given in the earlier reviews. But in a “forced 
choice,” clinicians would be generally well ad- 
vised to prefer a test series organized by, for 
example, Grace H. Kent with 500 cases to one 
organized by machinery with 50,000. The sort 
of criticism above directed at the DTLA ap- 
plied with considerable force to the original 
Kent E-G-Y Test; yet for its dimensions there 
has never been so usefal a procedure in clinical 
psychometrics. The clinical value of a test is 
too much a qualitative function, and too much 
a function of the background the examiner 
brings to it, to be properly assessable in terms 
of experience-tables of whatever magnitude. 

As in the original edition, the norms are in 
the common terms of age level, uniformly at 
intervals of three months. Such age symbolism 
is undeniably specious, but its lucidity causes it 
variously to persist. A more rational procedure 
would be to cite for somewhat less frequent 
age intervals mean scores with sigmas; this 
would give, with little added effort, a much 
clearer idea of where an individual stands with 
reference to his own developmental group. 

Like other tests, the DTLA is limited in the 
age range to which it is most appropriate. Its 
chief usefulness is in the grade-school years. 
At both extremes it has effective competitors, 
but also the persistent advantage of physical 
convenience. Indeed, the layout has received 
uniform commendation, and no better is known 
to have been achieved; but there should be a 
worth-while gain in spiral binding for the Ex- 
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aminer’s Handbook also. This would greatly 
facilitate learning, as well as accurate use of 
the detailed scoring data, at all levels of prac- 
tically attainable skill. Everyone, except the 
beginner with such tests, should learn not to be 
dependent on printed record forms. Compara- 
tively little of the DTLA record booklet is 
expendable as such, and the regular user might 
well effect economies of cost and space and, 
with practice, of time, with ordinary ruled 
paper. ; i 

The subtests of the series embody well-tried 
devices, such as absurdities and “comprehen- 
sion,” and others less well known but ingenious, 
as pictorial opposites and disarranged pictures. 
The most widely useful single subtest has 
seemed to be verbal opposites. Like other series 
of similar organization, the DTLA is not a 
closed system. The clinician should feel quite 
free to brigade selections from its subtests with 
others from the Wechsler-Bellevue, the Kuhl- 
mann-Anderson, and the like. In this way cer- 
tain quantitative and spatial underemphases of 
the DTLA can be compensated; for with all 
concessions to the penetrating review of these 
matters recently contributed by Garrett, this 
series also puts.a good many of its eggs in the 
verbal basket. 


For reviews by Anne Anastasi and. Henry 
Feinberg and an excerpt from a review by 
D. A. Worcester and S. M. Corey of an earlier 
edition, see 38:1058. 
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Gesell EDU Schedules. Ages 4-60 weeks; 
1925-40; individual; also called Gesell Maturity Scale, 

esell Norms of Development, Gesell Tests, Preschool 
Child Development Scale, Preschool Child Test, Yale 
Psycho-Clinic Developmental Schedules, Yale Tests of 
Child Development, and other variations; Arnold 
Gesell and others: 
a) 1925 SERIES. 1925; manual (Gesell’s The Mental 
Growth of the Preschool Child) is out of print; test- 
ing materials (listed in publisher’s catalog as Pre- 
School Child Test)—C. H. Stoelting Co. 
b) 1938 SERTES. 1 5-38 ; manual (Gesell, Thompson, 
and Amatruda’s he Psychology of Early Growth)— 
Macmillan Co.; testing materials (listed in publisher’s 
catalog as Gesell Maturity Scale)—Marietta Appa- 
ratus Co. 
C) 1940 SERIES. 1925-41; manual (Gesell, Halverson, 
Thompson, Ilg, Castner, Ames, and Amatruda's The 
First Five Years of Life) —Harper & Brothers test- 
ing materials must be assembled by examiner ; 6o per 
specimen set of printed materials and record forms— 
Psychological Corporation. 
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Nancy Bayley, Lecturer in Psychology and 
Research Associate, Institute of Child Welfare, 
University of California, Berkeley, California. 
The Gesell Developmental Schedules have been 
widely used and copied since their appearance 
in 1925. Because in their early form the pro- 
cedures, scoring, and standardization were very 
incompletely described, efforts to use the tests 
have resulted in a variety of adaptations, all 
independently standardized, with each author 
interpreting them differently and establishing 
his own specific methods of giving and scoring 
essentially the same testing situations and ma- 
terials. ; 

Recently, in a series of books, Gesell (with 
various collaborators) has published revisions 
of the tests with very full descriptions. The 
Psychology of Early Growth (13) describes 
the normative sample and presents in great 
detail the methods and behavior norms for the 
first year; The First Five Years of Life (20) 
among other things includes directions for the 
developmental tests from one through five 
years; and Developmental Diagnosis (22) gives 
directions for administering, scoring, and inter- 
preting the tests from four weeks through three 
years. It is not possible to get all of the in- 
formation about the tests and their standard- 
ization from any single book ; and in each book, 
the organization of the directions and the forms 
for recording the child's behavior are different. 
Developmental Diagnosis is probably the most 
practical to use as a testing manual. The record 
forms developed in this book are less compli- 
cated (though one may question the advisability 
of their emphasis on "key" ages), and the di- 
rections are better organized for exploring the 
behavior of a given child in a testing situation. 
However, even in this book, the specific direc- 
tions for administering the tests are in an 
entirely different section from the descriptions 
of characteristic behavior to be expected and 
scored. 

The "normative sample" on whom the tests 
were standardized consists of 107 children (49 
boys and s8 girls) selected to represent à 
homogeneous middle-class background (on the 
basis. of the Barr rating scale). These children 
were tested repeatedly, so that there were at 
least 26 children and usually about 35 tested at 
each of 20 age levels from four weeks through 


five years. 
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The descriptions of characteristic behavior, 
developmental trends in behavior, and tech- 
niques for eliciting the desired behavior at dif- 
ferent levels of development are very superior, 
They evidence much experience, careful ob- 
servation, and good clinical insight in these 
matters, The test materials are very suitable 
for their purposes and by now are well known 
to anyone at all familiar with infant tests, The 
complicated equipment of crib, photographic 
dome, one-way vision screen, etc., though im- 
portant for research, are not essential for the 
tests, and practical alternatives are indicated. 

The tests are arranged to give a general im- 
pression of a child's maturity of performance. 
Many characteristic types of behavior which 
may or may not occur are listed, and there is 
no attempt to have equal numbers of items at 
successive age levels, In scoring the Gesell Di 
velopmental Schedules, the items of behavior 
are rated plus or minus, according to whether 
or not they occur, and then the highest age at 

is determined 


l age may be modified by the 
tester s judgment about the general quality of a 
child's performance, It may be further quali- 
fied by a plus or minus, to indicate whether the 


Gesell’ 
method of scoring, and his claims for predict- 
ing later intelligence from these scores in in- 
Lor (lie en to rest on several confusions 
with a failure to make any careful 
MEUM oi the relation. betwen the 

s scores made in infancy and later, 

As for the scores, Gesell repeatedly insists 


an infant's f behavior f 

et pattern o by some form 
is that such a numerical score always falls at 
the most representative level of a child's be- 
havior if the items are arranged approximately 
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interpreting a child’s performance ; the qualify- 
ing factors are considered in addition to the 
test score. 

The insistence on the superiority of the DQ 
over the IQ is supported only by emotionally 
toned words and phrases to the effect that the 
DQ unlike the IQ may be used as a starting 
point in diagnosis, that it is not “limited to a 
single inclusive formula” but can be computed 
independently for the various fields of behav- 
ior, making possible better judgments about a 
child's real developmental status. (He does not 
even mention other methods of indicating rela- 
tive performance.) A better justification for 
the use of the term developmental quotient is 
that in infancy the most important factor in 
determining individual differences in perform- 
ance is variation in rates of maturing. 

However, he fails to note that as the general 
process of maturing slows down, this factor 
loses ground rapidly to innate differences in 
capacity (and environmental factors) which 
eventually become paramount after all indi- 
viduals have reached maturity. As there is no 
logical necessity for rates of maturing to be 
related to eventual individual differences, it be- 
comes important to test whether or not this is 
so, This Gesell fails to do. He continues to hold, 
by assertion and a few selected illustrative 
cases, that the DQ remains relatively constant 
and that both feeble-mindedness (simple amen- 
tia) and intellectual superiority can be diag- 
nosed in infancy, But there is nowhere any 
quantitative evaluation of the extent to which 
the children in the normative sample (or 
others) remain in the same diagnostic classifica- 
tion, and no recognition that other investigators 
have consistently found that these early behav- 
ior scores have little relation to scores made 
after three or four years, (See, for example, 
the tests of P. Cattell, Goodenough, and Bay- 
ley.) 


Florence M. Teagarden, Professor of Psy- 
chology, The University of Pittsburgh, Pitts- 
burgh, Pennsylvania, One is confronted with 
selecting from the dozen or more books put out 
by Dr. Gesell and his collaborators those that 
will most nearly clarify for the clinical psychol- 
ogist the actual procedures used by Dr. Gesell. 
Books and motion pictures have rather clearly 
demonstrated Dr. Gesell's approach to the child 
and his handling of his own test materials. No- 
where, however, has any one been able to dis- 
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cover how Dr. Gesell uses his normative data 
for a quantified appraisal of an individual child, 
The following review attempts to compare his 
descriptions of examining and appraising in- 
fants as described in the three most recent of 
his books in this field, namely: The Psychology 
of Early Growth (13), The First Five Years 
of Life (20), and Developmental Diagnosis 
(22). 

In The Psychology of Early Growth Gesell 
and Thompson give maturity level summaries 
for ages 4, 6, 8, 12, 16, 20, 24, 28, 32, 36, 40, 
44, 48, 52, and 56 weeks, These summaries 
list developmental sequences critical for each 
age level (short and less detailed summaries are 
also given in an appendix for ages 15 months 
through 66 months). The summaries through 
the 56 weeks level include items on head con- 
trol, arm-hand posture, leg-foot posture, body- 
posture and progression, regard, prehension, 
language and social development, and adaptive 
behavior, There are differing numbers of items 
in these different categories and at different 
ages. In this respect the summaries do not con- 
stitute a "scale" as scales are ordinarily con- 
structed. "The summaries, however, are fairly 
clear-cut and are rather easily used by anyone 
who has had experience in the infant field. The 
writer has been using them extensively for a 
number of years and has enjoyed the possi- 
bility of comparing a child's development in 
the different fields mentioned above, This is 
true even though one may not see the need, or 
indeed the wisdom, of using Gesell's very elab- 
orate “analytic development schedule” provided 
in this volume, This is a very time-consuming 
method of finding indications of “more ma- 
turity" and “less maturity" than the “two dis- 
tinctive and adjacent age levels from which to 
reckon deviations.” One must in the end work 
out his own method of scoring, recording, and 
summarizing the child's behavior in respect to 
these summaries, 

In The First Five Years, Dr. Gesell gives 
descriptive accounts of the behavior of infants 
of 4, 16, 28, and 40 weeks and of 1 year of age. 
These descriptions conform fairly closely to the 
corresponding age level in Psychol- 
ogy of Early Growth. No discussion is 
however, in The First Five Years as to 
use for appraisal purposes, Dr. Gesell also 
gives similar descriptions of behavior of chil- 
dren of 18 months, 2 years, 3 years, 4 years and 
5 years. At still another place in this volume 
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Dr. Gesell begins a technique which he develops 
further in the text discussed below, that is, to 
give in parallel columns the percentage of chil- 
dren performing each behavior item at two or 
three successive age levels, No method of com- 
puting final scores or measures is given, how- 
ever, 


Developmental Diagnosis by Gesell and 
Amatruda is the most recent volume dealing 
with infant measurement, Again behavior items 
are given which represent motor, adaptive, lan- 
guage, and personal-social development. Begin- 
ning at 4 weeks, pictures and descriptions are 
given of development at succeeding 12-week 
intervals through 12 months; likewise, for 18, 
24, and 36 months, The gaps between these 
levels, however, are supplied through addi- 
tional charts of parallel columns. For example, 
where 16 weeks is the key age, comparison 
is shown of the 12 weeks and of the 20 weeks 
levels. In this volume, too, Gesell provides for 
a Developmental Quotient or DQ derived by 
dividing maturity age by chronological age. “A 
distinctive DQ can be derived for each of the 
four major fields of behavior or for specific 
functions like prehension, locomotion, — 
lation, etc." "In any field of behavior the 's 
maturity level is at that point where the aggre- 
gate of + signs changes to an aggregate of — 
signs. If the + and — signs are so irregularly 
distributed that this point of change becomes a 
band or zone of change, the maturity level must 
be thought of as a zone with a basal level.” The 
usual confusion of Gesell's books occurs in this 
volume also since "growth trend charts" give 
almost but not quite the same comparisons as 
those just described. This volume then, in 
short, gives more clearly the Gesell method of 
infant measurement and appraisal than any of 
his 
nom eie poe dei inventories a 
scale, The number of items differs from one 
ca! to another and from age to an- 
other. is not a weakness of the method so 
much as it is an evidence of our inability to 
break down infant behavior into small seg- 
ments, One may not care to use a developmen- 
lal quotient for an infant. What is of more 
importance for practical clinical purposes is to 
be able to distinguish those phases of an in- 
fant's behavior which partake of what we shall 
later in his life call “intelligence” or “social 
behavior" from those which are "gross motor" 
in nature. When a clinician is faced with the 
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task of making repeated appraisals of an in- 
fant's development with a view of determining 
"suitability for adoption," for example, he 
realizes the contribution that Dr. Gesell has 
made and at the same time the enormous 
amount of knowledge and sagacity still to be 
developed. 


[ 277 ] 
*[Re Gesell Developmental Schedules.] GEsELL, 
ARNOLD, AND AMATRUDA, CATHERINE S. Developmen- 
tal Diagnosis: Normal and Abnormal Child De- 
velopment: Clinical Methods and Practical Ap- 
lications. New York: Paul B. Hoeber, Inc., 1941. 
p. xiii, 447. $6.50. * (PA 15:4836) 


Am J Mental Def 46:430 Ja '42. I. N. K. * 
this work is the first of its kind to give the 
practitioner quantitative measures for the eval- 
ulation of defective children. 

Am J Orthopsychiatry 12:372 Ap '42. Mar- 
garet T. Wilson. This book....is principally ad- 
dressed to pediatricians and family physicians. 
Child psychologists can use it advantageously. 
A thorough treatise on the developments in 
childhood, its value is increased because it in- 
cludes both normal, and abnormal child devel- 
opment. Deviations such as the amentias, 
endocrine and convulsive disorders, cerebral 
injuries, sensory defects, prematurity and pre- 
cocity are noted in special chapters devoted to 
each. A new diagnostic technique is evident, 
and the last quarter of the book can be used as 
a manual for these tests. The content of the 
test items is not entirely new, but their arrange- 
ment is. The practicability of this method in 
diagnosis and the interpretation of findings is 
exemplified in many case studies. Developmen- 
tal diagnosis requires study of the major fields 
in development, considered to be the motor, 
adaptive, language, and personal-social behav- 
for. These four fields develop interdependently 
while they also parallel each other. This test of 
behavior the authors consider as logical as any 
of the functional tests of clinical neurology. 
Rate of development is expressed by the devel- 
opmental quotient (D.Q.) which represents the 
proportion of normal development present at 
any given age. The advantage of this expres- 
sion over the intelligence quotient is that it can 
be used for each one of the four fields and is 
therefore more adaptable as a device. The 
trend of growth is divided into zones or key 
ages from 4 weeks to 36 months, An interest- 
ing distinction is made between permanent and 
temporary patterns in behavior and this is one 


of the keynotes of diagnosis for prognosis. In 
addition, the total symptom complex of each 
individual case must be considered, and the 
many ramifications that are so often coincident, 
such as environmental influences. An additional 
advantage of the book is the inclusion of a sec- 
tion on arrangements for hospitals and institu- 
tions, and clinical cinematography. Any child 
psychologist who must examine cases for adop- 
tion should not be without this book. 

Am J Psychiatry 99:470 N ’42. Milton H. 
Firestone. * a compilation of previous publica- 
tions by the authors, and although much 
broader in scope confines itself to diagnostic 
and prognostic procedures. They have had an 
unusual opportunity for research with both 
normal and abnormal children in the infant and 
preschool age groups and have very ably con- 
densed their results to furnish a guide to the 
understanding of growth in all of its aspects. * 
Special attention is called to a new term in 
psychological parlance, the Developmental Quo- 
tient, which is technically computed in a man- 
ner similar to the Intelligence Quotient but is 
distinct from it. The Developmental Quotient 
evaluates the child from an integrated total 
picture, taking into account behavior in its 
widest meaning as the standard rather than 
measuring the so-called intelligence which is a 
part function. This fact is important, because 
recognition of the D.Q. at an early age may 
make it possible to predict the rate of growth 
with accuracy, the D.Q. being a constant factor 
—i.e., a D.Q. of 50 at age 3 will indicate that 
the child will be 50 per cent developed at age 
6 or at any other age selected. Furthermore, 
it is possible to determine the D.Q. of any 
specific aspect of major behavior in the 
motor, adaptability, language or personal-social 
spheres, which makes it a flexible device. * 
An interesting use of the cinema to study be- 
havior of children is introduced. * 

Am J Psychol 55:611-2 O '42. Edgar A. 
Doll. The able authors of many publications 
on infant development and clinical pediatrics 
have collected and reformulated herein much 
valuable material on normal and abnormal 
child development. How much of it is new and 
original will depend on the reader's familiarity 
with the work of others in the same fields. The 
treatment is essentially clinical and is directed 
primarily to neurologists and pediatricians. * 
One is inclined to deprecate the heavy claims 
of the publishers on jacket and advertising cir- 
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cular, and to commiserate the authors on the 
professional embarrassments these must have 
caused them. Let us hope, too, that such ingen- 
uous publisher's enthusiasm will not mislead 
the unsophisticated student. The well-known 
four-aspect Yale schedules of early child de- 
velopment (birth to five years) are presented 
with explicit clarity but without material evi- 
dence on normative versus clinically significant 
deviation. The developmental quotient is re- 
affirmed as essentially constant and predictive 
without. much regard for evidence to the con- 
trary, and only a minimum of comment on 
delayed development and delayed retardation. 
The clinical difficulties of forecasting the 
course of development in endogenous versus 
exogenous cases, and the individual vagaries of 
predictability so seldom anticipated yet often 
encountered, are only lightly emphasized. As 
material on the experience, point of view, and 
working methods of one laboratory the volume 
is helpfully illuminating, so illuminating indeed 
that one could wish that the treatment were 
more representative of the field of develop- 
mental diagnosis as a whole. The authors have 
performed a genuine service in bringing the 
normative and developmental points of view of 
clinical and child psychology to medical atten- 
tion. A larger function might have been served 
had more comprehensive reference to other 
methods and standpoints been offered—some 
supportive, some elaborative, and some contra- 
dictory, the more so because of the authors' 
high prestige, skill in formulation, extended 
clinical orientation, and especially their posi- 
tion as intermediaries between clinical child 
psychology and clinical pediatrics. 

J Am Med Assn 118:259 Ja 17 '42. * The sec- 
ond part of the book deals with defects and devi- 
ations of development. This is a most valuable 
section, which will be appreciated by physicians 
who tend to be baffled by problems of mental 
retardation. Differential diagnosis, in terms of 
the author's use of the developmental quotient, 
is succinctly dealt with, and an etiologic classi- 
fication of developmental defects and devia- 
tions is offered. Mental retardation, endocrine 
and convulsive disorders are discussed diag- 
nostically. A chapter on special sensory handi- 
caps is particularly informative. A discussion 
of clinical aspects of child adoption presents 
an excellent point of view with which all physi- 
cians should be familiar. * It is recommended 
that all medical students and youthful physi- 


cians read the small section entitled "Imparting 
a Diagnosis" (p. 312). Here some important 
mental health considerations in dealing with re- 
tarded children are stressed. * This book may 
be warmly recommended to pediatricians and 
physicians. It offers comprehensive and well 
organized information about infants and chil- 
dren. The authors state specifically that they 
are chiefly concerned with the maturity and or- 
ganization of the neuromotor system; with de- 
velopmental neurology. This approach, most 
useful as it is, results in less emphasis on some 
psychologically and psychiatrically important 
aspects of child development. The intense fo- 
cusing of interest on the conduct of the infant 
is not paralleled or supplemented by equally 
full consideration of his primary relationships 
with the mother and other members of the 
family. These considerations are not neglected 
in the book but are not sufficiently brought into 
focus to give as fully rounded a picture of the 
early development of the child as the rich mate- 
rial of the book might permit. A word of cau- 
tion also is in order. It would be easy to gain 
the impression that the use of the develop- 
mental schedules and the conduct of the de- 
velopmental examination is a simple matter and 
that any physician without particular prepara- 
tion might apply them in his office practice. As 
a matter of fact, if authentic use of Dr. Gesell’s 
developmental norms is to be made, the physi- 
cian should be carefully schooled and practiced 
in the conduct of the examination, and the 
greatest care and meticulousness exercised. 
Successful use of the examination, particularly 
for prognostic purposes, requires closest at- 
tention to very minor deviations in the infant's 
responses. The use of the refined tool Dr. 
Gesell has developed calls for skill and pre- 
cision, and therefore training in its use. The 
volume is well printed, the language through- 
out is clear, simple and concise, and there are 
abundant photo tracings. 

J Consulting Psychol 6:272-3 S-O ’42. 
Helen L. Koch. * Since no new basic data are 
being presented, those crudities of standardiza- 
tion and quantification which obtained in the 
case of the findings reported previously, still ob- 
tain. Simplification of presentation is achieved 
by the use of charts, phototracings, and by 
tabular arrangements which make it easy to get 
both horizontal and vertical views of the be- 
havior complex Gesell is trying to analyze. The 
behavior picture, furthermore, is sketched in 
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detail only for what have been called key ages 
—4, 16, 28, and 40 weeks and 12, 18, 24, and 
36 months. The physician is encouraged to 
compute a Developmental Quotient for his 
child patient, or even several quotients. In the 
derivation of these he is allowed much liberty. 
Unfortunately, Gesell gives no hint of the effect 
upon the magnitude of the D.Q. of such fac- 
tors as test composition, statistical artifacts, or 
standardization procedures. One is told that 
"in uncomplicated cases we assume the rate 
will remain more or less constant throughout 
the whole cycle of behavior growth." (P. 114) 
The California, Fels Foundation and Harvard 
results, for example, whether because of the 
nature of mental growth or the instruments 
used to reveal it, would scarcely support this 
statement unless the terms “uncomplicated” 
and “more or less” are stretched seriously. 
Since the average physician has no training in 
psychometrics, statistics, child psychology, or 
allied disciplines and since, furthermore, his 
judgments are likely to reflect themselves 
promptly in recommendations for the treatment 
of his patient, the writer wonders if both the 
physician and his patient might not fare better 
if the former employed some one adequately 
trained to make and evaluate the behavior de- 
terminations just as in the case of blood chem- 
istry analyses, etc. * In spots the content of the 
volume resembles that of a text on differential 
diagnosis, though the treatment of particular 
disorders and symptoms is scarcely sufficiently 
thorough and detailed to justify this classifica- 
tion. The volume apparently aims rather to in- 
form the physician of how children with vari- 
ous defects—birth injuries, lesions of specific 
areas of the nervous system, sensory handi- 
caps, convulsive disorders, endocrine anamolies 
and amentia—may be expected to perform 
when faced with the tasks around which Gesell's 
normative schedules have been built. Consis- 
tent with this objective, the authors seldom 
mention any contributions of other workers to 
the problems considered or provide any sub- 
stantial.evaluation of controversial theories as 
to the causation of any of the disorders dis- 
cussed. Since the writer is limited to a very 
brief review, she will simply list a few out of 
a considerably larger number of statements 
which she believes are due some challenging. 
(1) “Formed connections have been made be- 
tween: nerve fibers and muscle fibers. A be- 
havior pattern- has taken place. All behavior 


patterns both in prenatal and postnatal life 
take shape in a comparable manner.” (P. 4.) 
(Is not this at best, even if the particular 
wording is ignored, an oversimplification not 
necessary in a medical text?) (2) “The D.Q., 
unlike the I.Q., is not limited to a single inclu- 
sive formula. A distinctive D.Q. can be de- 
rived for each of the four major fields of be- 
havior; for specific functions like prehension, 
locomotion, manipulation, etc. This makes the 
D.Q. an adaptable device. The I.Q. registers 
changes in the growing complex of behavior. 
Fluctuations in a general D.Q. or in specific 
D.Q.’s denote intrinsic and extraneous factors 
which are subject to interpretation.” (P. 115.) 
(3) “In some of its manifestations precocity 
may be regarded as a form of environmental 
acceleration, It may reflect a disproportionate 
emphasis on verbal behavior ; but such an over- 
stimulation can scarcely produce primary ac- 
celeration. It is more in the nature of excessive 
specification and overorganization." (P. 272.) 
(Since "primary" in the text is used to mean 
genetic, Line 4, in effect, offers proof by defini- 
tion.) (4) “Well-defined speech prior to the 
first birthday, however dependent on models 
for imitation, can scarcely be ascribed to en- 
vironmental forcing. It represents an inherent 
growth characteristic, biologically determined." 
(P. 273.) (Evidence? Of course, for anything 
the organism does it must have the necessary 
structure, but this consideration does not illu- 
minate the problem of individual differences 
which is implied.) The format of the volume 
is good. Type and margins are large; punctuat- 
ing devices numerous. 

J Ed Res 35:226-7 N "41. Carter V. Good. 
* another of the important contributions by 
Gesell and his associates * While....addressed 
primarily to the physician, it contains much of 
value to educational and psychological workers 
* much of the background material on which 
the new book is based has appeared elsewhere, 
addressed more directly to workers in educa- 
tion and psychology. * 

J Nerv & Mental Dis 01:89 Ja’45. A(rthur) 
N. F(oxe). * the fruit of many years of indus- 
trious work and collecting of data on the in- 
fant and child. The authors' work in this field 
is already well known and here they add to the 
accomplishment. The value of having norms is 
undisputed, They are presented one after the 
other for each age period in the child. As 
stated by the authors, the volume should be 
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very helpful to the pediatrician and general 
practitioner, * 

J Speech Disorders 11:247-8 S '46. Warren 
H. Gardner. Speech pathologists are interested 
in the development and maturation of the nor- 
mal and abnormal child, especially as these re- 
late to the acquisition or lack of average speech. 
They seek to learn the causes of deviations in 
speech development. Development Diagnosis is 
replete with hints, suggestions and diagnostic 
indications of abnormal development that leave 
little doubt of possible implications with speech 
deviations. Normal total integrated functioning 
of behavior is presented in many stages of de- 
velopment together with retarded or halted 
stages. * A most interesting chapter on sensory 
handicaps presents the correlation of such 
deviations to behavior responses, for example, 
the relation of deafness to quality and degree 
of vocalization, visual attention, social adapta- 
tions, and emotional behavior. Throughout the 
discussions the presence or absence of speech 
or stages of its development reveals the sig- 
nificance of growth deviations to speech de- 
fects. An interesting chapter entitled ‘Environ- 
mental Retardation' points to the tragedy of 
institutionalized children or those with unfor- 
tunate backgrounds together with symptoms of 
their misfortune. Speech deviations often are 
among these symptoms, This book should be 
on the ‘must’ list of collateral reading in speech 
pathology, and should be on the desk of every 
speech specialist. 

Psychosom Med 5:108 Ja '43. Alfred A. 
Washburn, Many of us who have watched with 
admiration the painstaking studies of Dr. Gesell 
and his group at the Yale Clinic of Child Devel- 
opment have hoped for a book from Dr. Gesell's 
pen which would present the gist of his work in 
a form such that it might be applied by the busy 
practitioner to the everyday problems of child 
care. "Developmental Diagnosis" is a delightful 
and fascinating answer to this hope. Presented 
with clarity and simplicity, with a choice of 
words and illustrations inimitably Gesell's, it 
should furnish both an ideal text for medical 
students and an indispensable reference book 
for the young practitioner, Nearly every one 
dealing with children might read “Develop- 
mental Diagnosis" with both pleasure and profit. 
The four chapters of Part I are concerned with 
the principles and methods of establishing a de- 
velopmental diagnosis. * The simplicity of this 


presentation should appeal to busy practitioners 
or welfare workers who must have such material 
in a form easily and pleasantly available if they 
are to make good use of it. The growth of be- 
havior is made to stand out, as it should, as an 
integral part of the essential unity of the de- 
veloping organism. Part II contains Chapters 
V-XVII, dealing with the diagnosis of defects 
and deviations of development. * Although there 
were moments when I found Gesell a little too 
rigid and dogmatic for my point of view yet 
I found a store of wisdom and kindliness in 
every chapter. * The discussion of cerebral in- 
juries is thought-provoking. * In his discussion 
of special sensory handicaps and of child adop- 
tion Gesell demonstrates the depth of his under- 
standing of human nature during its early years 
of development. And in his chapter on environ- 
mental retardation he balances nicely the genetic 
and environmental forces, * Part III closes the 
volume with the emphasis on treatment in two 
brief and concise chapters, "Diagnosis and 
Guidance" and "Developmental Supervision." 
Here he presents the form for a preliminary 
behavior inventory which is surely brief and 
simple enough to be of value to even the busiest 
of clinicians. * In order to make the book of 
maximum use to those readers who wish to 
follow the study of developmental diagnosis in 
their own work the authors have included 84 
pages of appendices. Here may be found de- 
tailed descriptions, charts and tables which cover 
the techniques and equipment of the examina- 
tion, of charting growth trends, and of making 
cinema studies, 

Q R Biol 17:92-3 Mr '42. A series of publica- 
tions on the pioneer work of Gesell and his 
co-workers has preceded this book. Develop- 
mental Diagnosis, which is destined to become 
a classic, has the advantage over these earlier 
books in that it is much more inclusive as regards 
subject matter. It also gives the most concise 
and workable presentation to date of the Gesell 
clinical testing material. * Chapter III, by far 
the longest in the book, is the most basic in that 
it "integrates the developmental tests, the be- 
haviour characteristics, and the growth trends 
of the behaviour patterns for the period from 
4 weeks to 3 years. This chapter is organized 
for convenient reference and is illustrated with 
over a hundred photo-tracings of normative 
behaviour patterns." * an excellent working 
index. 
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[Re Gesell Developmental Schedules.] Geser., 
ARNOLD; AMATRUDA, CATHERINE S.; CASTNER, Bun- 
TON M.; AND THompson, Heren. Biographies of 
Child Development: The Mental Growth Careers 
of Eighty-Four Infants and Children: A Ten- 
Year Study From the Clinic of Child Develop- 
ment at Yale University. New York: Paul B. Hoe- 
ber, Ses 1939. Pp. xvii, 328. Out of print. * (PA 13: 
3325 


Am J Psychiatry 97:096-8 Ja '41. W. Line. 
* The interested reader will find in each case 
history a considerable amount of fascinating and 
definitely stimulating material. Insofar as any 
comprehensive thesis emerges, it is that later 
development can be predicted, along certain 
potentials at least, with a degree of accuracy that 
is indeed encouraging—provided that the pre- 
dictions are based upon discerning observations. 
Even individuality often shows itself in infancy 
significantly for prediction purposes, and, again, 
along some lines more than others, So that, on 
the whole, the report gives considerable evidence 
of the validity and solidarity of the objective 
observational approach to the field of child de- 
velopment. In addition to the general thesis, 
almost every page of the book reflects some 
challenging tentative suggestions * The aim of 
dynamic psychology, as expressed by Allport, 
for example, is to be increasingly able to predict 
what can be accomplished with an individual, 
rather than what he (because of physical or 
mental constitution) will become. This point of 
view is in marked contrast with that of the 
volume under review—not in being contrary, of 
course, but only in being complementary. It is 
one, however, to which the whole field of child 
study must give increasing attention. 

J Consulting Psychol 5:90 Mr-Ap ' 41. Eliza- 
beth E. Lord. In Biographies of Child Develop- 
ment we are shown the practical application of 
the testing techniques described in the earlier 
books by Gesell. A familiarity with these earlier 
books, more particularly with The Mental 
Growth of the Pre-School Child and Infancy 
and Human Growth would seem to be a pre- 
requisite for a vivid, meaningful appreciation of 
these biographies, since “the analytic appraisal 
of maturity” is obtained through the clinical 
use of the Gesell schedules. For the clinical 
psychologist concerned with the appraisal and 
prediction of mental growth these studies are 
of special interest. * 

J Genetic Psychol 58:221-4 Mr '41. Living- 
ston Welch, * Examples of biographical analysis 


that should remain as classics are to be found 
in this book. The cases under discussion are 
limited to 84 and these have been carefully 
selected from many thousands. * Apart from 
any specific interest in predictability, much is to 
be gained by a careful study of a great number 
of the biographies of atypical cases. * Both the 
clinical material and theoretical discussions are 
the most interesting products that so far have 
come from the Clinic of Child Development at 
Yale. They constitute invaluable contributions 
to child psychology and psychiatry. 

Nature 145:140 Ja 27 '40. * The primary aim 
of the authors is to assess the prognostic value 
of the normative criteria used at the Yale Clinic 
of Child Development. The book is carefully 
written and may be recommended to psycholo- 
gists in Great Britain. 


For additional reviews, see 40:B913. 
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ke Gesell Developmental Schedules.] GeseLL, 

RNOLD; Hatverson, Henry M.; THompson, HELEN ; 
Irc, Frances L.; Castner, Burton M.; Ames, Louise 
Bates; AND AMATRUDA, CATHERINE S. The First 
Five Years of Life: A Guide to the Study of the 
Preschool Child. From the Yale Clinic of Child De- 
velopment. New York: Harper & Brothers, 1940. Pp. 
xiii, 393. $4.00. * (London: Methuen & Co. Ltd., 1941. 
215.) (PA 14:3804) 


Am J Dis Children 60:1009 O '40. * This 
authoritative work...probably gives the most 
accurate picture of physical and intellectual 
growth which has appeared. * Like all of the 
books by Gesell and his associates, this book is 
written in a very direct style, the terminology 
‘is easily understood and it is interesting through- 
out. It should be read by. every one working 
with children. 

Am J Phys Anthrop 27:171-2 Je 29 '40. 
A(les) H(rdlicka). * Dealing only with the 
White American child and that of a restricted 
locality, it is of but secondary interest to anthro- 
pology; and it gives but little attention to the 
numerous highly interesting behaviors reminis- 
cent of human ancestry. But it includes....many 
tests and observations which shed considerable 
light on child mentality. How much more valu- 
able would all this have been had it included 
some racial—at least the Negro—and other 
group comparisons, made by the same methods 
and same observers. But perhaps that is re- 
served for a further endeavor. 

Am J Pub Health 30:1125-6 S ’40, Richard 
A, Bolt. There is no book of which this reviewer 
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is aware that gives a more complete picture of 
the normal growth and development of the pre- 
school child in so compact a form as this... 
volume * The care with which the material has 
been selected out of twenty years of funda- 
mental research is evident on every page. We 
have here not only the results of carefully 
planned scientific observations but also a philos- 
ophy and methods which should point the way 
to productive future studies. The task “is to get 
a better appreciation of the individual ways in 
which individual preschool children mature." 
We grasp the central concept of this illuminat- 
ing volume when we realize in its own words 
that “The total child ceases to be an academic 
abstraction as soon as we try to ascertain the 
grouping of his behavior traits and the trends 
of his growth career." The contributors to this 
volume have preserved a remarkable unity of 
purpose in presenting a great diversity of mate- 
rial. * This reviewer having a keen interest in 
photography is especially impressed with the 
pictorial illustrations covering twenty-one pages. 
* The section on Examination Records and 
Arrangements should prove very helpful to all 
those actually interested in growth and develop- 
ment of preschool children. A list of 142 selected 
references completes the volume. Here is a book 
that cannot be too highly recommended. 

Brit J Ed Psychol 11:236 N '4r. C. W. 
Valentine). * The first part....by Dr. Gesell... 
provides an up-to-date survey of the results of 
the large amount of research that has gone on 
now for many years at the Yale Clinic. After 
general discussion of the nature of mental 
growth, Gesell turns to a detailed study of the 
first year of life, giving typical descriptions of 
the behaviour and accomplishments of the infant 
at one, four, six, nine, and twelve months of 
age. Of course, there are, at these stages, enor- 
mous individual variations ; but Gesell attempts 
to draw a recognizable picture of the 'average' 
child of these periods. The next chapter gives 
similar sketches of typical behaviour at eighteen 
months, two years, three years, four and five 
years. These are all valuable in that they are 
more than the itemized results of specific tests, 
though they are based to a very considerable 
extent upon tests and observations on specific 
points. Part IL..gives more detailed, special- 
ized accounts of certain aspects of development. 
Thus Dr. Halverson first writes on motor de- 
velopment, including walking and prehension. 
Dr. Helen Thompson provides a long chapter 


on 'adaptive behaviour,' including drawing and 
number concepts. Dr. Castner deals with lan- 
guage development and others collaborate on 
personal and social behaviour, including play 
activities, developmental detachment, and so 
forth. All these form valuable supplements to 
the more general unified descriptions given in 
Part I. In Part III the chief section deals with 
what is called the ‘development examination.’ As 
usual, Dr. Gesell avoids confining his tests to 
those of general intelligence. The book con- 
cludes with schedules of the standard tests. It is 
useful to have a revised list of these according 
to the latest results obtained at the Yale Clinic. * 
even for those who are familiar with the numer- 
ous publications of Dr. Gesell and others of the 
Yale Clinic, this further general summary and 
survey forms a most useful and important con- 
tribution to the study of the pre-school child. 

J Ed (London) 74:134-6 Mr ‘42. J. W. 
Jenkins. * an admirable guide to the study of 
behaviour development in young children. It is 
a book which all nursery-school teachers and 
clinicians should consult. Dr. Gesell edits the 
book....and gives a preliminary outline of the 
process of development from birth to the sixth 
year. The terminology he uses is, in general, 
non-technical, his style is attractive and well 
suited to his subject. It is designed to appeal 
to a wide circle of readers, and many parents 
will find much to interest them in these pages. 
The use of a pictorial survey is a characteristic 
feature and shows children of various ages 
performing both individually and collectively. 
The second section, which is contributed by Dr. 
Gesell's collaborators, is an objective study of 
behaviour development in four major fields— 
motor characteristics, adaptive, language, and 
personal-social behaviour. Full details are given 
of the standardized procedures followed by the 
research workers and, as far as possible, they 
describe their technique for enlisting the co- 
operation of the more wayward infants. The 
experimental method as a whole is dominated 
by a liberal and humane application of psycho- 
logical principles. It is difficult to overestimate 
the value of this work, and one cannot help 
feeling that funds should be available in this 
country for clinics to be set up, similar to the 
Yale Clinic, in which child guidance work could 
be supplemented and research organized into 
the behaviour development of children at these 
early formative stages. Finally, in the last sec- 
tion the conduct of the developmental examina- 
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tion is considered. The object is to diagnose a 
child's behaviour level in the four fundamental 
categories treated in section two. This is done 
by matching the performance of an individual 
child with a series of schedules which are de- 
rived from an intensive study of a ‘normative 
group' of children, This group was small and 
represents a fairly homogeneous sample of chil- 
dren under 6 years of age. It is clear, from a 
study of these schedules, that in a number of 
cases the variability in performance is consid- 
erable and it is to be expected that with an un- 
selected population this would be even greater. It 
seems desirable that some further investigation 
should be made into the age distribution of 
children responding in a way considered typical 
of an age group. As these schedules are more 
widely used it may be found necessary to give 
a more specific measure of this variability. The 
results are stated as percentages and when inter- 
preting these results the size of the groups in- 
volved must be borne in mind. The percentage 
„scale is liable to mislead when used with such 
small numbers of children. These are very minor 
criticisms which merely concern the form in 
which the results are presented. There can be 
no question of the great significance and value 
of this work to child study, and should these 
Schedules require some slight modification in 
the light of further research it is doubtful 
whether the differences made will seriously af- 
fect diagnosis. 

Psychoanalytic Q 10:475-8 Jl "41. Ives Hen- 
drick. To all students of the development of 
mind and personality, a new book by Arnold 
Gesell is of special interest. His life work has 
become one of the important cornerstones of 
our knowledge of how the infant becomes a 
child. * There is a cleanness about both the 
technique and conceptual vision of Gesell which 
is seldom muddied by work from other fields. 
He can frequently refer to the importance of 
social and environmental factors which he has 
not intensively studied without being sidetracked 
by the work of those who have. This perfec- 
tionist individualism has contributed both to the 
clarity of his work, and also to his scientific 
isolation. The psychoanalyst will be the first to 
point out that Gesell has studied only the build- 
ing material and not the carpentry of the indi- 
vidual’s personality development. Gesell himself 
says in the book we are reviewing (p. 13): 
‘Environment determines the occasion, the in- 
tensity, and the correlation of many aspects of 


behavior; but it does not engender the basic 
progressions of behavior development. These 
are determined by inherent, maturational mech- 
anisms.’ Freud’s own reiterated views of the 
importance of constitution refer more especially 
to the hereditary determination of drives and 
‘choice of neurosis’ than to the effectors which 
Gesell especially studies; but there is no es- 
sential contradiction. The analyst can find in 
Gesell's results much empirical material for 
checking his own inductions concerning infan- 
tile development, and especially the rudiments 
of the ego. The importance of such factors as 
fantasy, pleasure and pain, object relationships, 
and identification in determining the eventual 
selection and configuration of potentialities in 
the development of personality are vaguely rec- 
ognized by Gesell, but not intensively studied. 
But morphology and physiology are none the 
less fundamental because they are not the whole 
story. The First Five Years of Life....will not 
become a basic book in child psychology only 
because it is essentially a restatement of those 
methods, results and fundamental concepts 
which were definitively presented in The Mental 
Growth of the Preschool Child (Macmillan, 
1925) and Infancy and Human Growth (Mac- 
millan, 1928). The new book does not present 
a basically new research, as did Feeding Be- 
havior of Infants, by Gesell and Ilg (Yale Uni- 
versity Press, 1937), nor is it a new technique 
of clinical reporting as was the detailed photo- 
graphic manual, An Atlas of Human Behavior 
(Yale University Press, 1934). Some of the 
unity and literary effectiveness of the earlier 
books are sacrificed to the advantages of col- 
laboration with his associates. It gains chiefly 
from the amplification and refinement of many 
details in consequence of the work of the inter- 
vening years and from the reiterated emphasis 
on the clinical viewpoint and depreciation of 
laboratory quantification for its own sake. * 
There is a bibliography and an excellent index. 

Q R Biol 15:509 D ’4o. * beautiful photo- 
graphic illustrations * probably the most author- 
itative and comprehensive work on a subject in 
which the work of the author and his group 
have been greatly responsible for such progress 
as has been made. 

Univ Mich Sch Ed B 12:32 N ’40. * The 
book is detailed and authoritative, replete with 
illustration and example, and written in clear, 
nontechnical style. It will be an important refer- 
ence in psychology and a source of much specific 
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background for prospective teachers of nursery 
groups. 


For additional reviews, see 40:B914. 
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[Re Gesell Developmental Schedules.] Geszx1, 
ÅRNOLD, AND THOMPSON, HELEN; ASSISTED BY CATH- 
ERINE STRUNK AMATRUDA. The Psychology of Early 
Growth: Including Norms of Infant Behavior and 
a Method of Genetic Analysis. New York: Mac- 
ig is 1938. Pp. ix, 290. Out of print. * (PA 
24211 

Brit Med J 4046:182 Jl 23 '38. * important 
book of reference * Thanks to this work we are 
beginning to obtain a very fair idea of what 
the normal infant can do at various ages and 
how far deviations from the normal are to be 
regarded as non-pathological. It is only by estab- 
lishing this norm that we can study abnormali- 
ties with intelligence and discover what condi- 
tions in infancy may be recoverable and what 
must be regarded as inevitably permanent handi- 
caps. The medical man who is concerned with 
the diagnosis and prognosis of injuries and de- 
velopmental defects in infancy will find a wealth 
of material in this volume which will serve him 
in very good stead in controlling his own obser- 
vations. 


For additional reviews, see 40:Bg16. 
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XInfant Intelligence Scale. Ages 3-30 months; 
1940; a downward extension of the Revised Stanford- 
Binet Scale; individual; $45 per testing outfit, post- 
paid; oog per 25 record forms, postpaid; $4.00 per 
manual (The Measurement of Intelligence of Infants 
and Young Children by Psyche Cattell) ; (20-40) min- 
utes; Psyche Cattell; Psychological Corporation. 


REFERENCES 
1., CATTELL, PsvcHr. The Measurement of Intelligence of 


Infants and Young Children. New York: Psychological Corpora- 
tion, 1940. Pp. 274. $4.00, postpaid. * (PA 15:2397) 

2. CarrELL, PsvcnE. “Intelligence of Infants and Its Meas- 
urement.” Trans N Y Acad Sct, Series II 3:162-71 Ap ‘41. * 
(PA 15:4041) 

Florence M. Teagarden, Professor of Psy- 
chology, The University of Pittsburgh, Pitts- 
burgh, Pennsylvania, The research from which 
the test was developed was conducted at the 
School of Public Health of Harvard University. 
The “standardization is based on 1,346 examina- 
tions made on 274 children at the ages of three 
(months), six, nine, twelve, eighteen, twenty- 
four, thirty, and thirty-six months.” As is 
usually true in the construction of a new scale, 
currently existing scales provided some of the 
material for test items. “A study of the reliabil- 
ity and validity coefficients and of the individual 
IQ curves gives indications that the tests from 
about fifteen months on have nearly as high 


reliability and predictive value as do most intel- 
ligence tests for children of school age." Most 
clinicians will probably agree with Dr. Cattell 
that the scale at the lower levels, where predic- 
tion is not high, nevertheless is “of considerable 
assistance in appraising an infant." 

The testing materials, nearly all of which are 
again available, are relatively simple and inex- 
pensive. They come packed in a convenient con- 
tainer. What is of even greater importance is 
that the materials appeal to infants and young 
children. Some of them are familiar to those 
who have had experience with earlier infant 
tests. Directions for administering and scoring 
are, for the most part, clear and definite. 

The testing begins at the two-month level, 
each of the five test items at this level being 
given .2 month (two-tenths month) credit. Two 
alternate test items are also provided at this 
level. This plan is carried out for each month ~ 
through the twelve-month level. At that point 
two months are grouped together. For example, 
at the thirteen and fourteen months level there 
are five items, each item carrying .4 month 
(four-tenths month) credit. Beginning at the 
thirty-one-month level the test is the same as 
the corresponding levels of the Revised Stan- 
ford Binet Scale, Form L. The test runs through 
the four-year level. 

Cattell’s Infant Intelligence Scale is a great 
comfort to a clinician who wants to add some 
quantifying data to his own observations, infant 
behavior inventories, and hunches. The test does 
not permit an analytical breakdown into the 
various functions or mental operations dis- 
played by the infant. Such a breakdown is a 
merit of the different Gesell standardizations. 
Of course an experienced clinician can make his 
own analysis of the baby’s Cattell performance, 
but there is no provision for quantifying or 
recording separately skill in gross motor abil- 
ities, prelanguage ability, social characteristics, 
and so on. 

The answer at which the clinician arrives by 
the use of the Cattell scale is a “mental age” 
to tenths of a month! One may not care to avail 
himself of the IQ technique for which the test 
thus provides. One may wish that some discrep- 
ancies in age placement between the manual, 
the test-blank items, and the test recording 
spaces could be eliminated (as they no doubt will 
be). Nevertheless, the Infant Intelligence Scale 
is a valuable clinical adjunct in the field of infant 
and preschool testing. 
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Beth L. Wellman, Professor of Child Psy- 
chology, Iowa Child Welfare Research Station, 
The State University of Iowa, Iowa City, Iowa. 
Cattell's scale represents an assembly of a large 
number of test items very well adapted to in- 
fants in terms of motivation and interest. The 
procedures are clearly described, and the manual 
is well documented with photographic illustra- 
tions, 

Selection of test items to be included in the 
scale was mostly made from the Gesell array. 
Many items, however, were summarily dis- 
missed, without trial, as those “which were 
thought to be unduly influenced by home train- 
ing or to depend mainly on large muscular con- 
trol” (1, p. 23). It is difficult for the reviewer 
to grasp the rationale of omitting items influ- 
enced by home training, since if this procedure 
were to be followed in other tests at older ages 
Such items as vocabulary, making change, and 
many others would be eliminated. With the ob- 
jective of eliminating items dependent on large 
muscular control the reviewer is heartily in 
accord, since the usual finding at all ages is lack 
of relationship between muscular control and 
intelligence. However, inspection of the items 
included in the Cattell scale leads one to feel 
that a good many items did creep in, neverthe- 
less, that were mainly motor in character. For 
example, such items as hand preference, lifts 
head, lifts chest, head erect and steady, appear 
highly questionable. Cattell made no item analy- 
sis or validation of items except to give per cent 
passing by age. It is unfortunate that the work 
of Richards and Nelson * on the Gesell items 
appeared too close to the time of the publication 
of the Cattell scale for the results to be utilized. 

The avowed purpose of the Cattell scale was 
that it was “so constructed as to constitute an 
extension downward of Form L of the Stan- 
ford-Binet tests” (1, p. 24). In spite of the fact 
that the test items were placed in such a man- 
ner as to bring the median IQ's as close as 
possible to that obtained with the Stanford- 
Binet at three years, the scores at the youngest 
QUERER TCU NT a 
and Its Predictive Value for Performance on Mental Tests at 
Two and Three Years." J Genetic Psychol sugas Je '38. 


Nelson, Virginia L., and Richards, T. W. “Studies in Mental 


elopment: III, Performance of Twelve-Months-Old Chil. 
dren on the Gesell Schedule, and Its Predictive Value for Men- 


tal Status at Two and Three Years." J Genetic Psychol 54:181- 


91 Mr 739. 

Richards, T. W., and Nelson, Virginia L. “Studies in Mental 
Development: II, Analysis of Abilities Tested at the Age of 
Six Muss by the Gesell Schedule." J Genetic Psychol 52:327- 
31 Je '38. 

ards, T. W., and Nelson, Virginia L. “Abilities of In- 


fants During the First Eighteen Months.” J Genetic Psychol 
7$$:299-318 bj ^39. w = xs 


ages failed to correlate to any appreciable de- 
gree. The correlation of the score at 3 months 
with Form L at 36 months was .10, at 6 months 
.34, and at 9 months .18. From 12 months on 
the correlations ranged from .56 to .83. Thus, 
if validity is to be judged in terms of correla- 
tion with later measures, this scale is of doubt- 
ful validity below 12 months, and the scale 
did not meet its main purpose for the first year. 
of life. 

It is unfortunate that the author showed a 
complete unawareness throughout of the work 
of Kuhlmann and the fact that he had pub- 
lished a scale in 1922 (revised in 1939) ex- 
tending through infancy. While Cattell's scale 
has a larger number of items, it remains to be 
seen whether the selection of items was more 
adequate. 

Cattell's standardization group was not large 
(274 children), and she makes no claim that 
they represented a random sample. 


Sch & Soc 54:475-6 N 22 '4r. Norma V. 
Scheidemann. * This scale represents an enor- 
mous task painstakingly carried out. It is a 
definite improvement over all other infants 
tests now available. Because of its many advan- 
tages it is by far the most usable device for 
testing infants and should prove to be of in- 
estimable value to clinical and research psy- 
chologists who are competent to administer and 
interpret intelligence tests with due regard to 
their significance and full awareness of their 
limitations. Even psychologists not specifically 
concerned with mental measurement will be 
impressed with the amount of objectivity and 
scientific accuracy the author has found pos- 
sible to incorporate in this scale. It is the re- 
viewer's belief that the Cattell scale will be- 
come the standard instrument for testing in- 
fants ; that it will attain a prestige among infant 
scales comparable to that attained by the Stan- 


ford—Binet among intelligence scales for older 
children. 
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X[Re Infant Intelligence Scale.] CarrELL, PSYCHE. 
The Measurement of Intelligence of Infants and 
Young Children. New York: Psychological Corpo- 
ration, 1940. Pp. 274. $4.00. * (PA 15:2397) 


Am J Orthopsychiatry 11:607 Jl 4r. Helen 
Speyer. * a test which many psychologists en- 
gaged in testing infants and young children 
will welcome * is so constructed that it con- 
stitutes a downward extension of Form L of 


369 ] 


TESTS & REVIEWS: INTELLIGENCE—INDIVIDUAL 


283 


the Stanford-Binet. This gives it a tremen- 
dous advantage over other preschool tests, 
since it provides a simple scale with which a 
child may be examined from infancy to ma- 
turity. One error in re-testing is thus elimi- 
nated, as results obtained on different scales 
will not need to be compared and equated. * 
Statistical findings are clearly presented. Chap- 
ter 3, which gives instructions for administer- 
ing the examination, is especially fine and 
shows that Dr. Cattell is well aware of some 
of the difficulties involved in the clinical use of 
preschool tests. Some delightful illustrations 
of infants' responses to test situations will help 


an examiner in scoring. This scale fills a defi- . 


nite need and many of us will look forward to 
widespread use and early confirmation of its 
value. 

Am J Psychol 58:153 Ja '45. T. L. McCul- 
loch. * Apparent errors of item-placement are 
numerous. According to the tables, e.g. all 18- 
month items are more difficult than any regu- 
lar items placed at 20 and 22 months. Other 
types of error throughout the book are of 
relatively minor importance. Validity and reli- 
ability are reported as being very low at lower 
age-levels, but acceptable at higher levels. The 
author stresses the importance of not attaching 
too great significance to scores from the scale 
alone. Most of the test-items offer great prom- 
ise of usefulness, but, because of irregular 
placements at in-between ages, results from 
administration of the present scale should be 
treated with even greater reserve than is liber- 
ally recommended by the author. 

J Appl Psychol 25:366-7 Je "41. C. M. 
Louttit. * Cattell gives valuable suggestions 
for examining young children and warns 
against using the scale as an instrument of pre- 
cision. The quantitative characteristics of the 
scale make it valuable for research purposes. 
However, its relatively low validity (r = .10 
between 3 and 36 months, and .71 between 2 
years and 3 years) makes individual prediction 
hazardous unless experienced clinical judg- 
ment supports the test score. This new scale 
appears to be a useful addition to present in- 
struments. Its value can only be proved by use. 

Psychol B 38:900 N ’41. Rachel Stutsman 
Ball. A rare degree of insight into the nature 
of young children and into the requirements of 
a satisfactory test for them has been combined 
in this new scale with a highly practicable or- 
ganization of the materials and methods of 


testing. * Some of the outstanding merits of 
this scale are: (1) The test age levels are set 
at each month for children from two to twelve 
months, for two-month levels from twelve to 
eighteen months, and for three-month intervals 
from eighteen to thirty months. This offers the 
potentiality of much greater accuracy in test- 
ing than earlier scales with greater intervals 
between age levels. (2) There are five items at 
each age level, with at least one alternate and 
often more. (3) The scoring of the scale is 
easily understood, since it follows the same 
method used in the Stanford-Binet scale. (4) 
Certain of the Stanford-Binet items are inter- 
mingled with the items from other sources at 
the ages twenty-two to thirty months to facili- 
tate the supplemental use of the scale with the 
Stanford-Binet scale. (5) The scale avoids 
time limits and timed tests. (6) The test scale 
needs to be administered in no definite order, 
and it has been standardized so that the child 
may be held on the mother's lap during the 
test if it seems desirable. (7) The examina- 
tion takes from twenty to thirty minutes. (8) 
The standardization is based on 1346 exami- 
nations on 274 children. (9) The directions for 
test administration are clearly and fully given 
with illustrative photographs for each test. 
(10) The author has given a helpful analysis 
of the adjustments and points of view neces- 
sary in testing infants and young children. * 
There is no exact statement of the criteria 
used for the placement of tests at the different 
age levels, although a table is given showing 
the per cent passing each test at all the signifi- 
cant age levels. Whatever method was used. 
apparently there needed to be considerable 
realignment of the items during the course of 
the standardization. * There is a tacit assump- 
tion that correlation with another test evalu- 
ates the validity of the test. The split-half 
method was used to determine reliability, 
which method some might question with mate- 
rial of this type. The test results upon which 
the scale was standardized are used to deter- 
mine the predictive value of the scale. * While 
this meagerness of statistical treatment is dis- 
appointing to those who realize its significance 
and danger, this fact should not be allowed to 
detract from this really valuable contribution 
to the mental measurements of young children. 
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X*Intelligence Tests for Young Children. Ages 
2-8; 1945; individual; requires only such apparatus 
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"that can readily be found or made in the school or 
home"; 4s. per manual; (30) minutes; C. W. Valen- 
tine; Methuen & Co. Ltd. 


REFERENCES 
1. Waxetam, B. B. “The Application of a New Intelligence 
ET 23 m piney oneal C ie Prediction of PAXNRCOR 
"rit 'syci 14:142-50 N '44. 19:1355. 
2. VALENTINE, CW Ii mcelligence Tests for jae Children. 
London: Methuen & Co. Ltd., 1945. Pp. xii, 67. 4s. * (Cleve- 
land, Ohio: Sherwood Press. $1.50) (PA 1913520) 


Brit J Ed Psychol 15:105 Je "45. C. Burt. 
For long there has been an urgent need for 
Some simple scale of intelligence tests for use 
with younger children in this country. The best 
known scales, such as the Terman-Merrill re- 
vision or the Binet tests, require apparatus 
that is at once bulky and expensive; and, fur- 
ther, they have been constructed and stand- 
ardized for American rather than for British 
children. * The author has selected from vari- 
ous well-known scales those tests which are of 
recognized validity and at the same time easy 
and interesting to apply; but in addition he 
has devised some extremely ingenious and at- 
tractive tests of his own. The clear and de- 
tailed instructions make the procedure perfectly 
comprehensible even to those with little or no 
formal training in the technique of mental 
testing, and include suggestive comments which 
could be read with advantage even by experi- 
enced testers at child guidance clinics. The con- 
struction, standardization and different uses of 
the tests are fully described in an Appendix; 
and the discussion incidentally brings out sev- 
eral pressing problems, which deserve the at- 
tention of future research workers. The amount 
of useful and instructive matter here packed 
into one small volume makes it a miracle of 
condensation; and the whole is based on a 
unique combination of common sense, scien- 
tific insight and a life-long study of the prob- 
lems of early childhood. A word of gratitude 
is due to the publishers for the excellent pro- 
duction under war-time conditions. 

Brit J Psychol 36:106 Ja ’46. * for the use 
principally of teachers and those who are not 
expert intelligence testers. * An excellent se- 
lection of both verbal and performance tests 
has been made; they are taken from Burt’s 
revision of the Binet-Simon tests, the Terman- 
Merrill and Merrill-Palmer scales ; some of the 
Gesell items are included; and the author has 
added some of his own. He also discusses the 
selection, use and standardization of the tests, 
and gives a very useful set of instructions and 
description of procedure. The whole forms an 


exceptionally valuable testing scale, consisting 
of items likely to be interesting and attractive 
to children and well designed to show up every 
facet of their intelligence. For certain of the 
biographical items relating to the younger chil- 
dren, which are taken from Gesell, the tester 
might have to rely on hearsay evidence; but 
these items are well worth including since they 
are such good indicators of the child's general 
maturity, If one point of criticism may be of- 
fered, it is that the scoring of mental age is 
rather involved, owing to the fact that the num- 
ber of tests, and hence their score value, differs 
at different ages. Perhaps in a second edition 
the author could simplify this—and also extend 
the scale for older children, including tests dif- 
ficult enough for the controversial 11+ group. 
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*xKent Series of Emergency Scales. Ages 5-7, 6-8, 
6.5-10, 9-14; 1932-46; also called E-G-Y Scales; indi- 
vidual; 1 form; 4 levels; 60¢ per manual; Grace H. 
Kent; Psychological Corporation. 
a) SCALE A. Ages 5-7; 1044-46; formerly called An- 
dover School-Entrance Test; $1.75 per 100, postpaid. 
b) SCALES B AND C. Ages 6-8, 6.5-10; 1946; $1.75 per 
100, postpaid. 
C) SCALE D. Ages 9-14; 1032-46; revision of Emer- 
gency Test or Kent E-G-Y Test; $1.75 per 100, post- 
paid. 
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Charles N. Cofer, Associate Professor of 
Psychology, University of Maryland, College 
Park, Maryland. For fifteen years the Kent 
E-G-Y Scales have served a real need, found 
especially in the military services during the 
war, for a rapid and adequate means of roughly 
evaluating an individual's intellectual ability. 
In this revision, the valuable features of the 
earlier forms have been preserved, and the 
scales seem to be stronger than before. 

The new scales contain many more questions 
than did the old forms. There are four scales 
in the revision as compared to three in the 
1932 edition, and the new scales cover the age 
range from 5 to 14 years. The use of independ- 
ent norms for each scale and the overlapping 
of each scale with the ones above and below 
it are important and valuable features. The 
content of the questions is varied and would 
appear to have interest value for almost any 
English-speaking subject. The reviewer's ex- 
tensive experience with the old forms of the 
scales leads him to recommend them highly as 
clinical instruments for rough evaluations; in- 
deed, in a few cases they have given a more 
satisfactory index of the patient's ability, as 
seen independently in his occupational and so- 
cial history, than have elaborate psychometric 
devices. 

"There are, however, serious limitations in the 
standardization and other methodological as- 
pects of the development of the scales. Since 


Dr. Kent is fully aware of these limitations and 
discusses them briefly in the manual, it seems 
unnecessary to mention them further here. Dr. 
Kent justifies the presentation of the scales in 
terms of their use as rough, approximate, 
screen devices not designed to supplant the 
formal tests. In the hands of competent and 
experienced clinicians the scales would not, in 
most cases, be used unwisely. However, they 
are apparently so simple and easy to adminis- 
ter and require so little material for adminis- 
tration that untrained, careless, or irresponsible 
persons are easily able to use them and may 
do serious harm through attaching great sig- 
nificance to the mental ages obtained. This re- 
viewer has seen often enough psychiatrists, 
social workers, and others regard any mental 
age as being as fixed as the patient's chrono- 
logical age. This type of occurrence points to 
the need for even more careful evaluation of 
sampling, reliability, standardization, and valid- 
ity for the simple techniques than perhaps for 
the more elaborate devices. 

A number of studies have been reported 
which showed the old scales to give good agree- 
ment with results from the Stanford-Binet, 
Wechsler-Bellevue, etc. E-G-Y scores have 
sometimes been found to vary with some con- 
stancy above or below scores obtained from one 
or another of such psychometric devices. It 
wotld add to the usefulness of the present scales 
if such data were available, and a revision of the 
manual might include such evidence at least 
for the old scales. A more complete description 
of standardization groups and processes than 
is given would likewise be valuable. 
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Linfert-Hierholzer Scale for Measuring the Men- 
tal Development of Infants During the First Year 
of Life. Ages 30-152, 15965 days; 1928; individual ; 
out of print; Harriette-Elise Linfert and Helen M. 
Hierholzer; Catholic University of America Press. 
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. Nancy Bayley, Lecturer in Psychology and 
Research Associate, Institute of Child Welfare, 
University of California, Berkeley, California. 
This is one of the early scales for measuring 
infant intelligence, and its value is now pri- 


_ marily historical. It is derived in part from the 


1925 Gesell tests. The scale is notable, not only 
because it is one of the first efforts to describe 


carefully the administration and scoring of in- 


fants’ responses to a varied selection of stand- 


atdized test situations, but also because it in- 
cludes statistical evaluations of the reliability 
and validity of numerical scores based on test 
items passed. The sample of infants tested was 
selected to be a representative cross-section of 
the American white urban population, using 
Chapman-Sims socio-economic scores as a cri- 
terion. A total of 300 infants (50 each at 
months 1, 2, 4, 6, 9, and 12) were tested. The 
materials are simple, including colored balls, a 
ring, feathers, cubes, teaspoon, colored pic- 
tures, etc. The tests have been divided into two 
series. Series I, comprising 29 tests, applies 
to infants 1 through 5 months: Series II, com- 
prising 35 tests, is applicable to infants aged 6 
through 12 months. 

There may be some question as to the suita- 
bility of calling these tests scales of "mental" 
development because of their inclusion of 
varied types of behavior. The items in Series 
lare for the most part tests of sensory per- 
ception, gross motor coordinations, and simple 
reflex reactions. About one third of the items 
in Series IT are of gross bodily coordinations, 
such as standing and walking. About a fifth 
involve language comprehension and use. The 
others test adaptive responses or learning as 
evidenced in simple games and habit training 
(the latter often depending on the mother's 
report). 

"Test scores are the per cent passed of the 
total number of items given (usually the num- 
ber in a series). The means for each age were 
plotted and found to show essentially straight- 
line progression within each series. Smoothed 
age norms for each series were read from the 
plots. 

The split-half reliability of the tests (with 
Spearman-Brown correction) was calculated 
for each age. These 7s range from .70 to .88 
and average Sr. In testing validity, the scores 
at each age were correlated with the parents' 
rin Scores. The average of these 

:06. There were no significant differences 


in the scores of groups divided om the basis 
of number of siblings, of breast or bottle feed- 
ing, of health or of dentition. 

It is interesting to note that in a later study 
(5) the infants’ scores at 6, 9 and 12 months 
were found to have low negative correlations 
(of —.11, —.32 and —.20 respectively) with 
their scores on the Stanford-Binet at four 
years. 
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Minnesota Preschool Scale. Ages 1.5-6; 1932-40; 
individual; Forms A, B; $14.50 per testing outfit of 
either form ; $1 per 25 record blanks; $1 per specimen 
set of sample blank and manual, postpaid ; Florence L 
Goodenough, Katharine M. Maurer, and M. J. Van 
Wagenen; Educational Test Bureau. 
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Press, 15s. 6d.) (PA 21:286) 

Beth L. Wellman, Professor of Child Psy- 
chology, Iowa Child Welfare Research Station, 
The State University of Iowa, Iowa City, Iowa. 
The materials for this test are compact and very 
conveniently arranged in Manila envelopes 
bound together in a booklet with attached in- 
structions. In general, they are attractive to 
children, although not particularly colorful or 
artistically appealing. The 1940 manual is an 
improvement over that of 1932 in the follow- 
ing respects: presentation of test procedures 
and scoring, greater detail on scoring, and con- 
solidation of several tables into one for trans- 
lating score points into C-scores, per cent place- 
ment and IQ equivalents. 

Theoretically it would be an advantage to 
compute separate verbal and nonverbal IO's, 
for which the scale provides, if the term non- 
verbal could be clarified. This the authors have 
not been able to do, either in the manual or in 
subsequent publications. No help is given the 
examiner on how to interpret the two scores. 
Curiously enough, the nonverbal items do not 
extend below 2 years 6 months, the very ages 
at which verbal responses are most difficult to 
elicit and at which negativism and resistance 
are most frequently encountered. 

Although two forms of the test are avail- 
able, Forms A and B, the manual states that 
both should be administered whenever any- 
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thing of real importance is to be determined 
Írom the score. This, of course, obviates one 
of the chief advantages of alternate forms 
that of interchangeability. i 

Little information is given about the stand- 
ardization group. In the r932 manual there 
was promised a future monograph which, so 
far as the reviewer has been able to ascertain, 
has never appeared. The size of the standardi- 
zation group (900 cases, ages 174 to 6 years) 
appears adequate. The main bases of selection 
were occupation of father and sex. All of the 
children were urban and from one section of 
the country. 

In their r942 publication (4) the authors 
state that the computations of C-scores and IO 
equivalents were based on a wrong table of 
standards, but the correct values have not been 
published. 

The maximum IQ equivalent that is provided 
for at any age is 152; this maximum will be 
confusing to those accustomed to dealing with 
IQ's on the Binet tests ranging from 150 to 
200. That the scale lacks sufficiently difficult 
items to sample the abilities of bright children 
older than about 3% years is shown by the 
table of score points necessary to obtain the 
maximum IQ equivalent of 152 (or roo per 
cent placement). On the verbal scale these are, 
at ages 3-6, 4-0, 4-6, 5-0, and 5-6, respectively : 
71, 74, 76, 78, and 79 points out of a possible 
80. Out of a possible 55 on the nonverbal they 
are: 48, 5I, 53, 55 (at 5-o where a perfect 
score is required). Since it is not at all un- 
common for younger children to obtain IQ’s of 
150, the implications are, to note only two: 
(a) groups of superior children at 5 years are 
likely to show considerably lower IQ’s than 
groups similarly sampled at 3 years; (b) the 
test should be used with great caution in a 
study of IQ changes. 


For reviews by Rachel Stutsman Ball, Nancy 
Bayley, and Florence M. Teagarden of the 
original edition, see 40:1407. 
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Bie Minnesota Preschool Scale.] GoopENoucH, 
LORENCE L., AND MAURER, KarnaAnINE M. The Men- 
E Growth of Children From Two to Fourteen 
NS A Study of the Predictive Value of the 
innesota Preschool Scales. University of Minne- 
Rue Institute of Child Welfare, Monograph Series, 
Ae 20. Minneapolis, Minn.: University of Minnesota 
ress, 1942. Pp. xv, 130. Out of print. (PA 16:4562) 


Am J Psychol 58:151-2 Ja "45. Willard C. 
Olson. * Interesting questions for further in- 


vestigation grow out of the study. Is there. à 
difference in rate of maturing and stabilization 
of status in girls? Would improvement in the 
predictive value of tests for young children 
be brought about by means of an item-analysis 
in which terminal rather than initial status is 
used as one of the criteria for selection? To. : 
what extent is early differential success in tests 
according to type-of-content predictive of simi- 
lar specialization at a later time? Answers to- 
these and similar questions are being sought in 
further work. The monograph should be stud- 
ied and pondered by the clinician and by the 
student of the testing methods. 

Brit J Ed Psychol 16:102 Je "46. C. W. 
V (alentine). * The whole research provides a 
valuable piece of evidence, with special refer- 
ence, of course, to the Minnesota tests. The 
authors, however, do not, I think, attach suf- 
ficient importance to the need for giving to 
young children under five years three or four 
re-testings within a few months of each other 
to allow for varying moods and fluctuations 
and newly developing abilities. 

Brit J Psychol 34:79 Ja’44. * describes first 
the preparation of new scales of verbal and non- 
verbal intelligence test, each with alternative 
scales, for pre-school children aged 2-5 years. 
* The second part....deals with the testing, re- 
testing and follow-up over a twelve-year pe- 
riod of about 500 children. The main result 
that emerged was that although re-tests of 
children ages 3-4 years after periods up to one 
year showed a fair reliability, yet even then the 
average shift was 10 I. Q. points, and shifts of 
20 points were not uncommon. Prediction of 
mental status at later periods, up to the time 
of college entrance, was extremely uncertain. 
Nevertheless, characteristic sex differences 
emerged at very early ages, and also differences 
between those who excelled on the verbal 
scales, and those who excelled on the non- 
verbal. One would like to know if there were 
any consistent relationships between the type 
of fluctuation or shift in mental development 
as demonstrated by the test results, and the 
children's personality characteristics. It is to 
be hoped that the authors will continue their 
valuable investigations to include the study of 
such qualitative relationships, and thus throw 
more light on individual differences in mental 
development. 

El Sch J 43:372-3 F '43. Gustav J. Froeh- 
lich. * Among the desirable features of the 
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monograph one might mention the graphic 
presentations of individual patterns of growth. 
These demonstrate clearly the fallibility of men- 
tal estimates based on a single test administra- 
tion. The appendix giving “I.Q.-Equivalents 
for the Merrill-Palmer Tests,” the relatively 
adequate bibliography, and the frequent cross- 
references to related investigations are also 
noteworthy. In general....shows many evidences 
of care, thought, and meticulous research. On 
the other hand, one might question the con- 
tinuity of the sampling. We are told that “in 
spite of the changes in its personnel, . . . the 
general character of our group has not shifted 
greatly in the course of the twelve-year period” 
(p. vii), but this statement is not backed up 
with sufficient objective evidence, The statisti- 
cian may be somewhat disturbed by the arbi- 
trariness resorted to in the derivation of the 
scale values for the Minnesota Preschool 
Scales. This weakness has been recognized by 
the authors, but their argument that relative 
rather than absolute values are of primary 
concern in this study is well taken. This mono- 
graph is not a textbook; it is highly theoreti- 
cal. Only the professional worker in the field 
of mental measurement will find in it much of 
value for him, The casual student and the gen- 
eral reader in educational psychology and child 
guidance will find little to interest him. 

J Consulting Psychol 7:207-8 Jl-Ag ’43. 
Nancy Bayley. * Part L...summarizes the his- 
torical background of tests for young children, 
and then describes briefly the Minnesota Pre- 
school Scales, * In Part II....scores on the Min- 
nesota scales are compared with retests on the 
same scale after elapsed intervals of one and 
two years, and with retests at various intervals 
on other scales: the Merrill-Palmer, both re- 
visions of the Stanford-Binet, the Arthur 
Point Scale, and the American Council on Edu- 
cation College Entrance Examinations, Corre- 
lations between tests increase with increasing 
age at the time of first testing. However the 
tendency (reported by others) for r's to de- 
crease with increasing intervals between tests 
was found for some tests but not consistently. 
The Merrill-Palmer test showed the lowest re- 
lation to the Minnesota, all r’s being positive 
but low. Correlations with the Arthur scale are 
somewhat higher, but still lower than with the 
Stanford-Binet. In neither Merrill-Palmer nor 
Arthur scales is there any consistent change in 
the correlation as the interval between tests 


increases. This lack of agreement with other 
studies is puzzling. However, the authors do 
not make clear to what extent the same cases 
enter into the 7s between different tests at dif E 
ferent ages, nor whether the number of previ- 
ous tests per child has been controlled in the 
r's for prediction. If the cases used in each r 
are in large part different, or have unequal 
amounts of testing experience, then the failure 
to get consistently decreasing r’s with increas 
ing intervals between tests may be due, at least 
to some extent, to these variations in the sam- 
ple. * This book makes an important contribu- 
tion to our knowledge of the values and limita- 
tions of several widely used intelligence tests. 
In general the authors’ results from repeated 
tests of the same children corroborate those of 
other studies which have shown low predicta- 
bility of later intelligence from preschool tests. 
They further point to the areas in which pre- 
diction may be improved, and indicate fields for 
valuable future research on the processes of 
mental growth. 
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*[Re Minnesota Preschool Scale.] Maurer, KATH- 
ARINE M. Intellectual Status at Maturity as a Cri- 
terion for Selecting Items in Preschool Tests. 
University of Minnesota, Institute of Child Welfare 
Monograph Series, No. 21. Minneapolis, Minn.: Uni- 
versity or Minnesota Press, 1946. Be ix, 166. Litho- 
typed. $2.50. * (London: Oxford University Press. 
15s. 6d.) (PA 21:286) 


J Consult Psychol 11:155 My-Je ’47. This 
study makes a direct attack upon a psychologi- 
cal problem of long standing: the low correla- 
tion of measures of intellect at preschool ages 
with those obtained later. * provides valuable 
source material for future test construction. 

Times Ed Sup 1657:58 F r "47. * lucidly 
written * The investigation revealed that only 
half the pre-school test items possessed long- 
term predictive value. Equally significant is the 
inference that the best predictors are those 
tests involving perception, It is not surprising, 
therefore, to find in this study further evidence 
of "a constant core of mental functioning." In 
other words, the truth may be that aptitude 
tests for the under-sevens are reliable, in the 
non-technical sense, to the extent to which 
they measure “g.” Careful analysis of individ- 
ual test items is essential if comparative “pur- 
ity" of the “g” saturation is to be obtained. 
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Especially Devised for Deaf and Hard of Hearing 
Children From Four to Ten Years of Age. 1941; 
individual; 1 form ; $42 per testing outfit; $1.75 per 100 
sets of blanks and record forms; (50-60) minutes; 
Marshall S. Hiskey; distributed by the Psychological 
Corporation. 
REFERENCES 
1. Hrskgv, Marsuart S. A Non-Verbal Test Li i 

Aptitude (Especially Adapted for Young Deaf Chil laren). "Un. 
published doctor's thesis, University of Nebraska, 1940. (Ab- 
stracts of Doctoral Dissertations . . . , 1940, pp. 77-84. 

„2. Hisxey, MansuatLL S. “A New Performance Test for 
Young Deaf Children." Ed & Psychol Meas 1:217-32 Jl '41. * 


(PA 15:5405) 
3. Hiskey, Marsuatt S. “A Nonverbal Test of Learning 


Aptitude for Young Deaf Children.” Abstract. Psychol B 38: 
O '41. * (PA 16:338, title only) dme 


[ 290 ] 
Performance Tests of Intelligence: A Series of 
Nonlinguistic Tests for Deaf and Normal Chil- 
dren, Third Edition. Ages 5-6, 7-16; 1928-44; indi- 
vidual; Sets A (Ages 5-6), B (Ages 9-16) ; £23 for 
Set A materials; £14 2s. 3d. for additional materials 
needed for Set B; 5s. per manual ; manual must be or- 
dered from Oliver and Boyd Ltd.; test materials must 
be ordered from A. H. Baird, Instrument Maker, 
Lothian St., Edinburgh, Scotland; James Drever and 
Mary Collins. 
REFERENCES 

1, MacKaxz, Kerrg. A Comparison the Intelli 
Deaf and Hearing Children: A Study PY Wed ction o Deak 
and Hearing Children to Three Performance Scales and a Non- 
Language Test. Columbia University, Teachers College, Con- 
tributions to Education, No. 585. Rudolf Pintner, faculty spon- 
sor. New York: Bureau of Publications, the College, 1933. Pp. 
ix, 47. Out of print. * 

2. Drever, JAMES, AND Corr1Ns, Mary. Performance Tests of 
Intelligence: A Series of Non-Linauistic Tests for Deaf and 
Normal Children, Third Edition. Edinburgh, Scotland: Oliver 
and Boyd Ltd., 1946. Pp. $9. ss. * 


For excerpts from three reviews, see 36:B87. 
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*Preliminary Test of Intelligence: A Brief Test 
of Adult Intelligence Designed for Psychiatric 
Examiners. Adults; 1947; individual; r form; the 
test and instructions for administration, scoring, and 
interpretation are presented in American Journal o 

Psychiatry 103 :785-92 May 1047; no materials need be 
purchased; (5-15) minutes; Margaret Keller, Irvin L. 
Child, and Frederich C. Redlich. 
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Revised Stanford-Binet Scale. Ages 2 and over; 
1937 ; a revision of the Stanford-Binet Scale published 
1916; Forms L, M ; individual; $12 per testing outfit for 
each form ; $2.50 per 25 record booklets for each form ; 
Bod per 25 abbreviated record forms; $1.75 per abbre- 
viated directions for administering; $2.25 per set of 
printed card material for each form ; $3.00 per manual 
(08); Lewis M. Terman and Maud A. Merrill; 
Houghton Mifflin Co. 


REFERENCES 

1-134. See 40:1420. 

135. Terman, Lewrs M.; ASSISTED BY ARTHUR S. Orts, Vir- 
ciL Drcxsow, O. S. Hussanp, J. K. Norton, Lowry HowARD, 
. K. Franpers, AND C. C. CAssiNGHAM. “A Trial of Mental and 

edagogical Tests in a Civil Service Examination for Policemen 
and Firemen.” J Appl Psychol 1:17-29 Mr "17. * a 

136. Yerkes, Rosert M. “The Binet Versus the Point Scale 
Method of Measuring Intelligence.” J Appl Psychol 1:110-22 
ei. 

137. Beeson, M. F. “Intelligence at Senescence.” J Appl 
Psychol 4:219-34 Je-S 20. * 

138. Cusu MAN, C. L. “A Study of the Reliability of Mental 
Tests as Used in Oklahoma City." J Appl Psychol 11:509-11 D 
27. * (PA 2:2944) NES CN 
139. Dunn DD D. The Effect of Special Disability in 
Reading on Performance on the Stanford Revision of the Binet- 
Sawo Test. Unpublished master’s thesis, State University of 

Wa, 1927. 


140. DEARBORN, Water F., AND Lonc, Howanp H, “On 
Comparing IQ's at Different Age Levels on the Same Scale." 
J Ed Res 18:265-74 N '28. * (Pa Ago 

141. CHIPMAN, CATHERINE E. “The Tarrapondenig of School 
Achievement and Industrial Efficiency With Mental Age as De- 
termined by the Stanford-Binet.” Psychol Clinic 18:21-8 Mr-Ap 
'29. * (PA 3:4267) 3 z 

142. Riley, G. “Stanford Binet ‘Indicators’ of Mechanical 
Ability." Psychol Clinic 18:128-32 Myte *29. * (PA 4:1429. 

, 143., Brown, Anprew W. “The nge in Intelligence Quo- 
tients in Behavior Problem Children.” J Ed Psychol 21:341-50_ 
My ’30. * (PA 434976) 7 SURA 

144. WEISMAN, Sara E. “Case Studies of the Relationshi 
ere h-School Sse end AES Counsel- 
ing.” es 21:357- 730. 14070 

Tis. Woooxt, Citas. 8, "Analysis of 1. Q. Variability.” J 
Psycho-Asthenics a ep discussion 262-6 '31. * (PA 6:3274) 

146. Hoaxrey, Z, PAuLINE. “The Variability of Intelligence 
Pr gend iran i Eis n 37:119-46, discussion 146-8 '32. 
* :2667 
147. Brown, Ratpx R. “The Time Interval Between Test and 
Re-Test in Its Relation to the Constancy of the Intelligence 
Quotient.” J Ed Psychol 24:81-96 F '33. * (PA 733094), 

148. CaxpeE, BEATRICE, AND SMITH, Heren, “Relation Be 
tween Vocational and Other Interests and the Type of Perform- 
ance on the Stanford-Binet Tests.” Abstract. Psychol B 30:570 
O '33. * (PA 8:1363, title only) E 

149. TuLcHIN, Stuon H. “The Evaluation of Psychological 
Test Result Am J Orthopsychiatry 3:349-54 ]1 '33. * (PA 7: 
4909) 

150. Davis, Atonzo T, “Personality, Parent Intelligence and 
soe on C. Stanford-Binet.” J Juvenile Res 18:175-8 Jl '34- 

:5992 
151. SHIPLEY, Watrer C, “Stanford-Binet Test pem 
." Abstract. 5 


Related ted in Cn Posey Psychol B 313 
212) le only. 
152. MáxrIELD, mANCIS N. “Substitutes for the 'LO." J 


Psycho-Asthenics 40:147-56 '35. * (PA 10:1698) 

153. MAYER, Basam A. “Negativistic Reactions of Preschool 
Children on the New Revision of the Stanford-Binet.” Abstract. 
Psychol B 32:541-2 O '3s. * (PA 10:652, title only) 

154. Haury, GrnTRUDE. An Analysis of the Responses of 
Normal and Delinquent Individuals to Certain Test Items of the 
Stanford-Binet Scale and the Pintner-Paterson Scale, Unpub- 
lished master’s thesis, University of Kansas, 1986. i 

155. MICHAELS, JOSEPH J., AND SCHILLING, MangAnT E, “An 
Attempt to Determine the Degree of Anti-Social Behavior in 
Psychopathic Personalities and Its Correlation With the Porteus 
Maze and Binet-Simon Tests.” Am J Orthopsychiatry 6:397-405 
Ji ’36. * (PA 11:407) ve 

156. MICHAELS, JOSEPH J., AND SCHILLING, MARGARET E. “The 
Correlations of the Intelligence tients of the Porteus Maze 
and Binet-Simon Tests in Two Hundred Nenropeyentati Pa- 
tients." Am J Orthopsychiatry 6:71-4 Ja "6 * (PA 3053460) 

157. PALMER, ELEANOR MARGARET, A Study 4 Intellectual 
Deterioration in a Series of Schizophrenics and Organic Brain 
Diseases. Unpublished doctor's thesis, Universi! of Iowa, 1936. 

158. Kent, Grace H. "Suggestions for the Next Revision of 
the Binet-Simon Scale.” Psychol Rec 1:409-33 N' 37. * 


12:2662 

159. scc Dororuy Mancanzr. Serial Testing Versus Non- 
serial Testing of the 1937 Revision D the Stanford-Binet, Un- 
published master’s thesis, Columbia University, 1937. 

160. Trapp, Cart E., ann James, Evita B. “Com arative In. 
telligence Ratings in the Four T. of Dementia Praecox,” J 
Nerv & Mental Dis 86:399-404 37. * (PA 12:3016) 

161. Bonp, Expen A. "Some Verbal ts of the ny Re- 
vision of the Stanford-Binet Intelligence Test, Form L.” J Exp 
Ed 6:340-2 Mr '38. * (PA 12:6113) | z 

162. HirTY, Boxoruv PAULINE. Differential Responses of 
Epileptics and Nonepileptics to Items of the 1916 Stanford-Binet 
Scale. Unpublished master’s thesis, Ohio State University, 1938. 
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5 MALAMUD, WILLIAM, AND PALMER, ELEANOR M. "Intel- 
lectual Deterioration in the Psychoses.” Arch Neurol & Psychi- 
atry 39:68-81 Ja '38. * A 

164. McNzMAm, Quinn. “The Equivalence of the General 
Factors Found for Successive Levels on the New Stanford Re- 
MS Abstract. Psychol B 35:657 N 38. * (PA 13:1727, title 
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s Wzapkowskv, Epirn. “A Preliminary Study of Mental 
Growth After the Age of Fourteen Years in an Institution for 
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TA Aorer, DANIEL LESLIE. Types of Similarity and the 
Substitute Value of Activities at Different Age Levels. Unpub- 
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167. ANDERSON, L. Dewey. “The Predictive Value of Infancy 
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168. Cottins, ANNA Louise, Analysis of Stanford-Binet Test 
Responses in Three Types of Mental Disorder. Unpublished 
doctor’s thesis, Boston University, 1939- $ " 

169. GeRLACH, Marjorie. “A Study of the Relationship Be- 
tween Psychometric Patterns and Personality Types." Child De- 
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170. Jonson, ANNA P., AND Fexxow, Dosorny L. “Com- 
parison of Results of Stanford-Binet and Performance Tests 
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339. MAYMAN, MARTIN. “An Analysis of Scatter in Intelli- 
gence Test Results: A Review of the Literature.” Trans Kan 
Acad Sci 48:429-44 Mr '46. * (PA 20:2923) 
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340. Sarason, SEYMOUR B., AND Sarason, EsTHER Knoop. 
“The Discriminatory Value of a Test Pattern in the High 
Grade NUUS Defective." J Clin Psychol 2:38-49 Ja '46. * 

20:1940, 

341. SARTAIN, A. Q. “A Comparison of the New Revised 
Stanford-Binet, the Bellevue Scale, and Certain Group Tests of 
Intelligence." J Social Psychol 23:237-9 My '46. * (PA 21:309) 

342. SPAULDING, Patricta J. “Retest Results on the Stanford 
L With Mental Defectives." Am J Mental Def 51:35-42 Jl 
"46. * (PA 21:491) 

343. Tacrurt, R. “Comparison of Results Obtained From the 
Wechsler-Bellevue Vocabulary Test With Those From the 
Stanford-Binet Vocabulary Test Using a Population of Normal 
Subjects and Mental Patients,” Abstract. B Can Psychol Assn 
6:101 O-D 46, * (PA 21:920, title only) 

344. WALLIN, J. E. W. “A Comparison of the Stanford 1916 
and 1937 (Form L) Test Results With Those From the Artur 
Performance Scale (Form I) Based on the Same Subjects." 
J Genetic Psychol 69:45-55 S '46. * (PA 21:921) 

345. WELLMAN, Bern L., anp McCanotess, Boyn R. Factors 
Associated With Binet IQ Changes of Preschool Children. 
American | Psychological Mono- 
graphs, Vol. 60, No. 2, Whole No. 278. Washington, D. C.: the 
Association, Inc., 1946. Pp. iii, 29. Paper. $0.75: * (PA 22:1103) 

346. Dott, Encar A. "Note on the Age Placement of Year- 
Sene, Teji J Consult Psychol 11:144-7 My-Je '47. * (PA 
22:19 

347. Hutt, Max L. “A Clinical Study of ‘Consecutive’ and 
Dp Testing With the Revised Stanford-Binet." J Consult 
Psychol 11:93-103 Mr-Apr '47. * (PA 21:3631) 

,,,249. Sarason, Seymour B., AND Sarason, ESTHER Knoop. 

The Discriminatory Value of a Test Pattern With Cerebral 
Palsied, Defective Children." J Clin Psychol 3:141-7 Ap '47. * 
(PA 21:3029) 

349. SLOAN, WiLLIAM, AND Currs, Ricuarp A. “Test Pat- 
terns of Mental Defectives on the Revised Stanford-Binet 
Scale." Am J Mental Def 51:394-6 Ja ’47. * (PA 22:374) 

350. SLOAN, WILLIAM, AND OBLINGER, BARBARA. “Diagnostic 
Value of Picture Anomalies and Verbal Absurdities for Institu- 
tional OH EISE Am J Mental Def 51:532-4 Ja '47. * (PA 
22:375 

351. TuoMPsoN, CLARE WRIGHT, AND Macaret, Ann. “Dif- 
ferential Test Responses of Normals and Mental Defectives.” 
J Abn & Social Psychol 42:285-93 J1 '47. * (PA 22:1240) 


For reviews by Francis N. Maxfield, J. W. 
M. Rothney, and F. L. Wells and excerpts 
from reviews by Cyril Burt, R. B. Cattell, 
C. J. C. Earl, J. C. Flanagan, F. N. Freeman, 
F. L. Goodenough, E. N. Hay, E. Kawin, 
Grace H. Kent, M. Krugman, A. H. Martin, 
E. M. Nevill, C. Spearman, A. E. Traxler, P. 
E. Vernon, and eleven others, see 38:1062, 
38 :B 497, 40:1420, and 40:B1093. See also 203 
and 294. 
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*[Re Revised Stanford-Binet Scale.] McNEMAR, 
Quinn. The Revision of the Stanford-Binet Scale: 
An Analysis of the Standardization Data. Intro- 
ductory chapter by Lewis M. Terman. Boston, Mass.: 
Houghton Mifflin Co., 1942. Pp. ix, 189. $3.50. * (Lon- 
don: George G. Harrap & Co. Ltd. 15s.) (PA 16:3329) 


Brit J Ed Psychol 14:112 Je ’44. This book 
reports an extensive statistical analysis of the 
data obtained in the standardization of the 
new Terman-Merrill tests. Professor Terman 
writes the first chapter, which forms an ad- 
mirable introduction to the rest of the book. * 
There are many detailed tables which will be 
of special interest to statisticians. 

Ed Res B 22:51-2 F ’43. Francis N. Max- 
field. * In 1917 Terman published an account 
of the data on which his 1916 Stanford-Binet 
scale was based. Unfortunately little attention 
was paid to this publication. It contained a 


great deal of material that was significant for 
research, and still more significant for the clini- 
cal interpretation of psychometric scores. The 
Stanford-Binet scale outran all competitors, and 
its popularity was a major factor-in causing 
the psychometric method not only to outrun 
the method of clinical case studies, but to a 
considerable extent to supplant this method. 
More careful attention to Terman’s 1917 pub- 
lication might well have prevented some of the 
misjnterpretations of Stanford-Binet scores 
that became current. * It will be unfortunate 
if McNemar’s statistical analysis of Terman 
and Merrill’s data is similarly ignored, The 
author’s concise summary (Chap. XII) can be 
read in ten minutes and will serve the reader’s 
purpose better than any statement the reviewer 
can make here. 

J Consulting Psychol 7:60-1 Ja-F ’43. Ethel 
L. Cornell. The long awaited statistical analysis 
of the standardization data of the revision of 
the Stanford-Binet Scale has been compressed 
into a small volume, but one that will reward 
close study. The material that has been selected 
for the volume has been carefully chosen and 
admirably presented to clarify some of the 
clinical, as well as. statistical, questions that 
have been raised. * A clearer realization of the 
difficulties of both construction and adequate 
statistical analysis of such an age-scale raises 
some question in the reviewer’s mind as to 
whether psychologists will not soon discover a 
way to preserve the clinical advantages of this 
type of test while securing an instrument with 
more statistical flexibility. * The chapter deal- 
ing with the shape of LQ. distributions at 
different ages is perhaps of most general in- 
terest. * About a quarter of the book is de- 
voted to a discussion of factor analysis. Dr. 
McNemar has done a superb job of condensing 
an enormous amount of material and of pre- 
senting it so that it is readable both by the 
statistical and the nonstatistical minded. * The 
book is one that should be accessible for ready 
reference by all users of the revised scale, and 
should be carefully studied by critics of the 
scale. 

Occupational Psychol 17:204-6 O *43. Cyril 
Burt. In standardizing their latest revision... 
Terman and his colleagues arranged for the 
tests to be given to 3,184 persons—roughly 
speaking, 100 boys and roo girls at each age 
from 6 to 14, and 50 at each age above and 
below that level. The inquiry is thus almost as 
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extensive as that on which the standardization 
of the British revision of the tests was based 
some twenty years ago * By far the longest 
and most instructive chapter is that which deals 
with the factorial analysis of the several tests. 
Prof. Terman, in his introduction to Dr. Mc- 
Nemar's account, observes that ‘the Binet 
Scale has often been criticized because of its 
great variety of brief, disconnected tests—a 
‘motley array,’ as Spearman has somewhat 
scornfully described them.” It was, however, 
originally designed to test a ‘general factor’ of 
intelligence. That being so, it is astonishing 
that so few investigators have attempted either 
to demonstrate that there is, in point of fact, 
a ‘general factor’ running through all the tests, 
or to measure the amount which that factor 
contributes to the total variance. In this coun- 
try, we have, on a somewhat limited scale, en- 
deavoured to factorize the correlations between 
the tests both in the earlier and in the later 
revisions. We found a general factor contribut- 
ing 42 per cent of the variance, and at least two 
subsidiary factors contributing about 12 per 
cent and 16 per cent respectively. Dr. McNe- 
mar's results are almost identical. He finds that 
on an average "the first common factor ac- 
counts for about 40 per cent of the variance of 
the tests" and "the second factor for 5 to 11 
per cent, and the third for 4 to 7 per cent." 
His evidence suggests that the general factor is 
nearly the same at different ages. And he con- 
cludes that the tests may fairly be said to 
measure general intelligence, and "not a hodge- 
podge of abilities, as some have suggested." 
To discover the nature of the subsidiary fac- 
tors he relies on a rough graphic procedure. 
They seem to differ at different ages; the more 
conspicuous are apparently a ‘verbal,’ a ‘nu- 
merical,’ a ‘memory,’ a ‘problem,’ and (at the 
earlier ages) a ‘motor’ factor. These are almost 
exactly the same as were obtained by British 
investigators with a somewhat different pro- 
cedure, namely, a ‘group-factor analysis.’ * 
those familiar with reports of Binet surveys 
undertaken in this country will note that... 
there is a remarkable correspondence between 
the results obtained from the American data 
and the English. That of itself should greatly 
strengthen the confidence felt in the methods 
now available for measuring the intelligence of 
children of school age. This brief summary by 
no means exhausts the interesting problems 
discussed in Dr. McNemar's book. The whole 


series of studies constitutes by far the most 
extensive, thorough, and up-to-date analysis 
of data obtained by means of the Binet tests ; 
and should provide a model for future inves- 
tigators. : 

PO EUROS 22:70 O "43. Forrest H. Kirk- 
patrick. * McNemar’s statistical analysis of 
Terman and Merrill’s data is highly significant 
and must not be ignored. His concise summary 
(Chap. XII) can be read in ten minutes and 
adequately reports the study. * 

Psychol B 40:194-201 Mr '43. Henry E. 
Garrett. * McNemar...is certainly to be com- 
mended for the compact and succinct form in 
which the material is presented. * Various 
psychometricians, as our author notes, have held 
that an item is correctly placed when assigned 
to that age level at which just 50 per cent of 
the subjects pass it; and mathematically this 
would seem to be reasonable. McNemar, how- 
ever, vigorously condemns reliance upon the 
50 per cent criterion. He lists three “consider- 
ations" (p. 87) to explain why higher percents 
are necessary in the New Revision. These are 
(1) "the fact that it is simply impossible other- 
wise to construct an age scale of the Binet 
type that will yield mean mental ages equal to 
mean chronological ages"; (2) "the fact that 
the location and grouping of items at a given 
level is mainly one of convenience which facili- 
tates testing and scoring" ;...and (3) "the fact 
that the individuals of any age group encoun- 
ter items which are actually of 5o per cent dif- 
ficulty for their age group even though the 
items placed at their own age levels may be 
less difficult." It is not immediately obvious 
that either (2) or (3) of these considerations 
is directly relevant to the issue of test place- 
ment, while (1) is simply a flat pronounce- 
ment of what scale standardization may. have 
shown to be true without showing why this is 
true. Terman's explanation of the allocation of 
tests in the Old Revision is much more clarify- 
ing (Stanford revision of the Simon-Binet 
Scale, 1917, pp. 154-155) than is the present 
discussion. The rapid rise of the age progress 
curve in the early years and the resulting dras- 
tic changes in percents passing, plus the in- 
crease in spread at the later ages, are appar- 
ently the crux of the matter, * Essentially the 
allocation of test items to specific age levels boils 
down to a cut-and-fit process. The authors 
could have saved themselves much space by 
saying so at the outset. * Fourteen separate 
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factor analyses were then made (using Thur- 
stone's centroid method) * a strong first factor 
is conspicuous in all 14 tables. This factor ac- 
counts for the intercorrelations of the items 
and for from 35-50 per cent of the variance 
of the battery. * The fact that such a diversity 
of test items can be assembled to yield a gen- 
eral factor—even when carefully chosen so to 
do—is strong evidence (it seems to the re- 
viewer) for the existence of such a general 
factor in mental activity as Spearman has long 
urged. * In Chapter XI (Units of Measure- 
ment), McNemar drops the role of expositor 
to express positive opinions upon several of 
the persistent problems which confront the test- 
maker. * A standard score, he writes (p. 158), 
is simply a linear transformation of raw scores 
into SD-units; such a transformation (he 
points out) does not change the form of the 
distribution nor equalize the units of the test. 
This is entirely true, but is not directly rele- 
vant to the problem of scaling test scores into 
"truly equal units" (the phrase is Thorndike's). 
I cannot agree with McNemar that there are 
"a surprisingly large number of psychologists 
"who claim' that the use of the standard-score 
method will yield units which are equal or 
‘truly equal’ (p. 157)." * I think that Mc- 
Nemar has confused standard-scores with 
T-scores,—at least so far as equal unit scaling 
is concerned. T-scaling converts the percents 
achieving each score into equivalent SD-posi- 
tions along the baseline of the normal curve, 
No assumption is made regarding the distribu- 
tion of raw scores, but the true distribution of 
ability measured by the test is assumed to be 
normal. The distribution of calculated T-scores 
will have the same form as that of the original 
scores; only in the ideal normal distribution 
assumed for the trait are the T-scores normally 
distributed. SD-units along the baseline of the 
normal curve (upon which our T-scores will 
fall at irregular intervals) are mathematically 
equal or in Thorndike's words "truly equal." 
The point should be stressed, perhaps, that the 
units on any scale are equal only with respect 
to certain defined criteria. Psychologists have 
done too much breast-beating over the great 
advantage possessed by the physicist in having 
scales with equal units and a true zero. To be 
sure, the much-vaunted “cgs” scale does pos- 
sess many advantages, among them that of 
having equal units; but these units are equal 
with respect to certain criteria,—not all. The 


Weber-Fechner Law has shown that an inch 
added to a man’s nose is not perceptually the 
same as an inch added to his height, and the 
same number of pounds or of seconds is often 
psychologically quite unequal. The same situ- 
ation holds even more truly of other physical 
scales. * McNemar (p. 159) in his example 
showing that pounds are unequal to T-scale 
units exhibits clearly the confusion which arises 
when systems of measurement are compared 
which are set up on different criteria and are 
not directly commensurable. The scaled scores 
of the psychologist are “truly equal" with re- 
spect to a defined criterion of performance on 
the assumption of normality in the trait. That 
such scales are useful and efficient experience 
has amply shown. Closely related to his discus- 
sion of equal units is McNemar's criticism of 
Thorndike’s demonstration that intellect in man 
is normally distributed. McNemar contends 
that since Thorndike scaled his tests into equal 
units by means of the normal curve, it is not 
surprising that distributions expressed in these 
derived units should turn out to be normal. He 
writes: (p. 17) “Thorndike has only demon- 
strated the obvious ; a normal curve can be pro- 
duced by assuming it in advance." It may be 
noted that Terman has also made essentially 
the same criticism (Measuring Intelligence, p. 
25) : he calls it "lifting oneself over the fence 
by one's bootstraps." This criticism, I feel, is 
not only superficial and incorrect, but coming 
from such sources is liable to do real harm. It 
may be pointed out, in the first instance, that 
Thorndike fully understood the “circularity” 
objection advanced by McNemar. In Chapters 
II, VIII, and Appendix III of his Measure- ` 
ment of Intelligence (1927), he states explic- 
itly that a normal distribution of test scores 
might arise in lieu of other causes from (1) 
the fact that the test was constructed with the 
normal hypothesis definitely in mind; and (2) 
from chance errors, the scores of all subjects 
being the same. To meet the first difficulty, 
Thorndike employed two procedures. First, he 
scaled the score distributions for several well 
known tests (Otis, Alpha, National, are exam- 
ples) into "truly equal" units by means of the 
normal curve. The normal distribution was se- 
lected for scaling (others were considered, see 
Chapter II) because the distributions of scores 
in the school grades employed as standardiza- 
tion groups were bell-shaped and not noticeably 
skewed. Distributions of scores, expressed in 
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terms of corrected units, were then demon- 
strated to be normal for new and different 
groups taken at various age levels. It is im- 
portant to note that the scaling procedures 
corrected the units of the test (lengthened or 
compressed them) but did not force the new 
distributions into the normal form. As a second 
procedure, Thorndike showed that a test scored 
in truly equal units derived from Grade VI 
will return a normal distribution for Grade IX. 
He argues that while a test-maker might con- 
ceivably select his items so as to return a nor- 
mal distribution for a given group, it is too 
much to expect him to repeat for other and 
quite different groups. To meet the second dif- 
ficulty, Thorndike averaged four to five repeti- 
tions of the same test before scaling, on the 
hypothesis that the mean of several determina- 
tions would be largely free from chance error. 
One may offer valid objections to certain de- 
tails of Thorndike’s procedure. I am not en- 
tirely convinced that he has really demonstrated 
intellect to be distributed in strictly normal 
fashion, and I agree with McNemar that his 
use of the Chi-Square test is dubious. But I 
feel sure that Thorndike has shown intellect 
in man (as measured by standard tests) to be 
essentially symmetrical and bell-shaped, and at 
least to be approximately normal. And this 
demonstration is not invalidated by the rather 
shallow charge of circularity. In summarizing 
the material in this monograph, I should like 
to express the opinion that the New Revision 
is the most useful and is certainly the best con- 
structed instrument for measuring the intelli- 
gence of children which we now possess. It 
represents an achievement of first rank; and 
one of which all psychologists, no matter what 
their persuasion, may well be proud. 
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*[Re Revised Stanford-Binet Scale.] Pry - 
DOLF; Dracosrrz, ANNA; AND Koa Rom See 
plementary Guide for the Revised Stanford-Binet 
Scale (Form L). Applied Psychology Monographs of 
the American Psychological Association, No. 3. Stan- 
ford, Calif.: Stanford University Press, 1944. Pp. 135. 
Cloth, $2.25 ; paper, $1.50; lithotyped. * (PA 19:578) | 


Clearing House 19:458-60 Mr "45. E(arl) 
R. G(abler), * a necessary supplement to... 
Measuring Intelligence, by Terman and Mer- 
rill * As one who has given many Binets, it 
seems that this monograph is a veritable gold 
mine of definitive information for the ex- 
aminer. The responses are sufficient in number 


and variety to fit the range of pupil perform- 
ance. By good organization the material is 
made easily accessible and usable. k 

Ed Res B 25:55-6 F 13 '46. Francis N. 
Maxfield. Terman and-Merrill’s revised Stan- 
ford-Binet scale (1937) seems likely to main- 
tain the popularity of Terman’s earlier revision 
(1916) as an individual test of intelligence. 
One reason for the very general adoption of 
these scales has been the fact that the authors’ 
manuals in both cases have given fuller direc- 
tions for scoring the various test items than 
were provided by the authors of other revi- 
sions. More than half of the one hundred 
twenty-nine test items of either Form L or 
Form M of the revised Stanford-Binet scale 
can be scored as rigidly as the stencil-scoring 
of a group test. In the other sixty test items 
there is a flexibility characteristic of Binet's 
method that may elicit responses that are not 
scored so readily. Pintner and his collaborators 
have prepared this manual to supplement the 
directions given by Terman and Merrill for 
scoring responses to test items of the latter 
type in Form L. Actual responses taken from 
several thousand test records are scored plus, 
minus, or “Q,” as in the Terman-Merrill man- 
ual. A notable original feature is a listing of 
responses in categories according to type, thus 
emphasizing the principle underlying the scor- 
ing. On the whole, the scoring will commend 
itself. As the authors have purposely chosen 
many items on the borderline between plus 
and minus scoring, most experienced examiners 
will disagree with an occasional scoring. For 
example, in S. A. I, 6, the examiner asks, 
“What is the principal way in which melting 
and burning are alike?" One subject says, 
"Both break down a substance." Another Says, 
"Both make whatever is melted or burned dis- 
integrate." Why should one of these be scored 
minus and the other plus? The scoring of the 
first figure on page 27 is clearly in error. But, 
in general, this "supplementary guide" is ex- 
cellent, and the emphasis on underlying prin- 
ciples is particularly commendable. 


75€ per 25 record forms for either level postpaid ; 

35€ per set of manuals; Grace Arthur; Payetiolowical 
orporation. 

4) STENCIL DESIGN TEST I. 


b) STENCIL DESIGN TEST 11. 
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REFERENCES 
1. ARTHUR, Grace. “A Non-Verbal Test of Logical Think- 


ing.” J Consult Psychol 8:33:4 Ja-F 44. * (PA 18:2286) 

, 2. BoULGER, CATHERINE, AND ARTHUR, Grace. “An Unpub- 
lished Design Test." J Consult Psychol 8:31-2 Ja-F '44. * (PA 
18:2287) 

James M. Anderson, Instructor in Clinical 
Psychology, University of Rochester, Roches- 
ter, New York. Although intended as a subtest 
for the alternate forms of the Arthur Point 
Scale of Performance Tests, this ingenious 
series of tasks will undoubtedly find much wider 
use. Form I consists of 20 geometrically sym- 
metrical designs in color to be reproduced by 
using varying numbers of the 18 paper stencils 
of various colors and shapes. Form II is similar 
to Form I except that the designs are in black 
and white and they are reproduced by super- 
imposing varying numbers of the r2 paper 
stencil shapes, each of which is provided in both 
dark blue and yellow. Both the designs and the 
stencils are 3 by 3 inches so that it is possible 
to reproduce exactly the design, which remains 
before the subject. 

These tests seem to have universal appeal 
and they usually command spontaneous inter- 
est. There appears to be some unevenness in 
the scale of difficulty, but it is not marked. 
The range of difficulty is quite broad with a 
very high ceiling (some graduate students ex- 
perience difficulty), and, unfortunately, the 
lower level is of such difficulty as to prevent 
its use with most preschool subjects. 

There are two mechanical weaknesses of the 
test associated with the present material used 
for the stencils. Since they are made of paper, 
they are subject to very rapid deterioration. 
The bending experienced after a few uses is 
sufficient to give the reproduction a three- 
dimensional effect (as many as 5 stencils are 
needed for some designs). This was bother- 
some to some subjects. 

The use of dark blue and yellow stencils 
for the reproduction of the black and white 
designs of Form II seems to facilitate the re- 
versal of figure and ground in the reproduc- 
tions. These reversals are scored as being cor- 
rect. It would be interesting to see if reversals 
are as common with black and white stencils. 

Form II is noticeably more difficult than 
Form I, apparently because of the absence of 
colors which give a clue as to the successive 
stencils to be used for successive layers in 
evolving the designs. 

The theoretically possible time limit of ap- 
proximately 83 minutes for each form might 


possibly be prohibitive, especially if the tests 
are to be included in a battery of performance 
items. 

Some of the subjects observed by this re- 
viewer evolved a “system” of proceeding from 
the center of the design outward. Once the 
“system” was grasped, the nature of the task 
seemed to change radically, and the time re- 
quired for the test seemed to be a function of 
how early the “system” became apparent. This 
may not be a serious weakness, but it seems 
well to point out that the tests do not seem to 
test the same function(s) for all individuals. 

The present scoring system which uses pass 
or fail within the 4-minute time limit for each 
design appears to be too crude. Such an intrin- 
sically fine instrument deserves a more dis- 
criminating scoring system if it is to achieve 
its maximum usefulness. 

Tentative norms, in terms of CA (by whole- 
year steps) and number of designs passed, are 
available for Form I. There are no published 
norms for Form II. 
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Van Alstyne Picture Vocabulary Test for Pre- 
school Children. Ages 3-4; 1929; 1 form; individual; 
out of print; (15) minutes; Dorothy Van Alstyne; 
Public School Publishing Co. 


REFERENCES 
1. Van Atstynz, Domoruv. The Environment of Three- 


Year-Old Children? Factors Related to Intelligence and Vo- 
cabulary Tests. Columbia University, Teachers College, Contri- 
butions to Education, No. 366. New York: Bureau of Publica- 
tions, the College, 1929. Pp. 108, Out of print. (PA 4:386) 

Ruth W. Washburn, Consultant in Child 
Development, Shady Hill School, Cambridge, 
Massachusetts. The publisher’s catalog states: 
“This test is valuable not only for testing the 
vocabulary comprehension of normal children, 
but is also especially recommended as a sub- 
stitute for a mental test with shy or negativis- 
tic children, or with those who have retarded 
speech development.” For these purposes it has 
excellent features in that no oral response is 
necessary and the scoring is objective. The 
child either does or does not point to the cor- 
rect picture, one of four, on each of forty-five 
cards presented serially. 

The standardization of the test is, however, 
based on its use with a small group of eighty 
children whose age range was only seven 
months (33 to 39, inclusive), a group in which 
the average IQ was high (114). The cultural 
background of the children in the group was 
so similar as to make the test of improbable 
value for other cultural groups. For example, 
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a word such as “chipmunk” is rarely known by 
children in urban communities. 

The basis for the selection of the test words 
represented by pictures was good. On each 
card there is, first, at least one word of an 
equal level of frequency with the test word; 
second, at least one word associated with the 
test word in life situations such as “socks— 
shoes”; and third, at least one word with a 
sound similarity, of the first letter, with that 
of the test word. The drawing throughout the 
test is of very amateur quality. In some in- 
stances the drawing is so poor as to make fail- 
ure to identify the picture understandable, 
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*Visual Retention Test for Clinical Use. Adults ; 
1946; individual; $2 per manual (no other materials 


are required) ; (4) minutes; Arthur L. Benton; Psy- 
chological Corporation. 


REFERENCES 
1. Benton, AnrRUR L, “A Visual Retention Test fon Clay 


PARE Arch Neurology & Psychiatry $4:212-6 S "45. 

J Consult Psychol 11:225 Jl-Ag ’47. The 
Visual Retention Test is a clinical supplement 
to the more common auditory memory span 
test. Each of the alternative forms, A and B, 
consists of seven cards containing designs of 
increasing complexity. Administration, which 
Tequires only four minutes, follows the usual 
procedure of a metory-for-designs test, a ten- 
second exposure, and an untimed reproduction, 
Normative data are given for 160 adults, mostly 
men, and children’s norms are in process of 
compilation. Data indicate a rough correspond- 
ence of scores with levels of intelligence, Clini- 
cal use with patients suffering from cerebral 
injury indicates the diagnostic value of the 
test in relation to aphasia, and also suggests 
implications similar to those of the Bender 
Gestalt test. The spiral-bound manual contains 
information on administration and interpreta- 
tion, specimens for scoring, and the 14 plates 
for Forms A and B. 
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Robert I. Watson, Associate Professor of 
Medical Psychology, Department of Neuro- 
psychiatry, Washington University School of 
Medicine, St. Louis, Missouri. This individually 
administered point scale is particularly suitable 
for appraising selected verbal and performance 
abilities of adolescents and adults. Two forms 
are available, the second appearing in the early 
part of 1946. Since the first form is deservedly 
rather familiar, but the second not as well 
known, some attention will be devoted to a 
comparison of the two. 

Both forms contain eleven subtests, all suit- 
able and applicable throughout the age range 
covered by the scale. The six of these which 
Wechsler classifies as verbal tests are titled: 
general information, general comprehension, 
digit span forward and backward, arithmetic, 
similarities, and vocabulary. Originally it was 
assumed that the vocabulary test was unfair 
for illiterates and persons with a language 
handicap. Consequently, the normative tables 
for the derivation of a verbal IQ were designed 
on the assumption that five verbal tests, not 
six, would be used in many instances. Prorat- 
ing is therefore necessary if one follows the 
current recommendation of Wechsler to ad- 
minister all six verbal tests. The so-called per- 
formance scale includes picture arrangement, 
picture completion, block design, object assem- 
bly, and digit symbol. The sum of the weighted 
scores of this group of tests yields a perform- 
ance IQ and, in combination with the weighted 
scores of the verbal group, makes possible the 
derivation of a full-scale IO. All three IO's 
are found for a given individual by point score 
entry into tables appropriate to the chrono- 
logical age group within which he falls. This 
makes irrelevant any procedure calling for the 
intermediate step of the derivation of à mental 
age. 

Personal experience in a military setting 
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demonstrated again and again a clinical validity 
superior, for example, to that demonstrated by 
the Revised Stanford-Binet Scale, which had 
a tendency to yield IO's customarily interpreted 
as demonstrating mental deficiency in many 
individuals who could not be said clinically to 
be functioning at that level. The verbal mate- 
rial is couched in language suitable for the 
chronological ages and the culture of the indi- 


viduals under consideration, with the exception . 


of the picture arrangement subtest, which 
proved too sophisticated for the average re- 
cruit tested. It also appeared easier to prepare 
nonclinically trained psychologists to admin- 
ister this device correctly and with acumen than 
was the case with certain measures involving 
the mental age approach. 

The only information available in the litera- 
ture, as yet, on the standardization of Form II 
as an alternate form appears in the introduc- 
tion to the manual for this form. Essentially 
the only data supplied are to the effect that 
norms are based on 1,000 male adults, 18 to 40; 
that preliminary comparison shows a "high 
correlation" between the scales; that there is 
a mean difference of less than 2 points be- 
tween the full scale scores; and that there is 
only a “small difference" in retest scores in the 
subtests, except that in Form II the compre- 
hension subtest is “a little harder" and object 
assembly is "a little easier." There is no reason 
to doubt that this is a fair summary ; neverthe- 
less, an opportunity to examine the data on 
which these conclusions are based should be 
available. 

The directions for Forms I and II are de- 
ceptively alike, but close scrutiny will show a 
considerable number of minor changes in 
wording. Most of these changes exhibit at least 
a greater surface clarity, but for the sake of 
these changes one must pay the price of learn- 
ing two sets of directions differing so slightly 
and yet so pointedly that a certain amount of 
confusion and irritation inevitably results. 

The Form II test material boxes revive the 
original Form I practice, later abandoned, of 
having the word "intelligence" prominently 
displayed on the cover with the consequent 
“mental paralysis" that overcomes some sub- 
jects when faced with this ominous word. The 
reason for this revival is not readily apparent. 

In contrast to the measures using the mental 
age method of calculating IQ's, each Wechsler- 
Bellevue subtest yields at all levels of func- 


E 


tioning a point score which is possible of con- 
version to a weighted score. Each weighted 
subtest score is a discrete measure capable of 
direct comparison with the scores for other 
subtests. This has made almost inevitable two 
prominent trends of research interest. There 
has appeared a plethora of abbreviated forms 
based upon the selection of three to six sub- 
tests, from the sum of the weighted scores of 
which an attempt would be made to derive a 
total score corresponding to that for the entire 
scale. Also, there has been considerable atten- 
tion paid to the rationale or meaning of the 
mental abilities measured by the subtests and 
their relative standards of functioning either 
within an individual or a diagnostic grouping. 
A shorthand of sign patterns for various diag- 
nostic groups derived from the absolute and 
relative magnitude of the weighted scores on 
the various subtests was an accompaniment of 
this latter trend. Wechsler himself has shown 
a commendable caution in regard to both 
trends, pointing out that the use of shorter 
forms depends upon the objectives of the ex- 
aminer in using the test and that the diagnostic 
features are to be interpreted with careful at- 
tention to all other sources of data. 

In the opinion of the reviewer, the Wechsler- 
Bellevue Intelligence Scale yields a more com- 
plete and valid picture of the intellectual func- 
tioning of an adolescent or adult than does any 
other single measure of intelligence. Only at 
the extremes, idiots and imbeciles on the one 
hand and very superior adults on the other, do 
other measures possibly make available a richer, 
more complete picture of the general function- 
ing ability of the individual. 


For review by F. L, Wells, see 40:1429. See 
also 299, 300, 301, and 40:B1121. 
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[Re Wechsler-Bellevue Intelligence  Scale.] 
Wecuster, Davin. The Measurement of Adult In- 
telligence. Baltimore, Md.: Williams & Wilkins Co., 
1939. Pp. ix, 220. * (London: Bailliére, Tindall, and 
Cox.) (PA 13:5389) For latest edition, see 301. 


Am J Psychol 54:154 Ja "4r. Edward E. 
Cureton. * The tests appear to have been 
reasonably well selected, and the directions are 
clear. * Nearly all the cases used in stand- 
ardizing the scale were obtained in and near 
New York City. The method of selection prob- 
ably gives approximately correct averages at 
the various ages, but there is some evidence 
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that the variabilities were not well controlled. 
As a result the interpretation of the IQ's is 
somewhat doubtful except where they are close 
to 100. On the other hand, adjustments for 
changes in average intelligence from 16 to 60 
remove one major cause of error in previous 
scales. The work is marred by numerous statis- 
tical errors. In attempting to insert a theoreti- 
cal foundation under what is obviously an em- 
pirical structure the author bogs down badly 
in factor theory. He confuses the standard 
error of the mean with the standard error of 
estimate (p. 135), proposes to measure the 
mental deterioration (with age) of individuals 
by a method that seems logical (p. 65) but 
ignores the fact that the tests would need to 
be much more reliable than any now in exist- 
ence, computes coefficients of variation from 
Scores based on an arbítrary zeropoint (p. 123), 
and assuming that standard scores are scaled 
Scores, presents 2 skew distribution of IQ's 
(p. 129) which probably proves only that the 
tests possess inadequate numbers of hard items. 
Since the empirical work is apparently good 
and the scale fills a real need, it will probably 
be used widely during the next few years in 
spite of its technical shortcomings. 

Ed Res B 20:238 N 12 ^41. Emily L. Stog- 
dill. * The book is one that will be included 
in any training program for psychologists espe- 
cially those who plan to work with the older 
age groups. The chapter on “The Problem of 
Mental Deterioration” is a particularly signifi- 
cant one. The chapters on “The Nature of In- 
telligence” and on “The Need for an Adult 
Intelligence Scale” are of general interest also 
* should be known at least by name to all 
teachers interested in tests and personnel 
work * 

J Abn & Social Psychol 35:598-9 O ’40. 
Alfred L. Baldwin. * Not only dose book 
make. a practical contribution by introducing 
an intelligence test designed for adults, but it 
makes a theoretical contribution as well. In his 
discussion of the nature of intelligence the 
author exposes implicit assumptions which, 
lying beneath current terminology, are mislead- 
ing and inconvenient if not, indeed, theoreti- 
cally incorrect. * Wechsler indicates that he 
wishes to define intelligence entirely in terms 
of a normal distribution of individuals of the 
same age as the individual being tested. * Al- 
though he may be quite correct in his prefer- 
ence for this definition of intelligence, his is 


not necessarily the only definition. Current 
procedure in standardization of tests is clearly 
based on other assumptions either explicitly or 
implicitly, and proponents might well find jus- 
tification for expecting that the distribution of 
IQ's would spread with an increase in age. 
This difference in basic assumptions is not an 
adequate basis for a criticism of current test 
procedure unless there is some empirical dem- 
onstration of the superiority of the new defini- 
tion. The import of the author's other criti- 
cisms, however, is independent of the answer 
to this question, and we are grateful that this 
important question in test methodology has 
been brought to the surface for discussion. * 

J Higher Ed 12:499-500 D '41. Emily L. 
Stogdill. * an interesting combination of theo- 
retical and practical considerations of the prob- 
lem of intelligence testing, especially at the 
adult level. Everyone concerned with the edu- 
cation or the clinical treatment of adolescents 
and adults should be familiar with the book 
and the test. It will be an important reference 
book in the training of psychologists and social 
workers. The first two chapters deal with the 
theoretical considerations underlying the con- 
cept of intelligence, and its measurement in 
adult years. The need for a new test is clearly 
shown. The third chapter discusses the concept 
of mental age and the intelligence quotient. The 
statistical considerations involved in the prob- 
lem are presented quite clearly. * the author 
forcefully expresses his position on the tend- 
ency of some medical men to relegate intelli- 
gence examinations to what he calls “apersonal 
psychometrics." * For many readers Chapter 
VI on the problem of mental deterioration will 
be the most significant. In the past, much that 
has been written on this topic was unscientific. 
Studies in the last decade are revealing that 
much that has passed as deterioration is actu- 
ally not in this category at all. The whole field 
is newly re-opened for consideration ; this book 
is a contribution to clarity of thinking in this 
area. * Chapter X on limitations and special 
merits of the scale will be of importance to 
persons using the scale, rather than to the gen- 
eral reader. * The Bellevue scale requires about 
an hour each to give and score. Its items are 
chosen with reference to the adults’ vocabu- 
lary, which will make it more acceptable to 
them than the Binet tests which have been so 
widely used to determine the level of intelli- 
gence. The practice of using group tests of in- 
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telligence with young adults and college stu- 
dents, however, is well intrenched, and has the 
added advantage of greater speed in giving and 
scoring, and of widely standardized norms. 
For these reasons, it is probable that this test 
will have a somewhat restricted usage, par- 
ticularly with groups which are already accus- 
tomed to the use of the older measure (Binet) 
or the group-test scores. It would seem rather 
evident that this test is destined to be an im- 
portant one in the history of work with adults, 
and for this reason, the book is one which 
should be known to all adult educationists and 
clinical psychologists. 

J Social Psychol 12:229-39 Ag ’4o. Charles 
L. Vaughn. * This work is extremely impor- 
tant and needs to be considered carefully be- 
cause he has accumulated a mass of test data 
by individual examination of adults, and be- 
cause he has constructed an intelligence scale 
which purports to do with adults what the 
Stanford Revision of the Binet-Simon Intelli- 
gence Scales has done with children. * Wechs- 
ler’s most significant contribution to theoretical, 
and perhaps to applied, psychology is to be 
found in his chapter on the “Problem of Men- 
tal Deterioration,” because it is in this chapter 
that he discusses the data from his study which 
bear upon the important topic “Variation of 
Scores on Different Tests with Age.” * Be- 
cause of the number and types of subjects and 
the variety of tests that he has used, Wechs- 
ler’s study will stand as a landmark in this 
field. * presents some suggestive ideas about 
the measurement of mental deterioration, but 
because of the difficulties involved, Wechsler 
makes no definite commitments on this topic. * 
Wechsler’s chapter on the “Nature of Intelli- 
gence” is perhaps his poorest. It is not an ex- 
ceptionally lucid review of divergent stand- 
points on the subject; one gets the impression 
that he has attempted to reconcile the irrecon- 
cilable. * The norms are based upon examina- 
tions of 1751 adults and children in various 
occupational and educational groups of New 
York State, principally of New York City. 
Wechsler has apparently surmounted quite well 
the difficulty of obtaining adult subjects, so 
frequently mentioned by others. * Although the 
adult subjects were not selected on the basis of 
their educational attainment, the percentages 
of U. S. population and Bellevue adults in the 
respective educational Categories agree rather 
well; there are, however, relatively more cases 


at the upper educational levels among the Bel- 
levue group than in the total population. One 
would expect Bellevue JQ's to be lower than 
they should be for this reason; but the error 
involved is probably not a large one. Holding 
age constant, Wechsler has attempted to repre- 
sent each occupational group according to the 
percentages of total white population in each 
occupational category of the U. S. Census of 
1930. However, because of the predominantly 
urban character of his population, it was neces- 
sary to substitute for the rural groups not avail- 
able certain urban occupational groups which 
might have a mean intelligence rating of the 
same order. He took mean scores of the vari- 
ous occupational groups represented in the 
U. S. Army draft. “Thus we found that bar- 
bers, bakers and teamsters could be substituted 
for farmers, and that other missing occupa- 
tional representatives could be similarly re- 
placed" (p. 111). Though this procedure is an 
ingenious solution to a difficult problem, it 
raises the question as to whether his distribu- 
tion represents that of the total population. 
This question cannot be answered until data 
are accumulated from other parts of the coun- 
try. Another similar criticism is that most of 
the children used for standardization came 
from the public schools of the City of New 
York, Yonkers, and unspecified New Jersey 
cities; he did, however, obtain some subjects 
from institutions for the feebleminded. Wechs- 
ler insists upon using 7Q's, although I believe 
that theoretical and practical purposes would 
be as well served by other methods of express- 
ing brightness. * There is a glaring omission 
of the distributions of scores by age groups 
from this book. Certain facts would indicate 
that his distributions are not normal. * Except 
for the omission of Vocabulary in the original 
scale and the inclusion of Object Assembly, 
the subtests are well chosen. * The seven pri- 
mary abilities isolated by Thurstone (number, 
visualizing, memory, word facility, verbal re- 
lations, perceptual speed, and induction), as 
well as certain other abilities, are pretty well 
represented. Word facility per se does not ap- 
pear to be as well represented as, the others ; 
the author realizes that he was mistaken in not 
including the Vocabulary Test in his original 
battery. Intertest correlations ate substantial, 
but are low enough to indicate that different 
abilities are being measured by the subtests. 
Whether the author has selected the subtests 
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with the lowest possible intercorrelations and 
highest possible correlations with the criterion, 
is difficult to determine. Although it is desir- 
able to select tests thus, one is confronted by a 
number of insurmountable practical problems 
in attempting to select tests in this manner; 
and probably no one has abided by the letter 
of this rule in test construction. Wechsler has 
used a crude item selection technique, but he 
does not present details of the work; and his 
criterion appears to be intelligence ratings from 
other tests. Regarding the correlations between 
the subtests and the total scale, Wechsler is not 
clear, for he gives two different impressions 
of his procedure. * Unfortunately, the author 
has given no differential weighting to the sub- 
tests even though they do not correlate equally 
well with the remaining tests. The reliability 
of the scale is fairly high, but still somewhat 
lower than that of the Stanford-Binet. * The 
correlation between two groups of four tests 
(N = 355, ages 20-34) was .go when corrected. 
In discussing these procedures the author leaves 
an unfavorable impression by such misuse of 
terms as this: “When corrected for attenuation 
by the Spearman-Brown formula, the correla- 
tions . . ." (p. 126). Obviously he is using the 
Spearman-Brown correction for double-length 
and not the correction formula for attenuation, 
* Although the verbal and performance scales 
are not equivalent, there does seem to be a 
closer agreement between them than there is 
between other verbal and performance scales, 
and for this reason can be more readily substi- 
tuted for each other when the need arises. * 
Wechsler is properly enthusiastic about the 
validity of the scales. * The correlation for 75 
cases (ages 14-16) between Bellevue and Stan- 
ford-Binet IQ's is 82 + .026. This correlation 
is fairly high, and shows that the two tests 
measure about the same thing. It does not tell 
how closely Binet and Bellevue IQ's corre- 
spond. * On the surface, at least, the Bellevue 
scale appears to be better for commitment pur- 
poses with adults than does the Stanford- 
Binet. In using the test we meet with the same 
major objection that Wechsler has; namely, 
Bellevue IQ's seem too high for older per- 
sons. This situation is most likely due to the 
fact that the Bellevue JQ takes into account the 
decline in test performance of persons after 
maturity. In connection with the validation of 
this test a question arises, which is raised by 
all tests of intelligence with which I am famil- 


iar. Instead of assembling a test upon the basis 
of some general, ill-defined conception of intel- 
ligence, going through an elaborate standardiz- 
ing procedure, and then trying to find out what 
a test will do, why not first set up specific quan- 
titative criteria, formulate questions which 
would seem to test intelligence, and then pro- 
ceed to construct the scale by using the better 
item selection techniques, such as those of 
Horst? * Record sheets and test materials for 
the Bellevue Scales may be purchased from 
the Psychological Corporation. A most careless 
job has been done on the record sheets. In 
comparing them with the manual, one finds 
numerous reversals of items on the subtests 
and other errors, which, although of probably 
no great importance, are annoying when one 
uses the record form. Possibly unjustly, this 
situation casts a shadow on the whole test, 
causing one to wonder whether the buried de- 
tails of the test construction work have been 
attended to as well as they should have been. 
The test materials on the other hand have been 
carefully designed. Materials for the Object 
Assembly Test are a definite improvement over 
those in the Pintner-Paterson Performance 
Test. The Picture Arrangement cards meet with 
much less resistance among adults than do those 
of the Cornell-Coxe Performance Ability Scale. 
This situation is partly due to the fact that the 
Cornell-Coxe is so difficult. The materials are 
relatively inexpensive, and the parts can be pur- 
chased separately in case of breakage. It would 
be well, however, if a manual with tables were 
forthcoming, as it is necessary to purchase the 
whole book in order to obtain the manual con- 
tained in one-third. Rather than attempt to 
elaborate in detail upon every item of the Belle- 
vue tests, I have prepared a table which shows 
the extent to which Conrad's desiderata of a 
valid general intelligence test are satisfied by 
the Bellevue scales. Table r contains this in- 
formation. In looking over this table, one is 
struck by the large number of criteria which 
are fulfilled by these scales in comparison with 
those doubtfully satisfied or not satisfied. How- 
ever, some of the most important criteria either 
are not met or are doubtfully met by these 
scales. Variations in IQ's, for example, may 
merely represent inadequacies in the statistical 
procedures ; the latter cannot be fully evaluated 
without certain data which Wechsler does not 
present. Adequate statistical procedures in them- 
selves, on the other hand, do not insure a con- 
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stant JQ. Much research will be needed before 
we can have some assurance that the Bellevue 
1Q is constant. For clinical purposes, the Bel- 
levue Intelligence Scales will have certain lim- 
itations as well as merits. They require only 
about an hour to give and score, and adult sub- 
jects are more favorably inclined to their con- 
tent than to that of tests designed for children. 
'The test will probably not be as popular as the 
Stanford-Binet, however, for several reasons. 
The Binet is particularly useful for commit- 
ting children to institutions for the feeble- 
minded and for ability grouping in the public 
schools. The Bellevue tests are not likely to 
have extensive usage in the educational field, 
except perhaps in adult education. Group tests 
are much less time-consuming for college stu- 
dents, who fall in the age group for which the 
Bellevue tests were designed, and group tests 
of intelligence are probably not much less val- 
uable than individual tests for personnel and 
guidance work among high school graduates. 
Use of the results in court work as evidence 
for commitment will not be as extensive as 
that of the Stanford-Binet results, primarily 
because there are fewer mentally defective 
adults than children who have not been com- 
mitted, and because in many states it is quite 
difficult to get adults committed. Courts will 
undoubtedly prove reticent about accepting re- 
sults from this test unsupported by other psy- 
chometric data, until it has had widespread 
application. In the meantime it would be well 
to present results from other tests in conjunc- 
tion with those from the Bellevue Scales. 
Clinics which examine adults and older adoles- 
cents should have this test in their repertoire; 
and it should be a standard item for study in 
the graduate training program for clinical psy- 
chologists. Despite several questions about the 
test, principally about the constancy of the /Q 
obtained from it, the test appears to be the 
best individual adult intelligence examination 
on the market. It will prove to be a valuable 
supplement to the Stanford-Binet intelligence 
examination for adolescents and should even- 
tually supersede it for adults. The performance 
scale appears to have been worked out better 
than most performance tests which are de- 
signed to measure general intelligence, but the 
fact that it has not been standardized on for- 
eign language and Negro groups limits the test 
where it would be most useful. The book, The 
Measurement of Adult Intelligence, should 


prove useful as collateral material in teaching 
courses in intelligence testing, test construction, 
mental growth and decline, and abnormal psy- 
chology. The chapter on the “Problem of Men- 
tal Deterioration" cannot be overlooked in any 
study of mental growth and decline. — . 

Psychiatric Q 15:593-4 Jl '41. * The first 
portion of this book is a lucid though cursory 
dissertation on the nature of intelligence with 
especial reference to its range, its aberrations, 
and its chronological changes in the adult. A 
discussion of this kind, augmented as it is by 
practical considerations, constitutes an excel- 
lent prologue to the second division of the 
book. * To the psychiatrist, the chapter on 
mental deterioration, another unsettled prob- 
lem, will prove interesting. * The book is....no 
exhaustive treatise on intelligence but makes a 
good manual for administration of the test and 
subsequent interpretation of the scores. It 
merits recognition, and the scales should fill a 
definite need. 

Psychoanalytic Q 11:111-2 Ja ’42. Clinton 
P. McCord. * The chapters dealing with the 
nature of intelligence, the classification of in- 
telligence and the concepts of mental age and 
mental deficiency constitute especially fine sum- 
maries of certain theoretical and practical con- 
siderations in clinical psychology which deserve 
to be better understood by psychiatrists who 
are too often handicapped because their train- 
ing in psychology has been limited in both ex- 
tent and character. In the last part of the book 
is given The Manual of the Bellevue Intelli- 
gence Tests....followed by a concise exposition 
of the special statistical methods which were 
used to treat the material and to make the nec- 
essary corrections in the process of standardi- 
zation. This is followed by a series of carefully 
prepared tables of intelligence quotients for 
the various age levels and for both perform- 
ance and verbal scales. The footnotes through- 
out are to the point and most illuminating. * 
At every point the author reveals that he is in 
touch with the work and accomplishments of 
other important investigators in this special 
field, and his evaluations exhibit a remarkable 
and refreshing degree of objectivity. Dr. Wechs- 
ler’s book represents a piece of painstaking 
research, well tabulated, concisely reported, 
and with valuable practical implications for any 
psychiatrist whose training has afforded a cer- 
tain degree of mastery of the techniques of 
clinical psychology. It should also be directly 
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or indirectly useful to judges, juries, educators 
and lay executives. 


For additional reviews, see 40:B1121. 
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*[Re  Wechsler-Bellevue Intelligence — Scale.] 
WecusLer, Davi». The Measurement of Adult In- 
telligence, Second Edition. Baltimore, Md.: Wil- 
liams & Wilkins Co., 1941. Pp. xi, 248. * (London: 
Bailliére, Tindall, and Cox.) (PA 16:1239) For latest 
edition, see 301. 


Am J Orthopsychiatry 14:553 Jl 44. Hen- 
riette Woolf. * essentially the same as the origi- 
nal published in 1939, except for the addition 
of a chapter titled "Diagnostic and Clinical Fea- 
tures.” In this the author formulates subtest 
patterns that are typical of the organic, the 
schizophrenic, the neurotic, the adolescent psy- 
chopath, and the mental defective. The material 
is based on clinical observation rather than ex- 
tensive statistical validation, The method is not 
intended as a psychometric shortcut to psychi- 
atric diagnosis but as a differential aid of value 
in proportion to the skill and experience of the 
examiner. There is a discussion of the per- 
sonality patterns reflected by the various tests 
and illustrative cases are given. The preface 
comments on the changes in wording of some 
test questions amd states that new examples 
have been added to the scoring directions for 
three tests. Detailed comparison of the two 
editions revealed six simple changes in direc- 
tions and approximately the same number of 
scoring additions, The appendix now has tables 
of means and standard deviations for ages seven 
to forty-nine and a formula for calculating 
IQ's between the ages of sixty and eighty. 

Am J Psychol 55:608-9 O "42. Anne Anas- 
tasi, * One may....question the wisdom of using 
the term IQ to refer to a ratio whose deriva- 
tion is basically different from that of the tra- 
ditional IQ. * Approximately the first third of 
the book contains a discussion of underlying 
theoretical problems involved in the construc- 
tion of the scale, such as the nature of intelli- 
gence, mental decline and the problems encoun- 
tered in adult testing, and the concept of mental 
deficiency. It is perhaps unfortunate that the 
reader should be introduced to the excellent 
piece of work which this book represents 
through the chapter on the nature of intelli- 
gence. In this chapter, for instance, the writer 
presents Spearman's two-factor theory as 
though it were the universally accepted theory 


of trait relationship, the reader receiving no 
hint of controversy or discrepant results in this 
connection. A number of statements in the 
chapter appear, furthermore, to bea combina- 
tion of circular reasoning and wishful think- 
ing. Thus Wechsler writes (p. 8), “but as 
regards the demonstration of the existence of 
‘g’ as a common factor, there seems to be no pos- 
sibility of doubt. Psychometrics, without it, loses 
its basic prop.” (p. 8). Hypostatization is found 
in his contention that “we may say that ‘g’ is 
a psychomathematical quantity which measures 
the mind’s capacity to do intellectual work” 
(p. 8). Wechsler’s own bias towards a single 
general factor is illustrated further in the book, 
when he outlines the various possibilities for 
treating marked discrepancies between verbal 
and performance scores. The possibilities he 
presents are: to accept the verbal IQ; to ac- 
cept the performance IQ; and to average the 
two—he ignores completely the fourth possi- 
bility of retaining both as two independent 
measures! In a pessimistic critique of test- 
validation, Wechsler writes, “teachers’ judg- 
ments which were first condemned were now 
used as supporting criteria for the validity of 
tests" (p. 130). This involves a basic confu- 
sion between follow-up judgments based upon 
observation of behavior over a long period of 
time, which should serve as criterion ratings 
for test validation, and judgments from first 
impressions, which the tests are designed to 
replace. On page 27 Wechsler presents a graph 
which shows a definitely curvilinear relation- 
ship between standard scores on the Wechsler- 
Bellevue Scale and chronological age between 
7 and 17. From such data he concludes that 
“any method of calculating IQ's which assumes 
a linear relationship between chronological age 
and mental age cannot possibly give constant 
values for any considerable portion of the 
growth period." This involves a confusion be- 
tween original test-score and mental age. If 
mental age is determined in the customary 
way, i.e., by the performance of the majority 
of subjects at each chronological age, then the 
relationship between mental age and chrono- 
logical age must be linear, the mental-age unit 
automatically shrinking to adjust for the de- 
creasing progress at the upper ages. In conclu- 
Sion, it may be said that the chief merit of 
Wechsler's book is to be found in its presen- 
tation of a well-constructed and carefully 
standardized scale which represents a. very 


~ 393 ] 


TESTS & REVIEWS: INTELLIGENCE—INDIVIDUAL 


300 


valuable contribution to adult intelligence test- 
ing. The author's excursions into psychological 
and testing theory, on the other hand, are open 
to criticism at many points and frequently 
weaken, rather than strengthen, his discussion 
of the test. 

J Assn Am Med Col 17:64 Ja *42. 

J Consulting Psychol 7:167-8 My-Je '43. 
Irving Lorge. The second edition....is not mate- 
rially changed from the first. The primary 
change is the inclusion of a chapter on the 
clinical factors of the Wechsler-Bellevue intel- 
ligence scales and their diagnostic application. 
* Wechsler undoubtedly has made a significant 
contribution to clinical practice in the organiza- 
tion of well-known tests into a composite scale. 
Clinically applied, the scale has considerable 
diagnostic as well as measurement value. The 
clinical usefulness of this test, of course, will 
depend on the examiner's familiarity with the 
test and with the meaning of the test scores 
from various populations. Basically, however, 
the point of origin is the basic philosophy of 
the test. Wechsler defines intelligence as "the 
aggregate or global capacity of the individual 
to act purposefully, to-think rationally, to deal 
effectively with his environment.” In this sense, 
global is intended to characterize the individ- 
ual’s behavior as a whole. * The first question 
that can be asked of the scales might be: Is the 
test measuring a unitary function? * In order 
to interpret Dr. Wechsler’s contribution, an 
independent analysis of Dr. Balinsky's data 
was made, using the Hotelling component 
analysis. * In the age-9 group, the first factor 
accounts for approximately 44 per cent of the 
total variance; in the 12-year group, 4I per 
cent; in the 15-year group, 31 per cent; in the 
25-29-year group, 27 per cent; in the 35-44- 
year group, 38 per cent and in the 50-59-year 
group, 50 per cent. These findings seem to cor- 
roborate those of Jones and Conrad that vari- 
ous sub-tests make differential contributions to 
the total score as a person grows older. It is 
particularly important to recognize that in some 
age groups only about one quarter of the total 
variance is accounted for in the. first factor, 
whereas in the other age groups almost one 
half of the total variance is accounted for in 
the first factor. If intelligence is to be consid- 
ered as a global whole, one would certainly ex- 
pect that the relative contribution of a general 
factor such: as g should make approximately 
the same contribution in all age groups. This, 


Wechsler's scale undoubtedly does not make. 
Since Dr. Wechsler implies that "Professor 
Spearman's generalized proof of the two-factor 
theory of human abilities constitutes one of the 
great discoveries of psychology," it should also 
be recognized that Dr. Wechsler's scale is not 
a two-factor scale by any-means. It is a multi- 
factor scale, in which the factors are inconsist- 
ent from age group to age group. * Wechsler's 
scale not only is not a test, but also is not a test 
of consistent factors. Combinations of the vari- 
ous factors into a total test-scale must be very 
difficult to interpret, partly because we do not 
know the genetic variation in subtest scores nor 
the genetic variation in factor scores. The clini- 
cian, therefore, must interpret cautiously the 
implication of the test score in terms of the age 
of the person. Undoubtedly, Dr. Wechsler's 
scale raises once more the important question 
of content and form in intelligence scales. * 
Wechsler purposely mixes up measures of 
speed and of power. The mixture....raises the 
important problem of what it is that the scales 
are measuring. Are they measuring the ability 
to learn, the ability to adjust in the world or 
the efficiency of behavior? As a person grows 
older, reaction time declines, visual and audi- 
tory acuity decline. Such physiological and 
sensory changes, in combination with the psy- 
chological adoption of tempo may penalize older 
persons on speed tests, Insofar as they do 
penalize a person, the interpretation of an in- 
telligence test score becomes complicated. Is 
the test measuring fundamentally physiological 
changes or an adopted tempo or is it measur- 
ing intellectual power to adjust to the world? 
Dr. Wechsler’s test measures neither well. The 
number of factors derived at each age level 
suggests that he is measuring many different 
things simultaneously. This measurement of 
many different things simultaneously is not 
global intelligence, even though a single score 
may be used to represent a person’s position. 
The test will reflect variation in educational 
opportunity. In tests of information, of arith- 
metic and even of comprehension, the environ- 
ment and educational background of the person 
undoubtedly influence his test score. It seems, 
therefore, that a general appraisal of Dr. 
Wechsler's scale is that he has made a signifi- 
cant contribution in pooling the ten or eleven 
tests he has used, but that he has not yet solved 
the problem of measurement of adult intelli- 
gence either globally or differentially. It is un- 
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fortunate that the many contributions to the 
measurement of adult intelligence that have 
been made available since 1937 have not been 
incorporated into Dr. Wechsler's book. It is 
more unfortunate that some of the typograph- 
ical errors in the first edition were not cor- 
rected in the second. Certainly the Williams 
and Wilkins Company are not to be congratu- 
lated on the second edition. The printing of 
tables, particularly, does not reflect favorably 
on a house whose printing is to be “sans tache." 
J Higher Ed 13:396-7 O "42. J. Wayne 
Wrightstone. * timely and important * This is 
the best and most carefully constructed of the 
present scales for measuring adult intelligence. 
* Several sections of Mr. Wechsler's volume 
are of general interest and value. These include 
his discussions of the need for an adult-intelli- 
gence scale which recognizes that test perform- 
ance varies with age, revision of concepts of 
mental age and intelligence quotient as applied 
to adults, concept of mental deficiency, and pre- 
liminary uses of the scale for diagnostic and 
Clinical purposes. Certain biases or prejudices 
of the author are implicit, especially in his dis- 
cussion of the nature of intelligence where he 
seems unduly to favor and present the theories 
of Spearman and Thorndike and to dismiss 
those of Thurstone, Kelley, and others. Cer- 
tain shortcomings, also, should be mentioned. 
The norms, for example, should be accepted as 
tentative until checked by use among more un- 
selected geographical and cultural groups than 
were originally used. The formula for predict- 
ing intelligence quotients for various age-groups 
needs to be checked and refined upon unselected 
samples. The diagnostic and clinical features 
need to be checked carefully and to be accepted 
critically by trained examiners until the validity 
of such diagnosis has been more fully demon- 
strated. This volume will be of particular in- 
terest to students of tests and measurements, to 
psychologists and psychiatrists in hospitals and 
correctional institutions for adults, and to clini- 
cal workers in colleges or in industries. 
Occupational Psychol 16:92-4 Ap '42. B. D. 
Misselbrook. This important contribution to 
the literature of mental testing is still insuffi- 
ciently known in this country. It has a dual 
claim to prominence.—First, on account of 
much original and valuable information and 
Some very pertinent criticism contained in it; 
secondly, as probably the last of the Scales, 
Wechsler has produced a work of compromise, 
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and in so doing has indirectly shown the inade- 
quacy of a scale. Something is offered to those 
who are interested in the battery as opposed to 
the scale, by the provision of separate verbal 
and performance I.Q.'s, and by the presence of 
tables giving the means and standard deviations 
for the sub-tests. In a short chapter on the 
clinical features of the scale, which appears in 
the second edition, patterns of performance are 
also considered. But the real estimate of the 
'global capacity' intelligence is to be found in 
the I.Q.’s for the full scale. Wechsler accepts 
Spearman's demonstration of the existence of 
‘g; but maintains that a test which endeavours 
to measure 'g' exclusively is a poor test of in- 
telligence. Other ‘salient factors’ must be in- 
cluded, and by these he means principally the 
‘K’ and ‘Z’ factors of W. P. Alexander. Refer- 
ence is also made to the ‘functional unities’ of 
Alexander, but we are given no information on 
the factorial content of the sub-tests ; this might 
have been of particular interest in the case of 
the performance tests, Psychologists who would 
be in agreement about the importance of the 
‘X? and ‘Z’ factors, and who, in fact, regard 
the performance at any test as the reaction of 
the total personality, will find little informa- 
tion about the nature of the clinical data to be 
obtained from the individual tests provided. 
Many of the tests, e.g. Comprehension, Simi- 
larities, Block Designs, do provide such data; 
but it is its analysis, and the construction of 
tests for the detailed study of aspects of a re- 
action, which are now most required. This is 
too large a project for one man or one book. 
The book contains important facts on the de- 
cline of mental efficiency with increasing age, 
and on those tests which ‘hold up’ with age and 
those which do not, The analysis of the mean- 
ing of the M.A. and the 1.Q., particularly when 
used in assessing adult intelligence, is excel- 
lent. It can be recommended to psychiatrists, 
The Bellevue Scale 1.Q.’s are calculated by a 
new and more valid method. It is, however, to 
be regretted that the term was not abandoned. 
Wechsler has not merely complained of the in- 
adequacy of.the existing material for clinical 
use with adults ; he has given a lead by produc- 
ing something else, Unfortunately it is a lead 
in the wrong direction, Although it may be use- 
ful to obtain English norms for certain of the 
sub-tests for immediate use, a standardization 
of the full scale is undesirable. The alternative 
is for psychologists working in the clinical 
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field in this country to collaborate to produce 
an adequate battery. 2 
Psychiatric Q 16:610 Jl '42. * The new edi- 
tion has given the author an opportunity to add 
a chapter on the clinical features and diagnostic 
applications of the scale. In addition, other 
changes have been made in the text which add 
to clearness and ease of understanding on the 
part of the subject being tested. * presents, in a 
readable form, all that is worthwhile in both 
theoretical and practical aspects of mental de- 
ficiency. During the present generation, much 
advancement has been made in the study of this 
subject and interest is steadily developing. 
There is much yet to be done in this field and 
Wechsler does not hesitate to point out what is 
lacking in attempts which are being made in 
various directions for perfecting the art of in- 
telligence testing in the case of the subnormal, 
the average and those more gifted children and 
adults who are often in need of guidance. 
Psychiatry 5:454-6 Ag '42. Winifred Rich- 
mond, * Chapter Three is an excellent discus- 
sion of the concepts of mental age and the in- 
telligence quotient * One's personality func- 
tions in everything that he does, and he may 
succeed or fail on tests not because he lacks 
learning ability or the capacity to comprehend, 
but because of traits or habits or temperamental 
qualities which help or hinder the function of 
his intelligence. The process of merely getting 
an I.Q. ignores this fact, and may do grave in- 
justice to a child classified on that basis alone 
—as is still the practice in many places. The 
inclusion of performance tests in the Bellevue 
scale does something to remedy the situation, 
and gives the subject a chance at practical con- 
crete situations as well as bringing out certain 
modes of behavior in attacking such tasks. In 
the chapter on "Diagnostic Features" the au- 
thor discusses the use of his test in pathological 
cases. On the whole it is an excellent presenta- 
tion and any psychologist at all capable of ex- 
amining such cases ought not to be misled by 
the tables and illustrative cases into thinking it 
is easy to diagnose psychiatric conditions by 
means of 'signs.' But one feels that the author 
does not stress sufficiently the fact that psy- 
choses occur at all levels of intelligence, down 
to the imbecile at least, and that the degree of 
intelligence possessed by the subject may in- 
fluence an intelligence test as much as, or even 
more, than the type of mental disorder. Many 
cases, even of dementia pracox, can pass ex- 


cellent intelligence tests, with little indication 
of abnormal functioning ; and there are no tests 
of this type that will discover the non-institu- 
tional psychopath—and often enough the insti- 
tutional one as well. Nothing is more revealing 
to the institution-trained psychologist than to 
be set to examining “normal” people. He finds 
all the peculiarities he has been accustomed to 
find in his pathological cases, if not in such 
number or such great degree, at least enough 
to show him that the tests are not yet specific 
for any particular mental state. They can offer 
leads, but are not yet refined enough for differ- 
ential diagnosis. When the Manual of Instruc- 
tions is reached, one will find that the poor 
placing of instructions, norms, criteria, and so 
on which made the first edition difficult to use 
has been remedied. One is obliged to turn pages 
here and there and turn back again in scoring 
and in interpreting. Something could be done in 
rearranging the individual tests on the blanks 
to make the order correspond to that in the 
book, but surely instructions for scoring should 
be gathered together in one place, and not scat- 
tered about throughout the test presentation or 
placed after the tables of weighted scores. Per- 
haps a word about experience with the Bellevue 
tests is in order. For normal people, nurses, at- 
tendants, and others who must be examined, 
they were found to be very useful, giving con- 
siderable information about the kind and de- 
gree of the subject's intelligence, and being less 
time-consuming than some other tests. For neu- 
rotics they are also useful, especially for the 
more intelligent. For psychotics, except the 
convalescents and those not very sick, they are 
not so useful. The verbal portion of the scale 
is overweighted with tests of abstraction, there 
are no memory tests except the repetition of 
digits, and the performance tests are fatiguing 
to many patients. Nor has it proved very re- 
liable with patients of low intelligence; it rates 
them higher than, by other criteria, they are 
known to be. Nevertheless, one feels that the 
Wechsler-Bellevue test is on the right track. 
The fetish of the I.Q. is disposed of, the 
growth and decline of intelligence with age is 
taken into account, and the combination of ver- 
bal and performance tests in one scale is. a 
time-saver for the busy psychologist. 
Q R Biol 17:283 S 42. 
[301 ] 


*[Re  Wechsler-Bellevue Intelligence Scale.] 
Wecuster, Davi». The Measurement of Adult In- 
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telligence, Third Edition. Baltimore, Md.: Williams 
& Wilkins Co., 1944. Pp. vii, 258. $2.50. * (PA 19:815) 

Am J Psychol 58:420-2 Jl *45. Quinn Mc- 
Nemar. There is no evidence in this edition 
that the author has profited from the critical 
reviews of the first edition by Cureton (this 
JOURNAL, 54, IQ4I, 154) and of the second 
edition by Anastasi (ibid., 55, 1942, 608-609). 
The present reviewer, who agrees with, and 
recommends to the reader, all the criticisms set 
forth by Cureton and by Anastasi, will confine 
his attention to the two main changes made in 
this third edition. These are concerned with a 
method of measuring mental deterioration and 
the use of ‘signs’ and interest variability in 
clinical work. It is proposed that mental de- 
terioration can be measured on the basis of the 
difference between an individual’s scores on 
four Hold subtests versus four Don’t Hold 
(decline more with age) subtests, allowance be- 
ing made for normal decline. The difference is 
expressed as a percentage, with the ‘hold’ score 
as a base or divisor, “in order to take into ac- 
count the absolute magnitude” (p. 65) of the 
scores being compared. This false notion of 
‘absolute’ leads to percentages varying as 
widely as 20 per cent to 80 per cent for dete- 
riorations which would be judged equal on a 
standard-score basis. But this is not our major 
criticism, Suppose ‘a 52-year-old with a ‘hold’ 
score of 36 (average for his age-group) has a 
‘don’t hold’ score of 22 (9 points below his age 
average). His deterioration would be 39 per 
cent, which becomes 25 per cent when allow- 
ance is made for normal decline with age, Does 
this really represent deterioration? If so, from 
what unknown earlier level of functioning? 
Straightforward consideration of the correla- 
tion (circa .74) between ‘hold’ and ‘don’t hold’ 
scores, and their standard deviations (Circa 
9.3), leads to the interesting fact that 7 per 
cent of normals with no deterioration whatever 
except normal decline would show as large a 
loss by merely maintaining their relative posi- 
tions as of age 20. If a 52-year-old scored 36 
on the ‘hold’ and 31 on the ‘don’t hold’ tests, 
no loss other than normal decline would be 
inferred ; but for all that the clinician can tell 
this 31 may have represented a 19-point loss, 
since the patient's Score on the 'don't hold' tests 
could easily have been so at age 20, even 
though his score at that time Was average on 
the ‘hold’ tests. Such are possible outcomes of 
Wechsler's differential-test-score method for 


measuring mental deterioration without knowl- 
edge of previous functioning. For clinical and 
diagnostic use, the author has elaborated a sys- 
tem of signs which depend upon an individual's 
intertest variability. When a subtest score devi- 
ates more than a specified number of points from 
the individual's mean subtest score, it is said to 
be ‘significant.’ The required amount of devia- 
tion varies with the magnitude of the total score 
which implies that the scattergrams for total 
score versus subtests are heteroscedastic. The 
same deviation is required for all ten subtests 
despite the fact that typically their correlations 
with total score range from the .50s into the 
70s. Surely a deviation of 2 points for Object 
Assembly cannot possibly have the same sig- 
nificance as a 2-point deviation for Block 
Design (r’s of .409 and .714 respectively, ages 
20-34). Furthermore, what is the real signifi- 
cance of, for example, a 2-point deviation ac- 
companying total scores of 80-110? Since the 
standard deviation of the distribution for any 
subtest is approximately 3.00, it follows that 
the standard errors of estimate, or normal 
variations of subtest scores for a given total 
score, will range from 2.0 to 2.5. Thus a devia- 
tion as great as 2 points will occur 32 (or 
more) per cent of the time as a matter of nor- 
mal variation, hence it seems gratuitous to 
claim ‘significance’ therefor. Wechsler next 
proposes to utilize certain of the ‘significant’ 
deviations as signs for use in clinical diagnosis. 
It is to be hoped that such usage can be objec- 
tively validated instead of being based on that 
vague thing called ‘clinical experience,’ which 
Wechsler (p. 157) crawls behind in discussing 
the well-controlled experiment of Magaret and 
Wright (J. Appl. Psychol., 27, 1943, 387-398). 
The findings of these investigators concerning 
the lack of large intertest variability for schizo- 
phrenics is rejected because it is not in agree- 
ment with his ‘own observations’ and the ‘clini- 
cal experience’ of others. This discrepancy is 
attributed to the method used by Magaret and 
Wright for calculating intertest variability, 
which he properly—considering his misunder- 
standing of the method—argues should have 
been done differently. Then he proposes a 
method of calculating intertest variability which 
is the exact method they used, so he has unwit- 
tingly placed himself in the position of having 
to accept their findings, or else again take cover 
in the clinical foxhole. 

J Am Med Assn 127:739 Mr 24-45. One of 
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the most serious criticisms of earlier editions 
of this well-known and accepted battery of in- 
telligence tests devised for adults rather than 
for children was that the tests given were not 
standardized on a sufficient number of persons. 
This situation has now been corrected to some 
extent, but the type of cases used leads to sta- 
tistical misinterpretations. In addition there are 
some defects in the author's concepts of test 
formation, particularly the idea that there is 
such a thing as mental alertness that can be 
measured. Wechsler argues that his test is a 
good measure of intelligence because experi- 
ence has shown it to be so, and the experience 
of the reviewer is that this is not an acceptable 
statement. In many instances the test results 
are too high when weighted against the psy- 
chiatric evaluation. The fact that the test is 
weighted for adults causes the author to make 
certain concessions which are not acceptable: 
the slightly deteriorated or elderly adult reacts 
higher than his actual mental function. Mental 
defectives are also rated higher than they would 
be on other tests, and most clinical psychiatric 
evaluations indicate that the Wechsler examina- 
tion rates defective persons too high. The test 
procedure and the criteria for evaluating the 
answers are included in the book. For the 
skilled examiner who can select tests effectively 
to suit various problem individuals, the Wechs- 
ler test is a very satisfactory one. For general 
use the psychiatrist might well be cautious in 
accepting results obtained by any but the most 
highly trained clinical psychologist. 

Personnel Adm 8:23 S "45. Irma Stark. * 
The Wechsler-Bellevue Scale has made psy- 
chological history. Since its first appearance, 
users of the Scale have recorded in volume and 
with enthusiasm its value as a clinical instru- 
ment, The literature, collectively reviewed, in- 
dicates that the Scale has accomplished for 
adult ‘intelligence testing what the Binet has 
done in the field of child testing. In addition, 
it is the first such scale which provides a scien- 
tific basis for analyzing the behavior of an in- 
dividual in terms of those elements which are 
characteristic of psychological disorders. Al- 
though the book is indispensable to clinical psy- 
chologists, psychiatrists, teachers of psychology 
and psychiatric social workers, it has value too 
for those personnel workers engaged either in 
test construction, test administration, or indi- 
vidual placement and rehabilitation. * Although 
the Wechsler—Bellevue Intelligence Scales will 


undoubtedly have most usefulness in those 
agencies where individual diagnosis and treat- 
ment are the primary concern, such as in Vet- 
erans’ Facilities, there is much in the book, in 
the excellent introductory section, in the care- 
ful discussion of the standardizing techniques 
and in the test items themselves that can offer 
leads for more scientific personnel administra- 
tion and merit examination test construction. 
Psychol B 42:186-7 Mr ’45. Ann Magaret. 
* Chapter 6 outlines a method for determining 
quantitatively...the degree of a subject’s men- 
tal deterioration. Based on the assumption that 
abilities which do not decline with age in nor- 
mal subjects are those least affected by dete- 
riorative processes in psychosis, the method in- 
volves comparing a subject’s mean score on the 
tests which hold up with age with his mean 
score on tests which do not hold up with age. * 
Whether the assumption that deterioration in 
psychiatric disorders follows that in senility is 
justified can be determined only through the 
application of Wechsler’s technique to psy- 
chotic groups and comparison of clinical with 
psychometric findings. With the exception of 
one case study, however, no test of this as- 
sumption is reported. Chapter 11, Diagnostic 
and Clinical Features, summarizes the author’s 
impressions concerning the significance of a pa- 
tient’s relative performance on single subtests 
of the scale in the differential diagnosis of neu- 
rosis, organic brain disease, schizophrenia, psy- 
chopathy, and mental deficiency. * Tables are 
given showing patterns of subtest scores for 
various clinical syndromes. The reader is told 
what constitutes a significant variation from 
subtest to subtest, and is provided with “rules 
of thumb” for determining diagnostically im- 
portant variations, but evidence for the va- 
lidity of such rules is not reported. Despite the 
facts outlined in Chapter 6, age is apparently 
not considered in this method of diagnosis. The 
validity of differential diagnosis must depend 
upon demonstration of correspondence between 
inter-test variations and particular psychi- 
atric disorders, preferably with large groups of 
carefully diagnosed patients, age controlled. 
No such analysis is presented by the author, 
who depends rather upon informal clinical ex- 
perience in correlating pattern with syndrome. 
A cursory inspection of recent research on the 
use of the scale in diagnosis (as listed in 
Wechsler’s bibliography) reveals that the im- 
pressionistic data found in the author’s diag- 
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nostic tables are occasionally at variance with 
reported research results. Despite this fact, and 
despite Wechsler’s own statement that “the 
data . . . are not intended, nor can they be used 
as psychometric shortcuts to psychiatric diag- 
nosis” (158), there are general statements 
throughout this chapter which strongly tempt 
the hurried psychometrician to succumb to such 
short cuts. For example, “The correlations are 
sufficiently high to be of value in vocational 
guidance . . .” (146) ; “The inconsistency here 
is what definitely shows this case to be schizo- 
phrenic” (163). * A convenient method of ob- 
taining IO's for subjects between the ages of 
60 and 8o is offered. * Psychological examiners 

. may regret the continued absence of precise 
“statements of subtest reliabilities, and of indi- 
cations of practice. effects, and will recognize 
the repetition of minor awkwardnesses of in- 
structions. The thesis of the major revisions, it 
seems to the reviewer is that an examiner ad- 
ministering the Wechsler-Bellevue Scale to a 
single patient may obtain considerable infor- 
mation besides a measurement of general intel- 
ligence. Clinicians would agree that this is true 
of any standardized situation in which a pa- 
tient is observed. In a scale which finds as wide 
application as the Wechsler—Bellevue, however, 
it seems essential that those cues which the ex- 
aminer uses in assaying the patient be made 
explicit through the reporting of exact research 
results wherever possible. Unless this is done 
systematically by the authors and users of 
Standard scales, the diagnosis of mental de- 
terioration or psychiatric disorder from in- 
telligence test results will remain inspired 
guesswork. 

Q R Biol 20:106-7 Mr '45. C. Hart West- 
brook. * One outstanding value of the Bellevue 
Scale is that it enables clinicians to measure 
and score separately by the Verbal and the Per- 
formance Scales the two distinct groups of 
abilities or capacities, verbal and performance, 
which in some other widely used tests are 
measured by single scale and indicated by a 
composite score, * Another value of the book 
is that the extension of the Scale to mental 
levels higher than those measured by any other 
but Thorndike's CAVD Scales makes possible 
the mental measurement of persons up to the 
age of 60 years, and by a process of extrapola- 
tion, “without serious error by the use of some 
simple multipliers which take care of the age 
factor,” of those between 60 and 80 years, * 


Wechsler has completely revised the table of 
“signs” for different clinical entities revealed 
by the tests and has elaborated a new quantita- 
tive method for the determination of mental 
deterioration. These changes will render the 
book much more valuable to the clinician, both 
for his own records and in his interpretation of 
the results for the guidance of other members 
of the clinic in their final diagnosis and in 
treatment and disposition of difficult cases. 
Alterations in the scoring of certain subtests 
and in the use of scores of subtests for further 
analysis of the mental deterioration or im- 
provement of the subject constitute another 
improvement in the Scale. * Chapter 11, on 
Diagnostic and Clinical Features, should be 
especially valuable, as it contains tables of test 
characteristics of the clinical groups, followed 
by comments on the groups and illustrative 
cases, each accompanied by a tabulation of the 
scores made on the subtests of both Verbal and 
Performance scales and the respective I.Q.’s. * 
In view of the well-adapted materials used and 
the evidences of standardization, validity, and 
reliability presented by the author, it seems that 
the new edition of the Bellevue Scale should 
prove to be highly reliable and valid for gen- 
eral use as well as for the special use intended 
with returning soldiers in the problems of their 
rehabilitation. 
[ 302 ] 

*Wechsler Memory Scale. Adults; 1945; individ- 
ual; Forms 1, 2; $1.75 per 50 record forms, postpaid ; 
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paid; David Wechsler; Psychological Corporation. 
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Kate Levine Kogan, Clinical Psychologist, 
154 Chesterfield Road, Pittsburgh 13, Pennsyl- 
vania. This simple, concise measure of memory 
function has already proved to be a valuable 
addition to the available clinical techniques. 
Administration time is brief , directions are clear, 
and scoring criteria are objective. Perhaps its 
most important use is in conjunction with the 
Wechsler-Bellevue intelligence scale, since the 
memory quotient is designed to be directly 
comparable to the intelligence quotient. Use of 
the test permits intra-individual comparison of 

E De niu. H 
the patient's memory impairment with his loss in 
other intellectual functions rather than only a 
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comparison with a general average or norm. 
Thus, it is possible to distinguish accurately be- 
tween the kind of memory impairment which is 
merely one aspect of generalized mental ineffi- 
ciency and that which represents a specific 
decrement in memory function. Certain other 
measures have had a somewhat similar goal. 
The Hunt-Minnesota Test for Organic Brain 
Damage, for example, is based on the relation 
between vocabulary level and scores on two 
types of immediate memory tasks. However, 
since vocabulary level is commonly accepted as 
being a better indication of native endowment 
prior to the onset of illness than it is a measure 
of functioning level, a different picture is ob- 
tained. The Wechsler Memory Scale, in con- 
junction with the intelligence scale, allows for a 
broader analysis of specific mental interference 
at the time of examination. Furthermore, allow- 
ance is made for memory variations with age. 
The test material is sufficiently varied to pro- 
vide qualitative observations which may have 
additional value. One can distinguish, for ex- 
ample, cases in which remote memory, data of 
personal identification, and extremely familiar, 
well-learned patterns are retained in contrast to 
faulty performance with the formation of new 
learning associations; such findings often pro- 
vide confirmatory evidence of disturbances of 
attention and concentration. One can compare 
retention of meaningful material and rote 
memory. Finally, one can isolate specific dis- 


turbance in the area of space-form perception 
and visual memory. These features make the 
test findings especially helpful in problems in- 
volving differential diagnosis. 

The test was designed for use with adults 
only, and the scoring involves the addition of 
certain constants to the raw score in order to 
equate scores at different age levels above 
twenty years. The test author has stated in a 
personal communication to the writer several 
years ago that the test might prove to be useful 
with adolescents by employing the same age- 
correction unit established for the 20- to 24-_ 
year group. This has not been subjected to ex- 
perimental verification, but the practice has 
seemed to yield meaningful and consistent re- 
sults with adolescents. 

Because of its advantages over other similar 
techniques, this scale is appropriate to a wide 
variety of research studies. However, the lack 
of an alternate or parallel form has been felt, 
especially for those experimental designs which 
require a “test-retest” method. One group of 
investigators has recently proposed an alternate’ 
form which was designed to provide equivalent 
material for each of the test items (2). This 
has been published so recently that there has 
not been sufficient time for widespread use. 
However, if it proves to be an accurately 
equated alternate form it will extend the sphere 
of usefulness of the Wechsler Memory Scale 
even further. 


MATHEMATICS 


Reviews sy Albert A. Bennett, William Betz, Paul Blommers, 1 " 
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Cooperative Test in Secondary School Mathe- 
matics: Higher Level. Grades 10-12; 1942; IBM; 
Form S; separate answer sheets need not be used; $2 
per 25; 25¢ per specimen set, postpaid; 4o¢ per 25 
machine-scorable answer sheets; 15¢ per stencil for 
scoring answer sheets; 80(85) minutes; Margaret P. 
Martin, William Mollenkopf, Radcliffe W. Bristol, 
William S. Litterick, and Carroll G. Ross; Cooperative 
Test Service. 


E. H. C. Hildebrandt, Associate Professor 
of Mathematics, Northwestern University, 
Evanston, Illinois. Students taking this test are 
classified into one of three groups: those hav- 
ing had less than three and one-half years of 
high school mathematics, those who have had 
three and one-half years or more of high school 
mathematics but not including advanced (col- 
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lege) algebra, and those who have had three“ 


and one-half years or more of high school 
mathematics including advanced (college) al- 
gebra. There are 150 questions, most of which 
give a choice of five answers. Students in the 
first group mentioned above start with Item 1 
and go as far as possible in the time allowed. 
Those with three and one-half years of high 
school mathematics begin with Item 51. Stu- 
dents who have not had the advanced (college) 
algebra omit questions marked with an asterisk. 

The questions are distributed as follows: 
algebra 58, plane geometry 40, solid geometry 
14, trigonometry 24, and college algebra 14. 
The first fifty questions are divided almost 
equally between elementary algebra and plane 
geometry. The last fifty questions are about 
equally distributed among trigonometry, solid 
geometry and college algebra. The early ques- 
tions rely on an understanding of fundamental 
definitions and formulas. Apparently the im- 

portant topics of all the secondary school 
-courses in mathematics have been included. 
^ Scores on the test are obtained by subtract- 
ing one-fourth (to the nearest unit) of the 
number wrong from the number right. An- 
_ Swers to questions are listed on a fan-type key 
to facilitate checking. 

The questions appear to be well chosen, 
graded, and distributed. They show careful 
consideration on the part of the editors, The 
type used is satisfactory, and important items 
in each question are given added emphasis by 
using either bold type or italics. 

The test could be given at the end of the 
high school course to obtain a measure of com- 
petence in high school mathematics, Colleges 
could use it effectively to determine what defi- 
ciencies might be present in the secondary 
school training or as a basis for entrance into 
more advanced courses in mathematics. 
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Cooperative College Mathematics Test for 
First-Year Courses. Grade 13; IBM; Provisional 
Form T; 1943; separate answer sheets need not be 
used; $1.75 Der 25; 25¢ per specimen set, postpaid ; 40e 
per 25 machine-scorable answer Sheets; 15¢ per sten- 
cil for scoring answer sheets; 40(45) minutes; Emma 
Spaney ; Cooperative Test Service. 

Albert A. Bennett, Professor of Mathe- 
matics, Brown University, Providence, Rhode 
Island. This test is designed for a 40-minute 
period of working time, and aims to compass 
dn its 55 questions most of the topics likely to 
be emphasized in any first-year college course 


in mathematics. It accomplishes this ambitious 
and discouraging task in a highly creditable 
manner. From the point of view.of any one 
person being examined, the test is likely to be 
judged unreasonable, touching as it does upon 
many topics that the particular individual may 
never have heard of while slighting others in 
which he has been drilled. As part of a “profi- 
ciency examination” for students of varied 
background who would waive a requirement 
of freshman mathematics or who seek transfer 
credit on the claim of extensive prior training, 
the test is particularly well designed. Necessarily 
attention is directed toward the recognition of 
technical terms and of their approximate mean- 
ing, and can call for little or no written com- 
putation. The test has been successful in incor- 
porating a.large number of important concepts 
and in avoiding unimportant verbal distinc- 
tions. 

Despite the freedom from misprints, and the 
care shown throughout, the zealous purist can 
find excuse for criticism in some of the for- 
mulated questions or answers. The words 
“best” in Item 49 and “simplest” in Item 50 
may be accepted grudgingly. In Item 1, a 
function should be called a "correspondence" 
rather than a "number"; in Item 12 one may 
ask in what argument is the equation supposed 
to be rational and integral, and even whether 
such a quality can ever characterize an equa- 
tion as a whole. Item 16 seems to be unfortu- 
nately worded. Item 32 makes no mention of 
what denominator is intended and cannot 
therefore ask one to identify a corresponding 
numerator. In Item 42, Choice 3, the second 
equal sign could better be a minus. In Item 2 kh 
one might question whether any proposition 
could be “the geometric significance of C." In 
Item 29, presumably "either" does not refer to 
"each." In the difficult task of selecting four 
erroneous answers to be offered as alternatives 
to the correct one, the authors have not always 
succeeded in making all five approximately 
equally plausible. Indeed, a clerical assistant 
not even understanding a problem could rule 
out many of the answers as probably erroneous, 
acting on the simple rule that any answer hav- 
ing no other close analogue in the list of five 
cannot be right. But, of course, such a narrow- 
ing of the list of plausible answers would not 
identify the single correct one. The test is to 
be particularly commended for the unambiguity 
with which the questions are phrased, 
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A scoring aid, provided for the examiner, 
requires folding and a little manual juggling 
but serves satisfactorily after some experience. 
The published norms furnished with this test 
do not apply to this 40-minute revision. The 
general manual of directions for using the 
Cooperative tests is all that could be desired. 

With increasing diversity of first-year mathe- 
matics courses, the primary objective of this 
test becomes increasingly futile. Whether math- 
ematical achievement is adequately measured 
in terms of recognition and one-step computa- 
tions is also open to debate. 


Nathan Morrison, Principal Actuary, New 
York State Division of Placement and Un- 
employment Insurance, New York, New York. 
It is likely that four out of five teachers of 
college mathematics have an automatically un- 
favorable reaction to multiple-choice tests in 
mathematics. It is this reviewer's experience 
that most teachers prefer the essay type of test 
with the student being asked to answer from 6 
to 10 questions, each involving a considerable 
amount of work. It may well be that this essay 
type of test is more reliable and more infor- 
mative concerning the ability of the student. 

Yet, in many situations, the multiple-choice 
test may be desirable or required because of 
the scoring speed and potentially wider range 
of coverage. In this reviewer’s opinion, mul- 
tiple-choice tests in mathematics would gain 
easier acceptance and would be greatly im- 
proved if certain principles were followed which 
would preserve some of the basic features of 
essay types of mathematics tests. 

First, the items should be carefully designed 
so that the student has a specific problem to 
work and has occasion to look at the answers 
only to see if the solution he has obtained is 
among them. As a general rule, it should not 
be possible to work the problem backwards by 
eliminating the incorrect alternative answers. 
It is unfortunate that many teachers have come 
to believe that multiple-choice tests are little 
better than guessing games requiring more 
luck and general intelligence than mathematical 
ability. 

Second, an average of about two minutes of 
working time per item should be allowed. This 
should be accomplished by having a range of 
items requiring a minimum of one-half min- 
ute to a maximum of possibly four minutes. 
Too often, multiple-choice tests are more tests 


of reading ability and speed than of mathe- 
matics. 

Third, there should be a minimum number 
of items which are only definitions of terms or 
of procedures or which require that students 
should have memorized long and complicated 
formulae. If a student is doing only six prob- 
lems and has 15 or 20 minutes for each, he 
may have time to develop a needed formula 
from more elementary ones, but this is not the 
case in multiple-choice tests. 

The Cooperative College Mathematics Test 
For First-Year Courses does not take into ac- 
count the principles stated above. Only 40 min- 
utes are allowed for 55 items, an average of 
less than 45 seconds per item. Almost half of 
the items are so stated that the student must 
read the entire item, including all five alter- 
native answers, before he even realizes what 
the problem requires of him. Many of these 
items together with the answers contain about 
100 words. It appears doubtful whether the 
average student would have sufficient time to 
read these items, even if he did not have to try 
to work the problem and obtain a solution. 

The test is intended to cover first-year col- 
lege mathematics, including the calculus, but 
the items do not give an adequate, balanced” 
coverage. It is this reviewer’s opinion that the 
14 items devoted to trigonometry are about 
twice the desirable number, and it appears 
doubtful whether two items devoted to simple 
and compound interest are needed in this type 
of test. 

[ 305 ] 
Cooperative Mathematics Test for Grades 7. 8, 
and 9. Grades 7-9; 1940-47; IBM; Forms Q, RO, X; 
separate answer sheets need not be used; $2 per 25; 
25¢ per specimen set, postpaid; 4o¢ per 25 machine- 
scorable answer sheets; 15¢ per stencil for scoring an- 
swer sheets; 80(85) minutes; Forms Q and RO: Alice 
H. Darnell, John C. Flanagan, Stevenson W. Fletcher, 
and Rose E. Lutz; Form X: Vernon Price; Coopera- 
tive Test Service. 
REFERENCES 

1-2, See 40:1433. 5 

3. THomas, Louis W. A Statistical Analysis of a Test in 
Iunior High School Mathematics, Unpublished master's thesis, 
Ohio University, 1940. Pp. 46. (Abstracts of Masters’ Theses 
++) I941, De 53.) 

M. L. Hartung, Associate Professor of the 
Teaching of Mathematics, The University of 
Chicago, Chicago, Illinois. [Review of Forms 
Q and RO.] If a single test must be used in 
evaluating the general mathematics program in 
the grades indicated, this is probably the best 
available instrument. Forms Q and RO are 
essentially equivalent and will be reviewed to- 
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gether. Each form consists of 130 items ar- 
ranged in parts as follows: I, Skills; II, Facts, 
Terms, and Concepts; III, Applications; IV, 
Appreciation. For every item the student must 
select the best answer among five choices. With 
few exceptions, the items are well formulated, 
and the possible responses offered are well 
chosen. Although the content is predominantly 
arithmetic, there is a better sampling of mate- 
rial from simple algebra and informal geometry 
than is common in the arithmetic tests for these 
grades, Numerical trigonometry is omitted, and 
volumes of other than rectangular solids are 
slighted. 

The only manual supplied the reviewer was 
the generalized set of "Directions For Using 
The Cooperative Tests." This contains careful 
directions for giving, scoring, and interpreting 
the tests, including general precautions to be 
observed in using tests. It is excellent as far as 
it goes, but it contains no data on the validity 
and reliability of these tests. The articles by 
Seder (1, 2) refer to the earlier Forms O and 
P; and since these are practically equivalent to 
Forms RO and Q, the data from them suggest 
that the coefficient of reliability of the total 
score is about .92. The part-score reliabilities 
are probably all above .70. Seder's data indi- 
cate that the coefficients of correlation between 
the parts range from about .46 to 72. On the 
key is a table for converting raw scores to 
"scaled scores" which equalize units and facili- 
tate the comparison of results between the 
forms and with other Cooperative tests. Tables 
of percentile norms of scaled scores for each 
grade and part are also provided on a separate 
sheet. 

The effort of the authors to Separate the 
items into subtests each of which relates pri- 
marily to an important type of objective is 
highly commendable. It seems, however, that 
many of the items could just as well have ap- 
peared in another part. This is particularly true 
of those classified under “appreciatiori.” The 
intercorrelations between parts are therefore 
surprisingly low, A more penetrating analysis 
of the behavior suggested by the subtest titles, 
and the exercise of greater ingenuity in the 
selection of items to measure them, would 
probably make the subtests not only more valid 
with respect to the objectives but also even 
more independent. It is probable that the neces- 
sity of restricting the technique to simple five- 
choice items made it difficult for the authors to 


get at some of the important concepts they had 
in mind. The other Cooperative mathematics 
tests were separated into parts because differ- 
ent testing and scoring devices were to be used i 
the separation into parts in terms of objectives 
in Forms Q and RO is a desirable step for- 
ward. 


For reviews by Richard M. Drake, Judson 
W. Foust, and G. M. Ruch of Form, P, see 
40:1433. 

[ 306 ] i 
*Foust-Schorling Test of Functional Thinking in 
Mathematics. Grades 9-16; 1942-44; korma A Bs 
$1.45 per 25; 35¢ per specimen set, postpaid; 45(50 
minutes; Judson W. Foust and Raleigh Schorling ; 
World Book Co. 


REFERENCES 


1. Foust, Jupson W. The Construction of a Standardized 
Test in Functional Thinking. Unpublished doctor’s thesis, Uni- 
versity of Michigan, 1938. Pp. 193. 


William Betz, Specialist in Mathematics, 
Public Schools, Rochester, New York. The 
test contains 80 questions, to be answered in 
45 minutes, divided as follows: (a) Twenty 
completion-type statements, each based on a 
typical case of dependence, distributed over 
the fields of arithmetic, geometry, and algebra. 
(b) Seventeen true-false items on the same 
subjects and also on graphs. (c) Fifteen items 
which require the student to express the sug- 
gested relationships “in mathematical language 
or symbols.” The illustrations chosen are non- 
technical and should occasion no difficulty to 
a student having completed a good course in 
general mathematics. (d) Seven items, each of 
which calls for an equation expressing a sug- 
gested relationship. (e) Twenty-one comple- 
tion-type questions based on basic geometric 
facts, simple formulas, and typical graphs. 

The test will undoubtedly be welcomed by 
experienced teachers; the items are sufficiently 
representative and varied to enlist the interest 
of students tested. If "functional thinking" is 
indeed of central importance in the teaching of 
mathematics, the norms on this test suggest 
that functional thinking must receive much 
more careful attention in the classroom than 
appears to have been the case. 


M. L. Hartung, Associate Professor of the 
Teaching of Mathematics, The University of 
Chicago, Chicago, Illinois. This test is designed 
to measure the following abilities: (a) to rec- 
ognize the relationships between varying quan- 
tities by telling on what the value of certain 
quantities depends; (b) to interpret a mathe- 
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matičal statement or expression of dependence 
by telling just how the one quantity depends 
upon the other, or varies as the other varies; 
(c) to express such relationships in mathe- 
matical language or symbols. In spite of the 
unquestioned importance of these objectives, 
no test specifically devoted to measuring them 
has hitherto been generally available, This test 
of 63 completion items and 17 true-false items 
is for grades 9 through 12, and might also be 
used with college freshmen. It demands little 
computational skill and consequently should be 
used along with other instruments by teachers 
who recognize the value of testing all the im- 
portant objectives of a modern mathematics 
program. In the opinion of the reviewer, this 
or a similar instrument should be a part of the 
testing program of every high school. 

The manual contains an unusually complete 
discussion of : the need, scope, and purpose of 
the test; preliminary tryout, validity, reliability, 
standardization; how to give the test; how to 
score the test; and interpretation of scores. 
Standardization based on nearly 5,000 cases 
provides percentile norms for each grade. The 
reliability found by the Kuder-Richardson 
method ranges from about .80 to .88 for dif- 
ferent grades, and the intercorrelations of 
Forms A and B, also a measure of reliability, 
are about .87. 

The critical comments which follow suggest 
ways in which the test can be improved. First, 
the completion technique used requires the stu- 
dent to write word answers to Items 1 to 20. 
In several cases (e.g., Items 50 and 52), a 
number of alternative forms of algebraic ex- 
pression are correct. In such cases the objec- 
tivity may be questionable. For example, even 
the second answer in the key for Item 55 is 
not acceptable to the reviewer. Second, a con- 
siderable number of items are so conventional 
in content that a formal or memoriter response 
may suffice, with little or no real functional 
thinking being involved. An example is Item 
7: "The perimeter of a square depends on 
." Although all of the content of the 
test is valid in the sense that it is convention- 
ally used in teaching, this feature reduces the 
scope of the objective. Ideally, the students 
should be expected to recognize, interpret, and 
express relationships new to them. Third, it 
would be desirable to have separate scores on 
each of the subabilities. In Breslich's experi- 
mental test for the same objectives, subscores 


were obtained, although they were not very 
reliable. Ultimately, teachers should have reli- 
able subscores on the various aspects of func- _ 
tional thinking if they are to discover whether 
redistribution of emphasis is needed. In the 
meantime, the use of this test is likely to re- 
veal one of the weakest parts of the mathe- 
matics program in many schools. This in turn 
should eventually result in an upward revision 
of the norms for the test. 


Clearing House 19:263-4 D '44. E(arl) R. 
G(abler). The values in mathematics that fre- 
quently are not attained are those beyond the 
mechanical, in the realm of the functional. This 
test gives an adequate measure of the students’ 
ability to see this relationship or dependence. 
However, it seems inadvisable for the authors 
to claim considerable transfer of the functional 
thinking gained in the study of mathematics to 
other fields and situations. The test has ample 
justification without the claim.* , 

Ed Res B 24:139-40 My 16 '45. Harold P. 
Fawcett. * All teachers of mathematics who 
wish to increase the power of their students to 
handle relationships will find this test helpful 
in measuring the extent to which they are 
achieving this purpose.* 
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*General Mathematical Ability: Tests of General 
Educational Development: High School Level, 
Test 5. High school; 1944-45; IBM; Form B; sepa- 
rate answer sheets must be used; $2 per 25; 40¢ per 25 
machine-scorable answer sheets; 15¢ per scoring key; 
25¢ per specimen set, postpaid; worklimit (120) min- 
utes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education ; distributed by the 
Cooperative Test Service. (Also distributed by Sci- 
ence Research Associates: $2 per 25; 65¢ per 25 ma- 
chine-scorable answer sheets; 5o¢ per key; so¢ per 
specimen set.) 


For reviews by Herbert S. Conrad and 
Warren G. Findley of the entire battery, see 20. 
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Iowa Placement Examinations: Mathematics Ap- 
titude. Grades 12-13; 1925-44; IBM ; 2 editions; G. D. 
Stoddard and E. W. Chittenden; Bureau of Educa- 
tional Research and Service, State University of Iowa. 
a) SERIES MAI, REVISED. 1925; Forms A, B ; $4 per 100; 
50€ per specimen set including the other 10 tests in the 
series, postpaid; 40(45) minutes. 

) NEW SERIES MA-2, REVISED. 1925-44; also called 
Quick-Scoring Edition; IBM; Form M ; separate an- 
swer sheets need not be used; $1.15 per 25; 30£ per 
specimen set including the other 4 tests in the series, 
postpaid; $r.50 per 150 machine-scorable answer 
sheets; 15¢ per stencil for scoring answer sheets; 
75(80) minutes; revised by E. W. Chittenden and 
D. B. Stuit. 
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Edmund P. Churchill, Assistant Professor of 
Mathematics, Antioch College, Yellow S. prings, 
Ohio. The 1941 edition of the manual for the 
Towa Placement Examinations states that the 
aptitude tests are designed for those who have 
taken little or no formal work in the subject. 
It will not be the purpose of the review to 
judge how well these tests fit this design, for 
other comments in the manual and the use of 
college freshmen for standardizing the tests 
indicate that it was expected that the aptitude 
tests, like the mathematical training tests, would 
be used primarily with college freshmen having 
the normal amount of mathematical training. 
The test items do, however, seem to depend 
less on formal training than do the items on 
the mathematics training test. On all forms 
there are sections devoted to number-series ex- 
ercises, to short verbal problems, and to ques- 
tions based on reading material. Forms A and 
B (copyright 1925) also include a section of 


reasoning problems, and Form M (copyright 
1942) has a section designed to measure the 
students’ range of information in the field of 
mathematics. 

The major differences between Form M and 
the two earlier forms are that the new form is 
nearly twice as long—75 minutes working time 
—as the other forms—40 minutes working 
time—and that, except for one set of true- 
false questions in each form, the items of the 
longer test are of the multiple-choice type rather 
than the free-answer type previously used. The 
increased length of Form M was required, 
according to the manual, to obtain improve- 
ment in the test's discriminatory power and 
reliability. The reliability of Form A is given 
as 87, of Form M as .97. 

The test items appear to be ones which 
should differentiate between very poor and very 
good students, and we are assured that such is 
the case. For the test as a whole, few data 
are given to indicate how well it will predict 
success in college mathematics either when 
used by itself or when used along with the 
mathematics training test. The manuals do 
state that the average correlation between test 
scores and course grades for the series of tests 
Which includes Forms A and B was .5 and that 
the correlations for the group of tests with 
which Form M was issued ranged from .61 to 
-72. Whether the courses involved were liberal- 
arts or engineering-physical-science-mathemat- 
ics courses is not stated. Nor are any data given 
as to the value of the test in selecting students 
who will profit from courses of more than the 
usual depth, in selecting students who need 
subfreshman work before beginning their col- 
lege courses, or in differentiating between stu- 
dents who have the ability to complete success- 
fully more than one semester of mathematics 
from those who have not. Information on these 
last matters would seem to be often of more 
importance than over-all test score-course grade 
correlations. 

Single sets of norms are given for Forms A 
and B; for Form M norms are given for 
liberal-arts and engineering freshmen. The 
large difference between these two sets of 
norms clearly indicates the need for separate 
norms for these groups, 

The reading selections have not been worded 
as carefully as they should have been. The 
directions for the true-false items are faulty ; 
judging from the scoring forms, a statement is 
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to be marked false if it is not necessarily true; 
but no directions to this effect appear in the 
tests. The most serious defect from a mathe- 
matical point of view is the heavy dependence 
on number-series exercises. It can be argued 
that the student who is bright enough to real- 
ize that for each of these exercises one can 
construct a simple rule which will agree with 
the given terms and give any desired values 
for the missing terms will be bright enough to 
know what the "correct" answer is. That this 
is undoubtedly true seems scant justification 
for encouraging the fallacious belief that the 
following terms of a series are determined by 
the first five terms (or, for that matter, by the 
first 500). Number-series exercises have been 
used, of course, in many other tests, but it does 
not seem unreasonable to suggest that the time 
has come to eliminate this type of item from all 
tests and to replace it with items which are 
mathematically sound. 

Without additional validity data it is difficult 
to pass judgment as to the usefulness of this 
test. Form M is undoubtedly a better predictor 
of freshman grades than are Forms A and B 
and is easier to score. Since no precise infor- 
mation is given as to how much better a pre- 
dictor Form M is, it is hard to decide how well 
this improvement compensates for the fact that 
Form M cannot be administered in one class 
period. Those faced with the choice of a test 
to predict freshman mathematics grades will do 
well to try out both Forms A and M, the 
Cooperative Intermediate Algebra Test, the 
Towa Placement Examinations: Mathematics 
Training, and the test * prepared by Fred Rob- 
ertson before making a decision as to which 
one to use. 


Paul L. Dressel, Chairman, Board of Ex- 
aminers, Michigan State College, East Lansing, 
Michigan. The most recent form, Form M, 
differs from the earlier forms in increased 
length, in the use of multiple-choice rather 
than completion items, in separation of infor- 
mation and problems, and in discarding the use 
of items involving inequalities. The increased 
length (and perhaps some of the other changes) 
has increased the reported reliability from .878 
to .968. The replacement of write-in comple- 
tion items by multiple-choice items increases 
the objectivity of scoring. True-false items are 


* Robertson, Fred. “Some Phases of the Mathematics Testing 
Trogus at the Iowa State College." Math Teach 36:296-302 
43. 


still used, and the illogical policy of using a 
minimum score of zero on this section despite 
the use of the scoring formula R-W is still 
continued. Since Form M is a definite improve- 
ment over the earlier ones, further comment 
will be directed entirely at this most recent 
form. 

The test does not solve satisfactorily the 
major difficulties presented in construction of 
a mathematics aptitude test. The number-series 
test situation presents an impossible problem 
and one which any critical person would dis- 
dain to answer since any answer can be jus- 
tified. For Items 32, 33, and 35 of the same 
test and part, the answers which seemed most 
"obvious" to the reviewer were not even given. 
Furthermore, the section is unduly long and is 
apt to become very tedious. 

The reading material and questions of Part 
3 may not test reading ability at all. Some stu- 
dents could answer all the questions without 
reading the paragraphs. It should be possible 
to select or write material which would be un- 
familiar and to ask questions on it which could 
be answered only after reading the material 
presented. 

The information section, Part 4, is not en- 
tirely consistent with its name. Items involving 
skills, knowledge, and reading ability are in- 
discriminately mixed. One item, “The shortest 
path between two points on the surface of the 
earth is a straight line," is mis-keyed. Unless 
the path must also be on the surface of the 
earth (and this is not stated), the statement is 
true, but it is keyed as false. The item 1/a +- 
1/b = 1/(a + b) (Item 22, Part 4) is true 
if values are properly selected for a and b, and 
this may bother some students—especially 
when they are familiar with imaginary num- 
bers. The authors have used the unspecific term 
"side" to mean both edge and face. 

No exact indication of the predictive value 
of this test is given in the manual, but it is 
mentioned regarding the entire battery of the 
Iowa Placement Examinations that “These 
correlations ranged from .61 to .72." Consider- 
ing the working time required (75 minutes), 
even the top figure is not remarkable when one 
takes into account that high reliabilities and 
predictive coefficients are relatively easier to 
attain in mathematics than elsewhere. 

The Iowa mathematics aptitude test would 
seem to be more satisfactory than any other 
test of the type now available, but it does not 
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separate aptitude from training and it utilizes 
no really new approach to the aptitude problem. 


: [ 309 ] 
Iowa Placement Examinations: Mathematics 
"Training: Series MTi, Revised. Grades 12-13; 
1925; Forms A, B; $4 per 100; 50 per specimen set 
including the other 10 tests in the series, postpaid; 
40(45) minutes; G. D. Stoddard and E. W, Chitten- 
den; Bureau of Educational Research and Service, 
State University of Iowa. 
REFERENCES dues 
. Stoppard, Grorce Dinsmore. Towa Placement Examina- 
Hons. oe of Iowa, Studies in Education, Vol. 3, No. & 
»owa City, Iowa: the University, 1925. Pp. 103. Paper. $1.00."' 
2, MPG A. jane sean SC of USE Exami- 
d 24:619-20 N 13 26, i 
Pakanen, m. K “Analysis of an bte crues Tests." 
Psych 1303-1 '26. 1:49! A. 
Lt Bron, [A “Towa Placement Examinations.” 
Soc 24:2126 "26, * j 
SeA SEM TA. tA CODE DE of ‘Aptitude’ and ‘Train- 
ing’ Tests for Prognosis.” J Ed Psychol 19:658-65 D '28. * (PA 
sada ILLER, Lawrence WILLIAM. An Experimental Study of 
the Iowa Placement Examinations. University of Iowa Studies 
in Education, Vol. s, No. 6. Iowa City, Iowa: the University, 
1930, Pp. 116, Paper. $1.00. * (PA 5: 99) í 
7 ieee A, P. The Prediction of cholastic Achievement 
‘or Freshman Engineering Students at Purdue University. 
urdue University, Division of Educational Research, Studies 
in Engineering Education II. Lafayette, Ind.: the Division, 
May 1942. Pp. 22. Paper. $0.35. * (PA 16:5020) 
8. Jounson, A, P. The Relationship of Lest Scores to Scho- 
lastic Achievement for 255 Engineering Freshmen Entering 


Purdue University in September, 1939. Unpublished doctor’s 


thesis, Purdue University, 1942. 


Edmund P. Churchill, Assistant Professor of 
Mathematics, Antioch College, Yellow Springs, 
Ohio. This test, now more than twenty years 
old, consists of simple problems in arithmetic 
and elementary algebra and of questions of 
information from the field of geometry. The 
test is 40 minutes long; the questions are of 
the free-answer type except for those dealing 
with geometry, which are true-false questions. 

Neither the manual nor an examination of 
the test itself makes clear the basis on which 
the test was constructed. Since the questions 
involve little understanding of the methods of 
mathematics and only the most elementary 
techniques of algebra, the test can hardly serve 
as a real measure of the adequacy of a stu- 
dent’s high school training in mathematics, 
One may guess that the test was really in- 
tended as a predictor of marks in college 
freshman mathematics courses and that the 
use of “training” in the test title was intended 
primarily to indicate that the answering of the 
questions requires more information than is 
needed on the corresponding aptitude test. 

A three-page test manual, which covers ten 
other tests as well, gives norms for both forms 
and a reliability value of .88 for Form A. No 
data are given as to what groups were tested 
to obtain the norms and the reliability, nor is 
any statement made as to how the reliability 
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was computed. The information given regard- 
ing the tests’ usefulness as a predictor of fresh- 
man grades is meager and vague. The manual 
is not dated but appears to be almost, though 
not quite, so old as the test. 

Not all the questions are so carefully worded 
as they should be. The instructions for the true- 
false questions are faulty. Fractional scores 
are to be converted into the "nearest whole 
number," but the manual does not say what is 
to be done when the fraction is one-half, the 
only type of a fractional score which can occur 
on this test. Since the scoring key contains at 
least one error and does not list all the correct 
answers (there are 288 such answers to one 
question), it may be difficult to get this test 
properly scored by routine clerical workers. 

Despite the age of this test and its faults, it 
may have value as a means of detecting stu- 
dents who need additional training before they 
can hope to profit from college work in mathe- 
matics. It is unfortunate that the manual con- 
tains no data on this subject. 


Paul L. Dressel, Chairman, Board of Ex- 
aminers, Michigan State College, East Lansing, 
Michigan. The authors of these tests indicate 
a variety of uses including prediction of suc- 
cess, selection, sectioning, guidance, and diag- 
nosis of class weaknesses. In presenting norms 
in the manual, no information is given concern- 
ing the nature of the group on which the norms 
are based. This may imply greater applicability 
of the norms than is actually justified since it 
is certain that norms for students with varying 
amounts of high school, mathematics would be 
materially different. The reliability quoted by 
the authors without any specification of the 
nature of the group is only .88. 

The directions for giving the test are clear, 
those for scoring less so. While it is suggested 
that scoring should be supervised by one famil- 
iar with the subject matter, there is inadequate 
instruction on the matter of equivalent answers. 
This is further complicated by inconsistencies in 
the answer keys. Thus, on Form A, the answer 
to Item ro of Part 2 is given as (b + a)/ab or 
(a + b)/ab, but in Items 15 and 16 only one 
order is indicated. Numerous inconsistencies in 
the keying of corresponding items on the two 
forms are to be found. For example, Item 18 
of Part 2 of both forms calls for a similar re- 
sponse, but the answer key to Form A specifies 
72ab* while the Form B key allows either 
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(6) (11)a?b* or 66a?D5. For this type of exami- 
nation, in which the student writes in his own 
answer, either much more explicit keys should 
be made or more definite instructions given to 
the scorers to consider the possibility of equiva- 
lent answers. 

While Form A and Form B are supposedly 
equivalent, with corresponding items of the 
two forms calling for nearly identical opera- 
tions, there are several inexplicable cases of 
lack of parallelism. For example, Item 16, Part 
I of Form A is “What per cent of 25 is 75?" 
while Item 16, Part 1 of Form B is “What is 
the sum of 4% and —525?" 

Each form contains some items which are 
either misleading or inexact. The term “aver- 
age" is used rather than a specific type of av- 
erage. Items such as 13 on Part 1 of both 
forms would be very confusing to a student 
who had been instructed in the matter of sig- 
nificant figures. Item 11 on Part 3 of Form A 
should read “any” instead of “a.” Item 11 on 
Part 4 of Form B requires the student to as- 
sume “a” as the ten's digit and “b” as the 
unit's digit. Actually as it is stated, “b” can 
be any of the numbers 1 through 9 rather than 
1 only as indicated on the key. Other items are 
subject to similar criticism. In scoring Part 3 
of both forms, the scorer is instructed that the 
minimum score must be o. This means that a 
student who gets 20 answers right and 20 
wrong on this section and the one who gets 
40 wrong (unlikely, but possible) get the same 
score. This is illogical and seemingly results 
from a feeling that the score on this section 
should not detract from scores made on other 
sections. If so, it is based on an erroneous 
concept of assumptions implicit in adding such 
Scores. 

These tests represent relatively early and al- 
most obsolete ventures in the field of objective 
testing. It is the opinion of this reviewer that 
any college mathematics department desiring a 
mathematics achievement test of this type could 
construct one which would be more satisfac- 
tory. 

[ 310 ] 
*Junior High School Mathematics Test: Acorn 
Achievement Tests. Grades 7-9; 1942; Forms A, B; 


$1.75 per 25; 15¢ per specimen set; 52(60) minutes; 
Harry Eisner; Acorn Publishing Co. 


William Betz, Specialist in Mathematics, 
Public Schools, Rochester, New York. The 
test consists of three sections: 1, Concepts; 


2, Problem Analysis; and 3, Problems. Section 1 
contains 21 items dealing with measurement, 
scale drawing, taxation, the equation, evalua- 


tion, signed numbers, symbolism, perimeters, 


similarity, graphs, ratio, checking, metric units, 
symmetry, congruence, trigonometric ratios, 
bonds, installment buying. Section II offers 15 
problem situations pertaining to motion, mix- 
tures, mensuration, buying wisely, percentage, 
commission, installment buying, the hypotenuse 
rule, and trigonometry. Section III contains 26 
“problems.” Of these, 6 are nonverbal or ab- 
stract, and 8 are based on graphic or geometric 
diagrams. The abstract exercises involve the 
solution of a simple equation, of a pair of simul- 
taneous equations, of a fractional equation, and 
of a quadratic equation (all of the most elemen- 
tary types), as well as the multiplication of a 
binomial by a binomial, and the division of one 
algebraic fraction by another (monomial de- 
nominators). There are two very simple geo- 
metric constructions. Of the bona fide verbal 
problems all but one are purely numerical or 
arithmetical. Algebraic verbal problems are ex- 
cluded. Briefly, the test represents a sampling 
of conventional materials of instruction. It may 
be admitted that within a total time allowance 
of 52 minutes it would be difficult to cover a 
larger territory. 

It remains to ask whether the test really 
“reflects” an up-to-date program in junior high 
school mathematics. In the light of such docu- 
ments as the Joint Commission Report of 1940 
and of the Second Report of the Commission 
on Post-War Plans, the answer cannot be affir- 
mative, and the following criticisms seem war- 
ranted: (a) It it impossible to compress within 
a range of 62 items the basic concepts, principles, 
skills, facts, modes of thinking, and types of 
understanding, which we associate today with 
“the four main divisions” of junior high school 
mathematics, namely, arithmetic, geometry, 
algebra, and numerical trigonometry. (b) The 
field of arithmetical computation certainly de- 
mands a far more extensive and thorough sam- 
pling. (c) A variety of interesting devices can 
and should be used in the testing of concepts. 
(d)A clear distinction should be made between 
manipulative techniques or mechanical skills 
and bona fide problem situations. (e) Within a 
period of less than an hour the average pupil 
cannot cope with 62 situations stated so largely 
in verbal form. The task of reading alone will 
cause many low scores; that is, the form of the 
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test militates against its reliability as a mathe- 
matical testing instrument. (f) In spite of a 
very vocal dogmatism in favor of multiple- 
Neu techniques and of machine scoring, teach- 
“ers of mathematics rightly maintain a robust 
Skepticism as to the significance of this type of 
testing procedure, at least for the purposes of 
mathematical diagnosis and evaluation. 


[311] 
Kentucky Mathematics Test. Grades 12-13; 1945; 
Form KM-45; 8é per test including scoring service; 
# per test without scoring service; 50 (55) minutes ; 
E J. Asher; Kentucky Cooperative Testing Service, 
University of Kentucky. 

[312] 
X*Mathematics for High School Seniors: A 
Screening Test in Basic Skills. Grade 12; 1946; 
Forms A, B; 3¢ per test; 45(50) minutes; H. C. 
Christofferson and others; Ohio Scholarship Tests, 
Ohio State Department of Education. 

[313 ] 
Mathematics Test: Ohio Senior Survey Tests. 
Grade 12; 1935-37 ; Forms A, B; 3¢ per test; 90(100) 
minutes; R. W. Tyler and F. P. Frutchey (Form B); 
Ohio Scholarship Tests, Ohio State Department of 
Education, 


William G. Mollenkopf, National Research 
Council Fellow, Department of Psychology, 
Princeton University, Princeton, New Jersey. 
The manual accompanying these tests declares 
that they “will be of use . . . for location of 

a students deficient as regards minimal essentials 
along the lines covered . . . [and] for the diag- 
nosis of specific student weaknesses." There are, 
however, only 41 items divided among common 
fractions (6), decimal fractions (8), percent- 
age (6), measuring (6), and elementary algebra 
(15). Since it is generally accepted that a meas- 
ure intended for individual diagnosis should 
contain many items on each of many topics, 
it is the opinion of the reviewer that the parts 
are much too short to possess reliabilities ade- 
quate for the diagnosis of individual difficulties. 
It is to be seriously doubted whether the specific 
troubles of a particular student will be accu- 
rately revealed by use of these tests. Though 
the manual also describes these tests as “Senior 
survey tests,” they contain no items in any 
course higher than elementary algebra. This 
would seem to limit their usefulness as a sur- 
vey measure, 

The format of the tests could be much im- 
proved if Tinker and Paterson’s findings on the 
readability of type were followed, The exercises 
of Part I are presented in such small type that 
the fractions are very difficult to read. Form B 


uses even smaller type than Form A, in which 
the type size is considered already too small. 
In Items 3 and 4 of Part I, Form B, the 
smudged, tiny fractions can hardly be made 
out. Reading Item 3 of Part III, Form B, was 
impossible. 

The printed key for Form A apparently con- 
tains a misprint. The answer for Item 2, Part V 
is given as —120?; it should be —12c?. Neither 
the key for Form A nor that for Form B lists 
some alternative answers which might well be 
given for a few of the items in the algebra sec- 
tion. Also, in several items the precision toward 
which the student should work in his answer is 
not indicated. 

No data are presented in the manual concern- 
ing the reliability either of the whole or of parts 
of the tests, nor are correlations with any cri- 
terion presented as validity data. Since the 
manual suggests that Form A be administered 
before remedial instruction starts, and Form B 
after it is completed, it would be desirable to 
have data concerning the comparabilities of the 
scores on the two forms and on corresponding 
parts of these forms, so as to permit accurate 
assessment of student improvement. However, 
no data of this type are provided. 


[314 ] 
XPurdue Industrial Mathematics Test: Purdue 
Personnel Tests. Adults; 1946; preliminary manual ; 
Forms A, B; $2.50 per 25; 25¢ per specimen set, post- 
paid; 25(30) minutes; edited by Joseph Tiffin; C. H. 
wshe, Jr. and Dennis H. Price; Division of Ap- 
plied Psychology, Purdue University. 


Clyde H. Coombs, Assistant Professor of 
Psychology and Chief of the Research Division, 
Bureau of Psychological Services, Institute for 
Human Adjustment, University of Michigan, 
Ann Arbor, Michigan. This test is designed to 
measure achievement in simple and basic arith- 
metic and computational skills required of men 
in the various skilled trades and also of men 
on such jobs as setup, layout, maintenance, and 
design. The test may also be used in apprentice- 
ship and formal trade school training programs. 

This test is in two forms and has 32 items in 
each, covering elementary arithmetic, trigonom- 
etry, algebra, and geometry. The items are ob- 
jective with four alternatives. According to one 
of the authors, “The original body of items was 
constructed on the basis of an extensive analy- 
Sis of the mathematical operations and kinds of 
problems that appeared in a half dozen different 
Shop mathematics textbooks. Secondly, a ques- 


409 ] TESTS & REVIEWS: MATHEMATICS 315 


tionnaire was submitted to a number of trade 
schools, and fifteen different schools reported 
on such matters as what proportion of the shop 
mathematics course was devoted to each of these 
areas.” 

An experimental form of 100 items was ad- 
ministered to 200 trade school students. This 
provided the data for an item analysis and the 
selection of the items for the two forms, The 
items in the two forms are matched as to diffi- 
culty, discrimination, and mathematical opera- 
tions. It is not clear whether the criterion for 
the discriminative index for the items was total 
test score or an external criterion obtained from 
the trade school situation. 

Only one study has been made of the reliabil- 
ity of the test. The two forms were administered 
to 125 Army Specialized Training Corps stu- 
dents in personnel psychology and showed a cor- 
relation of .82 between Forms A and B. This 
would be a rather low reliability except that the 
sample was restricted in range of talent. It is 
probable that the reliability will be higher when 
the test is used with the type of population for 
which it is designed. 

The appearance of the test is good and the 
directions curt and clear. The number of items 
on any one mathematical operation are too few 
to be of diagnostic value. 

The approach followed here of building a test 
on the basis of the content of textbooks and on 
what trade schools are teaching leads in the long 
run to a form of inbreeding. Courses, texts, and 
tests will gradually all tend to become more 
technical and academic. The ideal approach 
would be from the direction of job analysis. 
What kind of mathematics is being used by 
industrial workers in various sorts of jobs? On 
the basis of such inquiry, write texts and tests. 
The courses will follow suit. 


C. C. Upshall, Psychologist, Industrial Rela- 
tions Department, Eastman Kodak Company, 
Rochester, New York. This test contains 32 
questions designed to measure the basic compu- 
tational skills required of men in various skilled 
trades. A coefficient of correlation of .82 be- 
tween Forms A and B is reported in the pre- 
liminary manual with the comment that a higher 
coefficient might be expected when a more 
heterogeneous group such as is found in indus- 
try is used. The multiple-choice form of ques- 
tion is used, with four choices offered for each 
question, A cut-out stencil permits rapid scoring 


by counting the number of X’s that show 
through the holes. 
The questions have been carefully selected so 


that each is typical of the kind of computation 


that must be made by men in one or more of the 
skilled trades. Some of the questions are so 
worded that they are clearly related to some 
phase of an industrial job. The test would gain 
in acceptance by industrial workers if more ques- 
tions were so worded, The wide variety of com- 
putation, including such diverse elements as 
questions on discounts, sizes of angles in varied 
geometric forms, volumes of various containers, 
algebraic equations, properties of circles, and 
speed of gears, assures that some of the ques- 
tions will be applicable to almost any skilled in- 
dustrial job. However, such variety has the 
defect that many of the questions may not be 
applicable to the particular job for which the 
employee is being hired or trained. Conse- 
quently, the score on the test may not reflect 
very accurately the knowledge of mathematics 
required for a particular job. The score is more 
likely to show the extent of training that an 
individual has had. 

In industrial situations where knowledge of 
mathematics is essential, this test should prove 
a useful addition to the measuring instruments 
now available. Its greatest value may be found 
as a pretest in apprenticeship courses for such 
jobs as setup, layout, and design to find out the 
level of mathematics achievement. The results 
should be of some value in indicating areas 
where the group needs additional instruction. 
If both forms are given, useful suggestions for 
individual instruction may be obtained. 


[ 315 ] 
*Rasmussen General Mathematics Test. High 
school and college; 1942; 90¢ per 25, postpaid; 15¢ per 
specimen set, postpaid; 40(45) minutes; Otho M. 
Rasmussen and H. E. Schrammel ; Bureau of Educa- 
tional Measurements, Kansas State Teachers College 
of Emporia. 


REFERENCES 


1. Rasmussen, Orno Mitts. A Standardized Test in Plane 
Trigonometry With a Brief Historical Introduction, Unpub- 
lished master’s thesis, Kansas State Teachers College, Emporia, 


1940. Pp. 58. 

William G. Mollenkopf, National Research 
Council Fellow, Department of Psychology, 
Princeton University, Princeton, New Jersey. 
Each of the two forms of this test consists of 
82 items. Of these, 35 are true-false, I9 are 
multiple choice, and the remainder are prob- 
lems requiring an answer to be written in. Pre- 
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dominantly the items deal with arithmetic and 
the simpler concepts of algebra and geometry 
which are included in the typical one-year math- 
ematics course given for noncollege preparatory 
students. This test would not be suitable for use 
as a general survey test in mathematics to be 
used at the end of the senior year in high school, 
for it covers very little of the material commonly 
included in courses in algebra, geometry, and 
trigonometry. The test does, however, appear 
to be suitable for use with students in a single- 
year terminal course in mathematics for students 
not intending to enter college. The manual states 
that the content was based on “a number of 
leading textbooks,” that the criticisms of teach- 
ers were obtained, and that results on prelim- 
inary editions were considered in building the 
forms. No further information is given as to 
the construction and selection of the items. 

The reliability of Form B is given as .86 for 
a group of high school students (N not stated), 
using the split-half technique. Since 92 items 
are to be answered in 40 minutes, the scores 
on the test may be influenced considerably by 
speed. If so, the stated reliability is spuriously 
high. Since there are two forms, it would seem 
desirable to have the correlation between them 
available, for this correlation would provide 
another indication of the reliability of the forms. 

A table of percentile scores is provided in the 
manual. These are based on the performances of 
477 high school students. Since only one table is 
provided, one must assume that the two forms, 
A and B, have practically identical score distri- 
butions, This, if true, would be a rare event in 
test construction. No evidence of such com- 
parability in the score distributions is, however, 
presented, 

While the test is, in general, well printed and 
readable, a somewhat larger type size and more 
leading between lines would improve legibility. 
One error was discovered in each form; in 
Form A, there is a typographical error in Item 
50, and in Form B, Item 79 uses “square” 
where “cubic” is the correct term. 
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Test of General Proficiency in the Field of 
Mathematics: Cooperative General Achievement 
Tests, Revised Series, Test 3. Grades 10-12 and 
college entrants ; 1942-47; IBM; Forms S, T, X; sepa- 
rate answer sheets need not be used; $2 per 25; 25¢ 
Der specimen set, postpaid; 4o¢ per 25 machine-scor- 
able answer sheets; I5é per stencil for scoring answer 
sheets; 40(45) minutes; Form S: Emma Spaney and 
Charlotte W, Croon; Form T: Margaret Martin; 
Form X: Paul J. Burke; Cooperative Test Service. 


John F. Randolph, Professor of M. athematics, 
Oberlin College, Oberlin, Ohio. [Review of 
Forms S and T.] REVIEW oF FORM s. No wonder 
mathematics is hard for students and no wonder 
they hate it. The student is supposed to parrot 
that “A is a function of B" means “a change in 
B produces a change in A." If this same student 
were unable to draw the graph of y = f(x) 
where f(x) — 2, the teacher would consider 
him hopeless. The examination asks the student 
to decide what is most appropriate, usually 
happens, most nearly corresponds to something, 
remains fairly constant, or probably happens 
but gives no criteria for determining such ex- 
trema. Such unheard of terms as "volume of 
friction" and "percentage of braking" (in con- 
nection with applying the brakes of a car) are 
used. If the description under the heading *Cal- 
culating Cop Rewarded" were given as an exam- 
ple of poor English, inaccurate statement, and 
commentary on the low intelligence of our police 
force, well and good, but please do not foist 
such drivel on students as mathematics. The only 
saving grace is that the “cop” comes at the end 
of the examination where only the best students 
will meet him. 

The examination is, however, not as bad as 
the above paragraph might indicate. The ques- 
tions in Part I, to be sure, seem too easy, but 
the problems about the flaked corn pie crust, 
budgets, and rates for sending money by various 
means are well chosen. Geometric intuition is 
adequately tested by problems on paper folding 
and cutting. Also, data displayed by bar graphs 
must be interpreted in the problems about acci- 
dents caused by drivers or pedestrians. 

REVIEW OF FORM T. This examination really 
tests not only algebraic manipulation and a little 
trigonometry, but familiarity with graphs (cir- 
cular, bar, and line), with tables (insurance and 
cash loan), with cameras, and with bird houses. 
The student must even use his judgment (which 
may be unfair) and extrapolate in one case. 

On a plane rectangular coordinate system it 
would be preferable to have the unit point on 
the X-axis given as (1,0) rather than as r. 
Even though textbooks speak of “The mid-point 
of the line joining A and B" wouldn't it be 
preferable to refer to the "line segment" join- 
ing A and B? Question 22 reads "Two lines 
intersect at right angles at a point on the circum- 
ference of a circle. If one of them is a radius of 
the circle, etc.” but might better read “Two 
lines intersect at a point on a circle. If one of 
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them passes through the center of the circle, 
etc." 

Question 27 is: "The ratio of the circum- 
ference of a circle to its diameter is (1) a con- 
stant (2) a dependent variable (3) an independ- 
ent variable (4) an unknown (5) an integer." 
The correct answer is supposed to be 1. Why 
not 2? Certainly the function (dependent vari- 
able) f defined by f(x) = x for x positive is 
a good function. Why not 4? The value of x is 
unknown. Of course if a student knew that 2 
and 4 are correct he would know that 1 is the 
expected answer, so no harm is done. 

The scaled scores on this examination should 
correlate with the degree of the students' knowl- 
edge and skill. 


For a review by John V. McQuitty of the 
entire battery, see 3. 


[317 ] 
XTest of Mathematical Fundamentals for Grades 
7-12, 1944; 1 form; 65¢ per 25; 15é per specimen set; 
specimen sets must be purchased to obtain the manual ; 
25(30) minutes; H. R. Beattie; Vocational Guidance 
Centre, University of Toronto. 
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*Ability for Algebra: Fife Tests of Ability, Test 
3. Entrants to secondary schools (England and Scot- 
land) ; 1947; 1 form; 5s. 6d. per 25; 4d. per manual ; 
60(65) minutes; Frank M. Earle; University of Lon- 
don Press Ltd. 
REFERENCES , 

1. EanLE, Frank M. Tests of Ability for Secondary School 
Courses. Publications of the Scottish Council for Research in 
prado At 19. London: University of London Press, 1936. 

. xiii, 138. 5s, 

2. EARLE, Franx M. Reconstruction in the Secondary School. 

ndon: University of London Press, 1944. Pp. 188. 8s. 6d.* 


For a review by James Maxwell of the 
entire battery, see 8. 
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*Algebra Readiness Test. Grades 8-9; 1947; 1 
form; $1.80 per 25; 18¢ per manual; 2¢ per class rec- 
ord sheet; 18¢ per key; 25¢ per specimen set, post- 
paid; 26(30) minutes; William R. Lueck; Public 
School Publishing Co. 
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XCalifornia Algebra Aptitude Test. High school; 
1940; 1 form; $1.25 per 25; 35¢ per specimen set, post- 
paid; 50(55) minutes; Noel Keys and Muriel Mc- 
Crum; Educational Test Bureau. 


David Segel, Specialist in Tests and Measure- 
ments, U. S. Office of Education, Washington, 
D. C. This test consists of three parts : (a) arith- 
metic problems, (5) the understanding of for- 


mulas, and (c) a number series. The authors 
report reliability coefficients of .95 and .96. 
Validity coefficients obtained by finding correla- 
tion coefficients between the aptitude test scores 
and scores on the Columbia Research Bureau 
Algebra Test at the end of the year for ten 
groups of students (totaling 349 students) in 
several different schools ranged from .54 to .85 
and averaged .76 through weighting each in- 
versely as the square of its probable error. This 
validity is the same as that reported for the 
Iowa Algebra Aptitude Test. Since these two 
aptitude tests have approximately the same 
validity, and the materials are much alike, the 
tests probably measure the same aptitudes. 

The validity coefficient indicates that the test 
results can aid materially in determining whether 
individuals should spend time on regular algebra 
and in classifying pupils in algebra for instruc- 
tional purposes. Norms are available on 2,079 
students tested in 20 communities in three states. 
(For limitations in the use of algebra aptitude 
tests, see the last paragraph of the writer's re- 
view of the Jowa Algebra Aptitude Test [327] ). 


[321 ] 


Cooperative Algebra Test: Elementary Algebra 
Through Quadratics, Revised Series. High school ; 
1937-43; IBM; Forms R, S, T ; separate answer sheets 
need not be used; $1.75 per 25; 25€ per specimen set, 
postpaid; 4o¢ per 25 machine-scorable answer sheets; 
15¢ per stencil for scoring answer sheets; 40(45) min- 
utes; Form S: Leone E. Chesire, Margaret P. Mar- 
tin, and L. P. Siceloff; Form T: Margaret P. Martin 
and Marion F. Shaycoit; Cooperative Test Service. 


REFERENCES 


1. Lee, Hagorp L, A Statistical Analysis of an Elementary 
Algebra Test. Unpublished master's thesis, Ohio University, 
1940. Pp. 60. (Abstracts of Masters’ Theses . . ., 1941, p. 30.) 


W. C. Brenke, Professor of Mathematics, 
Emeritus, The University of Nebraska, Lincoln, . 
Nebraska. [Review of Forms R and S.] FORM R. 
The scope of this test as indicated by the sub- 
title, Algebra Through Quadratics, is not borne 
out by the questions. There is no trace of quad- 
ratics except the very special case, x? = a. The 
total number of questions is 63, and the time 
allowed is too short to yield a reasonable norm. 
Moreover, a substantial part of the score would 
be due to very easy or trivial questions. To cor- 
rect this situation I would reduce the number 
of questions to 45 by making changes such as 
those suggested below. There should also be 
some use of decimal numbers in connection with 
equations and formulas. In general, the ques- 
tions relating to the barest rudiments in a test 
covering the work of one year, let us say, are 
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superfluous, since they are automatically covered 
by later questions and only serve to pad the 


. Score. 


Part I should be reduced to 15 questions. 
Omit Items 1 and 2 which are trivial; omit 
either Item 7 or 8; cover Items 10 and 12 by 
one question somewhat like Item 12; omit Items 
17 and 20 as sufficiently covered by Part III. 
Incidentally, microscopic type like that in Item 
20 should never be used. 

Part II should be reduced to ro questions. 
"This part contains certain types of graphs and 
13 "word problems." I believe that 8 such prob- 
lems would suffice to test the ability of pupils 
to handle this material. I would omit Items 1 
and 2 which are trivial, and Item 8 which takes 
too long to read and to get orientation, and Items 
9 and ro, the material of which is covered in 
other questions. 

Part III should be reduced to 15 questions. 
Omit Items 1 and 2 which are trivial ; omit Item 
4 (covered by Item 12 of Part I); combine 
Items 13 and 15 into d/x = x/4; simplify the 
figure used for Items 20 to 24'by omitting line 
AA’ (badly drawn) and line BB’, and base 
three or four questions on the other three lines, 

ForM s. This test is superior in several re- 
spects to Form R. The total of 63 questions is 
too much for the time allowed and would result 
in an unreasonably low norm. Too much of the 
score would be due to very easy or trivial ques- 
tions which contribute nothing to the desired 
testing of skills and abilities. It is better to have 
fewer questions so formulated that each one 
makes an effective contribution to the test. I 
would reduce this test to 45 questions somewhat 
as indicated below. 

Part I should be reduced to 15 questions, Omit 
Items 1 and 2 which are trivial; omit Item 1o 
which is covered by Item 22 ; omit Item 13 ; omit 
Item 15 which is covered by Item 18 in Part I 
and 14 in Part III ; omit Item 17 which could be 
included in Item 20, Part III. 7 

Part II should be reduced to 10 questions, 
There are more word problems than necessary. 
I would omit Items 1, 3, 6, 8, and 10. The re- 
maining questions are sufficient to test abilities 
along this line and would well serve that pur- 
pose. The phrasing of Item 8 would be improved 
by replacing the words “each denoted by x” by 
the less vague phrase “each containing x de- 
grees.” 

Part III should be reduced to 1 5 questions. 
Omit Items 1, 2, 3, 5, 7, 8, and 12 which are 


sufficiently covered in Part I. Another question 
on quadratics should be included. 


[ 322 ] 
Cooperative Intermediate Algebra Test: Quad- 
fatica and Beyond. High school; 1941-43; IBM; 
Forms R, S, T; separate answer sheets need not be 
used ; $1.75 per 25; 25¢ per specimen set, postpaid ; 40 
per 25 machine-scorable answer sheets; 156 per stencil 
for scoring answer sheets; 40(45) minutes; Forms R 
and S: John A. Long, L. P. Siceloff, and Leone E. 
Cheshire; Form T: Margaret P. Martin and Marion 
F. Shaycoft; Cooperative Test Service. 
REFERENCES 


1. Jounson, A. P, The Prediction of Scholastic Achievement 
for REND Engineering Students at Purdue University. 
Purdue University, Division of Educational Research, Studies 
in Engineering Education II, Lafayette, Ind.: the Division, 
May 1942. Pp. 22. Paper. $0.35. * (PA 16:5020) 

2. Jounson, A. P. The Relationship of Test Scores to Scho- 
lastic Achievement for 255 Engineering Freshmen Entering 
Purdue University in September, 1939. Unpublished doctor's 
thesis, Purdue University, 1942. 


L. B. Plumlee, Head of Mathematics Section, 
Test Construction Department, College En- 
trance Examination Board, Princeton, New 
Jersey. Parts I and III of Forms R and S and 
Part I of Form T consist of formal algebraic 
problems. Part II of Forms R, S, and T con- 
sists of verbally stated problems. The majority 
of items, both formal and verbal, are typical of 
those found in algebra textbooks and test mem- 
ory of certain concepts and processes, rather 
than the use of these concepts and processes in 
solving problems with novel settings. The scores 
should indicate which students have learned the 
subject matter most thoroughly, but are less 
likely to indicate those superior students who 
will be able to apply their knowledge in ad- 
vanced fields of learning. There is considerable 
variety in the subject matter covered. The prob- 
lems are in multiple-choice form (five options). 

The timing of Forms R and S is such that 
speed is probably important. Part I of Form T 
seems adequately timed, but 15 minutes for Part 
II (23 verbally stated problems) seem inade- 
quate. Separate statistics for the two parts are 
not furnished, but a total raw mean of only 
20 out of a possible 68 and the ggth percentile 
point at a raw score of 57 indicate that the test 
is either too long or too difficult, since persons 
who answer all 68 questions by sheer guess- 
work would be expected to obtain, on the aver- 
age, a raw score of 14. The raw mean of Form S 
was 16 out of a possible 56, indicating that this 
form, too, is either too long or too difficult, The 
difficulty of Form R, with a raw mean of 21 out 
of 56, is somewhat more appropriate. 

There are no serious ambiguities in item 
wording, although the length of some of the 
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items in Part II, Form T, has been unnecessarily 
increased by the inclusion of irrelevant data. 
One or two items in Forms S and T give an 
advantage to the science student. 

The use of boldface type for numbers and 
other values in the problems is good; spacing 
and exponent sizes are poor in many instances, 
however. 

Directions for using the test are complete and 
clearly written, covering administration, scor- 
ing, tabulation, use of norms, and interpreta- 
tion of results. The instructions for scoring 
cover the use of machine-scored answer sheets. 
Suitable scoring keys are provided for use when 
answers are recorded in the test booklets. Ade- 
quate norms are supplied, including percentile 
distributions, means, and standard deviations 
for 4,000 12-grade-system students and for 
2,000 I1-grade-system students. Scaled score 
equivalents for all raw scores are given as a 
basis for comparing scores from test to test. No 
reliability or validity statistics are given. 
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*Examination in College Algebra. College; 1944; 
IBM; Form CAt-2-B-4 (usually called Form B); 
separate answer sheets must be used; $2 per 25; 40€ 
per 25 machine-scorable answer sheets; 154 per scor- 
ing key; 256 per specimen set, postpaid; 180(190) 
minutes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed b 
Cooperative Test Service. (Also distributed by Sci- 
ence Research Associates: $2 per 25; 65¢ per 25 ma- 
chine-scorable answer sheets; so per key; 50¢ per 
specimen set.) 
REFERENCES 
1. Zant, James H. “The United States Armed Forces Insti- 


tute Examinations in Mathematics as a Means of Determin- 
ing Credit in Engineering Mathematics.” J Eng Ed 36:170-4 
48. 


Albert A. Bennett, Professor of Mathematics, 
Brown University, Providence, Rhode Island. 
This is an objective test of 70 questions designed 
for a 3-hour period. Each question has exactly 
one correct answer among five printed alterna- 
tives. In many cases, after listing four possibil- 
ities, the fifth reads, “None of these answers." 
The test is technically excellent. The questions 
in nearly every case call for thought and care, 
but none is unduly laborious and none demands 
special power or ingenuity. The wording is 
meticulously unambiguous and free from nota- 
tional provincialism. Only one error (the omis- 
sion of a minus sign in the figure of Item 58) 
was-found. A student only hazily familiar with 
a topic will find little sense of security in at- 
tempting to guess the correct answer tc a ques- 


tion by shrewdly inspecting the alternative pub- 
lished solutions, since each alternative will 
usually appear superficially plausible. The one- 
sheet IBM form, available for machine scoring, 
and the accompanying stencil make grading 
whether by machine or by hand a purely routine 
matter. The manual for scoring is a model of 
effective writing. A table of percentile norms 
based upon 33 or more schools accompanies the 
manual and certifies to the efficient spreading of 
scores in the cases tested. The median number of 
questions correctly answered is 27. 

The specific objectives aimed at in this par- 
ticular test are not announced here, but the 
general criteria adopted in constructing the test 
are clearly explained in the manual. The ability 
of the student to do definite things in several 
carefully limited fields is to be measured. One 
observes that superficial wide acquaintance with 
vocabulary or theory, ingenuity of high order, 
persistence in the face of complications, power 
to translate nonverbal statements into mathe- 
matical notation, recognition of analogues or 
symmetries, historical perspective—all of these 
qualities, whether good or bad, are scarcely 
touched upon here. Even on a prosaic level of 
computational proficiency the narrow scope of 
this test will prove a disappointment to many 
mathematicians despite its proclaimed sponsor- 
ship. While "algebra" is fundamental to all 
further work in mathematics, what is needed 
in later courses is seldom emphasis upon the 
details in the elementary theory of equations. 
This test practically ignores numerical approxi- 
mations, functional notation, logarithms, postu- 
lated properties of various number systems, 
identities, fitting a linear formula to approxi- 
mately linear data, properties of the binomial 
coefficients and of symmetric functions, the 
polar form of complex numbers, and elemen- 
tary statistical concepts. 


Edmund P. Churchill, Assistant Professor of 
Mathematics, Antioch College, Yellow Springs, 
Ohio. This test covers, in 70 multiple-choice 
questions, most of the standard topics in col- 
lege algebra: roots and exponents, linear and 
quadratic equations, the binomial theorem, pro- 
gressions, variation, complex numbers, elemen- 
tary theory of equations, determinants, permu- 
tations and combinations, probability, and some 
related subjects. By and large, the choice of 
questions is such as to constitute a fair sam- 
pling of the material which is covered in the 
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average college algebra course and which can 
be satisfactorily treated by multiple-choice 
questions. The most obvious omissions are 
logarithms and exponential functions. Undue 
emphasis, on the other hand, is placed on pro- 
gressions, about which five questions are asked, 
while homogeneous equations, DeMoivre's the- 
orem, and other topics of almost equal impor- 
tance are ignored. Simplifications, factoring, 
exponents, and radicals, which are often covered 
as review topics in the first chapter of the 
college algebra text, account for almost one- 
sixth of the questions on this test. 

The questions and alternative answers are 
well constructed and are, in general, so phrased 
as to make working the problem the simplest 
way of finding its solution. Graphs are satis- 
factorily used as the basis of one-tenth of the 
questions. In only one case is the solution of 
a problem tied to that of another, and in only 
one case does the solution of a problem de- 
pend on the knowledge of a principle from 
outside the field of mathematics. The authors 
state that critics nominated by the American 
Mathematical Society have checked the test 
for coverage, for accuracy, and for validity of 
exercises. 

Tf this test is judged on the basis of a care- 
ful study of its contents, credit being given 
for the use made in its construction of sound 
expert advice, there is no question—this is an 
excellent test. It is thus particularly unfor- 
tunate that the authors have provided neither 
statistical data on the test's reliability and valid- 
ity nor suitable norms. The norms which are 
supplied seem worthless as almost no informa- 
tion is given about the group on which they 
were obtained and what information is given 
suggests that the group was not ideal for this 
purpose. 

The test manual recommends that each col- 
lege establish local norms and make its own 
investigation of the test's validity. While this 
procedure is in many ways a desirable one, 
doing the job properly involves so much time, 
labor, and money that it would be feasible only 
as part of an extensive testing program. The 
test would be much more useful to the average 
test user had the authors, instead of arguing 
the inadequacy of a single set of norms, pre- 
pared several sets based on schools of varied 
Scholastic standards and differentiating between 
liberal arts and science-engineering students, 


324 

XExamination in ER Algebra—High- 
School Level Grades 9-10; 1944; IBM; Form 
SErA1-2-B-3 (usually called Form B) ; separate an- 
swer sheets must be used; $2 per 25; 40¢ per 25 
machine-scorable answer sheets; 15¢ per scoring key; 
256 per specimen set, postpaid ; 120(125) minutes ; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education; distributed by Cooperative Test 
Service. (Also distributed by Science Research Asso- 
ciates: $2 per 25; 65¢ per machine-scorable answer 
sheets; 50¢ per key; 50¢ per specimen set.) 

Richard M. Drake, Professor of Education, 
and Assistant Dean, College of Arts and Sci- 
ences, The University of Buffalo, Buffalo, New 
York. The purpose of this test is to' measure 
achievement in elementary algebra. This it 
should do very well. There is ample evidence 
that the test was very carefully constructed and 
that much research was involved in selecting the 
test items. 

The test manual contains much information 
relative to the use and interpretation of the 
results together with intelligently thought-out 
and unbiased limitations. The directions for 
administering are not explicit for this particu- 
lar test. However, the test is practically self- 
administering. The norms are said to be tenta- 
tive. They are based on 748 to 766 cases from 
I9 schools. In view of the relatively small num- 
ber of cases and the lack of evidence concern- 
ing the schools, this would appear to be the 
chief weakness of the test. Probably by the 
time this book appears in print this limitation 
will have been remedied, 

The test is divided into sections devoted to 
the topics commonly covered in first-year alge- 
bra. Norms are given for each section as well 
as the entire test. Thus, the test results can be 
used as a basis for remedial teaching, Two 
hours are allowed for taking the test. This ap- 
pears to be more than enough time for the 
average or better student. However, the ex- 
tended time and the fact that the test must be 
taken at one sitting do present an administra- 
tive problem. 

The test is composed of 53 items of the 
multiple-choice type. The pupil has a choice of 
six responses which is a larger number than is 
usually found. In all cases one of the possible 
choices is “None of these answers.” The dis- 
tractors are well chosen. 

The reviewer believes that this is an excel- 
lent test. When norms based on a more exten- 
sive sampling are available, it should prove to 
be very valuable either as an end-of-the-year 
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achievement examination or as a placement 
examination. 


John A. Long, Director of the Department of 
Educational Research, University of Toronto, 
Toronto, Canada. This test is the civilian form 
of the test in elementary algebra prepared by 
the United States Armed Forces Institute. The 
general manual for the Institute’s subject tests 
informs us that these tests were prepared by 
experienced specialists working in cooperation 
with subject teachers. They were prepared with 
clearly defined objectives in view. A prelimi- 
nary trial run permitted the elimination of un- 
satisfactory items, and the resulting examina- 
tion forms were submitted to one or more 
critics nominated by professional organizations. 
In the case of mathematics, the material appar- 
ently had to satisfy critics nominated by the 
American Mathematical Society. This appraisal 
led to the final edition of the test in two forms, 
one for use by the Armed Forces Institute and 
the other for use in schools and colleges. 
Norms, in the test under review, are based on 
roughly 750 students in 19 schools and are re- 
garded as tentative. The manual points out that 
the Armed Forces Institute would be greatly 
disturbed if the use of these norms resulted in 
the imposition of Institute standards on edu- 
cational institutions. Users of the tests are 
advised to set up their own local percentile 
norms, and the manual gives directions for car- 
rying this out. 

The examination in elementary algebra is 
multiple choice throughout. It is made up al- 
most equally of 5- and 6-response items. It is 
broken into five sections of 10, 12, 11, 10, and 
10 items each. Section I has to do mainly with 
the four fundamental operations; Section II, 
with the manipulation of formulas and the so- 
lution of simultaneous linear equations of two 
unknowns; Section III, with graphs and their 
interpretation; Section IV, with verbal prob- 
lems; Section V, with surds and quadratic 
equations, Percentile norms are given for indi- 
vidual subtests and for total scores. Since the 
subtests are not timed, one might question 
whether the subtest norms are comparable or 
valid; but since students are allowed 2 hours 
for the 53 items, it is probable that all but the 
most incompetent will have plenty of time to 
complete everything that lies within their 
power. 

The examination is devised both for ma- 


chine and manual scoring. In either case sepa- 
rate answer sheets are provided. Where the 
scoring is to be done by hand, it is facilitated 
by a perforated stencil. 

Attacks on the theory of formal discipline 
and on belief in the wide transfer of training 
have put mathematics on the defensive, and 
there appears today to exist a good deal of con- 
fusion on the objectives of high school mathe- 
matics. The fact that the Institute's mathe- 
matics examinations have survived the appraisal 
of the critics nominated by the American 
Mathematical Society should give at least some 
guarantee that the examination in elementary 
algebra meets reasonably worthy objectives. 
Necessarily many of the items are just such as 
one would find in the average algebra text- 
book; many, however, require a good deal of 
ingenuity and will not easily be solved by stu- 
dents who cannot rise above routine classroom 
procedures. 
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X*Examination in Second-Year Algebra—High- 
School Level. Grades 11-12; 1944; IBM; Form 
SSYA1-3-B-3 (usually called Form B) ; separate an- 
swer sheets must be used; $2 per 25; 404 per 25 ma- 
chine-scorable answer sheets; 15¢ per scoring key; 
25€ per specimen set, postpaid; 120(125) minutes; 
prepared by the Examinations Staff of the United 
States Armed Forces Institute; published by the 
American Council on Education; distributed by Co- 
operative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
Scorablk answer sheets; 50¢ per key ; 50¢ per specimen 
set. 

L. B. Plumlee, Head of the Mathematics 
Section, Test Construction Department, Col- 
lege Entrance Examination Board, Princeton, 
New Jersey. Forty-two of the test's 53 prob- 
lems are formal algebra; the remainder are 
verbally stated. There is little originality in the 
problems, the majority of items conforming to 
the traditional type of item found in most alge- 
bra textbooks. It is an achievement test with 
emphasis upon repeating learned processes 
rather than upon applying these processes to 
new situations. The range of subject matter 
seems broad, but concentrated on certain topics. 
Eight problems test handling of exponents and 
IO test simultaneous equations, but only one 
item tests logarithms, and one, the binomial 
theorem. Four items test the solution of quad- 
ratic equations by factoring; none require the 
use of the quadratic formula. Extraneous and 
zero roots are omitted. Knowledge of gears 
and levers is assumed in two of the ten verbal 


items. 
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David Segel, Specialist in Tests and Meas- 
urements, U. S. Office of Education, Washing- 
ton, D. C. This test is a revised edition of the 
test issued in 1931. It is available in a regular 
form and a machine-scored form. This 1942 

_ edition is the result of experimentation with 
six types of material: (a) mathematics mean- 
ing and vocabulary, (5) mensuration, (c) 
problem analysis, (d) equations and formulae, 
(e) numerical series, and (f) disarranged nu- 
merical equations. After statistical analyses of 
the experimental use of these six types of 
material, the final test was constructed. It con- 
sists of four parts: (a) arithmetic problems, 
(b) abstract computation, i.e., using letters in- 
stead of figures in computation, (c) numerical 
series, and (d) dependence and variation, 
which is a test of the understanding of the re- 
lationship of parts of the equation. In gen- 
eral, this edition of the test covers the same 
aptitudes as the original edition. 

The reviewer believes that the experimenta- 
tion back of this test and the results cited 
make it one of the best tests for prognosis of 
success in algebra. The relationship between 
the scores on this test and success in algebra 
is substantially higher than that existing be- 
tween success and any intelligence test using 
comparable test time. For the purpose of guid- 
ance into algebra classes and for classification 
within the algebra class, this instrument may 
be recommended. Something similar may be 
said about any good prognostic test in algebra. 
School people have found this type of test use- 
ful because of its superiority over an intelli- 
gence test for the purpose mentioned. The use- 
fulness of these tests will decrease more and 
more as multiple-aptitude tests, i.e., tests of 
various mental traits, are developed for this 
level, because so many different areas of school 
activity can be covered by such multiple apti- 
tude tests at the same time, and guidance for 
these different activities can thus be obtained 
through one measuring instrument. However, 
for sectioning purposes within an algebra class, 


an algebra aptitude test will probably always 
be superior. 


For reviews by Richard M. Drake and M.W. 


Richardson, see 40:1441. 
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1939-43; Form A; god per 25; 35¢ per specimen sct, 
postpaid ; 80 (90) minutes; Harl R. Douglass and Dale 
O. Patterson; Educational Test Bureau. 


John A. Long, Director of the Department of 
Educational Research, University of Toronto, 
Toronto, Canada. The 67 short-answer items 
do not appear to be arranged in order of diffi- 
culty; however, this may not be a serious fault 
in view of the liberal time allowance. The 
printing is not well done. In two or three in- 
stances fractions are in such small type as to 
force the student to draw on his imagination. 

There appears to be nothing about the test 
to recommend it strongly. For the most part, 
the items are of a very routine variety ; as such, 
they may suit the purpose of the routine 
teacher. Any reasonably competent teacher 
should be able to produce as good a measuring 
instrument by devoting four or five hours to 
the project. 


Daniel W. Snader, Professor of Mathemat- 
ics, University of Illinois, Galesburg, Illinois. 
This 8o-minute test consists of 67 problems 
which require complete solutions rather than 
mere matching or recognition such as is in- 
volved in multiple-choice exercises. This is a 
commendable feature of the test since examinees 
are required to actually demonstrate major 
manipulation abilities without suggestions from 
a list of multiple-choice responses. However, 
this feature makes scoring a bit more tedious. 

The test consists of two parts: Part I (42 
items) deals with fundamental operations and 
solution of equations; Part II (25 items) in- 
cludes problems on motion, age, number rela- 
tions, geometric relations, ratios, investment, 
coins, work, ete. Percentile norms are included 
for both parts. 

While the test recognizes somewhat the im- 
portance of the language and symbolism of 
algebra, its major emphasis is on the manipu- 
lative and mechanical aspects of the subject. 
To this extent the test may help to perpetuate 
the conventional type of algebra teaching. Part 
II attempts to measure ability to do relational 
thinking, but leaves much to be desired. Ap- 
preciations and attitudes are not measured at 
all. In general the test is strong in measure- 
ment of skills and processes but relatively weak 
on functional thinking and generalizations. 


[329 ] 
Votaw Algebra Test. High school; 1939; IBM; 
Forms A, B, C; Separate answer sheets need not be 
used; $1.25 per 30; 25¢ per specimen set; special IBM 
answer sheets not available; 45(50) minutes ; David F. 
Votaw, Sr. and David F. Votaw, Jr.; Steck Co. 
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Richard M. Drake, Professor of Education, 
and Assistant Dean, College of Arts and. Sci- 
ences, The University of Buffalo, Buffalo, New 
York. This test is designed to measure achieve- 
ment in first-year high school algebra. It is com- 
posed of 60 items of the multiple-choice type. 
The student has a choice of five responses which 
tends to reduce the probability of guessing. In 
all cases one of the possible choices is a “dif- 
ferent answer.” While this is a very desirable 
feature, it should not be the correct response 
when the distractors are poor (see Item 31, 
Form A). The reviewer has noted other similar 
limitations which would tend to indicate that 
more study should have been given to the con- 
struction of the test items. 

The author has included a good sampling of 
problems from typical ninth-grade algebra, and 
the test should therefore have rather high valid- 
ity for such courses. The reliability coefficient 
is reported as .88, but there is no indication as to 
the size or representativeness of the sample. 
Nor is this information available for the norms. 
The latter is a serious omission and casts some 
doubts on the usefulness of the results gained 
from such a measuring instrument. 

The typography of the test is also open to 
criticism. The print is small, and the general 
appearance leaves the impression that the work 
is crowded. In some cases there is work space 
on the test sheet, and for other questions there 
is not. This should be uniform. 

The directions for administering the test are 
explicit and complete. The time limit is given 
as 50 minutes in the directions and corrected 
as 45 minutes at the end of the manual of 
directions. This might cause some confusion. 
Even though the items are arranged in order 
of increasing difficulty, it is doubtful that 45 
minutes is sufficient. 

In conclusion, it should be pointed out that 
validity is the primary consideration in choos- 
ing a test, and it is the reviewer’s opinion that 
this test will do a fair job of measuring that 
which it is intended to measure. 


Nathan Morrison, Principal Actuary, New 
York State Division of Placement and Unem- 
ployment Insurance, New York, New York. 
This multiple-choice test covers the elements 
of first-year high school algebra. The time al- 
lowed for the 60 items is 50 minutes, which 
should be sufficient to permit most students to 


try every problem, since the items are rather 
simple and short. ` 

The items and the alternative answers appear 
to have been carefully prepared to discourage 
the student from trying to guess the correct 
answer by elimination of the alternative an- 
swers. However, the first 10 items are so sim- 
ple that it is doubtful whether they add any- 
thing except, of course, to the average score. 

Within the narrow range of material covered, 
the test appears to be adequate and fair. 


[ 330 ] 

Wisconsin Algebra Test. High school; 1939; Forms 
A, B; Form A has the title Wisconsin Achievement 
Tests: Algebra; $1.25 per 25; 50(55) minutes; pre- 
pared by the Bureau of Guidance and Records, School 
of Education, University of Wisconsin, and the Wis- 
consin Algebra Test Committee : Alvin G. Hellfritzsch, 
Clifford N. Mills, Orlando E. Overn, Roger L. Slo- 
cum, Paul L. Trump, and Bjarne R. Ullsvik; manual 
by T. L. Torgerson and G. J. Froehlich; E. M. Hale 
and Co. 


ARITHMETIC 


[331 ] 
*Arithmetic Reasoning. Grades 9 and over; 1941; 
1 form; $1.35 per 25; 35¢ per specimen set, postpaid ; 
30(35) minutes; B. V. Moore; distributed by Psy- 
chological Corporation. 


Jacob S. Orleans, Professor of Education, 
The City College of New York, New York, 
New York. This test measures a rather high 
level of arithmetic problem solving ability. The 
term "arithmetic" may be inappropriate as far 
as the last few problems are concerned since 
it is likely that most examinees will solve them 
by means of algebraic techniques. An inspec- 
tion of the problems themselves indicates the 
level of difficulty. The norms make this fact 
even clearer. For 554 high school graduates 
enrolled in the 1941 Introductory Engineering 
Subjects program at the Pennsylvania State 
College, the median number right was 11.5 out 
of a total of 20 problems. For 6,844 adults en- 
rolled in the Engineering, Science, and Man- 
agement Defense Training program the median 
number right was not quite 10. 

The test consists of 20 problems, most of 
them calling for a careful analysis of the prob- 
lem situation. Although technical knowledge 
of physics is not essential for the solution of 
the problems, it would undoubtedly be helpful 
in understanding some of the problem situa- 
tions. There is no question that most of the 
problems really require the use of a reasoning 
process, since they would hardly be recognized 
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by the examinee as the usual problem “types.” 

The time limit of 30 minutes is probably not 
adequate. The mimeographed manual accom- 
panying the test does not provide any infor- 
mation on the number attempted. Tt does state 
that the time allowed is "sufficiently ample to 
make it more of a power than a time test." 

No data are given on the reliability of the 
test. Some data are offered on the statistical 
validity of the test, the criteria being the inter- 
nal consistency of the items (correlations with 
total score) and the correlation of the items 
with scores on the Cooperative Physics Test 
for College Students. Since a single point of 
score represents (on the average) 8 percentile 
ranks between scores of 7 and 15 right for 
high school graduates (enrollees in the IES 
program), it would be comforting to know the 
reliability of the scores. A probable error as 
large as 2 points of score might, depending on 
the size of the score, displace the individual by 
as many as 20 percentile ranks, 

Norms are given as percentile ranks for the 
two groups referred to above. More detailed 
norms based on a larger and more widely dis- 
tributed sampling would be helpful, for in- 
stance in terms of background in mathematics 
and science as well as for different school 
levels. 

The test is characterized as providing a 
measure of aptitude rather than merely of 
achievement in arithmetic. However, no indi- 
cation is given of the field in which aptitude is 
measured or that it is an aptitude measure. 

The content is obviously ingenious, and the 
test may be a very good device for measuring 
arithmetic problem solving ability of rather 
high level. 

[ 332 ] 
Arithmetic Test: Fundamental Operations: Do- 
minion Tests. Grades 4-8; 1034-43; Forms A, B; soc 
per 25; 236 per specimen set; specimen set must bc 
purchased to obtain the manual ; 35(45) minutes; pre- 
pared by the Department of Educational Research, 


Ontario College of Education, University of Toronto; 
Vocational Guidance Centre, 


C. L. Thiele, Divisional Director, Exact Sci- 
ences, Public Schools, Detroit, Michigan. Ac- 
cording to the manual, "This test was designed 
to provide a reliable measure of achievement 
in arithmetic computation over a wide range of 


grades and ages." This purpose the tests seem 
to fulfill in a satisfactory manner. However, 


the authors make the following additional . 


claims which may be seriously questioned: (a) 


“The test was primarily intended for assist- 
ance in the placement of pupils, and for co: npar- 
ing pupil with pupil and class with class." (5) 
“Tt also has useful diagnostic possibilities. The 
examples have been so arranged as to make it 
readily possible for a teacher to discover to 
some extent the nature of individual deficien- 
cies in the fundamentals of arithmetic." 

In stating that pupils might be placed in their 
study of arithmetic by the use of a computa- 
tion test, it would seem that the Dominion 
authorities have taken the point of view that 
skill in computation is the chief goal of arith- 
metic teaching. The net result of focusing at- 
tention solely upon arithmetic computation can 
have only one outcome, and that is to make 
arithmetic a mechanistic, drill subject in the 
Dominion schools. In view of the advances that 
have been made during the past decade to make 
arithmetic meaningful to pupils and to give it 
significance through social application, the wide 
use of a test which measures arithmetic com- 
putation alone can be challenged. 

The diagnostic values of the Dominion test 
are limited for two reasons. First, the test in- 
cludes only a small number of test items on 
each major topic or skill. Second, certain im- 
portant ideas are not tested at all. 

The section of the test devoted to the four 
fundamental operations with whole numbers 
contains only twenty test items, i.e., five items 
for each operation. Ten test items are allotted 
to the other six divisions of the test in which 
is included weights and measures, addition and 
subtraction of common fractions, multiplica- 
tion and division of common fractions, addi- 
tion and subtraction of decimal fractions, mul- 
tiplication and division of decimal fractions, and 
percentage, respectively. Obviously, little diag- 
nostic use can be made of a test which, for 
example, contains only ten problems on addi- 
tion and subtraction of common fractions. It 
should be noted that the test aims to measure 
the arithmetical ability of pupils through the 
eighth grade. 

An arithmetic test which purports to meas- 
ure the fundamentals should include problems 
involving part-whole relations. The test con- 
tains no part-whole relation problems which 
require the whole to be found, given the part 
and its relative size, even in the section of the 
test devoted to percentage. Of the three kinds 
of part-whole relationships, only finding a part 
of a number is included under the heading of 
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common fractions. Ratio and proportion is 
omitted entirely from the test. With such omis- 
sions the test cannot give a valid measure of 
achievement in arithmetic computation. 

The test itself is well constructed. The test 
items were selected so as to represent a wide 
range of difficulty, the items being well graded 
in difficulty. A high reliability was established 
between the two forms of the test. Norms were 
developed to indicate the grade standing of in- 
dividual pupils. Tentative grade averages were 
also found for the test. The format of the test 
is one that makes it easy to score. All in all, the 
test conforms to the standards of good test con- 
struction and should be useful for broad survey 
purposes. 

In concluding this review, attention is again 
called to the dangers inherent in a Dominion- 
wide arithmetic testing program which tests 
only skill in computation. The inevitable result 
will be teaching aimed solely to prepare pupils 
to compute well without reference to the other 
important aims of arithmetic teaching. 


[ 333 ] 
Arithmetic Test (Fundamentals and Reasoning). 
Grades 3-6, 6-8; 1938-39; same as arithmetic tests in 
Municipal Battery; Forms A, B; 2 levels; $1.50 per 
25; I5é per specimen set of either level; 45(50) min- 
utes; Robert K. Speer and Samuel Smith; Acorn 
Publishing Co. 


[ 334 ] 


Basic Arithmetic Skills: Iowa Every-Pupil Tests 
of Basic Skills, Test D, New Edition. Grades 3-5, 
5-9; 1040-45; IBM, Form O, grades 5-9; Forms L, 
M, N, O; 2 levels; 44 per specimen set of either level; 
H. F. Spitzer in collaboration with Ernest Horn, 
Maude McBroom, H. A. Greene, and E. F. Lindquist 
(General Editor); Houghton Mifflin Co. 

a) ELEMENTARY BATTERY. Grades 3-5; $1.50 per 25; 
57(65) minutes. 

b) ADVANCED BATTERY. Grades 5-9; IBM, Form O; 
separate answer sheets need not be used for Form O; 
$1.60 per 25; so per 25 machine-scorable answer 
sheets; 30¢ per set of machine-scoring stencils; 63 or 
68(80) minutes. 


William A. Brownell, Chairman, Department 
of Education, Duke University, Durham, North 
Carolina, ELEMENTARY BATTERY. The four 
forms of these tests are directly comparable in 
content and organization. Part I consists of 
27 to 29 four-alternative multiple-choice items 
designed to measure Vocabulary and Funda- 
mental Knowledge. Part II, Fundamental Op- 
erations, contains 34 or 35 examples calling for 
computation with whole numbers and simple 
fractions. Part III, Problems, comprises 22 to 
27 verbal problems, about 10 of which relate 


to a single pictured situation, the rest being 
independent of each other. The test as a whole 
calls for 57 minutes of work and may be ad- 
ministered in a single sitting of 65 minutes or 
in two sittings of 35 and 30 minutes. 

Paper, format, and spacing are excellent. 
Scoring is made simple and comparatively free 
of error through use of peephole stencils. The 
front page of each test booklet is arranged with 
places for raw scores on the three separate 
parts and on the test as a whole, together with 
the corresponding grade equivalents. 

Part I is a departure from the traditional 
type of measurement in arithmetic which is 
usually restricted to abstract computation and 
“problem solving.” It represents a serious at- 
tempt to evaluate the understanding of ele- 
ments of arithmetical knowledge which are 
being steadily more widely accepted as out- 
comes of instruction—elements such as: read- 
ing and writing numbers ; common quantitative 
symbols; numerical facts, terms, and symbols ; 
common geometric figures; quantitative esti- 
mates; relative size of numbers and fractional 
parts. The number of such elements in arith- 
metic is of course very great. Undoubtedly the 
test authors regret that the sampling of this 
matter must be very limited because of the 
necessity of holding test time to a few min- 
utes. The items in abstract computation (Part 
II) are of the usual sort, and it may be as- 
sumed that they agree in their placement with 
common practice throughout the country. The 
problems in Part III are also typical of cur- 
ticular and measurement practice, though they 
are more “practical” and are perhaps better 
stated than is the rule. The use of centers of 
interest for organizing groups of problems is 
to be commended. Isolated problems, however, 
still serve an important function, and the au- 
thors are justified in retaining this type in their 
tests. 

Though it lacks the customary data on relia- 
bility and on the validation of the tests, the 
manual is otherwise complete and helpful. 
Simple, clear directions are given for adminis- 
tering and scoring the tests. Age norms, based 
upon grade equivalents, are provided, as are 
also percentile tables for grade equivalents, one 
table for each grade. A sample Class Summary 
is furnished and discussed. An Individual 
Cumulative Record, not supplied with the test, 
is also described. The last several pages con- 
tain a valuable discussion of points to be ob- 
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served in interpreting test scores, as well as 
suggestions both for initial and for remedial 
teaching. 

There probably is no better battery of arith- 
metic tests on the market. This judgment may, 
however, reflect the reviewer’s belief that eval- 
uation in arithmetic has for too long been nar- 
rowly conceived and that any competent attempt 
to broaden the base of evaluation is welcome. 

ADVANCED BATTERY. In form and organiza- 
tion the Advanced Battery resembles the Ele- 
mentary Battery. Part I contains 40 items on 
Vocabulary and Fundamental Knowledge; 
Part II, 33 to 41 computational examples (the 
number varying with the form) ; Part III, 25 
to 30 problems (the number again varying 
with the form). 

The Advanced Battery, however, differs at 
some points. (a) Being longer, it requires 63 
minutes of working time (hence, a testing 
period of 80 minutes, or two of 45 and 35). 
(b) Part II is divided into two sections, the 
first for whole numbers and fractions, the sec- 
ond for decimals, percentage, and denominate 
numbers. (c) More of the problems (Part III) 
are related one to another. (d) The pupils use 
an answer sheet. The answer sheet, which is 
the first page of the test booklet, includes a 
place for raw scores and grade equivalents for 
the three parts separately and for the tests as 
a whole. All conversion tables are relegated to 
the Manual, which in content and arrangement 
closely parallels that for the Elementary Bat- 
tery. 

Enough has been said to indicate that the 
four forms of this battery are closely alike. 
The items intended to test vocabulary and fun- 
damental knowledge are well selected. The ex- 
amples of Part II seem to agree with current 
school practice for the grades 5 to 9. And the 
verbal problems (Part III) relate to common 
experiences of children and to common social 
applications, 

The last of the four forms, Form O, differs 
somewhat from the other three, in that: (a) 
the number of examples dealing with decimals, 
percentage, and denominate numbers has been 
greatly reduced; (b) there is larger use of pic- 
tures and diagrams; and (c) more of the prob- 
lems are constructed around centers of interest. 
The reason for change (a) is not apparent; 
changes (b) and (c) seem to be in the right 
direction. 

Like the Elementary Battery, the Advanced 


Battery is to be recommended, and for the 
same reasons. 


Leroy H. Schnell, Department of Mathemat- 
ics, State Teachers College, Indiana, Pennsyl- 
vania. Each form of both batteries is in three 
parts: Part I, Vocabulary and Fundamental 
Knowledge; Part II, Fundamental Operations 
(in the Advanced Battery, this is headed : Sec- 
tion A: Whole Numbers and Fractions; Sec- 
tion B, Percentage, Decimals, and Denominate 
Numbers) ; Part III, Problems. ; 

Either battery can be given in a single period 
or in two shorter periods. Actual working time 
for the Elementary Battery is 57 minutes; Ad- 
vanced Battery, 63 minutes, except Form O 
which requires 68 minutes. Separate manuals 
are provided for each battery with minute in- 
structions for administering and scoring the 
tests. Tables are presented for converting (a) 
part and total scores into grade equivalents, 
(b) grade equivalents into age equivalents, and 
(c) grade equivalents into percentile rank. Co- 
efficients of reliability are not indicated. 

In the opinion of this reviewer, the most 
commendable feature of the arithmetic battery 
is to be found in Part I, Vocabulary and Fun- 
damental Knowledge. The authors have selected 
multiple-choice items involving: (a) reading 
and writing numbers; (b) knowledge of com- 
mon quantitative measures; (c) knowledge of 
numerical facts, terms, and symbols; (d) iden- 
tification (or knowledge) of common geo- 
metric figures (and terms) ; (e) ability to make 
quantitative estimates (or to "describe quanti- 
tatively") ; (f) ability to compare the size of 
numbers and fractions; (g) knowledge of the 
number system; (A) knowledge of common 
processes. This part of the test will be heartily 
approved by teachers who believe that much of 
the difficulty which children encounter in arith- 
metic is due to a lack of understanding of the 
basic meanings and concepts. They will not be 
disturbed by the fact that an item on this part 
of the test carries the same weight as compu- 
tational and problem-solving items. Tt is too 
frequently assumed that pupils who secure cor- 
rect answers in arithmetic drill and problem 
solving are capable of interpreting their an- 
swers and also that they understand the proc- 
esses and concepts involved. Unfortunately this 
assumption is not tenable. Many accurate 
"computers" secure correct answers merely by 
following, mechanically, procedures into which 
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they have little or no insight. For example, a 
pupil may determine correctly that the cost of 
one article has increased 3/8 and another 2/5 
but may be unable to decide which result indi- 
cates the greater increase, or fail to see the 
desirability for such a common-sense judg- 
ment, which may not be specifically required 
in the problem. Tests which require pupils to 
compare numbers, and which make an effort to 
determine their understanding of basic con- 
cepts, should have a salutary effect on arith- 
metic instruction. 

In connection with the above, one weakness 
of Part I should be noted here. A pupil's re- 
sponse on an item may indicate that he recog- 
nizes a given situation in which pi would be 
used. Or, he might demonstrate that he knows 
that 5/7, and not the product, sum, or differ- 
ence of two numbers, is a ratio. However, such 
recognition does not necessarily mean that he 
can apply these important concepts to problem 
situations. Consequently, such correct responses 
are of negligible significance unless supported 
by evidence that the pupil actually can apply his 
knowledge. Many important concepts included 
in this part of the tests are given specific at- 
tention in Parts II and III, but, unfortunately, 
many others are not. 

In the opinion of the writer, the Elementary 
Battery adequately samples the curricula of 
grades 3 to 5 inclusive, although the Advanced 
Battery is recommended for grade 5 because 
of a wider sampling of fractions. However, 
this reviewer does not regard the items of the 
Advanced Battery as an adequate sample of the 
curricula of grades 5 through 9. No problem 
(or drill-type item) deals with the mensuration 
of the circle, sphere, cylinder, or cone. The 
topic of simple interest is sparingly touched 
upon, as well as many other topics that are 
common to arithmetic and junior high school 
mathematics courses for grades 7 and 8. 

Only approximate answers are required in 8 
of the 31 problem-solving items in Part III, 
Form O, of the Advanced Battery. Since an 
exact answer to a practical problem often may 
be less meaningful than a good approximation, 
it is encouraging to find this type of item in 
standardized tests. Surely few, if any, edu- 
cators will deny the fact that if 21/32 of a 
farm's acreage is planted in corn, the most 
common sense interpretation is that "about 
2/3 is planted in corn." 

The authors are to be commended for their 


splendid discussion, in the manuals, on the 
“Need for Developing Better Understanding of 
Arithmetic Processes” and also for their ef- 
forts to assist users of their tests to plan a 
remedial program and to relate the results of 
the tests to local curricula. The analysis of all 
items of Part I of the tests should prove most 
helpful. The authors are urged to reconsider 
their assertion that since “The field of arithme- 
tic is already well defined, and there is relatively 
high agreement . . . as to what constitutes the 
essentials . . . no attempt will be made, there- 
fore, to offer here any suggestions for the de- 
velopment of the most commonly accepted 
basic arithmetic skills.” Analysis of the skills 
involved in items of Parts II and III would 
prove as helpful as this information for Part I. 

Responses to the multiple-choice items in 
Part I, in all forms of the Elementary Battery, 
are to be indicated in boxes appearing under 
the questions. Answers to items in the other 
two parts are to be written in boxes provided 
in the ample space reserved for computation. 
The scoring keys are made of heavy cardboard 
with holes cut so that only the pupil’s answers 
are seen when a key is properly placed on a 
page. Correct answers are printed under each 
opening. 

On the Advanced Battery, Forms L, M, and 
N, pupils are required to follow the question- 
able procedure of transferring their answers to 
a one-page answer sheet for Parts II and IIT. 
Responses to Part I are to be indicated by 
marking an X in tiny boxes. Forty rows, four 
to a row, of these tiny boxes are crowded into 
a space less than 1 by 10 inches. No device 
which sacrifices the comfort and assurance of 
the pupil, regardless of the resulting ease of 
scoring, can readily be defended. 

A single-page answer sheet is also provided 
for Form O (Advanced Battery), but four 
multiple-choice responses are provided for each 
of the 64 items of Parts II and III. Here, a 
new feature, definitely open to challenge, is in- 
troduced. In 54 items, “N” is one of the four 
choices. Pupils are instructed to place an X in 
the box by the answer that agrees with theirs ; 
but *if none of the three answers given is like 
yours, place an X in the box in front of the N." 
In approximately 20 per cent of these 54 items 
the correct response is “N.” Hence a pupil 
actually may solve one of these problems in- 
correctly and still be awarded a score merely 
because his incorrect answer was different from 
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the three incorrect answer choices! The pro- 
cedure is educationally unsound, in addition to 
the fact that grade and age equivalents for 
Form O cannot be given credence. 

Critics who analyze the test will be disturbed 
by inaccurate and misleading statements in sev- 
eral of the items. Some of these set an incor- 
rect pattern of reasoning while others lead to 
confusion of the meaning of basic concepts. To 
be specific: i 

(a) One item asks, “Which line is about 
four times as long as line A?” “Line” A is 
not a line but a shaded rectangle 2 by 8 mm. 
The “lines” with which A is to be compared 
are four rectangles of varying lengths. Con- 
fusing the concepts of line and area is to be 
deplored. 

(b) The question “How much of a price 
reduction has been made?” obviously indicates 
an answer in dollars and cents and, in the item 
now being criticized, would have been obtained 
by subtraction. Since the four multiple-choice 
answers (no N provided) are fractions, the 
question intended probably was “The reduction 
was what part of the original price?” In an 
item somewhat like the one above, children are 
asked “how much faster . . .” (speed of an 
airplane versus speed of an automobile). As 
the question is asked, the correct answer is 
345 m.p.h., but the answer sheet gives (no N), 
3 times, 4 times, 5 times, 6 times. Apparently 
the question intended was “How many times 
as fast... .” 

The above are samples of several instances 
in which meticulous care has not been taken to 
make an item precise and not subject to con- 
fusion or ambiguous meaning. 

It is this reviewer’s opinion that this battery 
of arithmetic tests can serve excellent educa- 
tional purposes if: (a) The answer-sheet diffi- 
culties noted above, including the procedure 
which may permit incorrect answers to be 
scored as correct, is remedied. (b) The Ad- 
vanced Battery is extended in scope to sample 
more of the work of the upper grades. (c) The 
inaccurate and misleading statements are cor- 
rected. 


For reviews by Frederic L. Ayer, Gustav J. 
Froehlich, and Ralph C. Preston of the complete 
battery, see 10. For reviews by William A. 
Brownell, J. Murray Lee, and Charles W. Odell 
of an earlier form of the complete battery, see 
38:872. 
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Apang S prion Tes, gade nu 
1045 Qs) mite Aili. Wrinkle, Juanita 
Sanders, and Elizabeth H. Kendel; Science Research 
Associates. 

Jacob S. Orleans, Professor of Education, 
The City College of New York, New York, 
New York. The authors posit 43 basic skills 
in arithmetic computation such as adding whole 
numbers, adding unlike fractions, and placing 
the decimal point correctly in multiplying. The 
purpose of the test is to measure command 
of these fundamental skills. To do this, the 
authors offer 68 examples. Since the list of 43 
basic skills is not given, it is not possible to 
determine how many examples are used for 
each skill. Apparently the average is slightly 
more than one example per skill. If the test is 
meant to sample the 43 skills, it apparently does 
so adequately. However, if the test is used for 
diagnostic purposes, it is obvious that the num- 
ber of tasks per skill is too meager. The test is 
accompanied by a diagnostic class record sheet. 
The suggestion is offered that the chart be used 
to determine both needed emphasis in group in- 
struction and individual needs for instruction. 
The latter purpose is hardly justified in view of 
the very small number of tasks for each skill. 

The reliability of the test is given as Kuder- 
Richardson coefficients of .86 for grade 6, and 
.96 to .98 for each of grades 7 to 12. Presum- 
ably these coefficients are based on the 354 to 563 
pupils per grade on whom the norms are based. 
The coefficient for the sixth grade is perhaps 
a bit lower than is typical of such tests. The 
other coefficients are uniformly so high as to 
raise the question of their reliability, particularly 
in view of the greater variability of the sixth- 
grade scores and the greater instability of the 
scores in the higher grades. The latter is implied 
by the instability of the norms. 

The norms are apparently significant only for 
grades 6 to 8. The norms show a raw score 
increase of 8 between the sixth and seventh 
grades; an increase of 5 between the seventh 
and eighth grades ; and an increase of 4 between 
the eighth and twelfth grades. The norms are 
given as percentile ranks for each grade. Above 
the eighth grade they have educational meaning 
only within the grade group, or perhaps for all 
pupils above the eighth grade considered as one 
group. Since the mean score for the end of the 
sixth grade is 40 out of a possible 68, it would 
seem desirable to obtain norms for the fourth 
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and fifth grades if the achievement levels for 
pupils of higher grades who are the poorest in 
arithmetic are to be accurately interpreted. More 
than a tenth of the students above the eighth 
grade have scores below the six-grade norm. 

The mechanical layout of the test allows 
room for improvement. In Items 20 and 21 of 
Form A many pupils will probably have to spend 
some time deciding what to do because the in- 
struction for Item 19 appears to apply also to 
the next two items. There is hardly enough space 
on the paper for the pupils to perform the neces- 
sary computations for some of the examples. 
The pupil is provided with the added burden of 
copying his answer in a column at the right of 
the page. The teacher’s time for scoring the 
papers should be reduced to a minimum but not 
at the possible expense of the accuracy of the 
score. The authors would have done better to 
arrange the items so that the answers as first 
obtained would appear in columns and would 
not need to be copied. It would take the teacher 
very little more time to score nine or twelve 
short columns than three long columns. 

The test seems hardly to offer any improve- 
ment over other long, detailed arithmetic com- 
putation tests. The one grade for which it ap- 
pears to have most significance presents the 
lowest reliability. Although the teacher may ob- 
tain a reliable measure of arithmetic computa- 
tion and data on emphases in teaching needed 
for the class, she should not assume that this 
information will enable her to correct the de- 
fects of individuals or those defects common to 
most of the class. 

There is too much evidence that mastery of 
arithmetic skills represents unreasoned, mean- 
ingless learning that is kept alive only through 
rote drill. A necessary step to improve the learn- 
ing is the analysis of the errors, the determina- 
tion of how and why they are made. The use of 
the test will provide a good basis for such 
analysis. 


F. Lynwood Wren, Julia A. Sears Professor 
of Mathematics, George Peabody College for 
Teachers, Nashville, Tennessee. This test “is 
designed to measure command of forty-three 
fundamental skills in arithmetical operations in- 
volving whole numbers, fractions, decimals, and 
per cents.” It is an outgrowth of a research- 
experimental program which began in 1939 in 
the secondary school of the Colorado State Col- 
lege of Education. The purpose of this program 


was the improvement of “instruction in those 
arithmetic skills frequently involved in every- 
day living.” In seeking to redefine the basic 
skills of arithmetic, “two basic criteria other 
than those of a statistical nature were desig- 
nated: (1) the basic skills should include only 
the useful computational skills needed in daily 
life and (2) the skills should be selected on the 
basis of social usage and child needs.” After the 
forty-three skills had been identified and a pre- 
liminary series of tests constructed, there was a 
two-year refinement period during which time 
the tests were used with different groups of stu- 
dents. Claim is then made by the authors that 
“since each item included in these preliminary 
tests was designed to measure one of the basic 
skills, the validity of the tests for use in the 
measurement of basic skills in arithmetic was 
established.” It is not altogether clear that this 
claim is justified. There is a real question as to 
how critically the two-year refinement program 
evaluated the effectiveness of each item in the 
tests as a “measure of one of the basic skills.” 
Also, one might well raise the question as to 
validity and significance of the number “forty- 
three.” 

The next step was to study “the difficulty and 
discrimination value of each item.” Here again 
different groups were used, and the findings 
with one group were checked against those with 
another. Items of zero difficulty and items which 
were “answered correctly by a higher per cent 
of students making low total scores on the test 
than by those making high total scores” were 
“discarded as lacking discrimination value.” 

The Kuder-Richardson formula was used in 
determining reliability coefficients. These co- 
efficients were .86 for the sixth grade, .96 for 
the seventh and tenth grades, .97 for the eighth 
and eleventh grades, and .98 for the ninth and 
twelfth grades. 

No time limit is specified for the test, but 
records have been kept in an effort to determine 
just how long it took students to complete the 
test. “On the basis of these experiences, it was 
concluded that (1) no class should be expected 
to complete, the test in less than 35 minutes, 
(2) most classes, regardless of grade level, may 
be expected to complete the test within 45 min- 
utes, and (3) those students who need the most 
time at any grade level will complete the test 
within 60 minutes.” 

There are two forms, each consisting of 68 
test items rather well distributed over the funda- 
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mental skills with whole numbers, common and 
decimal fractions, and per cents. There is only 
one verbal problem, and it is one using per cents. 
No attention is paid to computation with approx- 
imate data, to arithmetical appreciations, to the 
social uses of arithmetic, nor to basic under- 
standings. 

A scoring key is provided for each form. 
Although only one answer is given for each 
exercise, considerable latitude is given in the 
determination of correct answers through the 
following note (printed in the manual and in 
the margin of the key): “Any answer that is 
basically correct unless it violates the instruc- 
tions given in the test should be counted as a 
correct answer." The manual devotes two fairly 
lengthy paragraphs to indicating how this note 
should be used in scoring the test. 

Percentile norms are given for grades 6 
through 12, These scores are based on data 
from approximately 3,200 students from twelve 
school systems scattered over eleven states. The 
number of students at the different grade levels 
ranged from 354 to 563. 

A Diagnostic Class Record Sheet accom- 
panies the test. It is so arranged that individual 
student errors can be determined by analyzing 
horizontal entries, while class difficulties can be 
found by taking vertical totals of the errors in 
each type of problem. In this diagnostic chart 
the authors seem possibly to challenge their 
designation of “forty-three fundamental skills 
in arithmetical operations involving whole num- 
bers, fractions, decimals, and per cents.” The 
chart makes provision for the tabulation of 
errors in fifty fundamental skills distributed as 
follows: whole numbers, 11 ; fractions, 18; deci- 
mals, 13; and per cents, 8. 

The last sheet of the Record Sheet provides 
a Class Distribution Table, with instructions for 
making a class distribution. Instructions are also 
given for finding the median score of a class. 
The illustration used in implementing these in- 
structions is one of the simplest for finding the 
median of a distribution. No explanations are 
given as to what procedure to follow when a 
specific situation does not conform to the illus- 
tration, 

This test appears to be very well adapted to 
use as an achievement test or a diagnostic test 
on the fundamental skills with whole numbers, 
common and decimal fractions, and per cents. 
Since it is designed for use at the high school 
level it would seem that attention should have 


been given to the use of the same skills in exer- 
cises involving computation with approximate 
data. 

[ 336 ] 
XBrief Survey of Arithmetic Skills, Grades 5-12; 
1947; Forms A, B; 7¢ per test; 10(15) minutes; 
Arthur E. Traxler; Educational Records Bureau. 

[ 337 ] 
Chicago Arithmetic Readiness Test. Grades 1.5-2; 
1939; 1 form; $1.25 per 25; (30-50) minutes; Jeu. 
Johnson; E. M. Hale & Co. . 

REFERENCES 

1. JOHNSON, dons T. "The Chicago Arithmetic Readiness 
Test." Chicago Sch J 21:121-3 D '39. 

Foster E. Grossnickle, Professor of Mathe- 
matics and Head of the Department, State 
Teachers College, Jersey City, New Jersey. This 
arithmetic readiness test is a nonverbal test 
which samples such things as writing num- 
bers, counting, reproduction, recognition, and 
comparison of numbers. It also tests the most 
familiar common fractions. These are definite 
concepts which should be known before a pupil 
begins the study of combinations, sometimes 
known as the formal or systematic study of 
arithmetic. 

There is a total possible score of 51. Accord- 
ing to the manual there is no set norm for the 
test. The author concludes that a raw score of 
25 is a critical score. A pupil who scores below 
that point should be given a retest so that the 
teacher is able to detect points of difficulty. 

The author of the test makes no claim that 
the test is an instrument which will predict pupil 
success in the study of arithmetic. On the other 
hand, he maintains that the test has certain diag- 
nostic features which should show to the teacher 
where the pupil lacks an adequate background 
in the concepts tested. This view is consistent 
with the results obtained from the best research 
which has been made with readiness tests. In 
the light of this objective plus the nonverbal 
nature of the test and the well-selected concepts 
tested, the reviewer considers the Chicago Read- 
iness Test to be a good test in this field. 


[ 338 ] 
Chicago Arithmetic Survey Tests. Grades 3-4, 5-6, 
7-8; 1930-43; Forms 1R, 2R, 3; 3 levels; J. T. John- 
pu E. PED & e 
4) TEST A. Grades 3-4; $1.50 per so. 
b) test B. Grades 5-6; $2 S 
c) TEST c. Grades 7-8; $2 per 5o. 


William A. Brownell, Chairman, Department 
of Education, Duke University, Durham, North 


Carolina. Each of the three forms of this test 
consists of three tests, Test A for grades 3 
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and 4, B for grades 5 and 6, and C for grades 
7 and 8. Forms IR and 2R were revised in 
1941; Form 3, copyrighted in 1940, has not 
been revised. A manual is furnished for each 
form, along with a class record sheet for each 
test level. The tests and other materials are well 
printed. Answers are furnished on keys which 
lack the scoring aids found in the keys of most 
recent tests. 

Test A of each form has sections on number 
meaning, addition, subtraction, multiplication, 
division, measurement, and reasoning, with sec- 
tions on common fractions and decimals added 
for the more advanced children. Test B is simi- 
lar except that it excludes number meanings. 
Test C has sections on fundamental operations 
with whole numbers, common fractions, deci- 
mals, reasoning, measurement, and graphs. In 
several of these areas the number of items is too 
small to constitute an adequate sampling. For 
example, in Forms IR and 2R, Number Mean- 
ing is represented by only four items and Rea- 
soning by five in Test A. Measurement and 
computation with whole numbers appear to be 
well covered. 

^ Directions for administration are clear and 
simple. Each test has two parts, and two sittings 
are indicated. Testing time, long enough for 
three-fourths of the children to finish, is 60 
minutes for Test A and varies from 8o to 110 
minutes for the other two tests of the three 
forms. 

Relative difficulty of the test items was deter- 
mined by pretesting with approximately 675 
pupils in six grades of three Chicago schools. 
Revisions and standardization preceding the 
first printed edition were based on two adminis- 
trations of the tests to about 8oo pupils in five 
schools and then to some 25,000 pupils in thirty 
schools, all in the Chicago area. The results of 
this edition in 1938-1940 were used to make 
the 1941 revisions. No information on the valid- 
ity or the reliability of the tests is given. 

On the cover of each test is a tabulation of 
the test items by number, to show the half-grade 
level at which each skill should be taught. Total 
score “standards” and “latest norms” are given 
for each half-grade. On the revised tests the 
grade placement and the Chicago course of 
study reference are printed beside each item in 
the tests themselves. Tables of grade equivalents 
are to be found in the manuals. The Form 2R 
manual has Form 1 norms, and the norms in the 
Form 3 manual are headed “to be used with 
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results of Form 3, based on results of Form 2." 
The manuals also contain “Descriptive Tables” 
which describe each skill tested and give refer- 
ences to the units in the Chicago course of study 
where the skills are taught. 

The tests are said in the manual to be power 
tests. Yet, the authors can scarcely justify this 
claim, for the time limits permit only three- 
fourths of the children to finish. The tests are 
said also to give the teacher “complete diagnos- 
tic information.” Yet, most of the skills are 
represented by but a single item, thus affording 
an inadequate basis for individual diagnosis. 
The tests are survey tests and are probably valid 
when used this way by school systems which, 
like the Chicago system, have adopted the 
“stepped-up” curriculum advocated by some 
students of: arithmetic. 


Foster E. Grossnickle, Professor of Mathe- 
matics and Head of the Department, State 
Teachers College, Jersey City, New. Jersey. 
Each of the three tests is divided into two parts 
to be given at different sittings. Each part of the 
test for grades 3 and 4 requires 30 minutes; for 
grades 5 and 6, 50 minutes; and for grades 7 
and 8, 55 minutes. The norms, established from 
the results of approximately 25,000 pupils in 
schools near Chicago, are expressed in terms of 
grades rather than of some more objective 
measure, such as percentiles. A grade norm is 
almost meaningless, since universal promotion 
is now practiced in most school systems. Each 
item within the test is given a particular grade 
norm, There is considerable overlapping of the 
range for each test. The test for grades 3 and 4 
has grade norms from 3B through 5A; for 
grades 5 and 6, from 3B throygh 7A; and for 
grades 7 and 8, from 3A through 9B. The author 
maintains that the test shows at which particu- 
lar stage of a process the pupil’s work breaks 
down. He concludes in the manual that the test 
has diagnostic value because it shows the break- 
ing point within a given process. The limited 
sampling for each of the processes makes the 
test totally inadequate for diagnostic purposes. 
The name “survey” should indicate the use to 
which the test should be put. 

When a person studies the scoring key or the 
manual, he is faced with the question of how 
to mark the answer to certain problems, because 
of the way the results are labeled. Consider the 
following problem: How many square feet in 
a rectangle 6 feet long and 4 feet wide? The 
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answer given on the key is 24 sq. ft. If a 
pupil writes 24, is the answer right or wrong? 
There are many cases of this kind which make 
the final score subjective. Provision should have 
been made for these situations either in the test 
itself by writing the label, such as sq. ft., 
or by giving specific instructions on the key or 
in the manual. 

The Chicago tests appear to be good survey 
tests and should be used only in that capacity. 
They have no diagnostic value and are to be 
used primarily as administrative devices for 
comparing achievement of different groups. The 
selection of the subject matter in the tests is 
commendable. The tests do not perpetuate the 
kind of examples or materials which should be 
dropped from the curriculum. 


For an excerpt from a review by J. Wayne 
Wrightstone, see 40:1453. 


7 [339 ] 
Claj p- Young Arithmetic Test: Clapp-Young 
Self-Marking Tests. Grades 5-8; 1927-30; Forms 
A, B; 9of per 25; 8¢ per HERI set, postpaid ; 
35(40) minutes; Frank L. Clapp and Robert V. 
Young; Houghton Mifflin Co. 


Leroy H. Schnell, Department of Mathemat- 
ics, State Teachers College, Indiana, Pennsyl- 
vania. In spite of the fact that the test bears a 
1927 copyright date 4nd also that during the past 
twenty years there have been significant changes 
in the arithmetic curriculum for grades 5 
through 8 (for which the test was standard- 
ized), the majority of problems are sufficiently 
general that they do not "date" the test. How- 
ever, it is urged that certain problems be revised 
to conform with modern daily affairs. Even 
children probably will be impressed by the un- 
reality of having a car repaired for 80 cents 
an hour, selling rabbits at 23 cents apiece and 
bicycles at $10 each. 

Many of the items on the test are real prob- 
lem situations. Some are open to challenge, 
however. For example, a boy who wished to buy 
a pair of skates costing $12 would probably 
know how much he had saved toward their 
purchase. A problem that states that ro per 
cent of the amount required has been saved and 
that asks “How much has he saved?" asks a 
question which, in real life, would already be 
Known or would be ascertained easily by count- 
ing. 

The major purposes of the test apparently are 
to measure problem-solving ability, in relation 


to age and grade levels, and to help diagnose 
reasoning difficulties as well as computational 
errors involved in the solutions. While it would 
be unreasonable to expect this type of test to 
sample all topics in the curriculum of grades 5 
through 8, many users will undoubtedly note the 
absence of problems sampling such topics as the 
following: division of a fraction by a fraction 
(or mixed numbers) ; multiplication of a frac- 
tion by a fraction (or mixed numbers) ; division 
involving decimals in the divisor; finding an 
average; simple interest; the metric system; 
mensuration of plane and solid figures. (One 
problem asks for the number of acres in a field, 
the dimensions of which are given in rods.) 

Problems involving some of the above might 
well replace the four items on each form of the 
test that are drill exercises rather than problems, 
particularly the item requiring the addition of 
three numbers, the decimal parts of which have 
unlike denominators. 

No problem requires more than two steps 
to solve. Computational complexities, such as 
carrying, borrowing, zero difficulties, changing 
from cents to dollars and cents, etc., are ade- 
quately sampled but do not overcomplicate the 
solutions. 

Matched items on the two forms require iden- 
tical reasoning. While the complexities in com- 
putation for matched items on the two forms 
are not always the same, the differences are 
minor and the two forms are well balanced in 
this respect. 

In some items a wrong answer choice is due 
mainly to faulty reasoning, while in others, the 
difficulty is mainly computational. The diagnos- 
tic value of the test might be considerably en- 
hanced if the wrong answer "explanations" also 
indicated whether the main difficulty was rea- 
soning or computational. 

The junior author of this test reports that 
recent statistical study made of scores achieved 
on the test, as reported by users, revealed no 
significant changes in the grade and age stand- 
ards originally validated. 


[ 340 ] 
XCotswold Measurement of Ability I: Arithme- 
tic. Ages 10-11; 1947; 2 forms (Series 1, 2); 6s. per 
20; 1s. per manual; 6d. per single copy; 40(50) min- 
utes; C. M. Fleming; Robert Gibson & Sons (Glas- 
gow), Ltd. 


[341] 
XDiagnostic Tests in Arithmetic Fundamentals: 
Dominion Tests. Grades 2, 3, 4, 5, 6; 1945; 1 form; 
5 levels; 75¢ per 25 of any one part; 30¢ per specimen 
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set of any one part; specimen sets must be purchased 
to obtain the manual for any one part; prepared by the 
Department of Educational Research, Ontario Col- 
lege of Education, University of Toronto; Vocational 
Guidance Centre, the University. 

a) GRADE 2, ADDITION—SUBTRACTION. 87(110) minutes. 
b) GRADE 3, ADDITION—SUBTRACTION— MULTIPLICATION. 
73(95) minutes. 

C) GRADE 4, PART I, ADDITION—SUBTRACTION. I00(120) 
minutes. 

d) GRADE 4, PART 2, MULTIPLICATION—DIVISION, 90 (110) 
minutes. 

€) GRADE 5, PART I, ADDITION—SUBTRACTION—M ULTI- 
PLICATION—DIVISION. 80(100) minutes. 

f) GRADE 5, PART 2, FRACTIONS (ADDITION—SUBTRAC- 
TION). 90(110) minutes. 

9) GRADE 6, PART I, ADDITION—SUBTRACTION—MULTI- 
PLICATION—DIVISION. 60(75) minutes. 

h ) GRADE 6, PART 2, FRACTIONS—DECIMALS. 75(95) 
minutes. 


Leo J. Brueckner, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota, These tests are designed to measure end- 
of-the-year achievement and to diagnose weak- 
ness in the arithmetic fundamentals as outlined 
in the program of studies for the public schools 
of Ontario, Canada. Each test contains only the 
fundamental operations that are new for the 
grade; consequently, the test for a particular 
grade will not reveal weaknesses in the skills 
taught in earlier grades. 

Scores for each subtest (total test scores are 
not used) are expressed as percentages of pos- 
sible scores. The percentages are plotted on a 
cover-page profile on which the sth, 25th, and 
5oth percentiles are indicated; no other norms 
are presented. The last page of each test book- 
let presents diagnostic analysis charts for use 
in locating and analyzing the errors of the pupil. 

Since the same general testing procedure is 
used in all tests, a description of the addition 
subtests will be sufficient. The grade 2 test in 
addition contains three parts. The first part con- 
tains 64 addition facts (sums of 10 or less) 
which are presented orally by the examiner in 
order to reduce counting to a minimum. This 
oral presentation is sound if the pacing does not 
cause pupils to lose their places on the test 
papers. The second part includes 70 examples 
in vertical decade addition within the same dec- 
ade with sums under 100; oral pacing is again 
used. The third part consists of 60 examples in 
column addition with no carrying and no sum 
of a single column greater than ro. Single col- 
umns contain as many as five digits. No example 
contains more than two columns. Zero difficul- 
ties are included in the examples. The examples 
in these three parts are so constructed that each 
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addition fact is included five times. A plan for 
tabulating the combinations answered incor- 
rectly makes it possible to discover the specific 
facts that are not known. The manual states that 
errors on facts should be analyzed, including 
those in column addition examples. The latter 
procedure seems impossible unless the children 
work individually and give the steps in column 
addition aloud so that the examiner can record 
responses; this is, however, a group test. The 
test in addition therefore does not make possible 
the study of methods of work or of pupil under- 
standing of the process itself. In Parts 1 and 2 
the children work under time pressure, while in 
Part 3 ample time is allowed to complete the 
work, since the examples.in this part are not 
read aloud at a predetermined rate and all the 
work is done at the rate the pupil chooses. 

In the first part of the test for grade 3, the 
36 other addition facts are tested in the same 
manner. The second part includes 36 examples 
in higher decade addition involving bridging to 
the next decade. The third part includes long 
column addition examples. None of the skills 
or facts included in the test for grade 2 are 
included here, except for a few scattered facts. 
The intent of this test obviously is to test only 
new work for the grade. 

In grades 4, 5, and 6, the addition tests include 
more difficult examples. It is doubtful whether 
the diagnostic procedure which involves only the 
analysis of written work will result in a reliable 
diagnosis. The diagnostic value of the tests 
would be improved by including a section deal- 
ing with the steps taught in preceding grades. 
When serious deficiencies are revealed by test 
scores, the teachers should be encouraged to 
have the pupils do the work aloud so that their 
mental processes may be studied. Such a pro- 
cedure would yield diagnostic information of 
real value to the teacher. 

The procedures used to test knowledge of the 
facts for subtraction, multiplication, and divi- 
sion are similar to those used for addition. In 
the upper grades, the examples are long and 
difficult. The norms show that achievements are 
not at a high level; in fact, they are so low in 
some cases that the question might well be 
raised whether the examples are not too difficult. 

The tests are probably suitable for survey 
purposes, but the results for the upper grades 
will give no indication of specific weaknesses 
of individual pupils. The diagnostic value of 
the tests would be greatly improved if separate 
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tests were added on each of the major skills 
involved in the process. 

The diagnostic tests for addition and sub- 
traction of common and decimal fractions com- 
plete the series of tests. In grade 5, common 
fraction operations include like fractions and 
_unlike fractions in the same family, carrying and 
no carrying in addition, but no borrowing diffi- 
culties in subtraction. In grade 6 all types of 
examples in addition and subtraction of frac- 
tions, including borrowing, are tested. Very few 
examples involve mixed number types. No ex- 
amples in subtraction of fractions and mixed 
numbers from whole numbers are included. 
Many of the fractions include denominators that 
seldom if ever are used in daily life, such as 
2/15 and 13/16. The inclusion of these difficult 

i and socially useless examples is likely to require 
an undue amount of instructional time which 
could be spent more profitably on arithmetic of 
social value. The grade 6 tests include examples 
in addition and subtraction of decimals, Some 
of the examples involve ragged decimals ; since 
ragged decimals do not appear in social situa- 
tions, they should not be included in tests of 
essentials, 

The general approach to diagnosis used in 
these tests is of undoubted value. Moreover, the 
testing procedures are ingenious and suggestive. 
The use of three levels of standards is new and 
helpful. The tests have been carefully con- 
structed. Every effort has been made to make 
the results useful and meaningful. 


[ 342 ] 
[Essential Arithmetic Tests.] Ages 7-11 or 7-12; 
1945; 2 parts; Forms A, B; 5d. per test; 25-49 copies, 
4d. each; 1s. od. per manual ; 30(35) minutes per test; 
Fred J. Schonell ; Oliver & Boyd Ltd. 
$ ESSENTIAL MECHANICAL ARITHMETIC TEST. 
b) ESSENTIAL PROBLEM ARITHMETIC TEST. 


John Cohen, Lecturer, University of Leeds, 
Leeds, England. The problem test is homo- 
geneous throughout and consists of simple 
arithmetical problems ; the mechanical test re- 
quires simple mechanical operation of the four 
rules. The tests aim to measure arithmetical at- 
tainments in the primary school. They are stand- 
ardized on 2,100 pupils between 6 and 14+, with 
equal numbers of both sexes, and give results 
in terms of an arithmetic age, yielding an arith- 
metic quotient. The author suggests that in a 
four-year period, Forms A and B respectively 
should each be used twice in alternate years. 
Reliability coefficients range from .92 to .94. 


The tests are.not merely of use for routine 
record purposes. They are also of service in 
backward classes, and in transfers from class 
to class and school to school. 


Stephen Wiseman, Senior Lecturer in Edu- 
cational Psychology, University of Manchester, 
Manchester, England. These tests were designed 
for use with the age range from 7+ to ir 
(that is, in the English primary school), and in 
the handbook special emphasis is laid on their 
value when used in conjunction with record 
cards. The time limit of 30 minutes for each 
test was chosen so that it "would not fatigue 
pupils between the ages of 7 and II years." 
While agreeing with the desirability of such a 
criterion, one cannot avoid regretting the re- 
striction in sampling imposed by the resultant 
length of 50 items for each test. The effect of 
"fatigue" with an attainment test of this type 
operates differently at the two ends of the age 
range, since the test must contain many items 
which have not been taught at the 7+ level. 
Thus, if the test were lengthened, it could be 
designed so that "fatigue" of the younger pupils 
did not affect their scores on the relevant part 
of the test. With the present limitation in length, 
the norms show that for the mechanical arith- 
metic test an increase of I point in score corre- 
sponds to an average increase in age of approxi- 
mately 214 months; for the problems test it is 
nearly 3 months. The restriction on sampling 
is demonstrated by the fact that in mechanical 
arithmetic only one item on division of fractions 
is included (345 + 56). 

This criticism of length is the only major one 
to be made, and therefore appears more em- 
phatic than it would have if the tests had not 
been so satisfactory in other respects. There is 
no doubt that the tests will form a valuable addi- 
tion to the measuring devices available to the 
junior school teacher. The name of the author 
is almost in itself a sufficient guarantee that the 
items are soundly constructed and that they con- 
form to good teaching practice. The alternative 
forms are closely parallel—indeed the norms 
show them to be virtually identical—and teach- 
ers will find practical advice in the handbook 
on the ways in which the two forms may be 
used. Another valuable inclusion in the hand- 
book is information about the dependence of 
success in “problems” upon verbal ability, to- 
gether with the recommendation that the prob- 
lems test should not be used with children 
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having a reading age of less-than 8 years. 

Standardisation was carried out on “a group 
of 2,100 boys and girls (approximately equal 
numbers of each) and the "limitation of num- 
bers" is explicitly acknowledged in the hand- 
book. A larger-scale standardisation is planned. 
Wartime difficulties no doubt explain the small- 
ness of the sample; this was "compensated to 
some extent" by a "careful consideration of the 
pupils chosen." The familiar (English) “cream- 
ing" at 11+ has been taken care of by including 
the correct proportion of grammar school chil- 
dren in the higher age groups. Perhaps the 
handbook might have commented on the diffi- 
culty of achieving satisfactory norms on an 
attainment test in wartime even if a large sample 
is used. 

Reliability coefficients (test-retest with a 3- 
month interval) are reported in the handbook 
as ranging from .92 to .94. Curiously, no correla- 
tion is reported between parallel forms. 


[ 343 ] 

XExamination in Advanced Arithmetic—High- 
School Level. Grades 8-9; 1944; IBM; Form AdAr- 
1-B-4 (usually called Form B); separate answer 
sheets must be used; $1.75 per 25; 40¢ per 25 machine- 
scorable answer sheets; 15¢ per scoring key; 25 per 
Specimen set, postpaid; 120(125) minutes; prepared 
by the Examinations Staff of the United States Armed 
Forces Institute; published by the American Council 
on Education ; distributed by Cooperative Test Serv- 
ice. (Also distributed by Science Research Associates : 
$2 per 25; 65¢ per 25 machine-scorable answer sheets ; 
50¢ per key; 50¢ per specimen set.) 


Monica M. Hoye, Supervisor of Junior High 
AY chools, Providence, Rhode Island. This 120- 
minute test consists of 67 multiple-choice items 
arranged in four parts: Section I (23 items), 
basic arithmetical vocabulary, understanding of 
variously stated equivalent expressions, and 
knowledge of numeration; Section II (14 
items), computation with common fractions; 
Section III (19 items), computation with deci- 
mals and understanding of percentage; and 
Section IV (11 items), verbal problems. 

Section I requires no computation. The use 
of the multiple-choice items for testing vocabu- 
lary and understanding of arithmetical concepts 
and quantitative relationships is less satisfactory 
than completion items. The former procedure 
frequently makes possible the selection of the 
correct answer by suggestion or by elimination. 

The items on common fractions do not ade- 
quately sample the various processes. Emphasis 
is on the complicated phases of computing, and 
the abstract computation is frequently beyond 


practical application. There are but two sub- 
traction examples, both of the most difficult type 
and one with a common denominator of 56 and 
an answer of I 39/56 which could have little 
interpretative value. The two multiplication ex- 
amples are of the same kind, finding the product 
of 3 amounts, and important and much more 
common types of multiplication involving frac- 
tions are not represented. 

Obsolete and artificial examples are included 
in the fraction test. Four out of 13 examples 
deal with the reduction of complex fractions, at 
least 2 of which are extremely artificial and of 
manipulative value only. The test would be im- 
proved by omitting these examples and substi- 
tuting a more adequate sampling of subtraction 
and multiplication of fractions. 

There are significant weaknesses in the test 
of fundamental operations with decimal frac- 
tions. Addition of “ragged” decimals appears, 
finding the product of two decimals is the only 
type of multiplication given, and division of a 
whole by a decimal or a mixed decimal—the 
most commonly used as well as the most com- 
monly failed illustration of division of decimals 
—is omitted. The test of percentage is poorly 
graded, but examples are chosen to test under- 
standing of the principles of percentage as well 
as ability to manipulate with per cents, A notable 
omission is percentages less than one. 

There is no grading of the problem test, but 
the problem situations are practical and illus- 
trate the application of principles involved in 
area, volume, percentage, ratio and proportion, 
and distance-rate-time relationships. Correct 
solution of these problems demands critical 
thinking in the use of arithmetical knowledge 
and principles. 

The problem test considered as a measure of 
the functioning of worthwhile outcomes of the 
teaching of arithmetic is disappointing. The 
soth percentile, based on 1,262 cases for Form 
A, lies between 3 and 4 correct answers; the 
raw score of 3 falls at the 39th percentile and 
the raw score of 4, at the 61st percentile. Since 
more than 60 per cent of cases fall below a score 
of 4 (out of a possible 11), either the test is dis- 
couragingly difficult (which does not seem to 
this reviewer to be true), or major objectives 
of instruction are not being attained. The show- 
ing is even poorer for Form B: 60 per cent of 
1,270 cases fall below a score of 3. 

These tests give a measure of achievement in 
general and may be useful in determining the 
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classification of large groups of students. The 
sampling of items is too limited to make the 
tests adapted for diagnostic purposes, although 
achievement in the various sections of the test 
has some diagnostic significance. 


F. Lynwood Wren, Julia A. Sears Profes- 
sor of Mathematics, George Peabody College 
for Teachers, Nashville, Tennessee. This is a 
test designed to measure achievement in arith- 
metic at the high school level. It is adapted for 
civilian as well as military uses. There is every 
evidence that extreme care was taken in the 
determination and preparation of the test items. 
The procedure followed was briefly: (a) the 
determination of objectives through the cooper- 


. ative effort of examiners, specialists, and 


teachers; (b) the specification and collection 
of sufficient exercises testing each of the ob- 
jectives to provide two or more forms of the 
test; (c) tryouts of the selected exercises and 
subsequent elimination of undesirable items ; 
(d) submission of revised forms to a compe- 
tent jury for a check of “the tests for compre- 
hensiveness of coverage in both content and 
objectives, for accuracy of material, and for 
validity of exercises.” Such care would seem 
to give the test a rather high degree of validity. 
No statement is made concerning the reliability 
of the test. 

Tentative percentile norms are given for 
part and total scores, with an expressed prom- 
ise that they “will be improved as data are 
received from the high schools . . . using the 
civilian forms.” 

The test makes an effort to go beyond the 
mere memory of facts. Its sixty-seven exercises 
are distributed among four sections. One of 
the four sections is devoted entirely to verbal 
problems. The remaining three sections give a 
major emphasis to the fundamental skills with 
common and decimal fractions and per cents. 
There are no exercises designed specifically for 
testing the fundamental skills with whole num- 
bers. The only measures for such skills are 
those implied in the verbal problems and in the 
exercises designed for testing the correspond- 
ing skills with common and decimal fractions. 
While the major emphasis of Section I is on 
basic understandings, there is very little atten- 
tion paid throughout the test to arithmetical 
appreciations. All exercises are of the multiple- 
choice type. The specified time limit of 2 hours 
seems to be ample and should not interfere 


with the effectiveness of the test as a measur- 
ing instrument. ` 

"There is one error in the scoring key. Exer- 
cise 38 of Section III calls for the sum of 
0.045 + 0.0407 + 0.286, and the scoring key 
calls for the answer 0.738. This would be the 
correct answer if the exercise called for the 
sum of 0.045 + 0.407 + 0.286, a fact which 
would seem to indicate that the error is due to 
a misprint in the test. 


[ 344 ] 
*Hundred-Problem Arithmetic Test. Grades 7-12; 
1926-44; revision of Schorling-Clark-Potter Arithme- 
tic Test; Forms V, W ; $1 per 25; 35¢ per specimen set, 
postpaid ; 40(45) minutes ; Raleigh Schorling, John R. 
Clark and Mary A. Potter ; World Book Co. 


REFERENCES 
1. Woopy, Crtrrorp. The Sophomore and Freshman Testing 
Program in the Accredited High Schools of Michigan, 1939. 
University of Michigan, School of Education, Bureau of Educa- 
tional Reference and Research, Bulletin No. 152. Ann Arbor, 
Mich.: the School, September 1, 1939. Pp. viii, 160. Paper, 
0.75. 


Z5 Cewer, Braxe. “A School of Nursing Selection Program.” 
J Appl Psychol 27:452-7 O '43. * (PA 18:281) 

William Betz, Specialist in Mathematics, 
Public Schools, Rochester, New York. The 
100 items of the test are arranged as follows: 
addition, 10 items; subtraction, 10 items; 
multiplication, 15 items; division, 15 items; 
fractions, decimals, and percentages, 50 items. 
The items appear to have been carefully se= 
lected. Technicalities are avoided; only basic 
techniques are stressed. Hence the test as a 
whole may be regarded as a satisfactory instru- 
ment of evaluation. On the basis of the pub- 
lished median grade percentiles, one may won- 
der, however, whether the time allowance of 
40 minutes is really adequate for the average 
learner. 


Clearing House 19:264 D '44. E(arl) R. 
G(abler). * Because of its thorough standardi- 
zation and practical content it is valuable in 
pupil placement and diagnosis. * 


[ 345 ] 
X*Lee-Clark Arithmetic Fundamentals Surve: 
Test: High School Edition. Grades 8-12; di 
Forms A, B; 75¢ per 25; 25¢ per specimen set; 25(30) 
minutes; J. Murray Lee and Willis W. Clark; Calitor- 
nia Test Bureau. 


Monica M. Hoye, Supervisor of Junior High 
Schools, Providence, Rhode Island. This test 
is an analytical survey test of fundamental 
skills in arithmetic designed for high school 
pupils. It is a four-page booklet and covers 20 
different basic processes in arithmetic, whith 
are organized in pairs with 2 examples in each 
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process, making 40 examples in all. This or- 
ganization is in accordance with the Lee Main- 
tenance Drills and Tests in Arithmetic which 
are recommended for use by pupils receiving 
low scores on the test. The test is prepared for 
individual or class use and is hand scored. 

The examination covers fundamental opera- 
tions with whole numbers, common fractions, 
decimal fractions, and denominate numbers as 
well as changing common and decimal fractions 
and per cents to equivalent expressions, and 
two cases of percentage. It is claimed that high 
validity for the test was obtained by careful 
analysis of essential or basic processes required 
for computation in the fundamental operations 
in arithmetic. An average reliability for the 
test of .925 is given for grades 9-12. 

The emphasis of the test is wholly on me- 
chanical skill in the four fundamental processes, 
and no attempt is made to measure vocabulary, 
understanding of arithmetical concepts and re- 
lationships, or ability to apply principles and 
reasoning to verbal problems. Sampling usually 
is adequate for diagnosis of areas of pupil 
weakness and is reasonable with the exception 
of examples in computing with denominate 
numbers. The test would be improved by omit- 
ting examples involving operations with de- 
nominate numbers and substituting a more 
adequate coverage of subtraction, multiplica- 
tion, and division of common fractions and 
including division of a whole number by a deci- 
mal and the third case of percentage. 

The format of the test is good, items are 
well placed to allow work space, and directions 
are clear and concise. The test is simple to 
administer, score, and interpret. 

A single sheet headed Directions and Norms 
covers a brief discussion of the nature of the 
tests and their validity and reliability, includes 
directions for giving and scoring the test and 
interpreting results, and provides tentative 
norms, a table of percentile rank of norms, 
and answer keys. Also included with the test 
booklets is a percentile distribution record sheet 
for study of class results, which is in duplicate 
form for the purpose of reporting results to 
the California Test Bureau. 

Users of the test receive a copy of a sum- 
mary of data for 107 high schools in 25 states 
Which have used these tests. This includes a 
presentation of the data for over 13,000 stu- 
dents in grades 9, ro, rr, and r2 distributed 
to show the number of examples correctly 


solved and the corresponding percentiles. The 
summary also gives a rank-order distribution 
of median scores and equivalent percentile 
ranks achieved by individual high schools by 
grades. In the composite distribution of data 
for all schools and grades presented in Table 
I, the median score achieved for each grade 
was at the 6oth percentile indicating that the 
norms established by the original normative 
group were low. The presentation of the rank 
order of median scores by grades in Table II 
indicates that the greatest variations in medians 
in schools was found in the eleventh grade. 
From the data presented two conclusions 
seem to be: (a) high school attendance pro- 
vides no assurance of adequate mastery of 
arithmetic fundamentals, and (b) the median 
achievement of high school grade groups shows 
very slight variations. This may point to the 
need for testing achievement in fundamental 
computational skills in arithmetic of pupils in 
the upper high school grades and taking ad- 
vantage of a final opportunity for diagnosis of 
weaknesses and provision of remedial measures. 


F. Lynwood Wren, Julia A. Sears Profes- 
sor of Mathematics, George Peabody College 
for Teachers, Nashville, Tennessee. This “test 
is designed to provide an analysis of high 
school students’ difficulties in the fundamental 
processes of arithmetic and to guide remedial 
teaching." It is definitely restricted to a survey- 
of fundamental skills and no attention is paid 
to verbal problems, arithmetical meanings, or 
the social significance of arithmetic, Forms A 
and B are provided, and each form contains 
two problems in each of "twenty different basic 
processes of arithmetic." The split-half method 
was used in determining the reliability which 
"averages .925" for each grade, High validity 
is claimed to result for the test from "an 
analysis of essential or basic processes required 
for computation in the fundamental processes 
of addition, subtraction, multiplication, and 
division of whole numbers, fractions, deci- 
mals, or per cents." Tentative midterm percen- 
tile norms are provided for both forms “based 
on over four thousand tests distributed equally 
in grades 9, 10, 11, 12.” Subsequent data based 
on over 13,000 students in these same grades 
seem to show higher norms than those origi- 
nally established. “It is not known whether 
data submitted by the users tend to represent 
those districts which have a relatively favor- 
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. able outcome, or whether the original norms 
were slightly low." To the reviewer these facts 
seem to support the statement that probably 
the norms were established on data that was 
‘entirely inadequate. 

The test is timed for 25 minutes; a fact 
which would seem to affect very materially its 
value as a "guide to remedial teaching." To 
work 40 exercises in 25 minutes, and record 
the answers in a separate column where one 
must be careful to follow a specified code 
seems to place so much emphasis on speed that 
the efficiency of the test in detecting areas 
needing remedial teaching would be greatly 
diminished. 

Exercises 7a, 7b, and 17a of Form A are 
stated in symbols designating units of apothe- 
caries' weight, while the answers given indi- 
cate that units of avoirdupois weight were 
used. Inconsistencies occur in notation such as 
19 yds. and 6 in. These errors have been cor- 
rected in Form B, except that inconsistencies 
of notation occur in the answer sheets to both 
forms. In the work with decimals no attention 
is given to computation with approximate data. 
As a result purely manipulative answers, which 
are frequently meaningless, are given as cor- 
rect; for example, 42.0 X .084 — 3.5280. Simi- 
larly, in division the answer for the exercise 
84)3321 is given as 39 45r., or 39 15/28, or 
39.54. The student working this exercise would 
be wise to stop with the first answer. If he 
should attempt to give the remainder in frac- 
tional form, he would first encounter 39 45/84 
which would be scored as incorrect since the 
fraction is not reduced to its lowest terms, and 
this is not one of the simplest of fractions to 
reduce to lowest terms. If he should choose to 
give his answer in decimal form he must guess, 
for no guidance is given, as to where the au- 
thors desired the division to stop in the non- 
terminating decimal occurring as a quotient. As 
an answer to the question “4 is what per cent 
of 5?" either 80 or 8o per cent is accepted as 
correct, while as an answer to “What per cent 
of 8 is 2?" only 25 per cent is accepted as cor- 
rect. Similarly, 2.50 is given as the only correct 
answer to the exercise "Change 250 per cent 
to a decimal,” while 3.5 or 3%4 and not 3.50 
are given as acceptable answers to “Find 1 per 
cent of 350." 

The above exercises are taken from Form A, 
but similar cases are to be found in Form B. 
It seems highly desirable that all such incon- 


sistencies and inaccuracies be cleared up before 
the test can become very effective as a measur- 
ing instrument. 


[ 345a] SOME 
Metropolitan Achievement Tests [Arit metic]. 
Grades 3-4, 5-6, 7-9.5; 1933-47; Forms R, S; $1.40 per 
25; 35¢ per specimen set of any one level; Richard D. 
Allen, Harold H. Bixler, William L. Connor, and 
Frederick B. Graham; World Book Co. 
d) ELEMENTARY ARITHMETIC TEST. Grades 3-4; 65(75) 
minutes. 
b) INTERMEDIATE ARITHMETIC TEST. Grades 5-6; 80(90) 
minutes. 
C) ADVANCED ARITHMETIC TEST. Grades 7-9.5; 80(90) 
minutes. 


For reviews by Foster E. Grossnickle, Peter 
L. Spencer, Harry Grove Wheat, and Guy M. 
Wilson of an earlier edition, see 38:892 and 
40:1458.1. For reviews by Jack W. Dunlap, 
Richard Ledgerwood, Charles W. Odell, E. V. d 
Pullias, and Hugh B. Wood of an earlier edi- 
tion of the total battery, see 38:874 and 
40:1189. 


[ 346 ] 
Moray House Arithmetic Test. Ages 10-12.5; 1935- 
38; Tests 2, 7, 9; withdrawn from publication in 1947 ; 
Test 2, 37(42) minutes; Tests 7 and 9, 30(35) min- 
utes; Godfrey H. Thomson, W. G. Emmett, and A. S. 
Mowat; University of London Press, Ltd. 


John Cohen, Lecturer, University of Leeds, 
Leeds, England. Test 2 was prepared in re- 
sponse to a request from education authori- 
ties for a test to use at the age of 11+, chiefly 
for the award of scholarships. Norms are 
based on the results of testing entire age 
groups (126-137 months) in two areas and, in 
addition, a group of scholarship candidates 
aged 11+—in all, 4,980 children. The author 
admits that the norms suffer from two defects : 
first, the restricted age limits of the groups on 
which the norms are based; second, the com- 
petitive conditions under which the testing was 
held, Nevertheless, we may well share his con- 
fidence in the value of the test through the 
range from 120 to 150 months. 

A convenient “Ready Reckoner” is provided 
in the manual for converting raw into standard 
scores. A special supplementary table is also 
provided for borderline or doubtful scores. The 
standard scores allow for age and have a mean 
of 100 and standard deviation of 15. 

Test 9 differs from Test 2 in that it is di- 
vided into two sections with “mixed” items, 
instead of into homogeneous subtests. Section 
I contains 42 items on addition, subtraction, 
multiplication and division in random order; 
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. Section 2 contains 60 questions on arithmetical 
“reasoning. The manual is crystal clear and suc- 


ceeds in making the administration of the test 
practically foolproof. 

This test was constructed in 1936 to form 
part of the examinations held for entry to sec- 
ondary schools at age eleven. Norms are based 
on'scores of nearly 17,000 children forming 
complete age groups, ranging from IO years 5 


«months to 11 years 7 months, in four districts. 


Conversion tables are provided in the man- 
ual for calculating arithmetic quotients (not 
standard scores as in Test 2). In this, as in 
other Moray House tests, the quotients have 
a mean value set at 100 with a standard devia- 
tion of 15 points, so as to agree with Binet 
IO's. This would seem to require revision in 
view of the fact that the scatter of IQ's based 
on the Revised Stanford-Binet Scale is 16 
points or more. Like the other Moray House 
tests, this one is among the best of its type, 
soundly constructed, with high reliability and 
predictive value. 


[347 ] 
*Number Fact Check Sheet. Grades 5-8; 1946-47 ; 
IBM; Forms A, B; 75¢ per 25 machine-scorable test- 
answer sheets; 60¢ per set of stencils for scoring an- 
swer sheets ; 25¢ per specimen set, postpaid; nontimed 
(30-35) minutes; Roy Cochrane; California Test 
Bureau. 

[ 348 ] 
*Oral Diagnostic Test in Addition: Analysis of 
Errors in Addition: Dominion Tests. Grades 4-6; 
1945; individual; 1 form; 65 per 25; 10 per specimen 
set; (10-20) minutes; prepared by the Department of 
Educational Research, College of Education, Univer- 
sity of Toronto; Vocational Guidance Centre, Uni- 
versity of Toronto. 


Leo J. Brueckner, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota, This is an oral test for use with individ- 
ual pupils in grades 4, 5 and 6 who are weak 
in addition. It is designed to assist the teacher 
to obtain a detailed diagnosis of a pupil's spe- 
cific weaknesses in this operation. It supple- 
ments the Dominion Diagnostic Tests in Arith- 
metic Fundamentals. The pupil’s work sheet 
contains 12 addition examples, all involving 
column addition. The pupil works the exam- 
ples aloud. The teacher on a record sheet 
records the responses and errors made by the 
pupil. Spaces are provided for each step in 
each example. On the opposite page in the rec- 
ord blank, a check list of possible errors is 
given which the examiner can use to make an 
analysis of the pupil’s work. On this basis a 


diagnosis of the pupil’s deficiencies is then 


made. This general procedure is an excellent 
technique. The test would probably be im- 
proved if the examples covered a wider range 
of difficulties, including some of the more basic 
steps with easy operations. Similar tests for the 
other number processes might well be added to 
the series. 

[ 349 ] 
X*Renfrow Survey Tests of Mathematical Skills 
and Concepts. Grades 3-4, 5-6, 7-8; 1941; Forms A, 
B; $1 per 25; so¢ per specimen set, postpaid; (60) 
minutes; Omer W. Renfrow; C. A. Gregory Co. 
a) TEST I. Grades 3-4. 


b) test 2. Grades 5-6. 
c) TEST 3. Grades 7-8. 


C. L. Thiele, Divisional Director, Exact Sci- 
ences, Public Schools, Detroit, Michigan. The 
battery contains six tests with two forms for 
each two grades of the elementary school be- 
ginning with the third grade. Each test is com- 
posed of seven parts: (a) addition, (b) sub- 
traction, (c) multiplication, (d) division, (e) 
‘fractions, (f) denominate numbers, and (9g) 
concepts and background. One hundred and 
forty test items are divided equally among the 
seven parts of a test. For each part a separate 
time allotment is given. According to the au- 
thors, "each part of the test is suitable for 
survey and diagnostic purposes." To that end 
median scores are offered for the parts of the 
tests as well as interpolated grade-score equiva- 
lents for the whole test. The authors also state 
that “The content is based on all leading arith- 
metic series used in the United States." In that 
no copyright date appears on the tests, that 
statement has little significance. 

Although elaborate statistical devices were 
employed in the preparation of the tests, that 
alone does not guarantee their usefulness. That 
depends upon the way in which the tests fit 
the demands of present-day schools. What is 
basic in the teaching of arithmetic in the schools 
of today, therefore, should be measured in such 
a way that teachers can make a direct applica- 
tion of test findings to their teaching. The 
validity of the tests in terms of present-day 
arithmetic is questioned especially with refer- 
ence to the parts dealing with whole-number 
and decimal-fraction processes. At no point is 
a knowledge of place value tested directly. The 
section dealing with denominate numbers con- 
tains algorisms entirely out of date; conse- 
quently this part would be of little value to 
teachers in modern schools. Except for one 


te 
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section in each test devoted to concepts and 
background, verbal problems have been entirely 
omitted with the result that the social values 
of arithmetic are not measured by the test. 
Also omitted from the tests are problems which 
would indicate a sense of number relation- 
ship. In light of the above, it would be diffi- 
cult to substantiate the claim that "individual 
and class achievement in general mathematical 
achievement" can be determined by the tests. 

Another apparent weakness of the tests can 
be traced to an effort on the part of the authors 
to crowd too many problems on each page. As 
a result, there is insufficient room in some cases 
for the pupils to do all of their work on the 
test sheets, which they are instructed to do. In 
order to save space in the section devoted to 
fractions, the horizontal arrangement was used. 
Children familiar with the vertical form will 
find this confusing. 

What appears to be the best feature of the 

ts is the section entitled Concepts and Back- 
ground. Although many of the test items in 
this section could be labeled “definition,” there 
are problems which appear to test understand- 
ing and judgment. Better textbooks and courses 
of study stress these values. 

One conclusion seems to be in order with re- 
gard to these tests—they might with profit be 
brought up to date. 


[ 350 ] 
Schonell Diagnostic Arithmetic Tests. Ages 7-13; 
1936-46 ; 7d. per test ; 25-49 copies, 6d. each ; 45. 6d. per 
copy of Schonell's Diagnosis of Individual Difficulties 
in Arithmetic, Second Edition, which serves as the 
test manual; Fred J. Schonell ; Oliver & Boyd Ltd. 
REFERENCES 

LIEU uio T is of Individual 

i. SCHONELI E ivi ij ies i 
A kere, Second. Eaton "edel Id 3 Scotland Ole, k 


Boyd, Ltd., 1942, Pp. xi 115. 4s. 6d * : 
Clarke, Irwin &'Go., Ledy "1S 4^ 6% * (Toronto, Canada: 


For a review by C. Ebblewhite Smith, see 
40:1461. 


[ 351 ] 
XSchrammel-Otterstrom Arithmetic Test. Grades 
4-6, 7-8; 1945; Forms A, B; 2 levels; 9of per 25, post- 
paid; 20€ per specimen set, postpaid; 50(55) minutes ; 
H. E. Schrammel and Ruth E. tterstrom; Bureau of 
Educational Measurements, Kansas State Teachers 
College of Emporia. 

a) TEST 1I. Grades 4-6. 
b) rest 111, Grades 7-8. 


William A. Brownell, C. hairman, Department 
of Education, Duke University, Durham, North 
Carolina. "Test II, in two alternative forms, 
is intendi for grades 4, 5, and 6 and in- 


cludes 40 items under the heading, Computa- 
tion; 25 items under the heading, Comprehen- 
sion; and 30 Problems, for a total of 95 items. 
Test III, also in two forms, is for grades 7 and 
8 and includes 30, 24, and 24 items, respec- 
tively, under the headings just mentioned, for 
a total of 78 items. Testing, or work time, re- 
quires 50 minutes. l 

The test content is based upon an analysis 
of ten recently published textbook series and 
of various courses of study. The tests in their 
present form are a revision of an earlier test 
(1939), unpublished except as used in the 
Nationwide Every Pupil Scholarship Test. 
Norms were computed from 19,152 pupil 
papers, collected from more than 300 schools 
in 32 states, and are expressed in the manual 
in percentiles (midyear and end-of-year test- 
ing), grade equivalents, and school mark trans- 
lations. 

Reliability coefficients average .90 for total 
scores and .79 for part scores. The coefficients 
for the comprehension sections are as low as 
.56 for Test III and .66 for Test II. All coeffi- 
cients, except four of the 21 reported, were 
obtained with samples of pupils numbering 
fewer than 100, and fifteen from populations 
smaller than 45. 

The examples set for computation are ade- 
quately representative of the grades for which 
they are intended. The algorisms in a few in- 
stances are not those in common use. The items 
in the comprehension sections represent neces- 
sarily limited samples of the many which 
might be included (hence, the low reliability). 
The problems are well selected and cover many 
common uses of arithmetic. 

The tests are easily administered, and direc- 
tions are simple. Each test covers four pages, 
both sides of a single-sheet booklet. The scor- 
ing key includes optional methods of writing 
answers which are acceptable. The appearance 
of the test is not entirely satisfactory. The 
whole suggests crowding, especially in the sec- 
tions devoted to comprehension. 

For each problem in Test III, four possible 
answers are provided. The pupil solves the 
problems in the margin of the test sheet or on 
extra paper and then selects the answer which 
he considers correct. While this procedure has 
advantages from the standpoint of objectivity, 
it can easily lead to invalidity. Pupils could 
select answers without solving problems; and 
if their work is slightly inaccurate, they may 
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be given credit for correct solutions if they 
choose the alternatives nearest their answers. 

The authors’ claim that the tests’ “chief val- 
ues . . . lie in their diagnosis of individual 
strengths and deficiencies" is hardly tenable. 
Moreover, it is to be doubted that use of the 
test scores for "checking the efficiency of in- 
struction” and for “determining school marks" 
is warranted. The main merit of the test seems 
to be in connection with the survey function 
of measurement. For this purpose the test is 
acceptable when but a short period of testing 
time is available. 


[ 352 ] 
Southend Attainment Test in Mechanical Arith- 
metic, Ages 6-7, 7-8, 8-9, 9-10, 10-11, 11-12, 12-13, 
13-14; 1939; 1 form; 8 levels ; 3s. 2d. per 50 of any one 
level; rod. per manual ; rs. 7d. per specimen set; M. E. 
Hill; George G. Harrap & Co. Ltd. 


REFERENCES 

1. Hitt, M. E. The Education of Backward Children. Intro- 
ductory Sp by H. Boyes Watson, London: 
rap & Co. Ltd., 1939. Pp. 174. 6s. * (PA 14:3735) 

Stephen Wiseman, Senior Lecturer in Edu- 
cational Psychology, University of Manchester, 
Manchester, England. This test is designed on 
"Binet lines" and has a separate test for each 
age group from 6-7 years up to 13-14 years. 
“The child's basic arithmetic age is determined 
by the first [?] year-group in which he can 
pass every test item.” Fractions of a year are 
credited for additional single items passed. 
Since there are differing numbers of items for 
each year-group this produces some anomalies. 
For example, a child passing the item 870 + 11 
in the 9-10 test is credited with 249 of a year, 
but 905 — 6 in the 8-9 test adds only 149 of 
a year to his score. Perhaps more glaring ex- 


amples are 3% X 334 +3% (Mo year) in. 


year 12-13, and 8% — 415 — 34 (14 year!) in 
year 13-14. The inclusion of only three items 
for the 13-14 level, two of which are "prob- 
lems" involving percentages, is obviously open 
to criticism. 

The content of the tests is uneven in empha- 
Sis. Year 7-8 contains 7 subtraction items and 
3 multiplication items only. This particular 
point is commented on in the test author's Edu- 
cation of Backward Children (1) (which con- 
tains more information on the test than does 
the Handbook) : "there are six [?] subtraction 
sums in the 7-8 year, but these, though ap- 
pearing to test the same process, are graded 
in difficulty between 7 years 4 months and 8 
years 1 month." (My italics). Again, the in- 


rge G. Har- 


clusion of a high proportion of verbal problems 
in the years 12-13 and 13-14 is of doubtful 
validity in a “mechanical” arithmetic test which 
claims to measure “agility with figures.” 

The test was standardised on the complete 
elementary school population of Southend-on- 
Sea. In a communication to the writer the 
author gives the total number as approximately 
11,000. No children from grammar schools 
were included, so that the population above 11 
years did not include children in the upper 
ability levels. Thus each age group from 6 to 
II contained about 1,500 children, but the three 
upper age groups were only about 1,200 each. 
The handbook ought to include this infor- 
mation. 

No reliability coefficients are quoted in the 
handbook. The author has supplied the follow- 
ing figures for odd-even reliability coefficients 
for age-levels 8, 11, and 13 years respectively: 
-94, .87, and .9r. 

Instructions for administering the test afe 
very meagre, and one would expect a lack of 
uniformity of method as a result. The 6-7 year 
test, in particular, is virtually self-administer- 
ing, in the form of 4 + 2 = ——. This is open 
to much criticism. 

The test has not been tried out by the writer 
in competition with other tests of a similar 
nature, but it seems to possess no obvious ad- 
vantages over, say, the tests by Burt or Schonell. 


[ 353 ] 
Test on Arithmetic Meanings and Vocabulary. 
Grades 2-3; 1937; 1 form; out of print; prepared under 
the supervision of G. M. Ruch; Scott, Foresman and 
Co. 


Foster E. Grossnickle, Professor of Mathe- 
matics and Head of the Department, State 
Teachers College, Jersey City, New Jersey. 
This test is divided into two parts. Part 1 con- 
tains 21 questions for pupils in the second 
grade; Part 2 contains 32 questions which are 
intended for pupils in the third grade. Both 
tests are entirely verbal. Part 1 is labeled 
“Arithmetic Readiness," while the other part 
is captioned “Basic Arithmetic Abilities.” The 
meaning conveyed by the term arithmetic readi- 
ness depends upon the interpretation the maker 
of the test may choose to give the concept. The 
test may indicate readiness for some phase of 
arithmetic, such as the basic facts in each of 
the four processes, or readiness for some other 
phase of the subject. A test which purports to 
measure readiness for a subject as broad as 
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arithmetic must employ shotgun techniques. 

About one-fifth of the test items in Part 1 
pertain to a picture showing a board fence such 
as may be seen on many farms. The boards are 
of different widths, and the pupil is asked to 
identify the boards by their width. It is doubt- 
ful if a test is valid which devotes so large a 
part of the total time to a type of question that 
is outside the experience of a large part of the 
pupils in urban areas, 

The percentile norms given with the test 
were determined from scores derived from 
over 2,000 pupils who took the test in experi- 
mental form. There were two groups of ex- 
perimental subjects. The one group used a text 
entitled Number Stories, while the other group 
did not use the text. There is a significant dif- 
ference between the two groups in favor of 
the users of the text. To the reviewer it seems 
that the test is more of an achievement test 

ed on the content of the particular textbook 

d than a test which measures readiness in 
some phase of arithmetic. In other words, the 
test is too limited in its sampling of concepts 
to be a measure of readiness in arithmetic. The 
structure of arithmetic is so complex that it is 
impossible to have a general readiness test for 
the subject. The test under review does not 
specify the type of readiness implied; hence, 
its use may well bè confined to those classes 
which use Number Stories. In this case the test 
is one to measure or to predict the achievement 
which accrues from using this particular text- 
book in beginning number work. 


CALCULUS 
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*Examination in Differential Calculus, College; 
1944; IBM; 1 form (usually called Form B) ; 3 parts; 
Separate answer sheets must be used; 25¢ per speci- 
men set, postpaid ; prepared by the Examinations Staff 
of the United States Armed Forces Institute; pub- 
lished by the American Council on Education ; distrib- 
uted by Cooperative Test Service. (Also distributed 
by Science Research Associates : $2 per 25 of any one 
part; 65¢ per 25 machine-scorable answer sheets for 
any one part; 50¢ per specimen set of any one part.) 
4) BooK 1. Form CDfC-2-B-4; $2 per 25; 40é per 25 
machine-scorable answer sheets; 1 5€ per scoring key; 
one answer sheet and one key will serve for Book I and 
either supplement; 135(140) minutes. 

) SUPPLEMENT I, INFINITE SERIES AND PARTIAL DERIVA- 
TIVES. $1.75 per 25; 45(50) minutes. 
€) SUPPLEMENT 2, INTEGRATION. $175 per 25; 45(50) 
minutes. 


REFERENCES 


1. Zant, James H. “The United States Armed Forces Insti- 
tute Examinations in Mathematics as a Means of Determining 
Cont in Engineering Mathematics.” J Eng Ed 36:170-4 N 
45. 


[ 355 ] 
xamination in Calculus II: Integral Calculus 
peers Initial Course Containing No Inte- 
gration). College; 1945; IBM; Form CIC-2-B-4 
(usually called Form B) ; separate answer sheets must 
be used; $3 per 25; 40¢ per 25 machine-scorable an- 
swer sheets; 15¢ per scoring key; 25¢ d SIE 
id; 180(1 minutes; prepared by the Ex- 
aan Staff y United States Armed Forces 
Institute; published by the American Council on 
Education; distributed by Cooperative Test Service. 
(Also distributed by Science Research Associates : $3 
per 25; 65¢ per 25 machine-scorable answer sheets ; 50¢ 

per key; 50¢ per specimen set.) 

REFERENCES 


1. Zant, James H. “The United States Armed Forces Insti 
tute Examinations in Mathematics as a Means of Determining 
Credit in Engineering Mathematics," J Eng Ed 36:170-4 N '45. 


John F. Randolph, Professor of Mathematics, 
Oberlin College, Oberlin, Ohio. If an examina- 
tion tests what it is supposed to test, as this 
examination does, maybe nothing more should 
be said, but I cannot refrain from saying more. 
I understand this examination was made to 
cover only the topics of integral calculus com- 
monly taught in those colleges which answered 
a widely distributed questionnaire. Some col- 
leges are therefore leaving out a definition of 
the definite integral as the limit of sums, the 
fundamental theorem of calculus; methods of 
approximating definite integrals; force of at- 
traction between a mass and a. particle; mean 
value theorems for integrals; use of reduction 
formulas; parametric equations in connection 
with integration ; spherical coordinates ; and all 
theory. On the other hand, if the omission of 
topics from the examination indicates that these 
topics are not taught, then we are making prog- 
ress, for the word “infinitesimal” is nowhere 
in sight. 

The examination is formal from illustrative 
example to the end. The best preparation is to 
work the routine problems (do not bother with 
the text) in the most notorious old calculus 
book on the market. A good score on the ex- 
amination indicates proficiency in working such 
problems, but does not reveal the slightest 
knowledge of calculus. Do not blame the ex- 
aminers, they are merely examining what is 
taught ; they have shown considerable ingenuity 
in making problems that can be machine- 


graded. 
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[ 356 ] 
XAbility for Geometry: Fife Tests of Ability, 
Test 4. Entrants to secondary schools (England and 
Scotland) ; 1947; 1 form; 55. 6d. per 25; 4d. per man- 


ual; 45(50) minutes; Frank M. Earle; University of 
London Press Ltd 
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x 3 REFERENCES 
I. EARLE, Frank M. Tests of Abilit 
Courses. Publications of the, Ko 
Education, No. 1o. 
Pp. HE 138. 55. T 

2. EARLE, FRANK M. Reconstruction in the S. 
London: University of London Press, 1944. Pp. rea hart 


for Secondary School 
sottish Council for Research in 
London: University of London Press, 1936. 


For a review by James Maxwell of the entire 
battery, see 8. 
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Cooperative Plane Geometry Test. High school: 
1941-43; IBM; Forms R, S, T; separate sane sheets 
need not be used; $1.75 per 25; 25¢ per specimen set, 
postpaid; 40£ per 25 machine-scorable answer sheets; 
15¢ per stencil for scoring answer sheets; 40(45) min- 
utes; Emma Spaney, Margaret P. Martin (Form S), 
Tom A. Long, and L. P. Siceloff; Cooperative Test 
service, 


Harold P. Fawcett, Professor of Mathe- 
matics, The Ohio State University, Columbus, 
Ohio. [Review of Forms R, S, and T.] Even 
though the specific content of these Coopera- 
tive Tests “is based on surveys of curriculum 
outlines and bulletins, textbook analyses, and 
consultation with specialists in the different 
subject-matter fields,” there are few items in 
the three forms of this test which reflect mod- 
ern trends in the teaching of demonstrative 
geometry. The test does provide an instrument 
by which to measure the student’s familiarity 
with the facts of geometry and serves this pur- 
pose well, 

In the ro minutes allotted to Part I, the stu- 
dent is required to determine the truth or falsity 
of 30 geometric propositions. To score high on 
this section the student must be able to read 
rapidly and with comprehension. His responses 
must be rapid for any time spent in sketching 
and thoughtful study of a diagram will seri- 
ously limit his achievement. 

There are in Part II 20 incomplete state- 
ments, and the student is to complete these 
statements by selecting one of five suggested 
responses, The abilities tested are familiarity 
with definitions, the recall and application of 
certain geometric principles, computational 
skill, and the visualizing of loci. From the 
standpoint of geometric facts, the items are 
widely distributed and the sampling is good. 
Time limitations, however, will encourage re- 
call and memorization with little opportunity 
for the careful and thoughtful consideration 
which most of the items should receive. 

The time for Part III is 15 minutes, and by 
means of the 15 items in this section the stu- 
dent is tested for his ability to distinguish be- 
tween hypotheses and conclusions, to select the 


missing step in an argument, to organize the 
steps of a proof, to support statements with 
appropriate reasons, and to select converses. 
These are ideas related to thinking inside the 
field of geometry, and this is undoubtedly the 
strongest section of the test even though some 
of the geometric principles considered have 
little if any practical value. 

The teaching of demonstrative geometry has 
more to contribute to the education of a student 
than a knowledge of geometric facts, and it is 
unfortunate that the authors have made no 
effort to evaluate some of these larger out- 
comes. The test does provide an effective means 
of measuring the extent to which a student has 
mastered certain geometric principles, although 
time limitations of necessity make the sampling 
rather meager. The test appears to ignore un- 
derstandings associated with the nature of 
proof, the development of which is one of the 
major purposes for teaching demonstrative 
geometry. * 


C. O. Oakley, Professor of Mathematics, 
Haverford College, Haverford, Pennsylvania. 
[Review of Forms R, S, and T.] Form r. This 
examination adequately covers the material 
usually given in a course in plane geometry. It 
is well designed with respect to range and dif- 
ficulty. 

In Part I, in Item 27, the use of the word 
"side" might allow a student to contend that 
the answer is "no" by arguing that the center 
lies on the hypotenuse and not on a side. Item. 
29 could be confusing to the student, for is a 
square to be considered a rectangle or not? 
The wording in Item 30 should be changed, 
since a line drawn parallel to one side of a 
triangle might not even cut the sides of the 
triangle. 

In Part II, the multiple-choice type ques- 
tions, among the possible answers should be 
those that would be found if the problems were 
worked in an "obvious," but wrong, way. For 
example, the answer 15 should be among those 
offered in Item 11; and something involving 
v/2 in Item 15. Again few answers should be 
used that can be discarded at a glance, such as 
2624 in Item 2. The first sentence of Item 17 
might read: “Each of two right triangles has 
OE since 

In Part III, in Item 5, it might be clearer 
if the last sentence read: “The conclusion of 
the theorem states that... ." The wording of 
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Item 9 could be objected to on the grounds 
that "the simplest method of finding" is a psy- 
chological variable which is, in general, a func- 
tion of the individuality of the student. What 
may be simplest for one person may not be the 
simplest for another. The same remarks apply 
to the statement of Item I5. 

FORM S. This examination is better than 
Form R. The sample example at the beginning 
of Part II is worth while. There seems to be 
an adequate coverage of the subject material. 
Most of the problems are clearly stated and 
very searching. In Part I, some other problem 
should be given in place of Item 26. It is 
doubtful whether many students have ever seen 
any proof of the impossibility of trisecting a 
given angle with unmarked ruler and compass. 

In Part II, the second sentence might read: 
“If the ratio of the altitude to the base is 8 
in. . . ." In Item 11, the fifth answer can be 
discarded almost instantly by the student. Some 
Other number should be used. In Item 18, the 
fifth answer should be changed since it was 
given that the larger arc was 120°. The smaller 
could not be larger than 120°. 

In Part III, in Item 6, the last sentence 
should read: “With respect to these triangles 
the conclusion of the theorem. . . .” The fig- 
ures associated with Items 9 and 10 have some 
unnecessary letterings which might tend to 
confuse a student. 

ForM T, This test is very much better than 
either Form R or S. The three categories of 

. answers in Part I are a real improvement over 
the two-category answers that were offered in 
R and S. The sample question at the begin- 
ning of Part II should be of aid to the stu- 
dent in the questions that follow. 

In Part II, the fifth choice in Item 1 should 
be revised, As it stands it would be thrown out 
immediately by the student. 

In Part III, in Item 2, the last sentence 
should be changed to read: “The following are 
the first five steps in a proof. . . ." Item 7, in 
Part III, has the sentence “A simple method of 
finding the area of triangle ABC makes use 
of the fact that. . . .” The word “simple” 
should not be used. What seems simple for 
one person may not seem simple for another. 
There is no need of the word “simple” here. 
In Item 12 there are two figures although the 
wording refers to one figure. 


For reviews by Leroy H. Schnell, Charles 


C. Weidemann, and S. S. Wilks of earlier edi- 
tions, see 38:993 and 40:1467. 


[ 358 ] 
*Examination in Analytic Geometry. College; 
1944; IBM; Form CAG-2-B-4 (usually called Form 
B) ; separate answer sheets must be used; $2 per 25; 
40£ per 25 machine-scorable answer sheets; I5 per 
scoring key; 25€ per specimen set, postpaid; 180(190) 


i f ions Staff of the 
United Mec lid et iatitute; published by 
the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
scorable answer sheets; 50¢ per key ; 50¢ per specimen 
set. 

: REFERENCES 4 
vt Een agus Tr hg Vale Sua dns of Deiermining 
DN in Engineering Mathematics.” J Eng Ed 36:170-4 N 

C. O. Oakley, Professor of Mathematics, 
Haverford College, Haverford, Pennsylvania. 
This examination is just about as good as one 
could reasonably hope to obtain. It covers the 
field of analytic geometry very adequately and 
stresses the important topics. 

There are only two small points to remark 
on: In Section II, Item 20 might be a little 
clearer if it read “Find the horizontal asymp- 
tote, or horizontal asymptotes. . . .” In Section 
TII, the figure associated with Item 44 is not 
quite correct. The axes which intersect at O' 
should be labeled X' and Y' instead of X and Y. 


[ 359 ] 

*Examination in Plane Geometry—High-School 
Level. Grade 10; 1943; IBM ; Form SPG-1-B-3 (usu- 
ally called Form B) ; separate answer sheets must be 
used; $2 per 25; 40€ per 25 machine-scorable answer 
sheets; 15¢ per scoring key; 256 per specimen set, 
postpaid; 100(105) minutes; prepared by the Exami- 
nations Staff of the United States Armed Forces Insti- 
tute; published by the American Council on Educa- 
tion; distributed by Cooperative Test Service. (Also 
distributed by Science Research Associates : $2 per 25; 
65¢ per 25 machine-scorable answer sheets; 50¢ per 
key; 50 per specimen set.) 


Hale C. Pickett, Chairman, Department of 
Mathematics, State Teachers College, West 
Chester, Pennsylvania. This roo-minute test 
consists of 20 items on geometric problems, 11 
items on constructions, and II items on nu- 
merical problems. Since 5-response multiple- 
choice items are used throughout, the test can 
be machine scored. 

The 42 items appear to be too few for the 
time allowed. An interesting feature of the test 
is the use of line segments of various lengths 
for the choices in the construction problems ; 
the correct choice is the length of the required 
line in the construction. The mechanics of the 
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test are progressive, but the subject matter is 
traditional. 


[ 360 ] 

Iowa Plane Geometry Aptitude Test, Revised 
Edition. High school; 1935-42; IBM; 1 form; sepa- 
rate answer sheets need not be used; $1 per 25; 15¢ per 
manual; 20¢ per specimen set, postpaid; 144¢ per ma- 
chine-scorable answer sheet; 15€ per set of stencils for 
scoring answer sheets; 44(50) minutes; H. A. Greene 
and H. W. Bruce; Bureau of Educational Research 
and Service, State University of Iowa. 


Philip H. DuBois, Professor of Psychology, 
Washington University, St. Louis, Missouri. 
This test, developed to predict achievement in 
high school geometry, appears to be a useful 
instrument since the reported validity coeffi- 
cients are high—approximately .6 with the 


average of first- and second-semester grades in* 


geometry and .7 with objective geometry 
achievement tests administered after training. 

In a test designed for use in schools it is 
unfortunate, however, that validity data are 
presented for only 146 cases. Surely, with many 
thousands of high school students studying 
geometry each year, it is not too much to ask 
a testmaker to arrange for validation statistics 
for a more substantial number of cases. Con- 
structors of educational tests, who cannot plead 
scarcity of available cases or difficulty in secur- 
ing validity information, might adopt an arbi- 
trary minimum standard of 1,000 cases for all 
validation groups. 

When validity coefficients are presented for 
relatively few cases (in one group reported in 
Section 3 of the manual, N = 86), it gives a 
spurious air of precision to report validity 
coefficients to three decimal places. 

Although exact statistics are not presented, 
an inspection of the table of norms in the 
manual indicates that there is a statistically re- 
liable sex difference on this test, as might easily 
be anticipated. It would therefore be advisable 
to determine and present validity information 
separately for the two sexes. If the validities 
for the sexes differ, that might have important 
bearing on the use of the test in a high school 
educational guidance program. 

The format of the test leaves much to be de- 
sired, Answer sheets are available for scoring 
with the International Test Scoring Machine, 
yet the cover has spaces for name and identifi- 
cation data, and instructions are entirely in 
terms of indicating answers in the test booklet, 
for which space is provided. A test designed 
for machine scoring should: (a) be reusable, 


thus saving a certain amount of money to pur- 
chasers; (b) employ a separate answer sheet 
for all subjects, whether or not a scoring ma- 
chine is actually to be used; and (c) present 
in the booklet appropriate instructions for the 
use of the separate answer sheet. Otherwise, 
the purchaser using a scoring machine is likely 
to find a certain number of booklets spoiled 
for reuse, and the subjects are apt to be con- 
fused as to what they are to do. Manual scor- 
ing time is increased when the graders have to 
score items on several different pages instead 
of only one. A further revision of this test 
which would eliminate the space devoted to 
answers could present a better paging plan, so 
that no subtest would begin on a page on which 
the preceding subtest ends. 

These criticisms of the Jowa Plane Geometry 
Aptitude Test should not obscure the fact that 
the content of the test appears to be pertinent, 
the typography good, and the general plan ex- 
cellent. In its field, it can be recommended as 
usable and useful. * 


For a review by Edward E. Cureton and 
Charles C. Weidemann, see 40:1469. 
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Orleans Plane Geometry Achievement Test. First, 
second semesters high school; 1929; Forms A, B; 
$1.70 per 25; 35¢ per specimen set, postpaid ; Joseph B. 
Orleans and Jacob S. Orleans; World Book Co. 
a) TEST I. First semester high school. 
b) TEST 2. Second semester high school. 


Harold P. Fawceit, Professor of Mathe- 
matics, The Ohio State University, Columbus, 
Ohio. Test r, the first-semester test, has five 
parts: geometrical reasons (14 items), compu- 
tation (17), completing proofs (12), diagrams 
(4), and proofs (8). Test 2, the second-semes- 
ter test, also has five parts: computation (24 
items), completing proofs (12), loci (5), dia- 
grams (4), and proofs (6). The time limits 
are inadequate for some of the parts which 
require careful and critical thinking. To prove 
six originals in 29 minutes, for example, even 
though "statements only" are to be presented, 
is an achievement which calls for responses 
almost automatic and leaves little time for in- 
vestigation, exploration, and study. 

The structure of the tests indicates the in- 
terests of the authors in evaluating the ability 
to think in terms of geometrical concepts; 
nevertheless, many of the items do not leave 
the student free to think according to his own 
pattern. The section dealing with completing 
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of proofs, for example, gives the proofs with 
one or more steps omitted, and the student is 
required to insert the missing steps. This as- 
sumes a uniqueness of proof which is not con- 
sistent with fact. People do not think in the 
same pattern, nor should they be forced to do 
so. One of the major reasons for teaching 
demonstrative geometry is to improve the 
quality of thought by which students reach 
conclusions: to indicate that there is one and 
only one way in which he can think about a 
given geometric situation is to stifle this very 
desirable outcome. 

This test is effective for measuring familiar- 
ity with the factual content of geometry. The 
student is required to recall important prin- 
ciples and apply them intelligently, and some 
of the items call for careful thinking in terms 
of geometric situations. The emphasis, how- 
ever, is entirely on subject matter, and the 
authors appear to ignore those larger values 
associated with the nature of proof which are 
considered to be one of the major contribu- 
tions of demonstrative geometry. 


[ 362 ] 
Wisconsin Geometry Test. High school; 1937-39; 
Forms A, B; Form A has the title Wisconsin Achieve- 
ment Test: Plane Geometry; $1.25 per 25; 50(55) 
minutes; prepared by the Bureau of Guidance, School 
of Education, and the Wisconsin Geometry Test Com- 


ittee: in G. Hellfritzsch, Clifford N. Mills, Or- 
per VA Roger L. Slocum, Paul L. Trump, 
and Bjarne R. Ullsvik ; manual by T. L. Torgerson and 
G. J. Froehlich; E. M. Hale and Co. 
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Cooperative Trigonometry Test. High school and 
Co 1938-44; IBM; Forms O, P, U; separate an- 
swer sheets need not be used; $1.75 per 25; 25¢ per 
specimen set, postpaid; 40¢ per 25 machine-scorable 
answer sheets; 15¢ per stencil for scoring answer 
sheets; 40(45) minutes; Forms O and P: John A 
Long and L. P. Siceloff; Form U : Staff of the Cooper- 
ative Test Service; Cooperative Test Service. 


For reviews by J. O. Hassler, G. E. Hawkins, 
and S. S. Wilks of earlier editions, see 38:1074 
and 40:1474. 
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XExamination in Plane Trigonometry. College; 
1945; IBM; Form CPT-2-B-4 (usually called Form 
B); separate answer sheets must be used; $2 per 25; 
40€ per 25 machine-scorable answer sheets; 15¢ per 
scoring key; 25¢ per specimen set, postpaid ; 180(190) 
minutes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
scorable answer sheets; 50¢ per key; 50¢ per specimen 
d REFERENCES 

1, Zant, James H. “The United States Armed Forces Insti- 
tute Examinations in Mathematics as a Means of Determining 
Credit in Engineering Mathematics.” J Eng Ed 36:170-4 N 
45. 


MISCELLANEOUS 


Reviews sy Vera M. Amerson, Harvey A. Andruss, A. S. Barr, E. G. Blackstone, Ann Brew- 
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H, DuBois, Beatrice J. Dvorak, Elizabeth Fehrer, Norman Frederiksen, Albert J. Harris, 
Henry A. Imus, F. Nowell Jones, Harold E. Jones, Charles R. Langmuir, Orrel E. Little, Paul 
S. Lomax, Henry S. Maas, Agnes E. Osborne, Ray G. Price, H. H. Remmers, Mabel E. Rugen, 
Edward A. E David G. Ryans, Arnold E. Schneider, Verner M. Sims, Herbert A. 


Tonne, Herbert 


Toops, Arthur E. Traxler, C. C. Upshall, Goodwin Watson. 
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[ 365 ] 
Animal Husbandry Test: State High School Tests 
for Indiana. High school; 1932; Form A; 5%4¢ per 
test; I5¢ per sample test; 80(85) minutes; R. W. 
Gregory, K. W, Kiltz, W. A. Smith; with the assistance 
of S. S. Cromer and H. H. Remmers; State High 
School Testing Service, Purdue University. 


[ 366 ] 
XFarm Shop Tools: Recognition and Use: State 
High School Tests for Indiana. High school; 1943; 
Forms A, B; 5 per test; 15¢ per sample test; 50(55) 
minutes; Nelson M. Parkhurst; State High School 
Testing Service, Purdue University. 
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[ 367 ] 

Bookkeeping Test: State High School Tests for 
Indiana. First, second, third, fourth semesters high 
school; 1942-45; 4 levels; Form A (1st and 3rd semes- 
ters) ; Form N (2nd and 4th semesters) ; 4¢ per test; 
15¢ per sample test; 40(45) minutes; M. E. Stude- 
baker, B. M. Swinford, Vernal H. Carmichael, Frances 
R. Botsford, and Russell Burkhart; State High School 
Testing Service, Purdue University. 


[ 368 ] 
Bookkeeping Test: United-NOMA Business En- 


trance Tests. Schools and industry; 1939-47; new 
form usually available annually; prior to 1947, the 


443 ] 


TESTS & REVIEWS: MISCELLANEOUS—BUSINESS EDUCATION 


series title was National Clerical Ability Tests; $5 per 
25; $1 per test when scoring service and certificates for 
acceptable work are desired ; $2 per specimen set of the 
6 tests in the series; 120(130) minutes; prepared by 
the Joint Committee on Tests representing the United 
Business Education Association and the National Office 
Management Association ; National Office Management 
Association. v 


Harvey A. Andruss, President, State Teach- 
ers College, Bloomsburg, Pennsylvania. [Re- 
view of the 1946 Form.] This test assumes 
that a bookkeeper must be able to keep books, 
not merely to post or record sales, or enter 
cash items, or do other routine clerical work. 
Time allowance is two hours, but a bonus is 
given for doing the work in less time. 

The 1946 Bookkeeping Test seeks to deter- 
mine fitness for bookkeeping work through: 
(a) an understanding of principles and prac- 
tice; (b) ability to follow instructions; and 
(c) neatness. A large part involves correcting 
incorrect entries made in the Cash Book and 
Journal through careful reading of the expla- 
nations, also to correcting incorrect postings, 
after which a new trial balance of the General 
Ledger, with schedules of Accounts Receivable 
and Accounts Payable, is required. 

The test presents a handwritten set of books. 
However, it should be remembered that book- 
keeping is a matter of recording rather than 
reviewing (auditing) and reporting (account- 
ing). It would seem that the correction of in- 
correct entries smacks of the type of verifica- 
tion that is usually a function of the auditor, 
while the construction of a trial balance and 
the supporting schedules is certainly a very 
elementary accounting task and the making of 
a trial balance is the job of every bookkeeper 
—it is, in fact, the beginning of the reporting 
activity of the accountant. 

In the key to the bookkeeping test, point 
credits for finding errors vary from $5 to IO, 
while a 2-point bonus is credited for each full 
five minutes saved. No particular credit for 
recopying or taking a new trial balance ap- 
pears to be given. This would seem to be pro- 
vided for in the bonus given for completing the 
whole test in less than two hours time. 

The point of view of those making the 
Bookkeeping Test as it appears in the latest 
(1947) announcement of the United-NOMA 
Business Entrance Tests is as follows: 

Statements, adjusting, and closing entries are not 
complete bookkeeping tests . . . the ability to make 


these entries and statements is not expected of a be- 
ginning bookkeeper, as much as the ability to keep 
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records, and find and correct errors. . . . If a person 
can make and post entries, the facts do not prove that 
he can locate and correct inaccuracies, but if he can 
locate and correct inaccuracies, that is proof that he 
can also do the original work. Therefore, the U-N 
bookkeeping test is one involving locating and correct- 
ing inaccuracies. 


Ray G. Price, Associate Professor of Edu- 
cation, The University of Minnesota, Minne- 
apolis, Minnesota, [Review of the 1946 Form.] 
This test is intended to measure ability to do 
the work required of a bookkeeper. The testee 
is provided with a cash book, journal, general 
ledger, abstracts, trial balance, and subsidiary 
ledger. The original work is completely pro- 
vided for in the form of entries, postings, and 
a trial balance. However, there are several 
entries that are incorrectly made; others are 
incorrectly posted. These errors cause the trial 
balance and abstracts to be inaccurate. The 
testee is to locate and correct the incorrect 
entries and postings; and finally to rewrite the 
trial balance and abstracts. The test requires 
two hours. It is easy to administer. The form 
is excellent, meeting all the requirements and 
usual practices of such a test. Percentile norms 
are provided. 

The difficulty of adequately measuring a 
student’s ability in handling a set of books in 
a short time is recognized, yet the fact that this 
test requires two hours to administer presents 
a serious obstacle to its use in most high school 
classes 45 to 60 minutes in length. 

No validity is reported. The sponsors say 
that they “are fully aware of the fact that much 
additional work must be done before the full 
validity of these tests is established. . . . All 
that is claimed for the tests is that they are 
true samples of office work.” This reviewer 
would question somewhat the validity of the 
test from the standpoint of its adequacy to 
measure ability to do the ordinary routine work 
of a bookkeeper. Rather it tends to measure 
ability to do auditing. 

No reliability for the test is reported. An 
earlier form of the test (1941) had an esti- 
mated coefficient of reliability of .90. However, 
the form of the test was so changed that an- 
other statistical analysis will have to be made 
in order to estimate the reliability of the pres- 
ent form. 

Objectivity in scoring may be questioned ; 
for instance, 10 points of credit are allotted 
for “high grade of neatness,” while 5 points 
are allotted for “average neatness.” The basis 
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for the point system of scoring is not clear. 
How the weighting of points of credit was 


. determined is not explained. For example, for 


finding and correcting an error in the posting 
of the wrong amount, 5 points are allowed, 


while for finding and correcting an error in 


posting to the wrong account, a credit of 10 
points is awarded. How to handle part credit 
is another aspect of the scoring that is not 
explained. For instance, Item 8 of the key 
states; "Journal, March 11, Notes Payable 
debit should be on $2,750, instead of $2,777.50 
and a debit should appear to Notes Interest and 
Discount Expense, $27.50, 10 points." What 
should be the score for a student who corrected 

"the debit to Notes Payable but who did not 
make the entry to Notes Interest and Dis- 
count ? 

There is a total of 85 points on the test, not 
including a bonus of 2 points for each 5 min- 
utes saved under the two-hour time for taking 
the test. The minimum acceptable mark on the 
test is 31. No mention is made as to how this 
minimum score was determined. This seems 
unusually low since it would be possible for a 
testee to get 24 of the needed 31 points by the 
very simple expedient of handing in his paper 
at the end of one hour of time. 

Where the test is given on a national basis 
at properly sponsored centers as a means for 
issuing certificates to adequately qualified stu- 
dents the test may have some value. For the 
ordinary bookkeeping teacher to use in her 
class, the test has very limited value. 


[ 369 ] 
Business Fundamentals and General Information 
Test: United-NOMA Business Entrance Tests. 
Schools lustry; 1939-47 ; new form usually avail- 
able annually; prior to 1947, the series title was Na- 
tional Clerical Ability Tests; $4 per aj; $1 per test 
when scoring service and certificates for 


work are desired; $2 per specimen set of the 6 tests 


in th i inutes ; 
in the series 4G» minutes; xo by the Joint 
a 


e E 
agement Association; N: Office Managem 
Association. "M rca = ied 


Vera M. Amerson, Manager, Walsh School 
of Business Science, Miami Beach, Florida, 
[Review of the 1946 Form.] The 1946 Busi- 
ness Fundamentals and General Information 
Test is required of every one who takes one 
or more of the vocational tests included in 
the National Clerical Ability Test series. The 
fundamentals part is designed to measure 
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command of such skills and knowledges: as 
spelling, use of the apostrophe, plurals, arith- 
metic, and social science. The general in- 
formation part, in contrast with the school- 
taught knowledge covered in the fundamentals 
part, covers information gained from events 
reported by radio and newspaper and from 
common customs. / 

Thirty-five minutes is allowed for the Busi- 
ness Fundamentals Test and twenty-five min- 
utes for the General Information Test. Each 
part in the Business Fundamentals Test is 
timed separately in order to make sure that 
slow pupils do something in each part. If a 
person finishes a given part before the time is 
called, he is expected to go on at once to the 
next part. A five-minute recess is allowed be- 
tween the Business Fundamentals Test and 
the General Information Test. Those who fin- 
ish the General Information Test before time 
is called may hand in their papers. Most testees 
have adequate time to try all the items in both 
tests. 

Although each part restates the directions 
for recording answers on the separate answer 
sheet, the testee is given no opportunity for 
working out examples before taking any part 
of the test, In all parts of the Business Funda- 
mentals Test, except one, the testee is given 
only one example. Furthermore, the code for 
recording the answers changes from a b c for 
the first 10 items to C I for the next ro, to 
a b N for the next 6, to a b c N for the next 
5, to a b c for the next 9, and to 1 2 3 4 for 
the remaining 24. The arrangement of the ex- 
ample in Part A is different from the actual 
exercise, and in both Parts A and B the first 
example corresponds with the actual answer 
for the first item in each of these parts. “N” 
for the "none" answer is omitted in the exam- 
ple in Part E. These mechanical inconsistencies 
are supplemented by inconsistencies in capitali- 
zation, arrangement of items, and use of com- 
mas in amounts and after dates, indicating the 
probability of a hasty job of editing. Item 4 
in Part C is weak in that the spelling of both 
choices is correct while the instructions read 
"One word in each of the following groups of 
plurals is correctly spelled." 

A desirable characteristic of the test is the 
large proportion of functional information in- 
cluded in the test. Another desirable feature 
is the ease with which the test may be admin- 
istered and scored. 
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The reliability and validity of the test have 
not been checked since a careful analysis a few 
years ago resulted in the finding that the NCA 
tests were highly reliable and valid. The form 
of the test has been maintained over the years 
so as to retain the qualities of reliability and 
validity. Plans have been laid for another 
checking of these two qualities at an early date, 


C. C. Upshall, Psychologist, Industrial Re- 
lations Department, Eastman Kodak Company, 
Rochester, New York. [Review of the 1946 
Form.] The business fundamentals test includes 
short sections on (a) spelling (1o questions), 
(b) apostrophe (ro points), (c) plurals (6 
points), (d) arithmetic (5 questions), and (e) 
social science (9 questions). The general in- 
formation test has 24 questions; a majority of 
them are on subjects of current interest. 

The coefficients of reliability computed by the 
split-half technique for similar sections of the 
1941 edition of the test were: General Infor- 
mation, r = 84; Business Arithmetic, r = 75; 
and English, r — .84. 

The scores from these tests are not to be 
considered by themselves but are combined 
with scores from any of the five skill tests, to 
give a composite score for any particular skill. 
For example, the score for the filing test would 
be based on the score made on the filing test 
plus the scores made on the Business Funda- 
mentals and General Information Test. This 
procedure is justified on the basis that em- 
Ployers are reported to feel that English and 
arithmetic fundamentals and business and gen- 
eral information are things that should be 
taken into consideration when hiring a person 
for any of the jobs covered by the five tests. 
The weights given to the business fundamen- 
tals and general information test scores vary 
according to the skill test score with which 
they are combined. The weights are determined 
by the “pooled subjective judgments of ex- 
perienced business teachers and office mana- 
gers” made effective by appropriate statistical 
treatment. 

The questions under social science do not 
seem to be essentially different from those 
under general information except that one pro- 
vides three choices and the other four. Thus, 
the question “The Louisiana Purchase was 
——" (three choices) is placed under social sci- 
ence while “Blessed are the peacemakers ——” 
(four choices) is part of the general informa- 


tion test. General Information might serve as 
a title for both sections. 

Studies of the validity of the various skill 
tests combined with the fundamentals and gen- 
eral information tests have been promised, 
Objective on-the-job checks on the appropriate- 
ness of the weights given to the fundamentals 
and general information tests by the experts 
are also needed, How much, for instance, does 
knowledge of English, arithmetic fundamen- 
tals, and general information contribute to suc- 
cess on the job of file clerks? 

At present, certificates of proficiency are 
awarded on the basis of the test results, More 


3n- 


knowledge of the value of the tests for pre- - 


dicting success on the job should be obtained 
before the Certificate of Proficiency may be 
regarded as the main qualification for securing 
a position. This necessary knowledge can be 
gained more quickly if business firms will en- 
courage schools of business to give the tests 
and report the scores to the firms who employ 
the graduates. Follow-up studies should be 
made and reported as soon as possible, 


[ 370 ] 
*Commercial Arithmetic Test: State High School 
Tests for Indiana. High eet 1945746; Form A 
é per test; 15¢ per sample test; 40(45) minutes; 
urlotte id ts and Philip Peak i (A - High 
School Testing Service, Purdue University. 


[371] 

*Cooperative Commercial Arithmetic Test, 1-2 
semesters high school; 1944-47; IBM; Forms U, X; 
separate answer sheets must be used ; $1.75 per 25; 

per 2: Tepana a answer sheets 15¢ penaoa ing 
s ; 25¢ per specimen set, postpaid; 40. min- 
utes; ett by the staff of the Cooperative Test 
Service; Cooperative Test Service. 


[52] 
XE.R.C. St aphic Aptitude Test. Grades 9 and 
over; 1944; preliminary manual; 1 form; $365 per a5 
i alter 


Sof imen set; 33(45) minutes; 
Beemer, fes: Science EHE. Associates. 


REFERENCES 
" E [, e Testing in Shorthand,” 
Sg rg BRAK oY a 


Philip H. DuBois, Professor of Psychology, 
Washington University, St. Louis, Missouri. 
This test appears to be an excellent prognostic 
instrument, constructed on sound plan and pre- 
senting evidence of careful development and 
validation. 

Only one of the five subtests, speed of writ- 
ing, is a speed test. The others, word discrimi- 
nation, phonetic spelling, vocabulary and sen- 
tence dictation, have sufficiently long-time limits 


k 
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that the speed factor is considered of negligible 
importance. 

The material appears to sample adequately 
the skills required in learning shorthand. The 
validity coefficient of .70 between total score 
and accuracy of transcription of material dic- 
tated at more than 8o words per minute at the 
end of two years of shorthand study indicates 
that the test measures what it purports to 
measure. 

One paragraph in the manual of directions 
merits quotation even in a short review: "No 
reliability coefficients are reported for this test 
because it is felt that they add nothing to the 
reported validity coefficients. If the validity 
coefficient is satisfactory, the reliability coeffi- 
cient must be satisfactory. On the other hand, 
a test with high reliability may have low valid- 
ity, in which case the high reliability is of no 
value. The value of an aptitude test should be 
assessed by its validity coefficients, not by its 
reliability." 

While the position taken by the author is 
logical and may help to reduce our ritualism in 
regard to reliability coefficients (many of which 
are spuriously high anyway), this reviewer 
hopes that other test constructors will not fol- 
low Dr. Deemer's lead in omitting reliability 
coefficients from their reports. A properly com- 
puted reliability coefficient giving the consist- 

ency of the test as it exists is an inverse indi- 
cation of the extent to which further work 
with the approach used by an author is justified. 
The validity gives the usefulness of an instru- 
ment only as it exists. If the reliability by test- 
retest or by alternate forms is high, say .92 or 
over, little would be gained by further devel- 
opment along the same lines. If on the other 
hand, the reliability is in the .8o's, test con- 
sumers have the right to expect the author to 
perfect his instrument still further. A test con- 
structor who fails to report reliability infor- 
mation fails to provide the test user with in- 
formation essential to intelligent appreciation 
of the instrument. 


Edward A. Rundquist, Assistant Director of 
Personnel Research, Owens-Illinois Glass Com- 
pany, Toledo, Ohio. This test would be better 
termed a shorthand aptitude test since it is 
specifically designed to predict ability to learn 
and transcribe shorthand—which is but one 
phase of stenographic proficiency. It contains 
five subtests: I, Speed of Writing; II, Word 


Discrimination; III, Phonetic Spelling; IV, 
Vocabulary; and V, Sentence Dictation. It is 
similar in content and aim to the Turse Short- 
hand Aptitude Test. l 

The test has been carefully validated against 
objective measures of shorthand transcription. 
Errors in spelling, punctuation, and capitaliza- 
tion are not counted in scoring the transcrip- 
tion tests. This procedure emphasizes the aim 
of the author to predict shorthand transcrip- 
tion and not general stenographic proficiency. 
The test given at the beginning of a shorthand 
course correlates with accuracy of transcrip- 
tion measured at intervals of one and two years 
later from .54 to .70. These are substantial 
correlations and are of the order reported for 
the Turse test. Speed of accurate transcription 
is predicted less well, two correlations of .35 
being reported. 

The omission of some important statistical 
information detracts from an otherwise clearly 
written manual. It is known that 500 cases 
were involved in all the validity studies. But 
N's, means, and sigmas are not reported for 
the specific correlations obtained. Validity is 
reported only for total score. No validities or 
intercorrelations of the subtests are given to 
facilitate interpretation. 

The test score is interpreted by means of a 
table of predicted criterion scores (for each of 
the seven criteria) for twenty-point test score 
intervals. Percentile norms are given for the 
five subtests. Since norms usually vary widely 
according to age, selection, and previous ex- 
perience, the absence of total score norms is 
unfortunate. The percentile norms for the sub- 
tests would have much more value if the sub- 
test validities and intercorrelations were re- 
ported. 

With the exception of the dictation subtest, 
this test is easily administered and scored. It 
will take some practice to give this subtest cor- 
rectly. The problem is of greatest moment to 
the occasional user. The scoring of this test is 
laborious, involving the counting of syllables 
omitted, inserted, or substituted. While this 
subtest appears to involve an element impor- 
tant to stenographic proficiency, one wonders 
whether as good a prediction might not be 
obtained with a more easily scored test. Omis- 
sion of validities and intercorrelations of the 
subtests prevents any judgment on this point. 
Some modification to simplify scoring would 
be welcomed by the users of the test. 
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No reliabilities are reported, but this is un- 
important in view of the validities reported. 

These criticisms should not obscure the 
major fact that the test possesses substantial 
validity. In the reviewer's opinion it can be 
used with profit in the selection and classifi- 
cation of tenth and eleventh grade shorthand 
pupils. Whether its difficulty level will make 
it equally valuable for older persons in private 
secretarial schools is a matter for further 
study. 


[373] 
X*Examination in Bookkeeping and Accounting. 
First, second years high school; 1944-45; 1 form 
(usually called Form B); 2 levels; separate answer 
booklets must be used ; the test is not machine-scorable ; 
$1.75 per 25 answer booklets of either level; 1o¢ per 
scoring key for either level; 30¢ per specimen set of 
either level, postpaid; 180(190) minutes, either level ; 
prepared by the Examinations Staff of the United 
States Armed Forces Institute; published by the Amer- 
ican Council on Education; distributed by Cooperative 
t Service. (Also distributed by Science Research 
cs: $2 per 25; $3 per 25 answer booklets; 50¢ 
50€ per specimen set.) 


n 
Assoc 
per key; 
d) FIRST YEAR: SECONDARY SCHOOL. 1944; Form SBk- 
Ag-1-B-4; $2 per 25. 


b) SECOND YEAR: SECONDARY SCHOOL. 1945; Form 


SBkAg-2-B-4; $1.75 per 25. 


Harvey A. Andruss, President, State Teach- 
ers College, Bloomsburg, Pennsylvania. The 
first-year test consists of four subtests: (a) 
Accounting Terms, Facts, and Principles—40 
minutes; (5) Adjusting and Closing Entries— 
20 minutes; (c) Preparation of Accounting 
Entries—75 minutes; and (d) Preparation of 
a Work Sheet—45 minutes. Approximately 
two-thirds of the testing time is devoted to 
practical work as against knowledge and prin- 
ciples. No provision is made for the correction 
of bookkeeping entries. The scoring key does 
not show the year dates, only rulings. The Cash 
Receipts and Cash Payments Journals do not 
Show posting recapitulations. 

The second-year test consists of four sub- 
tests: (a) Accounting Terms, Facts, and Prin- 
Ciples—40 minutes; (b) Features of Various 
Forms of Business Ownership—ro minutes; 
(c) Preparation of Accounting Entries—65 
minutes; (d) Preparation of a Work Sheet— 
45 minutes. 

These tests seem to be comprehensive and 
worthy of consideration. 


[ 374 ] 
*Examination in Business. Arithmetic. High 
school; 1944; IBM; Form SBuAr-1-B-4 (usually 
called Form B) ; separate answer sheets must be used ; 
$175 per 25; 406 per 25 machine-scorable answer 


sheets; 15¢ per scoring key ; 25¢ per specimen set, post- 
paid ; 135(145) minutes; prepared by the Examinations 
Staff of the United States Armed Forces Institute ; pub- 
lished by the American Council on Education; dis- 
tributed by Cooperative Test Service. (Also distributed 
by Science Research Associates : $2 per 25; 65$ per 25 
machine-scorable answer sheets; 50¢ per key; 5o per 
specimen set.) 
[ 375 ] 

*Examination in Business dmi School 
Level. Grades 11-12; 1944; IBM; Form SBuE-1-B-4 
(usually called Form B) ; separate answer sheets must 
be used ; $2 per 25; 40£ per 25 machine-scorable answer 
sheets ; 15¢ per scoring key; 25¢ per specimen set, post- 
paid; 120(125) minutes ; prepared by the Examinations 
Staff of the United States Armed Forces Institute; 
published by the American Council on Education ; dis- 
tributed by Cooperative Test Service. (Also distributed 
by Science Research Associates: $2 per 25; 65¢ per 25 
machine-scorable answer sheets; 50¢ per key; 50¢ per 
specimen set.) 


Orrel E. Little, Associate Professor of Eng- 
lish, Manchester College, North Manchester, 
Indiana. This is a comprehensive yet simple 
and therefore desirable examination, testing in 
its five separate sections the achievement of 
objectives usually set for high school courses 
in business English. As the publishers state, 
they do measure more “than memory of facts 
... Numerous exercises represent problem sit- 
uations, involving for their successful solution 
applications of knowledge and the functioning 
of critical thinking.” 

Section I directs the student to select the 
misspelled words in a well-chosen list of roo. 
While these are above the minimum-essential 
level for general spelling usage, they are essen- 
tial to ordinary business communication and 
include applications of troublesome spelling 
principles, such as doubling the final consonant, 
referred, and dropping a silent e before a suf- 
fix, wholly, choosing the correct prefix—dis or 
de, per or pre, perspiration, etc. 

The Word Usage section tests diction or 
word choice by giving in sentence situations 25 
pairs of words frequently confused in business. 
These range from the ever-present principal- 
principle team to such puzzlers as accede and 
exceed. 

Section III uses multiple-choice groupings to 
test 20 such matters of form and usage as the 
correct arrangement, punctuation, and wording 
of the typed letterhead, the inside address, the 
salutation, the complimentary close, and identi- 
fying information in letters. Excepting one 
question concerning hanging indentations, the 
situations used are typical of those met in most 


offices. 
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In the fourth part an effective column ar- 
rangement of error-packed sentences from the 
body of a business letter provides multiple 
choices in grammar, punctuation, and diction, 
from which the student is to select “from the 
point of view of accuracy, style, or tone." Most 
of these also are typical of lessons to be mas- 
tered by a high school student: word division 
at the ends of lines, run-on sentences, agree- 
ment of subject and verb, wordiness, and con- 
fusion of adjectives and adverbs. However, 
there are included some controversial problems 
often taught today but not tested, such as the 
rigid placement of commas and periods within 
quotation marks regardless of the thought and 
the determination of paragraphing and paren- 
thetical expressions, which are too often mat- 
ters of interpretation. 

'The last section, carefully designed to test 
the recognition of effectiveness, consists of 
three short letters—a complaint, a reply to a 
request for information, and a recommenda- 
tion, Each sentence in them is given in three 
different forms: one lively but crude; another 
affected and wordy ; another simple, direct, and 
sincere. The student is given good opportunity 
to apply his knowledge and critical reflective 
thinking in deciding which of the three is the 
best. 

The publishers state that the tentative norms 
determined from over twelve hundred cases in 
thirty-two schools will be improved as more 
data are received. Stressing the desirability of 
relating varying degrees of achievement on the 
tests to the scholastic standards of the local 
institution, the editors give specific suggestions 
for turning the scores into letter grades. The 
diagnostic values of the test are enhanced by 
the availability of norms for both part and total 
scores, Analysis of the parts of the tests and 
of the scores will help the teacher evaluate her 
course, plan remedial work, and counsel the 
students. 


[ 376 ] 


XExamination in Commercial Correspondence— 
College Level. Grades 13-14; 1944; IBM; Form 
CCmC-1-B-4 (usually called Form B); separate an- 
swer sheets must be used; $2 per 25; so per 25 
machine-scorable answer sheets; 30£ per scoring key; 
35¢ per specimen set, postpaid ; 120(125) minutes ; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education; distributed by Cooperative Test 
Service. (Also distributed by Science Research Associ- 
ates: $2 per 25; 65¢ per 25 machine-scorable answer 
sheets; 50¢ per key; so¢ per specimen set.) 


Orrel E. Little, Associate Professor of Eng- 
lish, Manchester College, North M anchester, 
Indiana. The first four sections of this examina- 
tion are planned to measure the mechanics of 
business communication, whereas the last sec- 
tion attempts the more difficult judgment of 
effective arrangement and style. 

Section 1 lists 100 words in the college busi- 
ness person's vocabulary, from which the stu- 
dent is to select the misspelled forms. The 
authors have included examples of the common 
types of errors, as omitting, dining, originally, 
and words which the correspondent uses freely, 
such as, competitors, notary, assessment. The 
Word Usage section tests diction by giving in 
sentences 25 pairs of words frequently con- 
fused, from each of which the student is to 
select the appropriate one. He must distinguish 
between personal and personnel, cessation and 
secession. One pair—further and farther— 
should have been omitted. 

Section 3 uses multiple-choice groupings to 
test 20 such matters of form and usage as the 
correct arrangement, punctuation, and wording 
of the typed letterhead, the inside address, the 
salutation, the complimentary close, and identi- 
fication information in letters. Some teachers 
will object to the fact that semiblocked style is 
disapproved in one instance. In the fourth part 
an effective column arrangement of error- 
packed sentences from the body of a letter pro- 
vide multiple choices in grammar, punctuation, 
and diction, from which the student is to select 
“from the point of view of accuracy, style, or 
tone.” While most of these 74 situations are real 
and essential, at least 7 are objectionable be- 
cause they are controversial, such as the illogical 
placement of the period within quotation marks, 
the substitution of this for which, and the use 
of a comma before and between subordinate 
clauses. 

The problems in Section 5 attempt by three 
different methods to lead the student to judge 
the effectiveness of several letters, First he is to 
rearrange the jumbled sentences of two letters. 
The method to be followed—selecting from 
the multiple choices numbers representing the 
sentences—is so highly confusing as to render 
"critical reflective thinking" impossible. Then 
the student is asked to label as true, false, or 
doubtful critical remarks about a given “effec- 
tive sales letter.” Again his thinking is blocked, 
this time by the fact that the letter is not really 
effective. The final exercise is clear and worth 
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while: The student, after comparing the tone 
of five versions of a business letter, is to label 
each as satisfactory, too negative, not concise 
enough, etc. 

The publishers state that the tentative norms 
for both part and total scores determined from 
465 cases in twenty-three schools will be im- 
proved as more data are received. Stressing the 
desirability of relating varying degrees of 
achievement on the tests to the scholastic stand- 
ards of the local institution, the editors give 
specific suggestions for turning the scores into 
letter grades. Although the weaknesses of the 
fourth and fifth parts render the norms for them 
of questionable value, analysis of the parts of 
the tests and of the scores will help the teacher 
evaluate her course, plan remedial work, and 
counsel her students. 


Herbert A. Tonne, Editor, The Journal of 
Business Education, and Professor of Educa- 
tion, New York University, New York, New 
York. This test consists of five sections. Sec- 
tions 1 (Spelling), 2 (Word Usage), and 4 
(Grammar, Punctuation, and Diction) are gen- 
eral in nature and deal with phases of ability 
to use English common to all communication 
situations rather than just to business corre- 
spondence. Sections 3 (Letter Form and Usage) 
and 5 (Recognition of Effective Letter Writ- 
ing) give specialized consideration to business 
letter writing. All sections of the test are well 
constructed, avoid trick questions, and deal with 
the most important elements of the subject 
covered. 

No evidence of reliability is given, but it is 
quite likely that it is high. Norms are given in 
terms of percentile ratings for almost five hun- 
dred students for each form. In addition, per- 
centile ranks are also given for each section of 
the test. As evidence of understanding of the 
essential elements of good letter writing, the test 
is quite evidently valid. As final evidence of 
ability to write effective business letters, its 
validity is far more open to question. It would 
be interesting to compare the scores on this test 
of a large number of writers of effective busi- 
ness letters with those of little ability. The de- 
gree to which ability to recognize good letter 
writing correlates with actual ability to write 
good letters is of great consequence, and some 
evidence would be secured from this type of 
comparison. This comment should not be con- 
sidered a condemnation of the test, for in the 


final analysis there is no means of testing ability 
to write good business letters or any other kind 
of good writing other than that of actually 
having testees write letters and subjectively 
determining their effectiveness. 

The test is, in this commentator’s opinion, the 
best objective measure of business writing abil- 
ity at present available. While it has been aimed 
at college level students, it has considerable value 
in measuring the results of work in the second- 
ary schools and in private business schools. In 
these cases, of course, the tentative norms given 
for the test would not be valid. 

It is to be hoped that the publisher will secure 
data on the reliability of the test and give some 
evidence of its validity, when and if another 
edition is produced. 


[ 377 ] 

XExamination in Gregg Shorthand. First year high 
school; 1944; Form SGS-1-B-4 (usually called Form 
B) ; $2 per 25; 5¢ per scoring key ; 25¢ per specimen set, 
postpaid; (120) minutes; prepared by the Examina- 
tions Staff of the United States Armed Forces Insti- 
tute ; published by the American Council on Education ; 
distributed by Cooperative Test Service. (Also dis- 
tributed by Science Research Associates: $2 per 25; 
sog per key; of per specimen set.) 


Agnes E. Osborne, Assistant Professor, 
The City College of New York, New York, 
New York. This examiriation is not intended to 
test a student’s ability for a stenographic posi- 
tion. In no way does the examination test an 
individual’s ability to typewrite in acceptable 
form at vocational speed when shorthand notes 
are the stimuli, This is the standard for a 
stenographic position. However, the test is fairly | 
representative of the materials covered in first- 
year shorthand classes in many schools and of 
the skills obtained in one year’s time. The gen- 
eral form of the test is good. The directions to 
the student are concise but explicit, and the 
instructions for administering the test are suffi- 
ciently detailed to insure expert administration. 

The test is divided into three sections. In 
Section A, 175 printed words and phrases are 
to be written in shorthand. Inasmuch as the 
value of shorthand lies in the writer's ability 
to record outlines when spoken words, not 
printed words, are the stimuli, this section of 
the test does not measure the knowledge and 
skill required in recording notes when sound, not 
sight, is the stimulus. Entirely different scores 
would no doubt be obtained if the words and 
phrases were written from dictation. This pro- 
cedure would give a truer measure of the stu- 
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dent's shorthand knowledge and skill. For the 
Shorthand Reading Test, which is Section B, 
the student is required to transcribe into long- 
hand a shorthand article of 300 words. Some 
device should have been worked out to test 
reading ability with far less transcribing to be 
done. For example, there is no advantage in 
having a student transcribe the word "the" the 
number of times it appears in the article. The 
method of scoring the reading test may also be 
criticized. As one of the directions for scoring 
the transcript is to count % point for each 
correct word, repetitions receive the same credit 
as the first form. The word "the," for example, 
receives credit each time it appears, and receives 
the same credit each time as the words “for- 
eign," "monuments," and "resources," which 
occur only once in the article. Section C con- 
sists of the dictation of three letters at different 
rates of speed—the first letter at 50 words per 
minute, the second at 60, and the third at 70. 
A. phonograph record is available for the letter 
dictation. If it is not possible to obtain the 
phonograph record, the administrator may read 
from the printed material, which has been 
marked for timing. Explicit directions have 
been given to insure natural rather than artifi- 
cial dictation. 

"The manual that accompanies the test gives no 
analytical or statistical data for the shorthand 
test, but deals with the general testing program 
of the United States Armed Forces Institute. 
Reliability and validity coefficients for the short- 
hand test are not given. The percentile norms 
and equivalence tables for Forms A and B are 
made available to those who purchase tests, The 
teacher who uses the tests would find it helpful 
if more information were given regarding the 
construction of the tests, especially regarding 
the criteria used for the selection of the vocabu- 
lary. An analysis made by the writer of this 
review shows that the test words in Section A 
(Shorthand Vocabulary) cover practically all 
the major principles illustrated in the Gregg 
manual. Therefore, the teacher who wishes to 
use Section A for diagnostic purposes will have 
a basis for specific remedial teaching. 

The shorthand outlines on the scoring key 
lack uniformity and fluency, and they are ex- 
ceedingly heavy and very large. It would be im- 
possible to evaluate a student's notes correctly 
if the notes in the scoring key were used as 
models of Gregg shorthand. Inasmuch as the 
Scoring directions are to "Allow credit even 


though size and slant of examinee's outline dif- 
fer from the outline in the key," the emphasis 
is evidently placed on the application of short- 
hand principles and not on quality of penman- 
ship. ; 
This test was constructed under the auspices 
of experts and has probably been subjected to 
more scientific treatment than any other elemen- 
tary shorthand test. It deserves further use and 
should be supplemented with results presented 
in a revised manual, which should contain more 
specific information regarding the construction 
of the test, the validation, the reliability, and the 
interpretation of scores for shorthand learners. 
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*Examination in Typewriting. First, second years 
high school; 1944; 1 form (usually called Form B) ; 
2 levels; separate answer booklets must be used; the 
test is not machine-scorable ; $2 per 25; 25¢ per scoring 
key for either form; 45€ per specimen set of either 
form, postpaid; prepared by the Examinations staff of 
the United States Armed Forces Institute; published 
by the American Council on Education ; distributed by 
Conner Test Service, (Also distributed by Sci- 
ence Research Associates: $2 per 25; $3 per 25 answer 
booklets; 5o£ per key; so¢ per specimen set.) 

@) FIRST YEAR: SECONDARY SCHOOL. Form STy-1-B-4; 
$2 per 25 answer booklets; (130) minutes. 

b) SECOND YEAR: SECONDARY SCHOOL. Form STy-2-B-4; 
$3 per 25 answer booklets; (115) minutes. 


E. G. Blackstone, Professor of Business Edu- 
cation, The University of Southern California, 
Los Angeles, California. This test is designed 
for use by educational institutions to enable 
them to become familiar with the levels of 
achievement required in the Armed Forces In- 
stitute, and to: facilitate the just evaluation of 
the educational experiences in typewriting of 
returning service men and women. The tests 
were constructed by having specialists meet 
with teachers of typing to identify the educa- 
tional objectives to be attained; to specify the 
kinds of exercises to be included in the test; 
by preparing enough exercises to Provide for 
two forms of the test; and by trying out the test 
with groups of students in high school and col- 
lege. The tests were then submitted to one or 
more critics nominated by a professional organ- 
ization of business teachers, who checked the 
tests for comprehensiveness of coverage, for 
accuracy of material, and for validity of exer- 
cises. Then the tests were revised and given 
another tryout in schools. One form was kept 
for use by the Armed Forces Institute and the 
other was made available to high schools and 
colleges. 

The test consists of two five-minute timed 
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writings with credit to be given for the higher 
score of the two; a typing of corrected rough 
draft; a letter to be set up in modified block form 
from running copy; a simple tabulation; and 
an objective test of 30 items covering punctua- 
tion and other information in connection with 
the use of the typewriter. 

A special scoring key is used with the timed 
writings which gives illustrations of the types 
of errors to be marked. Ten words per minute 
are deducted for each error. 

For the rough draft, the letter, and the tabu- 
lation exercises, transparent scoring keys are 
provided to determine if the placement is cor- 
rect and to indicate penalties for degrees of 
error in placement. Errors in typing are also 
penalized. The objective test may be marked 
with special pencil for machine scoring, or with 
ordinary writing instrument for hand scoring. 

The tests may be interpreted by means of a 
set of percentile scores taken from more than 
1,700 cases in more than 40 schools. No data 
are given as to reliability or validity. 

The tests appear to be fairly comprehensive 
in form; at least enough so to be usable for the 
determination of credit to be allowed. The ob- 
jective section is particularly ingenious and 
good. 

It appears from the distributions of raw 
scores that are supplied with the norms for this 
examination that the step intervals used are too 
wide to yield discriminative scores at the upper 
levels. 

[379 J 
Filing Test: United-NOMA Business Entrance 
Tests. Schools and industry; 1939-47; new form usu- 
ally available annually; prior to 1947, the series title 
was National Clerical Ability Tests; $5 per 25; $1 per 
test when scoring service and certificates for accept- 
able work are desired; $2 per specimen set of the 6 
tests in the series; 120(130) minutes; prepared by the 
Joint Committee on Tests representing the United 
Business Education Association and the National Office 


Management Association; National Office Management 
Association. 


Arnold E. Schneider, Head, Department of 
Business Education, Western Michigan College 
of Education, Kalamazoo, Michigan. [Review 
of the 1946 Form.] The introductory statement 
on this test to the effect that "this test is to 
measure ability to arrange and file letters cor- 
rectly under office conditions as nearly as those 
conditions can be reproduced" sets up the test- 
ing objective clearly. This is a performance test. 
It is true that the performance takes place under 
simulated conditions and is to some extent of 


a paper-and-pencil variety, but nevertheless, it 
does attempt to measure performance in this 
vocational area. This is an efficient test and 
represents a long step forward in helping to 
solve the problem of vocational clerical testing. 

"The test measures up well in terms of accepted 
testing practices, In the main the directions are 
clear and concise. The directions under Part 1 
could be restated with more clarity and effec- 
tiveness. It would be well if the sample item 
were separated from the main body of the test 
questions in Part 1. This would make it much 
easier for the student to grasp what is required 
of him in this section. 

A question could be raised as to the format on 
page 1. A long list comprising 85 names is pre- 
sented on three pages with the request that 
names from list B be entered between the names 
on the long list spread over three pages. This is 
an awkward testing situation, particularly since 
the time element is involved. It may be that the 
writers were attempting to check for “paper 
shuffling” ability, but this is a mere conjecture. 

A manual for administration accompanies the 
test and clearly defines the work of the test 
administrator. This is particularly good for this 
type of test as it aids in the standardization of 
the administration of the test. It gives to the 
scores a greater degree of validity since the 
conditions under which the tests are adminis- 
tered are, for the most part, predetermined. 

Parts 2, 3, 4, and 5 of the test employ the use 
of sample form letters. These letters are on 
sheets approximately 314 by 574 inches. Actual 
letters on standard-sized letterhead of 81⁄4 by 11 
inches were photographed and reduced to the 
size mentioned. This, of course, has reduced the 
size of the type considerably. This opens up two 
possible criticisms which may or may not be 
serious. One of them is that the individual who 
has less than 20/20 vision may be penalized more 
for eyesight than for ability. This may have a 
bearing on the test performance when one re- 
lates it to the high percentage of eye difficulties 
to be found in a normal population as shown 
by the figures among draftees during the past 
war. It is also possible that some individuals 
may find it difficult to shuffle, arrange, and re- 
arrange the small sheets of paper which simulate 
actual letterhead size. It might well be that 
actual letter-sized sheets would improve the 
efficiency of the tests and add no difficulty to 
the problems inherent in the construction of this 


type of test. 
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An answer key is furnished in the manual for 
administration. It is suggested in the bulletin 
describing the tests that the tests are to be re- 
turned to the central office for scoring. This in- 
sures uniform scoring and proper analysis of 
each paper. A statement is also made that the 
tests may be corrected by the teacher. The scor- 
ing key provided in the administration manual 
does not lend itself to efficient use on the part 
of the teacher who would attempt to correct the 
tests locally. 

No claim is made as to statistical validity. 
There is an a priori claim that the tests do 
measure actual office performance in this area. 
This claim has been substantiated by the busi- 
nessmen and business teachers who have aided 
in the selection of the test items. There is no 
reason to question the statement that the tests 
are valid in that they are sample performance 
tests for the specific vocational area of filing. 
Percentile tables are furnished, based upon a 
wide sampling. 

Tt must be remembered that this test is one 
of a battery of six tests which attempts to 
determine vocational clerical ability. Although 
each individual test is a separate measuring in- 
strument, nevertheless the composite effect or 
picture of the test scores might conceivably pre- 
sent a performance picture which would more 
accurately portray the abilities of the testee 
than could individual tests. This test should be 
judged, in the long run, in terms of the entire 
battery of the United-NOMA Business En- 
trance Tests, 


C. C. Upshall, Psychologist, Industrial Rela- 
tions Department, Eastman Kodak Company, 
Rochester, New York. [Review of the 1946 
Form.] This filing test is designed as a test of 
achievement of people who have studied filing 
or have had filing experience. It is not intended 
to be a test of aptitude or trainability, The scores 
from this test are not given apart from the 
scores on a test of general information and 
English and arithmetic fundamentals. These two 
tests are considered a valuable part of each of 
the six tests of office skills published by the 
National Council for Business Education and 
the Office Management Association, A compos- 
ite score for the filing test is based on the 
weighted scores of this test plus those from the 
general information and fundamentals test. The 
importance of each of these tests is determined 


by the “pooled subjective judgments of experi- 
enced business teachers and office managers. 
The standard deviation of the scores, the relia- 
bility coefficient of the tests, and the coefficient 
of alienation between the separate parts are 
used to assign numerical weights that will reflect 
the judgments of the experienced teachers and 
managers. 

The filing test is made up of five parts, four 
of which must be answered by the testee. The 
parts are: (a) filing each of a list of 50 names 
of people and companies in another alphabetized 
list; (b) filing 50 letters alphabetically accord- 
ing to name; (c) filing the same 50 letters alpha- 
betically according to location; (d) sorting for 
triple check automatic filing; and (e) labeling 
folders for filing. 

A choice is given between Parts (d) and (e). 
Fifty pages of correspondence, reduced in size 
to 334 x 534 inches, serve as the material to be 
alphabetized in Parts (b) and (c). Accuracy 
is stressed more than speed. A small bonus is 
allowed for each minute saved by finishing be- 
fore the two hours assigned to the test are up. 

It is claimed that the test will measure “pro- 
duction speed” rather than “spurt speed,” but 
little proof for the statement is given apart from 
the fact that a candidate may take two hours to 
complete the test. It is doubtful whether a test 
of two hours will show “production speed” 
much better than a half-hour or an hour of 
testing. “Production speed” is based on many 
more factors than can be included in a two- 
hour test. 

Most employment departments in industry 
will find the testing time of two hours for the 
filing test plus the hour or more for the general 
information and fundamentals test too long 
to devote to gathering information about file 
clerks, at least until clear evidence is given of 
the high validity of these tests, The use of the 
tests at the end of a training period is to be 
recommended. Employment departments might 
well encourage business schools to give these 
tests and report the results to the firms that hire 
their graduates. 

The filing test may well prove to be one of 
the most valid now available. 
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*General Test of Business Information. Grades 9- 
16; 1942-43; Forms A, B; $1.15 per 25, postpaid; 20€ 
per specimen set, postpaid; 40(45) minutes; Stephen J. 
Turille; Bureau of Educational Measurements, Kansas 
State Teachers College of Emporia. 
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Vera M. Amerson, Manager, Walsh School 
of Business Science, Miami Beach, Florida. 
This test is an effort to measure a high school 
or college student’s ability to recall informa- 
tion about consumer business education and to 
solve certain practical problems. The test in- 
cludes such fields as record keeping, business 
papers, business English, buying habits, basic 
legal principles, and business dress. 

The test consists of three major sections and 
a total of 112 items. Sections I and II, consist- 
ing of 37 and 12 items respectively, carry the 
traditional titles of “True-False Test” and 
“Multiple Choice Test” instead of titles indicat- 
ing the kind or kinds of information being 
tested. Since the test is named General Test of 
Business Information, some may question the 
inclusion of Items 1 and 38, which deal with 
club records. Some may also prefer “journal,” 
“ledger,” or some other business term to the 
use of the word “diary” in Item 23. No doubt 
the word “mailed” in Item 42 (“A package 
valued at $150.00 to be mailed third class should 
be (1) airmailed, (2) expressed, (3) freighted, 
(4) registered.”) will be changed in future edi- 
tions, since it weakens responses (2) and (3). 

Section III, Business Problems Test, consists 
of 63 short answers to be filled in by the student. 
Since only one example is given for this section 
on the front page of the test, the student is likely 
to gain from the problem given the idea that the 
entire section consists of mathematical prob- 
lems. When work is actually begun, the student 
will find that in addition to mathematical prob- 
lems he has to do such tasks as filling in a check, 
writing an endorsement, checking a list of items, 
and answering information questions such as 
“What do the letters C.O.D. stand for in the 
business world?” and “At least how often 
should personal property inventories be taken ?" 

Although the test is intended for college stu- 
dents as well as for high school students, statis- 
tical data are given only for high school students. 


Herbert A. Tonne, Editor, The Journal of 
Business Education, and Professor of Educa- 
tion, New York University, New York, New 
York. This comprehensive test of the minimum 
essentials of consumer business information 
covers such areas as check writing and endorse- 
ments, business papers, notes and drafts, basic 
legal principles, recording, and personal-use 
business information and buying practices. Each 
form is composed of 112 items. The reliability 


coefficient is listed as being .91 with low prob- 
able error. This would make the test unusually 
reliable as classroom tests of this type go. The 
validity of any test of this type is, of course, 
subjective, but care seems to have been taken 
to insure adequate coverage of subject matter by 
checking available texts and syllabi and by sub- 
mission of the material to thoughtful critics. 

The test seems broader in scope than the 
Thompson Business Practice Test (38:942) 
and about as reliable and useful for classroom 
and testing purposes. Some of the test items 
are open to dual interpretation and a few of the 
answers given in the key are not correct. This, 
however, is not at all unusual in teacher service 
material of this kind. 

A comparison is given of scores made by 539 
pupils on Form A at midyear with scores of 465 
pupils on Form B at the end of the year, No 
indication is given, however, of whether these 
pupils were the same persons, and whether they 
had taken a subject such as junior business 
training during this interval. This hardly makes 
possible the “intelligent interpretation” which 
the manual of direction suggests is provided. 
Moreover, a pattern is given for translating the 
scores into school marks without reference to 
subject taken, age of student, or intelligence. 
Scores which are satisfactory for low-intelli- 
gence students would hardly be adequate for 
those of higher intellectual capacity; scores 
which might be high for a high school freshman 
would be very unsatisfactory for a college 
senior ; yet the test description indicates that the 
test “covers the minimal essentials of consumer 
business information that a high school or col- 
lege student should possess.” It is possible, how- 
ever, that these weaknesses are the fault of the 
publisher rather than the author. As most class- 
room teachers at present must construct tests, 
the publisher has a serious responsibility for 
eliminating those weaknesses in a test which, 
while they are quite tolerable when used in the 
author's classes for his own use, make the test 
unsatisfactory when presented for nation-wide 
distribution. 

Nevertheless considering the dearth of stand- 
ardized materials in the subject matter of gen- 
eral business, this test must be considered a 
distinct contribution. If the teacher ignores 
some of the suggestions for translating test 
scores into marks and is careful in the use of 
the key, the test will be found a valuable aid in 
determining classroom achievement in a general 
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business class and will provide at least a partial 
basis for diagnosis for remedial teaching. 


[ 381 ] 
Hiett Stenography Test (Gregg). First, second se- 
mesters high school; 1938-39; Forms B, C; 2 levels; 
$1.15 per 25, postpaid; 15¢ per specimen set, postpaid; 
40(45) minutes; Victor C. Hiett and H. E. Schrammel ; 
Bureau of Educational Measurements, Kansas State 
"Teachers College of Emporia. 
a) TEST 1. First semester. 
b) test 1. Second semester. 


Agnes E. Osborne, Assistant Professor, 
The City College of New York, New York, 
New York. Test I covers the work of the first 
semester and Test II covers the entire year’s 
work. The same kind of test items is used in 
each test and each test consists of five parts. 
Part I consists of a list of 50 printed words 
which are to be reproduced in shorthand in 
blanks provided for that purpose. No criteria 
for the selection of the words are given for this 
part of the test or for the remaining parts. 
Part II consists of a list of 40 shorthand sym- 
bols which are to be transcribed on blank lines 
provided for that purpose. In Part II, not 
enough space has been provided for the stu- 
dent’s longhand transcript while more than 
enough has been provided for the shorthand 
outlines in Part I. The authors probably expe- 
rienced difficulty in arranging these two parts on 
the same page. Part III contains a list of 20 
shorthand sentences, and opposite each sentence 
is a group of 4 words, one of which is to be 
chosen to complete the meaning of the shorthand 
sentence. The groups of words opposite the first 
IO shorthand sentences are in longhand, and 
those opposite the last 10 sentences are in short- 
hand. In this section of the test, an attempt has 
been made to test the student’s reading ability 
without resorting to the requirement of a com- 
plete transcript of the material. Part IV is a 
printed article of 200 words. The student is to 
write the shorthand outlines on the line imme- 
diately above each word. This test requires a 
skill that has not been practiced in the average 
shorthand class, and it remains to be determined 
whether writing outlines directly above printed 
words that are in context is indicative of short- 
hand skill. Part V consists of an exercise in 
letter dictation and longhand transcription. This 
part is to be graded only for accuracy of tran- 
scription. As the dictation for Part V is to be 
given in 3 minutes at the beginning of the testing 
period and 40 minutes is allowed for the remain- 
ing parts of the test, including the longhand 


transcription of the letter, it is evident that no 
attempt has been made to find out how fast the 
student transcribes. Because of the way the time 
is allotted, it is quite possible that students may 
not reach this practical section of the test—the 
transcription test, which approaches more closely 
than any other part of the test the objective 
of the shorthand course. 

The manual that accompanies the test con- 
sists of 2 pages of information. The authors are 
vague concerning the construction of the test. 
Insufficient analytical and statistical data regard- 
ing the important phases of selection of test 
items, scoring, weighting of parts, length of 
parts, and timing are submitted. The only data 
available on the reliability of the test are stated 
in one sentence: “Various components of parts 
yielded reliability coefficients for the several 
forms of the test from .65 to .85, average .75. 
These figures would seem to indicate fairly low 
reliability. Further information on the number 
of cases and the distribution of the scores 1s 
desirable if one is to get a very precise idea of 
reliability. Norms for the test are given in the 
form of percentile scores and were computed 
from the scores made by 5,296 students in 358 
schools. 

The materials for this test have been assem- 
bled in a 4-page booklet. Parts I and II are on 
the front page—evidently it is immaterial 
whether all students begin working on the test 
at exactly the same time. Some may fill in the 
information in the heading more quickly than 
others do and begin the test immediately or, at 
least, see the test material before the others do. 

The scoring key and the directions for admin- 
istering the test have been reproduced by a 
duplicating process and are therefore more diffi- 
cult to read than print would be. Some of the 
shorthand outlines on the scoring key are not 
discernible, and on some copies the directions 
are difficult to decipher. The shorthand outlines 
on the test forms are fairly good. The directions 
for administering are brief and too indefinite to 
insure the same performance by each adminis- 
trator. It is quite possible that students would 
feel the necessity for asking questions on Part 
III, especially where the performance varies 
from the average activity in the classroom. A 
sample answer is advisable for this part. 


[ 382 ] 
Kauzer Typewriting Test. First, second, fourth se- 
mesters high school; 1934; 3 levels; 90¢ per 25, post- 
paid; 15¢ per specimen set, postpaid; 15(25) minutes; 


RENE ooo 
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Adelaide Kauzer and H. E. Schrammel; Bureau of 
Educational Measurements, Kansas State Teachers 
College of Emporia. 

a) TESTI, FORM A. First semester. 

b) TEST I, FORM B. Second semester. 

C) TEST II, FORM A. Fourth semester. 


E. G. Blackstone, Professor of Business Edu- 
cation, The University of Southern California, 
Los Angeles, California. There are three of 
these tests. Test I, Form A, designed for use 
at the end of one-half year of instruction, con- 
sists of a timed writing and a paragraph copying 
exercise. Test I, Form B, designed for use at 
the end of one year of instruction, consists of 
a timed writing exercise and a paragraph that 
is to be centered on the page. Test II, Form A, 
to be used at the end of two years, consists of 
a timed writing and a simple tabulation. The 
timed writings are 15 minutes in length, and are 
scored by International Rules. 

Percentile charts are provided for interpret- 
ing each part of each test. The timed writings 
of Forms A and B of Test I are supposed to be 
of equal difficulty, but no statement is given as 
to how they were equated. No statement is given 
on reliability or validity. 

It appears that an effort has been made in the 
timed writings to provide material of little diffi- 
culty. Very few numbers or special characters 
are included, hence such factors are not meas- 
ured. The accuracy grade is determined by mul- 
tiplying the errors by 3 and subtracting from 
100. Why this particular penalty is used is not 
clear. The speed grade, determined by Inter- 
national Contest Rules, is transposed by means 
of a table into scores from 100 down. 

Teachers using these tests will need to learn 
how to interpret such scores, for the old method 
of using net words per minute does not apply. 
Perhaps this is an advantage. The scores in the 
other parts of the tests than the timed writings 
are combined with the timed writing scores by 
adding the totals and dividing by three. These 
may then be interpreted by means of percentile 
charts. Why this particular weighting of the 
parts is used is not justified. 

The use of International Contest Rules for 
scoring seems to be a doubtful method, since 
those rules have been subjected to much criti- 
cism as being arbitrary, contrary to business 
office practices, and unscientific in nature. There 
seems to be no effort made in the test results to 
make possible any comparison of student scores 
and business standards. 

The tests are rather similar to those with 


which teachers are familiar but have the same 
faults that most previous typing tests.have had, 
namely, uncertainty as to the difficulty of copy, 
failure to test all characters on the keyboard, and 
an arbitrary grading scheme. 


[ 383 ] 
*Kimberly-Clark Typing Ability Analysis. Grades 
9-12 and adults; 1942; preliminary manual; 1 form; 
out of print; nontimed (60) minutes; Clifford E. 
Jurgensen; Science Research Associates. 
REFERENCES 

1. JURGENSEN, CLiFFORD E, “A Test for Selecting and Train- 
ing Industrial Typists.” Ed & Psychol Meas 2:409-25 O '42. * 
(PA 17:1350) 

E. G. Blackstone, Professor of Business Edu- 
cation, The University of Southern California, 
Los Angeles, California, These tests were de- 
signed either as an employee selection aid or for 
training purposes. The booklet contains four 
exercises: typing a corrected copy of a rough 
draft, typing a simple tabulation, typing a short 
business letter from penwritten copy, and a 
tabulation of three columns requiring the stu- 
dent to rearrange and alphabetize the copy. 

The grading is based on a time score and an 
error score. The time is the number of minutes 
required to complete the test. The error score 
is based on three types of errors that are penal- 
ized at increasing rates: corrected errors, minor 
errors, and gross errors. A chart is provided to 
aid the teacher in classifying errors. 

Speed and accuracy scores may be combined 
into one score by use of an Error Score Convert- 
ing Chart which seems to have been devised by 
approved statistical methods. Combined scores 
may be determined in three ways: one by giving 
equal weight to speed and accuracy ; one giving 
speed twice as much weight as accuracy; and 
one giving accuracy twice as much weight as 
speed. Percentile charts make it possible to 
interpret scores for speed alone, for accuracy 
alone, or for any one of the three combinations 
of speed and accuracy. The chart gives percen- 
tiles for office typists and for high school 
seniors. 

Careful attention has been given to statistical 
treatment. The items were determined by con- 
ferences with office supervisors, then prelimin- 
ary forms were prepared and tried out. Then 
comparisons were made of the test results and 
efficiency ratings of office typists. The test sec- 
tions retained were chosen on the basis of high 
correlation with the criteria and low intercor- 
relations. 

Reliability coefficients range from .72 to .84, 
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depending upon the method used for combining 
speed and accuracy. These are fairly high corre- 
lations. Validity correlations between total test 
Scores and office efficiency ratings were .957, 
which is almost unbelievably high and could be 
possible only by chance or by means of far bet- 
ter efficiency ratings than are ordinarily avail- 
able. The validity correlations of the various 
elements are: time, .79; converted errors, 71; 
raw errors, .65; gross errors, .45; minor errors, 
-55; and corrected errors, .33. 

"These tests attempt to measure elements of 
typing other than straight copy elements, al- 
though straight copy is involved to a degree. 
"They have been given careful and complete sta- 
tistical treatment of a type that may well be 
recommended to other testmakers. 


[ 384 ] 


Machine Calculation Test: United-NOMA Busi- 
ness Entrance Tests. Schools and industry ; 1939-47 ; 
new form usually available annually ; prior to 1947, the 
series title was National Clerical Ability Tests; $5 per 
25; $1 per test .. hen scoring service and certificates for 
acceptable work are desired ; $2 per specimen set of the 
6 tests in the series; 120(130) minutes; prepared by 
the Joint Committee on Tests representing the United 
Business Education Association and the National Office 
Management Association ; National Office Management 
Association. 


Elizabeth Fehrer, Associate Professor of 
Psychology, Michigan State College, East Lan- 
sing, Michigan. [Review of the 1946 Form.] 
This test, like the others in the series, is an 
achievement test intended for evaluation of 
the student at the end of a business training 
course and for selection and evaluation of cal- 
culating machine operators. The test is designed 
to be a production-speed test in contrast to a 
spurt-speed test. Two hours are allowed for 
administration, and consequently the ability to 
do sustained accurate work is required. Either 
an electrically operated or a hand-operated ma- 
chine may be used. 

The test measures proficiency in the four 
fundamentals and in normal fractions and per- 
centages. The majority of items are in the form 
of specific problems. Many are fairly long and 
involve considerable computation before the 
final value, which alone is scored, can be com- 
puted. The items were selected to sample actual 
office work, and they cover a wide variety of 
problems. On the basis of the length, complexity, 
and representativeness of the items, the face 

validity of the test seems to be high. 

A new form of the test is prepared each year ; 


and although “the exact steps taken in the con- 
struction of the tests vary somewhat from year 
to year,” the construction of the tests apparently 
adheres to sound procedures. A preliminary 
form is tried out on a sample of students and 
office workers. Analysis of the tryout results 
includes a study of the reliability of the scoring 
procedure and an item analysis with a discard 
of all nondiscriminating items. 

Experimental evidence concerning the valid- 
ity of the test is lacking at present although a 
follow-up study is in progress “to ascertain the 
business success of those who took the tests in 
1937 and 1938 and to determine whether or not 
the certificate holders are making good, whether 
or not those who failed to win a certificate had 
any great difficulty in making good in jobs for 
which the tests said they were not qualified, 
and whether or not the certificate was helpful 
in securing employment.” Because of the simi- 
larity of the successive forms, the results for 
the earlier editions would presumably apply to 
subsequent editions of the test. 

Reliability, although not determined for each 
edition of the test, is probably high. A study of 
the 1941 edition indicated a split-half reliability 
coefficient of .96. 

This test, like the others in the series, is ad- 
ministered only in certified test centers on speci- 
fied dates and under adequate supervision. Test 
papers are returned to the central office where 
they are scored and results later reported to 
schools or employers. Norms based on those 
taking the test are in terms of percentiles. A 
minimum acceptable score on each test is speci- 
fied, but the basis for determining this critical 
score is not described. Successful candidates are 
sent a Certificate of proficiency in the subject. 

Earlier forms of the test are available for 
practice or training purposes. 


[ 385 ] 
Parke Commercial Law Test. High school; 1933; 
I form; gog per 25, postpaid; 15¢ per specimen set, 
postpaid; 40(45) minutes; L. A. Parke; Bureau of 
Educational Measurements, Kansas State Teachers 
College of Emporia. 


Ray G. Price, Associate Professor of Educa- 
tion, The University of Minnesota, Minneapolis, 
Minnesota. This test was “originally con- 
structed for use in the Nation-wide Every 
Pupil Scholarship Test of April, 1933.” The 
test, consisting of 120 items, is easily adminis- 
tered. The typography and make-up are not 
superior; however, they are satisfactory even 
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though the print is rather small. Percentile 
norms are provided, based on 623 cases. 

No statistical evaluation is given of the valid- 
ity of the test. "The test is based on the content 
of a number of respectable textbooks used in 
high school classes. The distribution of items is 
based on textbook content and judgment of 
commerce teachers." 

The test does not use the preferred technique 
of arranging the items in the order of ascend- 
ing difficulty. The true-false section of the test 
contains 34 false items and only 16 true items. 
It would seem desirable to have a better bal- 
ance of true and false items. 

Numerous clues and suggestions occur that, 
by the alert student, could be used to determine 
the correct response even though he were not 
familiar with the subject matter. Two examples 
from the matching items will illustrate this 
point: (a) “A condensed history or synopsis of 
a title to real estate." Answer, "Abstract of 
title." (b) "Statutes limiting the time within 
which law suits to recover damages may be 
brought." Answer, "Statutes of limitation." The 
rather consistent use of such terms as "all, 
“not,” “always,” "never," and “must” in the 
negative forms of items are other recognizable 
clues. 

This is a rather comprehensive test covering 
the essential principles of an elementary course 
in commercial law. It is easy to give and to 
score. It would be of value to the teacher who 
wished to evaluate the attainment of her class. 
The test should, however, be revised in the light 
of the content changes in business law since 
1933 when the test was first published. 


[ 386 ] 
*Seashore-Bennett WO RE Proficiency 
Tests: A Standard Recorded Stenographic Work- 


sample. Adults; 1946; Forms B-1, B-2; "Transcription 
Style (3314 rpm): $16 per set, postpaid; Standard 
Record Style: $17 per set, postpaid; $2 per 100 sum- 
mary charts, postpaid; 35¢ per manual, postpaid; (50- 
60) minutes ; Harold Seashore and George K. Bennett; 
Psychological Corporation. 


Ann Brewington, Associate Professor of 
Business Education, School of Business, The 
University of Chicago, Chicago, Illinois. These 
phonographically-recorded proficiency tests are 
worksamples which can be used by business 
firms, schools, and employment agencies in set- 
ting their own standards, Business firms can 
use them advantageously in hiring stenographic 
employees, in promoting on a work-production 
basis, and in setting achievement standards in 


training programs within the firm. In fact, one 
of the two forms is sold only to business firms 
in order to guarantee that testees will not have 
had access to the tests in schools or at employ- 
ment agencies. 

While careful preliminary work is evidenced 
in the selection of subject matter, in the length 
and form of the letters, and in determining what 
constitutes mailability, the most significant work 
has been on the problem of measuring rate. The 
use of records controls rate and at the same time 
eliminates the human factors influencing the 
two people constituting the dictation situation. 
Scoring omitted, mutilated, and substituted 
words and limiting the transcription time meas- 
ure rate and make it possible for the weighting 
of each of the other five factors of mailability 
to vary from job to job. The “just graduated" 
are differentiated from the experienced; the 
unemployable, from the “can learn," the compe- 
tent, and the eminently competent. These pro- 
cedures should facilitate the solution of a diffi- 
cult problem—how to weight the various kinds 
of errors in stenographic tests which vitiate the 
significance of a single-number score. 

Total dictation time for the five letters is 
approximately 20 minutes; total transcription 
time, 30 minutes. From one to fifty people can 
take dictation at the same time. Norms are not 
yet available. New forms will be added to the 
series as the tests become well known. 


[ 387 ] 
Shemwell-Whitcraft Bookkeeping Test. First, sec- 
ond semesters high school; 1937-38; Forms A, B; 2 
levels ; 9o per 25, postpaid; 15¢ per specimen set, post- 
paid; 40(45) minutes; E. C. Shemwell, John E. Whit- 
craft, and H. E. Schrammel; Bureau of Educational 
Measurements, Kansas State Teachers College of Em- 
poria. 

a) TEST 1. First semester. 
b) test 1r. Second semester. 


Arnold E. Schneider, Head, Department of 
Business Education, Western Michigan College 
of Education, Kalamazoo, Michigan. Although 
the objective type of bookkeeping test has been 
in common use during the past two decades, ` 
nevertheless few innovations or new and chal- 
lenging objective-test devices or situations have 
been devised. It may be stated that the Shem- 
well-Whitcraft Bookkeeping Test follows a 
too often repeated pattern. The tests are, at 
best, mediocre measuring instruments of the 
subject matter at this level. 

This test is designed to measure the achieve- 
ment of pupils at the end of the first semester 
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and the second semester of the first year of 
bookkeeping. The tests are supplied in alternate 
Forms A and B for each semester. 

The four tests comprising the series are con- 
structed on the following pattern: 50 true-false 
items, 25 multiple matching items, 25 comple- 
tion, 25 matching, and 5 problems, a total of 
130 items with a score potential of 150. The 
faultiness of this distribution of test items lies 
in the fact that the measuring instrument is rely- 
ing heavily on a type of test item the validity of 
which, for purposes of measuring bookkeeping 
skill, is open to question. This is particularly true 
of the 50 true-false items and the 25 completion 
items, 

Apparently no attempt is made to test, in a 
comprehensive objective fashion, several of the 
basic concepts which the student of bookkeep- 
ing should grasp during his first year’s work. 
Certainly some means of measuring the stu- 
dent’s knowledge, skill, and abilities in dealing 
with adjusting and closing entries, the working 
- sheet, the profit and loss statement, and the bal- 
ance sheet should have been devised and in- 
cluded in the test. The only two specific areas 
of bookkeeping performance which are actually 
tested are recognition and selection of proper 
journal entries and recognition for the purposes 
of account classification. 

A careful reading of the test items leaves 
much to be desired in the way of sharp and clear 
word usage. There is a good deal of ambiguity 
in many of the questions, The phraseology is not 
sharp and clear cut. Many of the questions are 
highly irrelevant. The claim for validity is based 
upon “common content of several leading text- 
books and courses of study,” but a study of the 
questions used in the true-false sections and the 
completion sections would suggest that many of 
the test items were built on a basis of testing 
over those areas which lent themselves to test 
construction. 

The fact that forms are supplied, that reliabil- 

_ ity coefficients for the tests have been deter- 
mined, and finally that percentile scores for an 
adequate population are supplied makes this test 
of limited value to the classroom teacher of high 
school bookkeeping. It does provide a compara- 
tive measure of those areas which the test does 
test. It is doubtful whether the test does, as the 
writers claim, determine pupil achievement, 
check the efficiency of instruction, or analyze 
pupil and class weaknesses. The alert and com- 
petent classroom teacher, who is not too over- 


worked, should be readily able to construct a 
test which, for her purposes, would measure 
achievement more accurately than does this test. 


[ 388 ] 

rthand Test: State High School Tests for 
oes as second, third, fourth semesters high 
school ; 1945-46; Form A; 5¢ per test; 15¢ per sample 
test; 22(40) minutes; M. E. Studebaker, B. M. E 
ford, Vernal H. Carmichael, Frances R. Botsford, and 
Russell Burkhart; State High School Testing Service; 
Purdue University. 


[ 388a ] 
*SRA Clerical Aptitudes. High school and adults; 
1947; Forms AH, BH ; separate answer pads must be 
used; 43¢ per 25 profile sheets; 75¢ per specimen set ; 
25(35) minutes; prepared by Richardson, Bellow $, 
Henry and Co., Inc.; Science Research Associates. 


[ 388b ] 
*SRA Dictation Skills. High school and adults; 
1947; separate answer pads must be used; the Ac- 
curacy Album includes two 12-inch records (78 rpm) 
presenting I practice and 5 test letters for measuring 
dictation accuracy; the Speed Album includes four 
12-inch records (78 rpm) presenting 1 practice and 8 
test letters for measuring dictation speed ; $1875 per 
set of albums; 45 per test booklet with answer pad; 
$1.65 per 25 answer pads; (60) minutes; Marion W. 
Richardson and Ruth A. Pedersen; Science Research 
Associates. 

[ 388c ] 
*SRA Language Skills. PoredeRidh De I 
form; $1.90 per 25; 50¢ per specimen set; 15(20) min 
utes; Marion W. Richardson and Ruth A. Pedersen; 
Science Research Associates. 


[ 388d ] 
*SRA Typing Skills. High school and adults 7 1947; 
$1.90 per 25; Sof per specimen set; 10(15) minutes; 
Marion W. Richardson and Ruth A. Pedersen; Science 
Research Associates. 

[ 389 ] 
Stenogauge. Stenographers ; 1922; $3 per introductory 
set including ro tests, postpaid ; 4¢ per extra test, post- 
paid; introductory sets may be ordered on approval; 
(12-20) minutes ; Eugene J. Benge; Management Serv- 
ice Co. 

Beatrice J. Dvorak, Chief, Worker Analysis 
Section, U. S. Employment Service, Washing- 
ton, D. C. The Stenogauge is a proficiency test 
for stenographers and typists. It measures dicta- 
tion speed, transcription speed, transcription 
accuracy, and spelling accuracy for stenogra- 
phers; and typing speed, typing accuracy, and 
spelling accuracy for typists. These elements 
enter into a single composite score, which is con- 
verted to a percentile rank. The user is in- 
structed to test his stenographers and typists to 
find the average rank in his own organization, 
and to compare the scores or ranks of applicants 


with those of employees with whose work he is 
familiar. 
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A practice period is provided, if necessary, 
for the applicant to become familiar with the 
typewriter which is to be used during the test. 

The dictation is not read at a standardized 
speed, but at a speed “which presses the appli- 
cant slightly.” The material dictated consists of 
a business letter, about 200 words long. Its tone 
may appeal to a few employers. A typical sen- 
tence is: “In the second place you insinuate that 
I, personally, have a grudge against you.” The 
punctuation is also dictated. Following the dic- 
tation, a 50-word spelling test (requiring that 
the words which are misspelled—about 25—be 
identified) is interjected. The dictated material 
is transcribed after the completion of the spell- 
ing test. 

The test for typists is similar to the test for 
stenographers, except that the dictation test is 
omitted, and the typing is from a perfect copy. 
For typists, the composite score is the sum of 
the typing rate per minute, the percentage of 
typing accuracy, and the spelling accuracy. The 
norms, or percentile ranks, were established, ac- 
cording to the author, in the period about 1922 
after validating the test by comparing test scores 
with ratings of stenographic ability. A question 
arises regarding the weighting of the elements 
tested. For stenographers the composite score is 
the sum of the dictation rate (words per min- 
ute), the transcription rate (words per minute), 
the transcription accuracy (per cent of dicta- 
tion correctly transcribed), and spelling accuracy 
(per cent of correctly recognized spelling and 
misspellings.) By what criterion it was deter 
mined that each of these elements should have 
unit weight, in terms of the indicated units of 
measurement, is not indicated. It is very unlikely 
that these are the most valid or the optimum 
weights. In any event, norms for each of the 
elements separately might have been more ap- 
propriate for placement purposes. 


[ 390 ] 
Stenographic Aptitude Test. Grades 9-16; 1939; 
I form; $1.60 per 25, postpaid; 35€ per specimen set, 
postpaid; (25) minutes; George K. Bennett; Psy- 
chological Corporation. 
REFERENCES 
1. Barrett, Domoruy M. “Prediction of Achievement in 


Typewriting and Stenography in a Liberal Arts College." J Appl 
Psychol 30:624-30 D '46. PBA 21:1624) = i 


Philip H. DuBois, Professor of Psychology, 
Washington University, St. Louis, Missouri. 
Compared with other available tests in the 
same field (the Turse Shorthand Aptitude Test 
and the E. R. C. Stenographic Aptitude Test), 


Bennett’s Stenographic Aptitude Test does not 
attempt to include as wide a variety of func- 
tions in its measurement of the abilities re- 
quired to master shorthand. Consequently, per- 
haps, its reported validity is not as high as 
those stated by the authors of the tests men- 
tioned above. Actually no validity coefficient 
for the entire test is reported, the validities of 
the transcription score being of the order of .27 
and of the spelling score .48 (both point bi- 
seriale), with a negligible correlation. being 
found between the two parts. The multiple cor- 
relation between the test as a prognostic instru- 
ment and later success in a private secretarial 
school would appear to be of the order of .55. 

The manual states that an early study of the 
prediction of success in typing and shorthand 
showed that little reliance could be placed on 
standard intelligence test scores or on special 
manipulative tests. It is therefore somewhat 
surprising to find a number of correlations of 
the subtests with Alpha scores, based upon 
investigations in 1939 and 1945. Certainly cor- 
relations of any aptitude tests with other pub- 
lished tests are of interest to workers in the 
test field, but this information presented with- 
out interpretation under the heading of “Stand- 
ardization" appears to be gratuitous. 

A more workmanlike job could have been 
done in presenting the validity information. 
Since the point biserial is affected by the point 
of the dichotomy, some indication should have 
been given about the division of the group as 
to success or failure in training. As success in 
a secretarial school is probably not an all-or- 
none affair, the Pearsonian biserial instead of 
the point biserial could have been justified (and 
would have yielded higher coefficients !). 

The reliability reported for the transcription 
test (.975) is perhaps spuriously high because 
of the way it was computed, namely, by corre- 
lating the scores made in odd minutes with 
those made in even minutes. The test requires 
substituting arbitrary symbols for five digits 
and, on a second page, translating the symbols 
back to the original digits. A high degree of 
learning is probably involved, which could 
easily boost the reliability. A more conserva- 
tive estimate of the reliability could have been 
obtained either by separately timed halves or 
by development of an alternative form. 


Edward A. Rundquist, Assistant Director of 
Personnel Research, Owens-Illinois Glass Com- 


391 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 460 


pany, Toledo, Ohio. This is a well-constructed 
test of model simplicity in administration and 
scoring. In this respect, as well as in its brief 
time limits, it is markedly superior to euch 
tests as the E. R. C. Stenographic Aptitude 
Test and the Turse Shorthand Aptitude Test. 
Unfortunately, its validity cannot be accurately 
ascertained from the published data. 

There are two subtests—a substitution test 

involving the translation of numbers to sym- 
bols and vice versa and a test of ability to 
identify and correct misspelled words, Validity 
was obtained by testing some seven hundred 
entrants to a private secretarial school and ob- 
taining a measure of success or failure about 
nine months later. The criterion is broader but 
less objective than that of objective tests of 
shorthand transcription used to validate the 
tests mentioned above. The use of this criterion 
is in line with the author’s purpose of predict- 
ing success in learning both typing and short- 
hand. It probably includes more than this. It 
is probably more closely related to job success 
than to the narrower criterion of ability to 
transcribe shorthand. The effort to collect such 
a criterion is difficult but, if successful, pays 
dividends. The difficulty in this instance is that 
the resulting validity coefficients are not capa- 
ble of interpretation from the published data. 
Validity coefficients of .27 and .47 for the sub- 
stitution and spelling tests are reported in point- 
biserial terms. These coefficients are the same 
for groups with and without typing or short- 
hand experience. These coefficients vary mark- 
edly with the percentage in the success and 
failure groups. No information is given on this 
point, nor are success and failure defined in 
any way. A further barrier to interpretation is 
that of the 700 students tested some 200 were 
eliminated for lack of an unambiguous record 
of success or failure. Where these cases are 
located on the criterion distribution will make 
a considerable difference in the results ob- 
tained. No test scores for the eliminated group 
or reasons for lack of unambiguous criteria 
data are presented to permit evaluation of this 
point. 

No validity is presented for total test score 
although the norms are in these terms. The 
product moment correlation between the sub- 
tests is around zero, so the tests will combine 
to yield maximum prediction. Could the point- 
biserials be taken at face value, the multiple 


would be substantially greater than the .47 ob- 
tained for the spelling test, but in the absence 
of meaningful basic correlations, there is no 
incentive to make the calculation. 

The author makes moderate claims for the 
test. In the reviewer’s experience with the pre- 
diction of shorthand success from spelling 
tests, he would anticipate a correlation in the 
neighborhood of .4. From the published data, 
however, we cannot tell whether the validity 
is higher or lower than that reported. ; 

The test is highly reliable. The manual is 
clearly written and directions for administra- 
tion easy to follow. Norms are presented for 
a wide range of groups. In view of the many 
good features of this test further validation 
studies are well warranted. 


[ 391 ] 


Stenographic Test: United-NOMA Business En- 
trance Tests. Schools and industry; 1930-47; new 
form usually available annually; prior to 1947, the 
series title was National Clerical Ability Tests ; $5 per 
25; $1 per test when scoring service and certificates 
for acceptable work are desired ; $2 per specimen set of 
the 6 tests in the series; 125(140) minutes; prepared 
by the Joint Committee on Tests representing the 
United Business Education Association and the Na- 
tional Office Management Association; National Office 
Management Association. 


Ann Brewington, Associate Professor of 
Business Education, School of Business, The 
University of Chicago, Chicago, Illinois. [Re- 
view of the 1946 Form.] This test is one of a 
group of six worksample tests jointly con- 
structed and administered annually by the Na- 
tional Office Management Association and The 
United Business Education Association, in an 
effort to lessen the differences between school 
standards and office standards. It is an attempt 
to measure the degree of “readiness for work” 
of skilled but uninitiated workers. 

With no significant change in the structure, 
content, or scoring, the proportion of testees 
from approximately 200 high schools and col- 
leges during an eight-year period receiving cer- 
tificates has ranged from 24 to 76 per cent. 
Approximately 3,200 Stenographic Certificates 
have been issued to testees passing the test. 

The dictation time is 30 minutes plus 5 min- 
utes for redictation as necessary, at varying 
speeds, followed by a go-minute transcription 
period. The maximum score is 230 points; the 
minimum acceptable score is 69 points. The re- 
liability is .9o. The validity is yet to be estab- 
lished. 
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Elizabeth Fehrer, Associate Professor of 
Psychology, Michigan State College, East Lan- 
sing, Michigan. [Review of the 1946 Form.] 
This is a shorthand achievement test “built on 
the assumption that a stenographer must be 
able to take ordinary dictation for reasonably 
long stretches at a time and get out her notes 
promptly and acceptably. Hence the candidate 
is subjected to a 35-minute dictation test and 
à 90-minute transcription test.” 

. The test aims to approximate work under 
actual office conditions and appears tz do so. 
The items to be dictated consist of nine letters 
which are to be transcribed in mailable form 
and straight matter to be typed in the form of 
a first draft. Specialized vocabulary has been 
avoided. The dictation is given at an aver- 
age speed of 75 words a minute and simu- 
lates natural original dictation as distinguished 
from stenographically phrased, quarter-minute 
marked school dictation. At the end of certain 
items a short time is allowed for redictation. 
In transcribing, a dictionary is allowed. Tran- 
scripts must make the sense intended by the 
dictator, but the words dictated and those 
transcribed need not be identical in all cases. 

A new edition of the test is prepared each 
year. Determination of the reliability of the 
test is not made for each edition. For the 1941 
edition the reliability coefficient was .90 which 
is satisfactory. The very length of the test 
seems sufficient to insure comparable reliability 
for the more recent editions, although this is 
something that could easily be verified for each 
new form. 

The validity of an earlier form of the test 
was studied by comparing the scores of fifty 
high school students with those of fifty em- 
ployed stenographers. The results (which are 
not available in their details) indicated that 
“the high school young people did superior 
work in stenography in the matter of place- 
ment, neatness, and accuracy as far as they 
went; but by the time they reached the middle 
of the test they faltered and either did not fin- 
ish or jumbled their transcription so as to be 
unusable. The working young people had the 
stamina and finished the test easily but with 
a lower quality.” This brief validity study sug- 
gests that the test probably does in fact meas- 
ure proficiency on the job and that the test 
score will indicate the immediate usefulness of 
an applicant. Use of the test might also have 
value in making the business school course in 


shorthand more practical if, in order to pre- 
pare students for the test, office practices are 
more closely duplicated in the classroom. 

Like the other tests in the series, this test 
is given only on specified dates in approved test 
centers and under competent supervision. Pa- 
pers are returned to the central office for scor- 
ing. Percentile norms based on those taking 
the test are later reported to schools and busi- 
ness firms. A certificate of proficiency is sent 
to those students or employees passing the test. 

Earlier forms of the test are available for 
practice or training purposes. 

[ 392 ] 
*Turse-Durost Shorthand Achievement Test 
(Gregg). 1-2 years high school; 1941-42; Form A; 
$1.20 per 25; 35¢ per specimen set, postpaid; (60) min- 
utes; Paul L. Turse and Walter N. Durost; World 
Book Co. 

J Bus Ed 17:39-40 Je '42. * should prove 
valuable to teachers and administrators in as- 
signing grades, sectioning and diagnosing diffi- 
culties, as well as in determining how their 
students stand in relation to other high schools 
and which of their students are ready to take 
positions in business or industry. 


[ 393 ] 
Turse Shorthand Aptitude Test. Grades 8-10; 1940; 
1 form; $1.70 per 25; 35€ per specimen set, postpaid ; 
45(50) minutes ; Paul L. Turse; World Book Co. 


REFERENCES 
1. Turse, PauL L. “Problems in Shorthand Prognosis.” 


J Bus Ed 13:17-8+ My 38. E 
2. TUCKMAN, JACOB. “ SUM of the Turse Shorthand Apti- 
"43. 


tude Test.” J Bus Ed HA 3 
3. Barrett, Dosoruy M. "Prediction of Achievement in 
Typewriting and SEU a Liberal Arts College." J Appl 
Psychol So:6aa in D '46. * (PA 21:1624) - 
. Hosler, Russert J. "Aptitude Testing in Shorthand.” 
J Bus Ed 22:25 My '47. * (PA 22:834) 


Philip H. DuBois, Professor of Psychology, 
Washington University, St. Louis, Missouri. 
This test consists of seven subtests which in- 
volve activities that are reasonably close to the 
work of the stenographer: stroking, which 
involves manual dexterity; spelling of words 
commonly misspelled; phonetic association in 
which words spelled approximately as they are 
pronounced are to be transcribed into conven- 
tional spelling; symbol transcription, in which 
material involving 14 arbitrary symbols is to 
be deciphered; word discrimination; longhand 
writing from rapid dictation ; and word sense, 
or reconstruction of words from abbreviations 
and context. The material was devised after a 
job analysis of stenographic work and, as 
might well be expected, has been found to be 
definitely predictive of success in training. For 
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268 cases, the total score correlates .67 with 
~ the Durost-Turse Correction-Transcription 

Test of Stenographic Achievement, after two 
. years of shorthand. 

The reported odd-even reliability is .98. 
Since all the sübtests have short time limits 
and since there is no indication in the Manual 
of Directions of the difficulties of the items, 
this reliability may be spuriously high. How- 
ever, the fact that the validity is satisfactory 

^. indicates that the instrument is useful and may 
be recommended in screening applicants for 
Stenographic training and for educational 
guidance. 
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Typing Test: United-NOMA Business Entrance 
Tests. Schools and industry; 1939-47; new form 
usually available annually; prior to 1947, the series 
title was National Clerical Ability Tests; $5 per 25; 
$1 per test when scoring service and certificates for 

. acceptable work are desired ; $2 per specimen set of the 
6 tests in the Series; 120(130) minutes; prepared by 
the Joint Committee on Tests representing the United 
Business Education Association and the National Of- 
fice Management Association; National Office Man- 
agement Association. 


E. G. Blackstone, Professor of Business 
Education, The University of Southern Cali- 
fornia, Los Angeles, California. [Review of 
the 1946 Form.] This test includes exercises in 
typing a corrected rough draft, the setting-up 
of a letter from running copy, a simple tabula- 
tion on a form, a simple tabulation on a plain 
sheet of paper, and the typing of a form letter 
with parts to be filled in. Two hours are al- 
lowed for the test. The test is scored for 
form and arrangement of typed matter, accu- 
racy, time consumed, and ability to follow in- 
structions. Any uncorrectible error causes an 
exercise to be rejected; correctible errors are 
accepted but penalized. Neat erasures are ac- 
cepted. All scoring of papers is done at the 
central office; a teacher is not able to score 
them. 

The reliability of the test is estimated to be 
-90. What its actual reliability is has not been 
determined. No figures are available for valid- 
ity. Certificates of proficiency are awarded to 
those whose scores are “as high as, or higher 
than the score representing the minimum of 
office acceptability,” and whose “composite 
score is as high as, or higher than the com- 
posite score equivalent to the minimum ac- 
ceptable skill score." How these minima are 
determined is not announced. 

The test seems to be an improvement over 


the typical copying test because it covers a 
longer period of time and because it includes 
several types of rather typical typing tasks. It 
is unfortunate that the test does not include 
instructions that would enable a typing teacher 
to score and interpret test results. Perhaps the 
tests were not designed as instructional tests 
to be used in schools, but they would be more 
useful if they could be so used. 

It is believed by the reviewer that every hon? 
est effort possible has been made to make the 
tests objective, but they are still lacking in 
statistical evidence that they are reliable or 
valid, that the method of computing a com- 
posite score is statistically sound, or that the 
method of determining what scores shall be 
given the award of proficiency is justifiable. 
Perhaps such statistical justification may be 
attained in the future—but the tests have been 
on the market for a number of years, and such 
justifications seem to be slow in coming. 


Beatrice J. Dvorak, Chief, Worker Analysis 
Section, U. S. Employment Service, Washing- 
ton, D. C. [Review of the 1946 Form.] This 
test is a two-hour proficiency test, designed to 
measure "ability to do practical typing work." 
It consists of five specific typing jobs: typing 
from rough draft, typing numerical data in a 
table (the table form has been set up for the 
examinee), determining the ra: a 
letter on a page, tabulating lists of words, and 
filling in a form letter and preparing it for 
mailing. Credit is allowed for every minute 
less than two hours required by the examinee, 
The score is a composite based on the form and 
arrangement of the typed matter, the accuracy 
of typing, and the time consumed. Failure to 
follow directions is penalized heavily. A table 
of percentile norms, based on high school stu- 
dents, is provided to interpret the composite 
score. A critical score, which approximately 90 
per cent of the students exceed, is presented. 

According to the copyright holder, the valid- 
ity of the test was established by administering 
it to a sample of high school young people 
ready for work and to a sample of office em- 
ployees of from six to eighteen months ex- 
perience. It was found that the employed sam- 
ple did considerably better than the high school 
sample. 

The reviewer's preference is for separate 
speed and accuracy norms—appearance, to the 
extent that it can be reliably measured, being 
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thrown in with accuracy—rather than for com- 
posite-norms, Speed compared with accuracy is 
not important for many typing jobs—typing 
insurance policies, for instance. An insurance 
company might not want a low accuracy score 
to be obscured in the total score by a high 
speed score, 

The test is too time-consuming for wide- 
spread practical use in employment selection. 


[ 395 ] 

XTypewriting Test: State High School Tests for 
Indiana. First, second, third, fourth semesters high 
school; 1942-44; Form A (1st, 3rd semesters) ; Form 
N (and, 4th semesters) ; 4¢ per test; 15¢ per sample 
test; 30(35) minutes; M. E. Studebaker, B. M. Swin- 
ford, Vernal H. Carmichael, Frances R. Botsford, and 
Russell Burkhart; State High School Testing Service, 
Purdue University. 

$ [ 396 ] 
United-NOMA Business Entrance Tests. Schools 
and industry; 1939-47; 6 tests; new forms usually 
available annually; prior to 1947, this series of tests 
was entitled National Clerical Ability Tests; $5 per 25 
of.any one test; $1 per test when scoring service and 
certificates for acceptable work are desired; $2 per 
specimen set; prepared by the Joint Committee on 
Tests representing the United Business Education As- 
sociation and the National Office Management Associa- 
tion; National Office Management Association. 
d) BOOKKEEPING TEST. I20(130) minutes. For reviews, 
see 368. 
b) BUSINESS FUNDAMENTALS AND GENERAL INFORMA- 
TION TEST. 45(55) minutes. For reviews, see 369. 
C) FILING TEST. 120(130) minutes. For reviews, see 379. 
d) MACHINE CALCULATION TEST. 120(130) minutes. For 
a review, see 384. 
€) STENOGRAPHIC TEST. 125(140) minutes. For reviews, 
see 391. 
f) TYPING TEST. 120(130) minutes. For reviews, see 394. 


REFERENCES 
1-9. See 40:1476. 


Paul S. Lomax, Professor of Education and 
Chairman, Department of Business Education, 
New York University, New York, New York. 
The United-NOMA Business Entrance Tests 
are the same as the former National Clerical 
Ability Tests. The change in name is due to 
the discontinuance of the former National 
Council for Business Education and the taking 
over of its testing program by the United Busi- 
ness Education Association, a newly reorgan- 
ized department of the National Education 
Association. The production and management 
of these tests continue under the same joint 
committee representing the National Office 
Management Association and the United Busi- 
ness Education Association. 

The 1946 United-NOMA Business Entrance 
Tests are intended to measure the competencies 


of business-trained students for initial employ- 
mentir one or more of the following kinds of 
office work: typewriting, filing, bookkeeping, 
stenography, and machine calculation. Each 
person who takes one or more of these voca- 
tional tests is also expected to take the Busi- 
ness Fundamentals and General Information 
Test. The business fundamentals part of the 
test deals with spelling, the apostrophe, plurals, 
arithmetic, and social science, all of which are 
assumed to be representative of school-taught 
information, The general information section 
of the test deals with current events, which are 
assumed not to be school-taught information. 
Two hours is allowed for each of the voca- 
tional tests; one hour for the Business Funda- 
mentals and General Information Test. 

The five vocational tests are designed to be 
valid measures of actual office work. The tests 
may or may not be valid measures of actual 
office training programs in schools, since the 
curricular validity of the tests apparently has 
not been determined. We are not told how ade- 
quately each of the tests covers all important 
phases of relevant office work. Nor are we in- 
formed how widely accepted by office man- 
agers are the standards of competency set up in 
the.tests, as, for example, the dictation rate of 
not over 75 words a minute on an average for 
a period of thirty minutes in the stenographic 
test. 

Inadequate data are provided concerning the 
comparability of the various annual series of 
the tests and the norms of achievement result- 
ing therefrom. The criteria of objectivity, ad- 
ministrability, and scorability have been well 
observed in the construction of the tests. The 
utility of the tests still requires much further 
study. No extensive study has yet been made 
to determine how well persons, who have 
passed the previous series of the tests, have 
succeeded in their office work. Nor do we know 
what practical improvements have resulted 
from the use of the tests both in upgrading 
the vocational caliber of office-training pro- 
grams in the schools and in upgrading the 
quality of employment testing. Nevertheless, 
these tests, which were first initiated in 1937, 
are probably the best business entrance tests 
that are yet available; hence, they merit wide- 
spread use by schools and offices and. careful 
study for progressive improvement and stand- 
ardization of the tests. 
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COMPUTATIONAL AND 
SCORING DEVICES 
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International Test Scoring Machine. For scoring 
IBM answer sheets; 1937-38; rents for $480 per year 
to all except colleges and universities, to whom the 
rental is $400 per year; the Graphic Item Counter is 
available for rental at $240 per year and $200 per year 
to colleges and universities; standard answer sheets 
printed on one side vary in price from 114€ per sheet 
when less than 500 are purchased to 34¢ per sheet when 
5,000 or more are purchased ; the corresponding prices 
for standard answer sheets printed on two sides vary 
from 2¢ to 114¢; mechanical pencils (filled with elec- 
trographic lead) vary from 214 per pencil when only 
12 pencils are purchased to 15¢ per pencil when 144 
are purchased ; 8og per box of 72 electrographic leads ; 
International Business Machines Corporation. 
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1-314. See 40:1492, 


15. HERKELMANN, LEO Emir, A Study of the Use of the Sep- 
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revel Unpublished master’s thesis, State University of lowa, 
1938. Pp. iii, 49. 
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Arthur E, Traxler, Associate Director, Edu- 
cational Records Bureau, New York, New 
York. The enormous development of objective 
testing during the last thirty years has made 
imperative the invention of procedures that 
would insure rapid, accurate scoring. It is esti- 
mated that approximately sixty million stand- 
ardized tests were administered to some twenty 
million people in the United States in 1944. 
While use by military and civil-service agencies 
accounted for a large proportion of these tests, 
an estimated twenty-six million tests were taken 
by more than eleven million individuals in edu- 
cational institutions, business firms, and per- 
sonnel consultant agencies. 

The idea of scoring tests by mechanical 
means has, for many years, intrigued individ- 
uals of an inventive turn of mind. Experimen- 
tation has been carried on with various me- 
chanical procedures, including simple homemade 
devices, the photoelectric cell, and even an in- 
genious device for weighing the right answers. 

The only mechanical scoring method which 
has been successful on a large scale, however, is 
the International Test Scoring Machine, which 
was invented in the middle 1930’s by R. B. 
Johnson, now an engineer in the laboratories 
of the International Business Machines Cor- 
poration. Dr. Ben D. Wood of Columbia Uni- 
versity was the testing leader who saw the vast 
possibilities in Mr. Johnson’s first model and 
who brought it to the attention of Thomas T 
Watson, President of the International Busi- 
ness Machines Corporation. 

Two elementary facts are basic to the oper- 
ation of the scoring machine. One is that a soft 
lead pencil mark is a conductor of electricity, 
and the other is that ordinary paper is a non- 
conductor. Instead of writing in the test book- 
let, the subject marks his responses on a sepa- 
rate answer sheet. For each item, he pencils a 
heavy mark between printed lines in the posi- 
tion which represents the number of his choice. 

There are 750 possible choices on one side 
of an answer sheet. This is a sufficient number 
to handle 150 five-choice questions, 250 three- 
choice questions, and so forth. The scoring 
machine contains 750 sensing units correspond- 
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ing to the response positions on the answer 
sheet. A high resistance coil is connected with 
each sensing unit in such a, manner that it 
allows a very slight but definite amount of elec- 
tricity to flow through thé unit when a pencil 
mark is.present to make the connection. When 
the answer sheets for a certain test are to be 
scored, the machine is set by means of a mas- 
ter scoring key which divides the current into 
a circuit for the right responses and a circuit 
for the wrong responses. By means of test 
sheets marked with known numbers of right 
and wrong responses, the machine may be set 
empirically to give the score directly on the 
basis of any scoring formula. The answer 
sheets are dropped one at a time through a slot 
in the machine, and the score on each one is 
read immediately from a dial. 

The machine itself works with the speed of 
electricity. The rapidity of scoring is limited 
only by the time required for the pointer on the 
dial to register and the speed of the operator 
in reading and recording the scores. On tests 
yielding only one over-all score, an experi- 
enced operator can score upward of five hun- 
dred tests an hour, with no error larger than 
one point. 

The scoring machine, in its present stage of 
development, is not mechanically perfect. It 
gets out of adjustment rather easily and is 
sensitive to changes in weather conditions, par- 
ticularly humidity. Users of the machine have 
suggested a wide variety of improvements, too 
numerous to be summarized here. Nevertheless, 
with the present model of the machine, a care- 
ful operator can score test papers not only 
many times as fast as, but also more accurately 
than, they are scored manually by teachers or 
clerks in the usual hand-scoring situation in 
local schools. 

In addition to the scoring feature, there is 
available a graphic item counter by means of 
which it is possible to obtain, as a supplement 
to the scoring process, a graphic record of the 
number of right responses given by a class or 
other group to each item in the test. As Flana- 
gan (19) and McNamara and Weitzman (32) 
have shown, this item counter is an important 
contribution to research, particularly in test 
construction and improvement. 

The test scoring machine is not sold to users. 
It is installed on a rental basis at a charge suf- 
ficient to cover upkeep and servicing. The 
present annual rental fee is $480. 


The International Test Scoring Machine is 
probably the most important single contribu- 
tion thus far made to large-scale objective 
testing. The fact that various service agencies 
are equipped with this machine has brought 
comprehensive objective testing within the 
budgets of hundreds of public schools that 
previously had not been able to afford to use 
such tests. It has relieved teachers and coun- 
selors from countless hours of drudgery at- 
tendant upon scoring tests manually. As Wat- 
kins (28) has pointed out, it is possible through 
the use of the scoring machine for classroom 
teachers to give tests and obtain the results 
quickly while interest in the test is still keen. 
In addition to its use in educational institu- 
tions, the machine is widely used in industrial 
psychology and civil service, and it was indis- 
pensable to much of the testing and research 
done by the armed forces during the war. 

The scoring machine, nevertheless, has sev- 
eral possible limitations which test producers 
and consumers alike should recognize. Certain 
of these limitations are shared by all other 
testing procedures in which separate answer 
sheets are used. 

In the first place, the use of separate answer 
sheets introduces a new variable which may be 
dependent in part upon the motor skill and 
the space perception of the individual. When the 
answer sheet is closely coordinated with the 
test booklet, as it is in certain published tests, 
this variable may be of little importance; but 
where there is not equal spacing of items in 
booklets and answer sheets and when no printed 
guides to adjustment are provided, the me- 
chanics of adjusting booklet and answer sheet 
may influence the scores of some individuals 
significantly. 

The experimental evidence on this question 
is not extensive. The most thorough study is 
one reported by Dunlap (13), in which answer 
sheets were found to be as satisfactory as un- 
derlining answers in booklets, Dunlap recom- 
mended the use of an articulated, serially num- 
bered answer sheet, a recommendation which 
has by no means been followed by all test pub- 
lishers. McCullough and Flanagan (9) re- 
ported the results of a study indicating that 
the validity of the machine-scorable Coopera- 
tive English Test was as high as that of a form 
of this test in which the answers were recorded 
in the booklets. More research of this kind is 
needed. 
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Studies by Herkelmann (15) and by Loper 
(18) have shown, as one would expect, that 
more time is required in the taking of a test 
when the answers are recorded on separate an- 
swer sheets than when they are eütered in test 
booklets. Some test authors have taken the dif- 
ference in time required by the two procedures 
into account in the standardization of their 
tests, while others have made no allowance for 
the greater time needed when the answer sheets 
are used. 

A second limitation of answer sheets men- 
tioned frequently by teachers and counselors is 


- that scored answer sheets are less useful than 


Scored booklets in diagnosis, teaching, and 
counseling because they do not show what ques- 
tions each individual has answered correctly. 
It is possible to meet this criticism to a consid- 
erable degree by overprinting the correct an- 
swers on the pupils' answer sheets by means of 
a multilith or even a mimeograph. 

A difficulty in the use of machine-scorable 
answer sheets, as distinguished from those that 
are to be hand scored, is the necessity of using 
special pencils and of employing them in a 
carefully prescribed manner. In order to in- 
sure correct machine-scoring, the answer sheets 
must be marked with the recommended pencils, 
and the subjects must make their marks heavy, 
black, and glossy. Many examiners have not 
seen a scoring machine in operation and are not 
sufficiently aware of the need for proper marks 
to impress this need upon the examinees. Con- 
sequently, the great majority of the answer 
sheets coming to a machine-scoring department 
are very indifferently marked. Thus, the papers 
must be scanned and re-marked by clerical 
help before they are inserted into the machine, 
or incorrect scores will be reported. (36) Asa 
rule, far more time in a machine-scoring unit 
is used in scanning and re-marking answer 
sheets than in scoring them. When the answer 
sheets are so poorly marked that nearly every 


- one must be re-marked as a part of the scoring 


procedure, the over-all time used in machine 
scoring is likely to be greater than that re- 
quired by well-organized and efficient hand 
scoring of the same papers. 

It is recommended that all publishers of new 
tests allow time at the end of an examination 
and after the test booklets have been collected 
for the subjects to go over their answer sheets, 
to darken the response marks, and to erase 
stray marks and that specific directions for this 


activity be included in the manual of instruc- 
tions. 

Regardless of. whether the separate answer 
sheets are to be machine scored or hand scored, 
the administration of tests with separate an- 
swer sheets calls for much better desk space 
than is provided in numerous classrooms. 
There should be room for the individual to lay 
his booklet and his answer sheet on his desk 
in a comfortable working position. One study 
has indicated that there is a small but consist- 
ent difference in the results of tests adminis- 
tered with separate answer sheets at desks and 
in chairs equipped with desk arms. (25, 27) 
The difference, of course, favored the use of 
desks instead of armchairs. It is reasonable to 
expect that the difference would be consider- 
ably larger if the results of tests administered 
at desks were compared with those obtained 
from tests administered in assembly-hall seats 
in the manner that tests not infrequently have 
been given in the past. h 

Another limiting factor in machine scoring 
is that all responses are forced into a purely 
objective form. Under the stress of the limita- 
tions imposed by the machine, testmakers have 
in recent years become very ingenious in the 


production of multiple-choice items, but re- 


search has not established that this type of item 
is better than, or even equally as good as, the 
semiobjective item, particularly in certain fields. 
(26) It is clear that wholly objective items are 
well adapted to such areas as vocabulary, 
mathematics, and science, but the necessity of 
making tests in such fields as English usage 
and spelling completely objective has occasion- 
ally led to some rather bizarre testing situations. 
In summary, it should be stressed that avail- 
able evidence indicates that the limitations of 
the scoring machine are minor ones and that 
the advantages of machine scoring over hand 
scoring as usually carried on are indubitably far 
greater than its disadvantages. It is believed, 
however, that both test specialists and test users 
have been somewhat too ready to adapt ma- 
chine procedures quickly and uncritically to all 
types of testing situations. The use of machine 
methods has mushroomed at the same time that 
the research on these methods has been piti- 
fully small. Moreover, the fetish for the use 
of separate answer sheets in connection with 
both machine scoring and hand scoring will 
never be on a fully defensible basis until it is 
supported by a great deal more pertinent and 
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sustained research than has thus far appeared 
in educational literature. 


For reviews by John G. Darley and H. T. 
Manuel, see 40:1492. 


[ 398 ] 
*Thurstone Scoring Board. A device for holding 
one or two stencils in place to provide correct align- 
ment for scoring answer sheets; $7.50; L. L. Thurs- 
tone; C. H. Stoelting Co. 


EDUCATION 
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Coxe-Orleans Prognosis Test of Teaching Abil- 
ity. Entrants to teacher-training institutions; 1930; 
Form A; $3.20 per 25; 35¢ per specimen set, postpaid; 
170(180) minutes; Warren W. Coxe and Jacob S. Or- 
leans; World Book Co. 


REFERENCES 


I. Coxe, Warren W., AND Orteans, J. S. “Procedure in 
Analyzing Prognosis Tests.” J Ed Res 22:36-43 Je '3o. * (PA 


CE Ds M. R eA Se ing Aptitude." 

E ,R. udy of Teaching Aptitude." J Ed Res 
sd Pi (4 2940 uous F. “Peaicine Val 
of Various Tests of Émotionality and Adjustment Ia a Geivence 
TONG fore Prospective Teachers.” J Ed Res 32:653-9 My '39. 
j ja RATE * P ermanenee of Tort in Teaching." 

5. SEAGOE, dex vy p PA AE AA Teaching Suc- 
cess," J Ed Res 38:685-90 My '4s. * (PA 19:3184) 

Harl R. Douglass, Director of the College 
of Education, University of Colorado, Boulder, 
Colorado. "These tests, first published in 1930, 
are intended to yield measures for the purpose 
of predicting probable success in classroom 
teaching for students thinking about entering 
or just entering a teacher-training institution. 
The coefficients of validity are not with teach- 
ing success but with a comprehensive achieve- 
ment test in normal school work given at the 
end of the first year of work at ten teacher- 
training institutions in New York State. They 
range from .534 to 839, averaging .645—not 
high enough to indicate the possibility of very 
useful prediction by means of the test alone 
though higher than the coefficients of correla- 
tion between the scores in the comprehensive 
achievement test on normal school work and 
on the Terman Group Test of Mental Ability, 
Tressler English Minimum Essentials Test, 
Whipple's High-School and College Reading 
Test, and average high school mark. Validity 
data in the form of multiple correlation coeffi- 
cients between success in normal school work 
and combinations of scores on the subtests of 
the Coxe-Orleans test are not given. 

There are five parts, or subtests as follows: 
Part I, General Information—true-false and 


multiple-choice items; Part II, Professional 
Interest—true-false items, which really measure 
information about methods of teaching and 
management and measure interest only indi- 
rectly ; Part III, Lessons in Education—sample 
reading selections in education; Part IV, Read- 
ing Comprehension—measuring ability to an- 
swer questions over given paragraphs of reading 
material from subjects in education; Part V, 
Problems in Education—items which measure 
the ability to apply given principles of educa- 
tion to specific practices. 

The materials of the parts of the test are 
such as to indicate a much better prognostic 
basis for success in educational psychology and 
measurement than in other fields of education. 
It is very unlikely that scores in the test are 
correlated with success in teaching closely 
enough to furnish any very useful basis for 
individual prediction, since some of the most 
important factors in teaching success, e.g., per- 
sonal attractiveness, leadership, social judg- 
ment, patience, etc., are not measured. 


David G. Ryans, Director, National Com- 
mittee on Teacher Examinations, American 
Council on Education, New York, New York. 
This test is intended for the selection and 
classification of entering students in teacher- 
training institutions. It is made up of 410 
true-false, completion, multiple-choice, and 
matching items and requires approximately 
three hours for administration. The content is 
varied and includes: (a) general information 
materials; (b) items dependent upon the stu- 
dent's observation of classroom practices; (c) 
short presentations of educational topics, fol- 
lowed by questions intended to measure the 
ability of the student to learn the types of mate- 
rials used in teacher-training courses; (d) 
paragraphs dealing with educational reading 
matter, followed by questions intended to test 
the student's comprehension of such materials ; 
and (e) descriptions of educational problems, 
followed by questions to judge the student's 
reaction to each situation. 

Validity is reported in terms of correlations 
between the results of this test and scores on 
a comprehensive achievement test given at the | 
end of the second semester of study in normal 
schools. For ten different institutions the cor- 
relation coefficients vary from .53 to .84. The 
authors suggest that the test not be used alone 
but in combination with such additional meas- 
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ures as a group intelligence test, records ot 
high school achievement or records on stand- 
ardized achievement tests in high school sub- 
jects. 

Abbreviated percentile norms, based on 2,883 
cases in 13 teacher-training institutions, are pro- 
vided. The authors suggest that in view of the 
variation of curricula from school to school 
the published norms not be used for the inter- 
pretation of the results of individual students. 

There is no indication of the method used 
in selecting the items for the test. Due to the 
variety of items employed and the fact that no 
provision is made for use of an answer sheet, 
the test is difficult and time-consuming with 
respect to scoring. The manual, however, is 
complete and well written. 


[ 400 ] 
XEducational Aptitude Test: George Washington 
University Series. Entrants to teacher-training insti- 
tutions; 1940; Form 1; $9 per 100; 15¢ per specimen 
set, postpaid; 55(60) minutes; Thelma Hunt and 
James Harold Fox; Center for Psychological Service, 
George Washington University. 


REFERENCES 

1. Tart, ARTHUR T. The Factorial Structure and Stability of 
the Educational Aptitude Test. Unpublished doctor's thesis, 
University of Southern California, 1944. (Abstracts of Disser- 
tations... ME PP. 137-40.) 

2. Seacor, May V. “Permanence of Interest in Teaching." 
J Ed Res 38:678-84 My "as. * (PA 19:3183) 

3. SEAGOE, May V. “Prognostic Tests and Teaching Suc- 
cess.” J Ed Res 38:685-90 My '4s. * (PA 19:3184) 

A. S. Barr, Professor of Education, The 
University of Wisconsin, Madison, Wisconsin. 
This test is composed of six subtests: (a) vo- 
cabulary; (b) research judgment; (c) logical 
reasoning; (d) information; (e) comprehen- 
sion; and (f) arithmetic. Norms and directions 
for giving and scoring the test are provided. 
Data on the validity and reliability of the test 
are not available. 


Harl R. Douglass, Director of the College of 
Education, University of Colorado, Boulder, 
Colorado. This test is intended to furnish a 
measure of the capacity of an individual to 
master courses in education. Each of the six 
subtests measures somewhat different capacities 
and abilities. Test 1, Vocabulary, is for general 
vocabulary—not especially developed in the 
light of needs of students of education. The 
words included are of the upper levels of dif- 
ficulty. Test 2, Research Judgment Test, con- 
sists of exercises calculated to determine the 
degree to which the testee can select from ro 
to 12 given items of data or information those 
Which are pertinent to a given problem or in- 


* 


vestigation. Test 3, Logical Reasoning, meas- 
ures the ability ot the testee to determine f rom 
given data whether a given judgment is justi- 
fied or not. Test 4, Information Test, consists 
of a series of true-false items from a great 
many fields of knowledge—not too technical 
items but largely general information. Test 5, 
Comprehension Test, is intended to furnish a 
measure of reading comprehension. Multiple- 
choice completion exercises are included which 
require the testee to select from four alterna- 
tives the correct answer to a question or excr- 
cise given based upon a preceding paragraph of 
reading material. Test 6, Arithmetical Calcu- 
lation Test, measures arithmetical calculations 
of the types employed more frequently in the 
study of education. 

The materials of the test are selected very 
largely from the fields of psychology and meas- 
urement, and it is obvious that scores on the 
test would be more useful in predicting achieve- 
ment in those fields than in such fields as his- 
tory of education, philosophy of education, 
secondary education, educational sociology, 
methods of teaching, curriculum, and school 
administration. Neither on the test form nor 
on the mimeographed directions for giving the 
test are any data given to indicate the degree 
of validity of the test for predicting success in 
the study of any field. Mechanically the test 
is superior. 

[ 401 ] 
X*Examination in Elementary Psychology—Col- 
lege Level. College; 1945; IBM; Form CEP-1-B-4 
(usually called Form B) ; separate answer sheets must 
be used; $1.75 per 25; 40¢ per 25 machine-scorable an- 
swer sheets ; 15¢ per scoring key ; 25¢ per specimen set, 
postpaid; 120(125) minutes; prepared by the Exami- 
nations Staff of the United States Armed Forces In- 
stitute; published by the American Council on Educa- 
tion; distributed by Cooperative Test Service. (Also 
distributed by Science Research Associates : $2 per 25; 
65¢ per 25 machine-scorable answer sheets; soé per 
key; 50¢ per specimen set.) 

[ 402 ] 
*Ohio Teaching Record: Anecdotal Observation 
Form, Second Revised Edition. Teachers; 1940-45; 


100 or more, 15¢ each; 25¢ per copy; Ohio State Uni- 
versity Press. 


REFERENCES 
1. Ratus, Lours. "The Revised Ohio Teaching Record." Ed 


Res B 20:241-8 D "41. * 

J Ed Psychol 33:154-5 F '42. Thomas O. 
Marshall. Although it may bear some resem- 
blance to both, this record form is not primarily 
a rating scale for teachers nor an attempt to 
objectify observations of teachers. It “is in- 
tended for use in a cooperative inquiry carried 
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forward by teachers and competent, informed 
observers, whose purposes are concerned with 
the improvement of teaching.” The record 
booklet is organized into three sections, the 
first of which includes several pages of instruc- 
tion and cautions in the use of the instrument. 
The second section contains eight observation 
guides, all of which are similar in form, con- 
sisting of one page of suggested items which 
should help the observer to notice specific fac- 
tors of teacher competency and a facing page 
on which the anecdotal recordings may be 
made. The third section contains suggestions 
and questions which might lead to two differ- 
ent but complementary summaries of the anec- 
dotal records, one of them based upon a well- 
developed set of criteria of teacher success. * 
The present form leaves spaces for three ob- 
servations of a teacher and it is implied that 
at least six are necessary in order to get a true 
picture of the teacher’s work. While they feel 
that they have developed the instrument care- 
fully and well, the authors are ultra-conserva- 
tive in their claims for it, and hope to revise it 
further as criticisms and suggestions by users 
of the form come to their attention. In the 
opinion of this reviewer, the instrument is 
likely to be valuable only when used by a highly 
competent supervisor. It is not designed for 
use by novices, 


[ 403 ] 
*Purdue Teacher’s Examination: How I Teach. 
Teachers; 1942; booklet title is How I Teach; IBM; 
Forms A, B; 25 machine-scorable test-answer book- 
lets; $1 per manual; $1.35 per specimen set postpaid; 
nontimed (30) minutes; Ida B. Kelley and Keith J. 
Perkins; Educational Test Bureau. 


REFERENCES 

1, DAVENPORT, KENNETH, An Investigation Into Pupil Rating 
of Certain Teaching Practices. Foreword by Ida B. Kelley and 
Keith Perkins. Edited by H. H. Remmers. Purdue University, 
Division of Educational Reference, Studies in Higher Educa- 
tion, [No.] 493 Further Studies in Attitudes, Series 7. Lafa- 
yette, Ind.: the Division, January 1944. Pp. 61. Paper. $0.75. 

(PA 18:3282) 
. 2, Kerrey, Ina B., AND Perxtns, KertH J. Further Studies 
in Attitudes, Series IV: An biG OLA Teachers’ Knowl- 
edge of and Attitudes Toward Child and Adolescent Behavior 
in Everyday School Situations. Purdue University, Division of 
Educational Reference, Studies in Higher Education, No. 42. 
grate. Td the Division, June 1941. Pp. 99. Paper. $1.00. * 

16:344 


David G. Ryans, Director, National Com- 
mittee on Teacher Examinations, American 
Council on Education, New York, New York. 
This test is intended to provide a measure of 
a teacher's knowledge of child development 
and children's problems. 

Various educational practices and opinions, 
relating to behavior patterns and teacher-child 


relationships, are stated and the examinee is 
required to mark a choice indicating the de- 
gree of his approval or disapproval. Each form 
of the test includes 75 items of this type. 

The scoring is direct. One point is counted 
for each correct answer, with no correction for 
guessing. Choices are recorded on the test sheet 
which may be either machine scored or hand 
scored. Percentile norms based on 1,000 cases 
are available. 

The items were selected from relevant psy- 
chological literature and from descriptions of 
schoolroom situations. They were studied and 
checked by 84 classroom teachers and ro judges 
"highly trained in mental hygiene or allied 
fields." 

Validity was studied through the comparison. 
of various criterion groups. It was found that 
higher scores were made by such groups as the 
following : teachers from “progressive” schools ; 
grade and high school teachers rated as "best" 
by their principals or superintendents ; teachers 
holding positions of leadership and authority 
in their schools; teachers who had been in 
present positions five or more years. The cor- 
relation between Forms A and B, based on 845 
cases, is represented by a coefficient of .88. 

The manual is complete, describing in detail 
the development of the inventory and various 
analyses of the data including critical ratios for 
individual items based upon the scores of 
“good” and “poor” teachers. 
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Stanford Educational Aptitude Test. Entrants to 
teacher-training institutions; 1928; 1 form; $2 per 25; 
15¢ per specimen set; nontimed (40) minutes; Milton 
B. Jensen; Stanford University Press. 
REFERENCES 

1, Jensen, Mitton B. Trait Differences Between Three 
Groups in Education (Teachers, Research Workers, and Ad- 
ministrators). Unpublished doctor's thesis, Stanford Univer- 
it: 27. 
E uy gun Mitton B. “Objective Differentiation Between 
Three Groups in Education (Teachers, Research Workers, and 
Administrators)." Genetic Psychol Monogr 3:333-454 My '28. * 


(PA 2:3652) m 
3. Rosrkzm, L. E. "The Measurement of Teaching Ability: 


Study Number One.” 7 Exp Ed 14:6-51 S'45. * (PA E) : 

4. Rotre, J. F. “The Measurement of Teaching Ability: 
Study Number Two." J Exp Ed 14:52-74 S '45. * (PA 20:1268) 

A. S. Barr, Professor of Education, The 
University of Wisconsin, Madison, Wisconsin. 
This test is designed to indicate the field, or 
fields, in education in which persons may be 
expected to do their best work. Scores are» 
tained for position preference ratings, disci- 
pline case problems, high school activities, and 
for the battery as a whole. Three comparisons 
are provided: (a) teaching-research; (b) re- 
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search-administration; and (c) teaching-ad- 
ministration. 

The test was validated against a group of 
205 men selected on the basis of ratings by 
from one to seven judges. The correlation of 
test scores and the ratings of the judges for 
the criterion group was .85. The reliability for 
the entire battery was also .85 with slightly 
lower reliabilities for the three parts. Full data 
covering the construction and standardization 
of the test are given in a paper (2) by Milton 
B. Jensen. 

The test has been widely used. Rolfe (4), 
using measured changes of eighth-grade pupils 
in the social studies in attitudes, problem- 
solving ability, and information as a criterion of 
teaching efficiency, secured validity coefficients 
of .o8 (for the teaching-administration compari- 
son); —.15 (for the administration-research 
comparison); and —.r3 (for the teaching- 
research comparison). Rostker (3) secured 
similar results with reliabilities of .85. 

The ultimate validity of this test depends 
upon the judgment and outlook of the raters 
used in validating the test. It has been fre- 
quently shown that it is difficult to establish an 
adequate criterion of teaching efficiency, and 
that the correlations between supervisory rat- 
ings, measures of pupil growth, and pupil rat- 
ings are not high. The validity of this test 
cannot, accordingly, be said to have been estab- 
lished ; it should be used with great care. 


David G. Ryans, Director, National Com- 
mittee on Teacher Examinations, American 
Council on Education, New York, New York. 
The stated purpose of the Stanford Educa- 
tional Aptitude Test is to provide “a compara- 
tive measure of one’s ability as a teacher, a 
research worker in Education, and a school 
administrator.” 

Three types of items are included in this 
test: (a) two teaching positions are described 
in some detail and the examinee is required to 
check his preference; (b) a discipline case is 
described and the examinee is required to check 
the procedure he believes should be followed 
and the degree of confidence he feels in his 
choice; and (c) a school activity is described 
and the examinee is required to check the 
course of action he would follow and his con- 
fidence in the choice. 

The development of scoring keys appears to 
have been thorough and statistically sound. 


The scoring itself, however, is complicated and 
tedious, involving algebraic additions of part 
scores, their multiplication by regression 
weights, further summation and multiplication 
to obtain final scores for “teaching vs. research” 
and “administration vs. research,” and subtrac- 
tion of the latter from the former to obtain the 
"teaching vs. administration" score. 3 

Validity was studied through administration 
of the test to criterion groups comprising 205 
men “selected on the basis of ratings by from 
I to 7 judges." The correlation between test 
score and ratings is reported to be "about 
+.85.” Near zero-order correlation coefficients 
were obtained between test score and age, sex, 
amount of professional training, and amount 
of professional experience. “Split score” relia- 
bility reported for the different types of items 
and the different abilities measured ranged 
from .76 to .91. 
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Teaching Aptitude Test: George Washington 
University Series. Grades 12-16; 1927; Form 1; $9 
per 100, postpaid; 15¢ per specimen set, postpaid; 37 
(45) minutes; F. A, Moss, T. Hunt, and F. C. Wallace; 
Center for Psychological Service. 


REFERENCES 

2 ED THELMA. TESI Teaching Aptitude," Ed 

m & Sup 15:334-42 My '29. an 
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6. quinis J. E., AND Staton, Tuomas F, “Predictive Value 
of Various Tests of Emotionality and Adjustment in a Guid- 
ance Program for Prospective Teachers.” J Ed Res 32:653-9 
My '39. * (PA 13:4211) P 
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J Ed Res 38:678-84 My '45. * (PA 19:3183 X 

8. Seacoe, May V. “Prognostic Tests and Teaching Success." 
J Ed Res 38:685-90 My '4s. * (PA 19:3184) 


A. S. Barr, Professor of Education, The 
University of Wisconsin, Madison, Wisconsin. 
This test is composed of five parts: (a) judg- 
ment in teaching situations; (5) reasoning and 
information concerning school problems; (c) 
comprehension and retention; (d) observation 
and recall; and (e) recognition of mental 
states from facial expression. Norms and direc- 
tions for giving and scoring the test are pro- 
vided. 

Dr. Thelma Hunt (1) reports correlations 
ranging between .30 and .50 with teacher rat- 
ings; .60 with the Social Intelligence Test; and 
-57 with the Mental Alertness Test. 
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Tests such as this should be used with great 
care until validated against an adequate criterion 
of teaching efficiency. Teaching is a very com- 
plex activity and apparently conditioned by an 
equally complex matrix of abilities of which 
intellectualized verbal ability would appear to 
be one or one group of abilities. The American 
Council on Education Psychological Examina- 
tion for College Freshmen has been shown to 
possess certain predictive value; whether a test 
such as the Teaching Aptitude Test possesses 
more predictive value remains to be deter- 
mined. 


[ 406 ] 
XTests of Human Growth and Development. Psy- 
chologists and social workers and teachers in training 
and in service; 1946; 4 parts; 1¢ per key for any one 
part; $1 per 100 answer sheets for any one part; 15¢ 
per instructor's guide; 6o¢ per specimen set; John E. 
Horrocks and Maurice E. Troyer; Syracuse Univer- 
sity Press. 
@) TEST OF KNOWLEDGE OF FACT AND PRINCIPLE IN 
HUMAN GROWTH AND DEVELOPMENT. $1.75 per 25; non- 
timed (30) minutes. 
b) A STUDY OF CONNIE CASEY, $2.50 per 25; nontimed 
(50-90) minutes. 
C) A STUDY OF BARRY BLACK. $2.50 per 25; nontimed 
(50-90) minutes. 
d) A STUDY OF SAM SMITH. $2.50 per 25; nontimed 
(50-90) minutes. 


REFERENCES 

x, Horrocks, 
edge of Human Development and Ability to Use Such Knowl- 
edge. npublished doctor's thesis, Syracuse University, 1945. 

D. 279. 

2. Hoxocxs, Jonn E. "The Relationship Between Knowl- 
edge of Human Development and Ability to Use Such Knowl- 
edge." J Appl Ps: eE AEEA O '46. * (PA 21:604) 

3. Horrocks, Joun È., AND Troyer, MAURICE F. “Case Study 
Tests of Ability to Use Knowledge of Human Growth and De- 
velopment.” Ed & Psychol Meas 7:23-36 sp '47. * (PA 22:858) 


Harold E. Jones, Professor of Psychology, 
University of California, Berkeley, California. 
This series consists of an 8o-item multiple- 
choice test, involving “knowledge of fact and 
principle,” and three case-study tests, involving 
problems which are chiefly social-emotional in 
origin, physical-economic, and intellectual- 
academic. Each of the case studies is presented 
in three sections, with progressive additions of 
case material and with multivariate questions 
pertaining to diagnosis and remedial treatment. 

Item scores are weighted on the basis of 
judgments by a selected group of experts; and 
norms on from three to eight hundred college 
students are available. Each case study test, 
requiring from 5o to 9o minutes to complete, 
has a reliability of from .7 to .8. Intercorrela- 
tions range between .4 and .6. Correlations 
with the fact-principle test are somewhat lower. 
It is apparent that these tests measure not 


foun E. "The Relationship Between Knowl-. ; 


merely the ability to apply what has been 
learned but also the disposition to conform to 
the clinical and educational points of view of 
the authorities used in the standardization. For 
college courses in adolescence, they may be 
useful chiefly as instructional aids and as . 
sources of discussion material rather than as 
instruments of measurement. 


Goodwin Watson, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. The tests combine a measure 
of knowledge of facts and principles about 
human development with a measure of ability 
to apply this knowledge in three case studies. 

The knowledge of facts and principles test 
consists of 80 multiple-choice items covering 
items common to widely used texts and refer- 
ences. Seven psychologists cooperated in the 
selection of items and preparation of the key. 
The reliability is reported as .9r. This test is 
a good example of a standardized achievement 
test at college or graduate level. Most instruc- 
tors in courses in adolescence will find them- 
selves in disagreement with the key at a few 
points because the foils are close in value to 
the approved answer, but the test as a whole 
will undoubtedly discriminate high from low 
achievement to the satisfaction of any com- 
petent psychologist. 

It is in the second part that Horrocks and 
Troyer have developed their main contribution. 
Three case studies are presented. Each presen- 
tation is interrupted after a description of symp- 
toms, and the student is asked to respond to 25 
diagnostic items and a dozen proposals for 
remedial action. Response to each item is on a 
five-point scale, and each point of the scale is 
assigned a weight. An answer sheet fits over a 
key so that with carbon paper between, the 
student’s check mark can be read off directly 
on the key as a score value. Then the student 
is permitted to go on to Part II of the case 
study, giving additional data from teacher re- 
ports, home visits, etc. After this further in- 
formation comes a second testing, again includ- 
ing both diagnostic and remedial proposals. 
Then comes a third block of information and a 
third test on diagnostic and remedial procedures. 

Reliability of the responses to the case stud- 
ies runs from .73 to .79; that for diagnosis (.68 
to .72) is higher than that for remedies (.55 
to 64). Norms for each case study include 
mean, median, mode, and a percentile table 
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based on 300 to 800 students in four univer- 
sities. The correlations between case studies 
range from .39 to .62; the correlation of diag- 
nostic score with knowledge of facts and prin- 
ciples ranges from .24 to .49, and the correla- 
tion of remedial score with knowledge of facts 
and principles, from .26 to .30. These low re- 
lationships support the initial hypothesis of the 
authors that ability to do well on an achieve- 
ment test of the usual factual type is not a 
good indication of ability to apply the knowl- 
edge in a concrete situation. 

The three case studies were artificially con- 
structed to illustrate certain typical problems in 
emotional insecurity (Barry Black), physical 
and economic handicaps (Connie Casey), and 
low IQ (Sam Smith). The cases were reviewed 
by clinical psychologists and presented in vivid 
and lifelike form. An overload of questions 
was prepared in the initial work on each case. 
Those that remain in the present form have 
been weighted by the responses of ten experts. 
Disagreement among the experts was the usual 
rule. In some instances the distribution was 
bimodal. For example, nurses tended to inter- 
pret as a health problem what social psycholo- 
gists felt to be basically a problem of poverty. 
The weight assigned to an answer on which 
most experts agreed was higher than that as- 
signed if only a few approved the response. In- 
ternal consistency tests were applied, and items 
that did not discriminate between those receiv- 
ing high and low scores on the whole section 
were eliminated. 

The authors rightly stress the value of the 
tests as teaching instruments, for class discus- 
sion brings out many interesting insights which 
escaped the experts who worked on the key. 

A minor technical criticism is that students 
who read the later sections of "further infor- 
mation," despite instructions not to do so, may 
gain an unfair advantage on earlier test ques- 
tions over those who do as directed. Prepara- 
tion of each section in a sealed folder would 
have prevented this "peeking ahead" but would 
have added, of course, to costs. 

The case studies represent the best examples 
the reviewer has found of the application of 
objective techniques to the measurement of in- 
sight into psychological relationships in per- 
sonality. On this same model it should now be 

possible to develop cases involving other and 
less familiar patterns of maladjustment. 


ENGINEERING 
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*Examination in Advanced Engineering Elec- 
tronics. Grade 16; 1945; Form CAEE-1-B-4 (usually 
called Form B) ; no norms available; $2 per 25; 5¢ per 
scoring key ; 25¢ per specimen set, postpaid; 150(160) 
minutes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 50¢ per key; 50¢ per 
specimen set.) 
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X*Examination in Diesel Engineering. Grade 15; 
1945; IBM; Form CDEg-1-B-4 (usually called Form 
B) ; no norms available; separate answer sheets must 
be used ; $2 per 25; 40¢ per 25 machine-scorable answer 
sheets ; 15¢ per scoring key ; 25 per specimen set, post- 
paid; 150(160) minutes; prepared by the Examinations 
Staff of the United States Armed Forces Institute; 
published by the American Council on Education ; dis- 
tributed by Cooperative Test Service. (Also distributed 
by Science Research Associates: $2 per 25; 65¢ per 
25 machine-scorable answer sheets; so¢ per key; so¢ 
per specimen set.) 


[ 409 ] 

*Examination in Engineering Drawing. Grade 13; 
1944; IBM; Form CEgDr-1-B-4 (usually called 
Form B); separate answer sheets must be used; $3 
per 25; 40¢ per 25 machine-scorable answer sheets ; 
15¢ per scoring key; 25¢ per specimen set, postpaid ; 
120(130) minutes; prepared by the Examinations Staff 
of the United States Armed Forces Institute; pub- 
lished by the American Council on Education; distrib- 
uted by Cooperative Test Service. (Also distributed by 
Science Research Associates: $3 per 25; 65¢ per 25 
machine-scorable answer sheets; so¢ per key; sof per 
specimen set.) 


[410 ] 
*Examination in Engineering Electronics. Grade 
16; 1945; Form CEgE-1-B-4 (usually called Form B); 
no norms available; $1.75 per 25; 5¢ per scoring key; 
25¢ per specimen set, postpaid; r50( 160) minutes ; prc- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education ; distributed by Cooperative Test 
Service. (Also distributed by Science Research Asso- 
ciates: $2 per 25; 5o¢ per key; so¢ per specimen set.) 


[41] 

*Examination in Engineering Mechanics. Grades 
14-15; 1946; IBM ; Form CEgMe-1-B-4 (usually called 
Form B) ; no norms available; separate answer sheets 
must be used; $1.75 per 25; 40¢ per 25 machine-scor- 
able answer sheets; 15¢ per scoring key; 254 per speci- 
men set, postpaid; 180(190) minutes; prepared by the 
Examinations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
cation; distributed by Cooperative Test Service. (Also 
distributed by Science Research Associates: $2 per 25; 
65¢ per 25 machine-scorable answer sheets; 50¢ per 
key; 50¢ per specimen set.) 


[ 412 ] 
XExamination in Fluid Mechanics. Grade 15; 1946; 
IBM; Form CFIMe-1-B-4 (usually called Form B); 
no norms available; separate answer sheets must be 
used ; $1.75 per 25; 40d pa 25 machine-scorable answer 
sheets; 15¢ per scoring key; 25¢ per specimen set, post- 
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paid; 150(160) minutes ; prepared by the Examination: 

s , s 
Staff oí the United States Armed Forces Institute; 
published by the American Council on Education; dis- 
tributed by Cooperative Test Service. (Also distributed 
by prince Ren Associates: $2 per 25; 65é per 25 
machine-scorable answer sheets ; key; 
specimen set.) RR EOS Es 


[413 ] 

*Examination in Machine Design. Grade 15; 1946; 
IBM; Form CMaD-1-B-4 (usually called TON Ey : 
no norms available; separate answer sheets must be 
used; $2 per 25; 4o¢ per 25 machine-scorable answer 
sheets ; 15¢ per scoring key ; 25¢ per specimen set, post- 
paid; 180(190) minutes; prepared by the Examinations 
Staff of the United States Armed Forces Institute; 
published by the American Council on Education; dis- 
tributed by Cooperative Test Service. (Also distributed 
by Science Research Associates: $2 per 25; 65¢ per 25 
machine-scorable answer sheets; 50¢ per key; 50¢ per 
specimen set.) 


[ 414 ] 
*Examination in Radio Engineering. Grade 16; 
1946; 1 form (usually called Form B); no norms 
available; $1.75 per 25 of either level; 5¢ per scoring 
key for either level; 25 per specimen set for either 
level; 150(160) minutes; prepared by the Examina- 
tions Staff of the United States Armed Forces Insti- 
tute; published by the American Council on Education ; 
distributed by Cooperative Test Service. (Also distrib- 
uted by Science Research Associates: $2 per 25 for 
either part; 50¢ per key for either part; 50¢ per speci- 
men set of either part.) 
a) [rest] 1. Form CRE-1-B-4. 
b) [rest] 11. Form CRE-2-B-4. 


[ 415 ] 

*Examination in Strength of Materials. Grade 15; 
1946; IBM; Form CSM-1-B-4 (usually called Form 
B) ; no norms available; separate answer sheets must 
be used ; $2 per 25; 40¢ per 25 machine-scorable answer 
sheets ; 15¢ per scoring key ; 25¢ per specimen set, post- 
paid; 180(190) minutes ; prepared by the Examinations 
Staff of the United States Armed Forces Institute; 
published by American Council on Education; dis- 
tributed by Cooperative Test Service. (Also distributed 
by Science Research Associates : $2 per 25; 65¢ per 25 
machine-scorable answer sheets; 50¢ per key; Sof per 
specimen set.) 


[ 416 ] 

XExamination in Surveying. Grade 15; 1945; IBM ; 
Form CSu-1-B-4 (usually called Form B) ; no norms 
available; separate answer sheets must be used ; $2 per 
25; 40f per 25 machine-scorable answer sheets; 15¢ 
per scoring key; 256 per specimen set, postpaid; 180- 
(190) minutes; prepared by the Examinations Staff of 
the United States Armed Forces Institute; published 
by the American Council on Education ; distributed by 
Cooperative Test Service. ( Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
B answer sheets; 50¢ per key; 50 per specimen 
set. 


ENVIRONMENT 


[417 ] 
XAmerican Home Scale. Grades 8 and over; 1942; 
preliminary manual; IBM; $1.65 per 25 machine- 
Scorable test-answer sheets; 35¢ per set of hand-scor- 
ing stencils ; $1.50 per set of machine-scoring stencils ; 


50¢ per specimen set; nontimed (40) minutes; W. A. 
Kerr and H. H. Remmers; Science Research Asso- 
ciates. 

REFERENCES 

1, Kerr, W. A., anp Remmers, H, H. “The Construction and 
Validation of a Group Home Environment Scale." Proc Ind 
Acad Sci 50:201-6 "at. * (PA 16:2767 

2. Kerr, W. A., AND REMMERS, i. . “The Construction and 
Validation of a Group Home-Environment Scale." Abstract. 
Psychol B 38:705 O "41. * (PA 16:267, title only) 

3. Kerr, W. A., AND REMMERS, H. H. “The Cultural Value 
of too Representative American Magazines.” Sch & Soc $4: 
476-80 N 22’41.* (PA 16:1083) 

4. Kern, WA. Measure of Socio-Economic-Cultural Level 
of the Home. Unpublished doctor’s thesis, Purdue University, 
194 Kune, W. A. “The Measurement of Home Environment 
and Its Relationship With Certain Other Variables,” pp. 7-43. 
In Further Studies in Attitudes, Series V. Edited by H. H. 
Remmers. Purdue University, Division of Educational Refer- 
ence, Studies in Higher Education, No. 45. Lafayette, Ind.: the 
Division, June 1942. Pp. 65. Paper. $0.75. * (PA 16:4919) 

6. Remmers, H. H., ann Kerr, W. A. “Home Environment 
in American Cities.” Abstract. Am J Sociol 51:233-7 N '45. * 
(PA 20:514) 

7. Govci, Harrison G. “The Relationship of Socio-Economic 
Státus to Personality Inventory and Achievement Test Scores." 
J Ed Psychol 37:527-40 D '46. * (PA 21:1554) 

Henry S. Maas, Research Associate in 
Group Psychotherapy, William Alanson White 
Psychiatric Foundation, Washington, D. C. 
This scale raises and fails to allay the same 
doubts as other group administered scales of 
“home environment,” intended to be answered 
by school boys and girls with a “Yes” or “No.” 
First, will undisguised direct questions about 
material possessions and parents’ education and 
membership in status-giving and labor organi- 
zations, etc. be answered honestly? Second, 
can any “home environment,” without trained 
field work observation, scaled or otherwise, of 
the social relationships within the family, be 
measured with sufficient validity through group 
administration to warrant the use of results in 
"individual guidance,” as the manual suggests 
as its primary purpose? 

In anticipation of the first question, volun- 
teers from “the colored women’s organization” 
in one city were sent to fill out scales in the 
homes of a sampling of 21 out of 292 Negro 
high school seniors to whom the scale had pre- 
viously been group administered. A coefficient 
of validity of .915 was obtained. One ques- 
tions both sociologically and psychologically the 
use of Negro high school seniors, who have 
proved their selectivity through their academic 
advancement, for the “validation” of a scale 
intended for much broader application than the 
“sample” used. 

For survey purposes and for research deal- 
ing with large numbers of subjects, the Ameri- 
can Home Scale seems to be of value. Its tests 
of reliability and its norm standardization upon 
eighth grade pupils in 42 American cities 
should insure its use for group studies. 


47 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 474 


But in considering the use of the scale for 
vocational, educational, and personal counsel- 
ing of individuals, one must be wary. Particu- 
larly questionable is the alleged “mental hygiene 
value" of the scale. There is absolutely noth- 
ing in the form to indicate, even indirectly, 
"knowledge of unsatisfactory home conditions" 
which “should help the teacher to gain insight 
into some of the basic underlying difficulties 
which are reflected in classroom behavior and 
adjustment" (manual, p. 7). No studies known 
to this reviewer have ever shown a significant 
relationship between a grouping of home fac- 
tors such as the number of its books, the pos- 
session of a piano or potted flowers in winter, 
or even the acceptance of government relief 
and an elusive, dynamic variable like the school 
adjustment of the family's children. One won- 
ders, therefore, why such claims for the use of 
this scale are made. The American Home Scale 
may provide only the barest clue (in Items 45 
and 46) but certainly is no substitute for the 
school social worker. 

Compressing the 48 scorable items within a 
single IBM form makes either machine or 
manual scoring relatively simple, but it im- 
poses some difficulty on respondents and would 
seem to limit use of the scale, roughly, to chil- 
dren of junior high school age and above. The 
visualization of numbers of books in the home 
is an improvement over, for example, the com- 
parable item in the Sim’s Score Card for Socio- 
Economic Status: “About how many books are 
in your home?" The purely quantitative aspect 
of this item (although an attempt is made to 
distinguish between adult and children’s books) 
may still, though, be questioned. Finally, while 
one ought not evaluate an instrument by a con- 
sideration of its potential abuses, there are 
some answers to items in the scale which, if 
given honestly by pupils and made available to 
the "wrong" teachers, might be used with un- 
favorable bias by the latter in their dealings 
with respondents. One assumes that results 
obtained from the use of the American Home 
Scale will be utilized scientifically, only by 
qualified personnel, and largely for survey 
rather than individual guidance purposes. 


Verner M. Sims, Professor of Psychology, 
University of Alabama, University, Alabama. 
This scale is, in this reviewer's opinion, easily 
the best instrument now available for measur- 
ing the social, cultural, economic levels of home 


environment (what is commonly called socio- 
economic status) through the medium of chil- 
dren’s responses to a questionnaire. In terms 
of the conventional criteria for judging educa- 
tional-psychological tests the scale rates high. 

The authors have taken advantage of previ- 
ous research in the field and have made use of 
recently developed statistical techniques for im- 
proved test construction. Not satisfied with a 
single check on reliability and validity, they 
have approached these questions through a 
number of ways. The norms, based upon ran- 
dom samples of eighth grade pupils from 42 
cities, representing an evenly spaced sample of 
Thorndike’s ranking of 310 American cities on 
their “goodness of living,” are as representative 
as any norms now available for educational- 
psychological tests. Norms are reported, also, 
for four fairly independent factors identified 
through a factor analysis as being components 
of socio-economic status. These norms make 
possible some diagnosis of home environment. 

The manual accompanying the scale ade- 
quately describes the instrument and its con- 
struction and furnishes the reader with the 
information needed to judge its worth. (Only 
one relatively minor question was unanswered 
for the reviewer. Previous research has indi- 
cated not only that the parent’s occupation is 
a very significant item in socio-economic status 
but also that adequate information concerning 
it is difficult to get. In this scale, however, an 
occupational classification is arrived at very 

simply by having the child himself do the clas- 
sifying. Personally, the reviewer would like to 
have seen the evidence relating to the relative 
efficacy of this procedure.) The manual con- 
tains excellent suggestions for use of the scale, 
and suggestions many of which are substanti- 
ated by some preliminary research. 

The scale has a number of clever features 
(such as reproduction of the test on a standard 
machine-scored answer sheet and the method 
used in helping children estimate the number 
of books in their homes) which make for ease 
of use and dependability of results. A few 
minor criticisms are, according to the authors, 
being changed in a revision which is now under 
way. 

One suggestion is offered to the prospective 
user. The scale is obviously a measure of urban 
home environment and, incidentally, might well 
have been so labeled. Not only are the norms 


based entirely on city children but it also seems 
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possible that the significance of several items 
in the test might be different with rural popu- 
lations. The scale should, therefore, be used 
with rural groups only in an experimental 
manner. 


HEALTH 


[418 ] 
X*Byrd Health Attitude Scale. Grades 10-14; 1940- 
41; I form; $1.75 per 25; 15¢ per specimen set; non- 
timed (35) minutes; Oliver E. Byrd; Stanford Uni- 
versity Press. 
REFERENCES 
r. Bvnp, OLıver Erasmus. The Study and Measurement of 


Health Attitudes. Unpublished doctor's thesis, Stanford Uni- 
eae 1940. (Abstracts of Dissertations . . . 1939-40, 1940, pp. 

Mayhew Derryberry, Chief, Office of Health 
Education, U. S. Public Health Service, Wash- 
ington, D. C. The test consists of roo state- 
ments, each of which state what should or 
should not be a policy, an activity, or a require- 
ment. To these statements the student responds 
on a five-point verbal scale varying from 
“strongly agree” to “strongly disagree.” Scor- 
ing is accomplished by adding the small index 
numbers (1 to 5) that appear opposite each of 
the five possible responses. The test is simple 
to administer and score. Full instructions are 
given on the interpretation of group and indi- 
vidual scores. 

The published instructions give no informa- 
tion on the validity, reliability, and objectivity 
of the test. It does state that the roo items in 
the test were chosen from an initial battery of 
400 items on the basis of their capacity to dis- 
tinguish the students who scored in the highest 
10 per cent from those who scored in the low- 
est IO per cent on the total battery. 

The printing of the numerical values along- 
side the responses offers a cue to the way in 
which the test will be scored. The author states 
that this cue operates in less than I per cent 
of the cases. Certainly the absolute data on 
such an important point should be presented. 

The consistent use of should or should not 
for every sentence results in many awkward 
statements as well as a number of obvious ones, 
eg., “A child with its first set of teeth should 
be kept away from the dentist." There are 
also a small number of statements which are 
not scored in accord with the best medical 
opinion, e.g., “A pregnant woman should drink 
alcohol if she so desires." 

Until data are collected and presented to de- 
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termine significance of low, average, and high 
scores in terms of possible health behavior or 
life decisions, the test has limited value. 


H. H. Remmers, Director, Division of Edu- 
cational Reference, Purdue University, La- 
fayette, Indiana. This “scale is designed to 
measure health attitudes of the group or of the 
individual" It was validated by choosing out 
of 400 the 100 items that best discriminated 
between the highest and lowest 10 per cent of 
a sample of 1,727 high school, junior college, 
college, and university students. Reliability is 
not reported. The items consist of declarative 
sentences with "should" as an auxiliary verb 
and with five constant alternative responses. 
*Norms," according to the author, "have not 
been compiled as they are apt to have little sig- 
nificance because of group differences or varia- 
tions in school and community environment." 
If such reasoning were valid, it would seem to 
apply to most educational and psychological 
measurements of the school population. 

While most of the items appear to the re- 
viewer to have good face validity, it should be 
noted that the method of validation used does 
not guarantee that health superstitions might 
not survive such an item-analysis criterion. It 
would seem desirable to validate the items by 
a jury of health experts as well as by the 
power of items to discriminate between the 
extremes of the distribution of total scores. 

A few items raised questions in the review- 
er’s lay (with respect to health) mind which 
would probably be rather widely shared. For 
example, Item 54, “Employers should have a 
recreation program for their workers,” will 
yield the highest score if one marks “Strongly 
agree” and the lowest score if one marks 
“Strongly disagree.” The implied paternalism, 
however benevolent, has not worked out too 
well in industry and is contrary to modern 
principles of mental hygiene with its strong 
emphasis on the psychology of participation. 
The item would be improved by revising it to 
read, “Every industrial organization should 
have a recreation program.” 

Again, Item 56, “Some form of financial 
assistance should be given to families in which 
there is tuberculosis,” should, it would seem, be 
qualified by the concept of need. A wealthy 
family might resent such “aid”! 

Although the instrument is labeled a “scale,” 
the word is used with literary rather than pro- 
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fessional license—a practice, to be sure, not 
at all uncommon but therefore not less unde- 
sirable in educational and psychological litera- 
ture. Since the word scale generally assumes a 
linear continuum, it would be of interest to 
subject the items to intercorrelations and factor 
analysis—a laborious job, to be sure. 

Despite these strictures it is likely that a 
teacher might profitably make use of this in- 
strument to evaluate progress as well as pos- 
sibly different teaching techniques and pro- 
cedures in his or her own classroom. 


Ed Res B 21:56 F 18 ’42. William J. Jones. 
* Statements selected for the final scale were 
those on which students who ranked in the 
highest ro per cent had the largest amount of 
disagreement with students in the lowest 10 
per cent. This process results in including very 
discriminating items. But it also seems to have 
resulted in items for which the "correct" an- 
swer is extremely obvious, namely, “The teacher 
should pay no attention to the health of the 
students,” “A person should return to work 
immediately after a serious illness,” “A child 
with its first set of teeth should be kept away 
from the dentist.” Solely on an a priori basis it 
seems to be true that the acceptance of dis- 
criminating power as the important criterion 
for the inclusion of items has resulted in a 
large number of unrealistic, psychologically in- 
valid items. The items are not grouped or 
keyed so that part scores in significant areas of 
health knowledge (or attitude) can be deter- 
mined. Only a total score can be determined 
and that by adding superscripts printed on the 
blank where the student can see them. The 
number of items for certain important areas 
of health knowledge seems to be out of pro- 
portion. Doctors and medical examinations 
appear to be overstressed while diet, prevention, 
safety, and social agencies are not given enough 
stress. No norms or data about the validity and 
reliability were included. Final judgments 
about the worth of the instrument should be 
reserved until evidence about the way in which 
it fits into the teaching-learning process is col- 
lected. 


[ 419 ] 
Gates-Strang Health Knowledge Tests. Grades 
3-8, 7-12; 1925-37 ; 3 forms at each level; $4.40 per 100; 
25¢ per specimen set ; Arthur I. Gates and Ruth Strang; 


Bureau of Publications, Teachers College, Columbia 
University. 


a) ELEMENTARY TESTS. Grades 3-8 i Forms A, B, C; 


minutes. : 
aes) ges tests. Grades 7-12; Forms D, E, F; 30 


(35) minutes. 


REFERENCES H 
1. Gates, ARTHUR I., AND STRANG, RUTH. “A Test in Health 
Knowledge.” Teach Col Rec 26:867-80 Je '25. * 


5 
in tartan ane High, Schols of Beaver County, Pennssive 
nia, Unpublished doctor’s thesis, University of Pittsburgh, 1941. 
(Abstracts of Theses . . . 1941, 1942, PP- 33-42-) 

Hygeia 17:364 Ap ’39. W. W. Bauer. The 
Gates-Strang health knowledge tests have been 
among the most widely accepted and used of 
all the devices for testing the health knowledge 
of children. In this revised edition their use- 
fulness should continue as in the past. They 
can be recommended without reservation for 
wide use in schools. They would also be excel- 
lent for party games of the mental “whoopee” 
type; adult scores would probably seldom equal 
the norms established for children! 


For reviews by Frederick Rand Rogers and 
A. H. Turney, and an excerpt from a review 
by Lois M. Shoemaker, see 40:1500. 


[ 420 ] 
*Health and Safety Education Test: State High 
School Tests for Indiana. First, second semesters 
high school; 1945-46; Forms A, N ; 4é per test; 15¢ 
per sample test; 40(45) minutes; Shelby Gallien and 
Hilda Schwehn; State High School Testing Service, 
Purdue University. 


[ 421 ] 
*Health Education Test: Knowledge and Appli- 
cation. Grades 7-16; 1946-47; Form A; $2.50 per 25; 
10¢ per manual; so per specimen set; 40(45) minutes; 
edited by Clifford L. Brownell; John H. Shaw and 
Maurice Troyer; Acorn Publishing Co. 


REFERENCES 
1. Suaw, Joun H., AND Troyer, Maurice E. “The Develop- 


ment of a New Test in Health Education.” J Sch Health 17: 
216-9 O "47. * 

H. H. Remmers, Director, Division of Edu- 
cational Reference, Purdue University, La- 
fayette, Indiana, The purpose of this test is 
stated to be that of testing “the students’ basic 
knowledge of health and his ability to apply 
that knowledge.” The publisher's catalogue for 
1946-1947 also states that it “covers the newer 
phases of health and includes problems with 
which secondary school and college students 
should be familiar. It also includes questions 
on physical education.” Concerning the series 
in which the test is published, the catalogue 
further claims that to insure curricular validity, 
“both the standard types of subject matter and 
new contemporary points of view have been 
included. The questions were checked against 
many courses of study . . . as well as the most 
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widely used textbooks in each subject. The 
content was then subjected to experimental use 
and was revised after consideration of sugges- 
tions contributed by more than one hundred 
and fifty experienced administrators. Efforts 
were made to include important types of ques- 
tions that had been ignored or under-emphasized 
in other tests.” If this procedure were compe- 
tently applied to this test, it is obviously more 
carefully constructed than most such tests are. 

The diagnostic elements of the test consist 
chiefly in that Part I, composed of 60 changing 
alternative multiple-choice items, purports to 
test basic knowledge and Part II, composed of 
four “problems,” each with from 9 to 12 true- 
false items (a total of 40) purports to test 
ability to draw conclusions. One may legiti- 
mately doubt whether either part is sufficiently 
reliable for acceptable individual diagnosis. No 
reliability data are reported either for the parts 
or for the total test. A form sheet labeled 
"Teacher's Directions, Class Record, Answer 
Form and Norms" is provided, but the norms 
are not given. The score key code for Part I 
is a word, some one letter of which precedes 
the correct response for each item. The re- 
viewer refrains from giving the word lest it 
invalidate the test if this review were to come 
under the eyes of prospective testees! While 
a rather engagingly ingenious gadget, such a 
key word, once in possession of one or more 
students, would obviously make the test worth- 
less. If ready-to-use scoring strips were pro- 
vided, it would have been an obvious conven- 
ience for those who use the test. Apart from 
this lack the mechanical arrangement of the 
test itself is excellent for convenience in scoring. 

A few of the items violate well-known and 
generally accepted principles of test-item con- 
struction. In several multiple-choice items, for 
example, the stem is not in itself meaningful 
and poses no specified problem—examples: 
“Most baldness,” “Height-age-weight charts,” 
“High heels are,” “Posture,” “Sleep,” “Emo- 
tional outbursts,” “The condition of the eye.” 
In each of these the items could readily have 
been improved in the direction of greater homo- 
geneity instead of leaving them in effect a series 
of relatively unrelated true-false statements. In 
Part II the ratio of false to true items is nearly 
2 to 1—25 to 15 to be exact; in one problem 
it is 5 to 2. The "problems" in this part appear 
to the reviewer to be well conceived. That all 
thinking about health problems, however, can 


be reduced to the dogmatic dichotomies of 
true-false propositions is not a proposition 
which would be generally accepted. For exam- 
ple, the testee is to conclude that “Shirley will 
be better off in later years because she has read 
so much and has ignored social activities" is 
false. Who knows? The test authors might 
profitably have considered the types of items 
made familiar by Ralph W. Tyler and his co- 
workers in the tests developed for use in the 
Eight Year Study of the Progressive Educa- 
tion Association for suggestions for getting a 
more adequate picture of the higher mental 
processes. 

The test gives indications of having been 
hurriedly done and rushed into the market 
before being adequately processed and refined. 


Mabel E. Rugen, Professor of Health Edu- 
cation, University of Michigan, Ann Arbor, 
Michigan. Finding satisfactory ways of meas- 
uring use and application of health knowledge 
is one of the most difficult problems confront- 
ing the health educator or teacher concerned 
with the healthful living of his students. It is 
what we do in matters of health that counts, 
and not what we know, nor even how we apply 
what we know in a paper-and-pencil test. 

This test, provided in two forms, for second- 
ary schools and colleges is composed of 60 single- 
response multiple-choice statements and 40 
true-false items related to four problem situ- 
ations. 

While the authors claim scientific accuracy 
for the items included, certain statements sug- 
gest incorrect interpretations, for example, 
“Improper formation of the teeth may cause 
speech defect"; "Sitting or sleeping in a draft 
reduces resistance to colds”; “A tourniquet 
should be used when other means of stopping 
the flow of blood have failed"; "John's diet 
[John is 17 years old] should include cod liver 
oil during the winter months" ; and a statement 
that cigarette smoking is an effective method 
of controlling weight. 

The items included in the test represent 
topics found in textbooks, courses of study, 
reports, and bulletins. (The number and spe- 
cific selection of sources are not indicated in 
the manual.) Areas sampled include the usual 
ones, i.e., nutrition, disease prevention and con- 
trol, health status, accidents and first-aid, to- 
bacco and alcohol, mental health, sleep and rest, 
posture, teeth, eyes, exercise and recreation, 
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etc. Little or no attention seems to have been 
given to studies dealing with student interests, 
concerns, and needs in the selection of these 
items, Items pertaining to an understanding of 
bodily changes, boy-girl relationships, growing 
up, sex education, social hygiene, home care of 
the sick, consumer problems associated with 
health products and services, and the use of 
community health resources, for example, are 
not included in the test. 

The tests (Forms A and B) have reliabili- 
ties of .go and .9r. Norms for the junior and 
senior high school and first year college are in 
the process of being developed. 

Suggestions for the use of the test are in- 
cluded in the instructor’s manual. Discussing 
the results of the test with the students and 
pointing out the relation between what they 
know (Part I) and how they apply knowledge 
(Part II) is recommended. This suggestion 
may be carried one step further by having stu- 
dents appraise their own health behavior in 
terms of the items being discussed. Health 
teaching to be effective in influencing behavior 
must be personalized. The paper-and-pencil 
test may be a useful teaching aid in directing 
discussion about health situations and use of 
knowledge, but the quality of the teaching and 
the critical self-appraisal that the individual 
may engage in as a result will be far more im- 
portant in determining his future health prac- 
tices than will be the scores made on the paper- 
and-pencil test. 


[ 422 ] 

X*Health Inventory for High School Students. 
Grades 9-12; 1942; IBM; Form A; 2 editions; $1.75 
per 25; 35¢ per specimen set, postpaid; 7¢ per copy, 
machine-scoring edition; 2¢ per machine-scorable an- 
swer sheet; 6o¢ per set of stencils for scoring answer 
sheets; nontimed (60) minutes; Gerwin Neher ; Cali- 
fornia Test Bureau. 1 


REFERENCES 


1. Newer, Gerwin CuarLes. A Study of the Health Knowl- 
edge, Attitudes, Status, and Practice of High School Pupils. 
Unpublished doctor’s thesis, University of Southern California, 
1942. (Abstracts of Dissertations . . . 1942, pp. 138-41.) 


Mayhew Derryberry, Chief, O fice of Health 
Education, U. S. Public Health Si ervice, Wash- 
ington, D. C. 'The inventory consists of 99 items, 
30 of which are devoted to recording a student's 
health status and his health practices; the re- 
maining 69 are multiple-choice questions on 
health information. The author states that it is 
the “purpose of the inventory to bring health 
practices in line with health knowledge.” This 
is certainly a desirable purpose but it is not 


altogether clear to the reviewer how the infor- 
mation tested in the last 69 items would affect 
such items in the health status part of the test 
as the number of times a person had been sick 
during the past twelve months or the number 
of colds one had had. . 

The multiple-choice type of question which 
is used throughout the test does not provide 
the best method for obfaining evidence on the 
frequency of occurrence of such symptoms 
as headaches, toothaches, etc. The interpreta- 
tion of the alternatives “frequently—seldom— 
never" or “frequently—occasionally—never” 
will vary widely for the individuals being tested. 

Part II of the inventory, designed to test 
health information, is divided into six sections 
covering public health, first aid, prevention of 
disease, proper health habits, diet, and mental 
hygiene. Norms are quoted for the test as a 
whole and also for each subject-matter section. 
Since the reported reliability of the total test 
is only .86, certainly the reliabilities of the six 
sections are not sufficient to justify individual 
interpretation of scores on each of the subject 
matters. Such a procedure is suggested both by 
the subject-matter norms and the profile chart 
printed on the front of the test. 

In the informational part of the test, the 
author has included relatively few items that 
test facts that have no direct functional value 
and has developed a number of excellent ques- 
tions designed to test the types of decisions 
boys and girls must make in their daily living. 
As an informational test the inventory seems 
a good tool. More data than are provided by 
the author are needed before the value of the 
first 30 items can be determined. 


Mabel E. Rugen, Professor of Health Edu- 
cation, University of Michigan, Ann Arbor, 
Michigan. In contrast to the health test which 
aims to measure accomplishment according to 
some established norm or standard is the health 
inventory which stresses student self-appraisal 
and the relating of health practices to health 
knowledge. 

This inventory is designed as an aid in help- 
ing the student relate what he knows to what 
he does and to what he is in matters of health. 
It is an outgrowth of several years of experi- 
mentation in the secondary schools of Los 
Angeles. 

The inventory is divided into three parts: 
health status, 10 questions; health practices, 20 


a 
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questions; and health knowledge, 69 single- 
response multiple-choice statements. Questions 
pertaining to health status include items about: 
frequency of illness, including the common 
cold ; fatigue; vision; teeth; athlete's foot; and 
use of laxatives. Questions pertaining to health 
practices include items on: sleep, dental care, 
nutrition, colds, vaccination, use of patent medi- 
cines, physical examinations, posture, hand- 
washing, and elimination. The questions per- 
taining to health knowledge center around 
such areas as: (a) public health, with emphasis 
on federal rather than state or local responsi- 
bilities; (b) first aid for accidents only; (c) 
prevention of disease, with emphasis on those 
that are communicable and spread through 
direct contact, water, or milk supply; (d) 
proper health habits, with attention to dental 
health as well as sleep, rest, and cleanliness; 
(e) diet, with emphasis on well-balanced meals ; 
and (f) mental hygiene. 

The basis for the selection of items included 
in the various parts of the inventory is not 
given. Items pertaining to sex education, hu- 
man relationships, child care, growing up and 
understanding oneself, usually of concern to 
the adolescent, are not included in the inven- 
tory. Likewise the advisability of including 
some items might be questioned, for example, 
including the removal of infected tonsils as a 
health habit, filtering water as a preventive for 
disease, improper diet as the chief cause of 
tooth decay. Most of the items, however, seem 
to be scientifically accurate. 

To assist the teacher in the use of the in- 
ventory, norms have been established for each 
of grades 9 through 12. Ratings and percentile 
scores, those for health status and health prac- 
tices on a weighted basis depending on the 
adequacy of the response, also have been de- 
veloped for each part of the test. No absolute 
scores are given to health status and health 
Practice responses. The health inventory as a 
whole has a reliability of .86. Both machine- 
scoring and hand-scoring editions of the test 
are available. Summary data sheets are in- 
cluded also. 

A comprehensive manual of directions which 
stresses the preparation of the students for the 
Inventory, scoring, interpreting the scores, and 
using the results is available. In the use of the 
inventory the instructor is urged to prepare 
the students thoroughly as to the purpose of 
the inventory, stressing the fact that it is not 


a test but a “means whereby he can find out 
how he can make himself healthier and there- 
fore happier in his daily living.” The health 
knowledge part of the test may be used to 
determine areas of study needing special at- 
tention. 

Properly used, this inventory should prove 
to be a helpful teaching aid in developing 
greater understanding for health and establish- 
ing improved health behavior because it assists 
the student in a critical analysis of his own 
status, practices, and information and their 
consistency. There is a need for more instru- 
ments of this kind to assist in functionalizing 
health teaching. 


[ 423 ] 
XHealth Practice Inventory. Grades 7-14; 1943; I 


form; $175 per 25; ISÉ per specimen set; nontime: 
(15-20) minutes; Ned B. Johns; Stanford University 


Press. 
REFERENCES 

1. Jouns, Enwaxp Burr. The Measurement of Health Prac- 
tices. Unpublished doctor's thesis, Stanford University, 1943. 
(Abstracts of Dissertations . . . 1942-43, 1943, PP. 167.70.) | 

2. PowELL, Vircinia M. An Analysis o; Relationships Exist- 
ent Between Current Health Practices, Personal and Social 
Adjustment and Physical, Performance. Unpublished doctor’s 
thesis, New York University, 1946. 

Thomas Kirk Cureton, Professor of Physi- 
cal Education and Director of the Physical 
Fitness Research Laboratory, University of 
Illinois, Urbana, Illinois. This test consists of 
36 questions to be checked (1) Never, (2) 
Rarely, (3) Sometimes, (4) Usually, or (5) 
Always, with points allotted as shown in the 
parentheses. Scores range from 36 to 180. 
Means, S.D.’s and percentiles are not given in 
the author's Information Manual. For rrr col- 
lege women Powell (2) found a reliability co- 
efficient of .73, a mean of 132.03 and a S.D. 
of 15.66. The range was 87 to 165 with skew- 
ness .07. The questions deal with the areas of 
nutrition, excretion, exercise, posture, use of 
scientific services, the evaluation of health in- 
formation, and defenses against diseases, acci- 
dents, drugs, and maladjustments. The author 
reports the reliability to be 87, 88, .86 on three 
retest experiments. Powell found .86 with 141 
freshmen women. 

Validation of the items and of the instru- 
ment as a whole is vaguely described but in 
general this included: (a) selection of items 
from health-education references; (b) com- 
parison of self-ratings with roommates’ ratings ; 
(c) judgments of specialists; (d) observation 
of health practices of 15 fraternity men over 
21, months; (e) "study of a high-school 
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hygiene class in which objective measures were 
applied to student responses on the Inventory” ; 
(f) determination of two “sets of discrimina- 
tion values” for each item. The findings on 
these studies are not given. No evidence is ad- 
vanced by the author that the inventory scores 
correlate with measured aspects of health and 
physical fitness or with objective appraisals. 
However, the Health Practice Inventory is a 
useful device for calling attention to some de- 
sirable health practices; although some of the 
questions included probably have little to do 
with actual health practices. 

There is certainly much question about 
whether such practices have any very funda- 
mental relationship to health. What living 
habits really build health? A serious question 
is the omission of specific items of importance 
which deal with the correction of remedial de- 
fects, positive physical fitness tests and rat- 
ings, daily outdoor exposure or artificial ultra- 
violet light, more adequate diet practices in 
specific areas, recognition of signs and symp- 
toms of unfitness, recognition and self-rating 
on desirable standards after appropriate testing 
experiences and the extent to which living 
habits are adjusted to temperament and con- 
stitutional body type. The very general rather 
than specific nature of the questions would 
lead a critic to doubt whether many of the 
specifics in health and physical fitness were 
touched at all. This makes many of the ques- 


tions hard to answer with any degree of exact 
meaning, 


[ 424 ] 

*Indiana University Motor Fitness Index. Boys 
and men in grades 10-16; 1943: the index is based upon 
three events: chins or straddle chins, push-ups, and 
vertical or standing broadjumps; so¢ per manual; 60 
or more examinees may be tested in a 50-minute period ; 
Karl W. Bookwalter and Carolyn W. Bookwalter ; 
Bureau of Cooperative Research and Field Service, 
School of Education, Indiana University. 


5 REFERENCES 
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2. BOOKWALTER, Kart W. Further Studies of Indiana Uni. 
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niversit ore, September 1943. Pp. Paper. 
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Bloomington, Ind.: Indiana University Bookstore, March 1943. 
Pp. 22. Paper. $0.50. * (PA 17:2652, 2653) 
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*Kilander Nutrition Information Test. Grades 9- 
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16 and adults; 1942-46; 1 form; » per ae 3430 per 
00; I er sample copy; nontimed (10-20) minutes ; 
Holes Frederick Kalender; the Author, 25 N. Edison 
St., Arlington, Va. 


REFERENCES 

1. Krrawpem, H. F. “What the Public Knows About Nutri- 

tion.” Res Q 15:188-97 My '44. * 
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XTrusler-Arnett Health Knowledge Test. Grades 
9-16; 1940; Forms A, B; 90€ per 25, postpaid; 156 per 
specimen set, postpaid; 50(55) minutes; V. T. Trusler, 
C. E. Arnett, and H. E. Schrammel ; Bureau of Educa- 
tional Measurements, Kansas State Teachers College 
of Emporia. 


Thomas Kirk Cureton, Professor of Physi- 
cal Education and Director of the Physical 
Fitness Research Laboratory, University of 
Illinois, Urbana, Illinois. This test is presented 
as two equivalent forms, each consisting of 144 
responses divided into 86 responses of the plus 
or minus kind, 21 responses of the selection 
of the best of four possibilities, and 37 re- 
sponses in which the best word or double word 
is selected to fill in and thus complete a state- 
ment, The authors state "that there'is a need 
for more thorough and consistent educational 
guidance in this field is apparent from the large 
number of students who on this test, and on 
similar tests, make surprisingly low scores." 
The authors report a reliability coefficient of 
.86 between Forms A and B when given to 549 
college students with probable errors ranging 
from 2.2 to 3.4 for various grades. The as- 
sumption is that the test measures the knowl- 
edge of the facts and principles of healthful 
living and that the results can be used for in- 
dividual guidance, although the authors confess 
that there is a doubtful relationship between 
such knowledge and the actual practice of 
health habits. The authors state, “it is obvious 
that without knowledge of the basic principles 
and rules it is improbable that maximum effi- 
ciency in the application of such rules and 
principles would result in the practical life of 
the citizen." Correlation of the results on this 
test with the Gates-Strang Health Knowledge 
Test for 200 high school pupils is reported as 
.67. For the same group the coefficient between 
odd-numbered items and even-numbered items 
on the Gates-Strang test, when stepped up by 
the Spearman-Brown formula, is 68; and for 
the Trusler-Arnett test it is .77. 

Form A contains several questions of a 
doubtful character, such as (keyed answers are 
given in brackets) : Item 36, "The burden of 
establishing mental health rests with the indi- 
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vidual" (true); Item 58, "Butter is more 
healthful as a part of one’s diet than oleomar- 
garine” (true) ; Item 70, “The proper air for 
breathing should consist of about one-fourth 
water vapor" (false); Item 73, “As one ages 
he outgrows his emotional difficulties” (true) ; 
Item 78, "Young people need longer but fewer 
rest periods than adults to ward off fatigue" 
(false). 

Likewise, in Form B: Item 48, “Nervous- 
ness may be due to learning" (true) ; Item 55, 
"Weight lost during heavy muscular activity 
should be regained during the next twenty- 
four hours" (true); Item 65, “The sense or- 
gans can function almost indefinitely without 
any great amount of fatigue" (true) ; Item 72, 
"Persons who take part in strenuous athletics 
during their school days are more susceptible 
to fatty degeneration of the heart in later 
years than are non-athletes" (true); Item 79, 
"In so far as mental work is concerned, cir- 
culating air is considered to be pure air" 
(true). 

While the previous questions cannot be an- 
swered correctly according to the answers 
given with any degree of certainty, there are 
other examples of poor question-making which 
illustrate questions which are obvious lip 
service to an unrealistic belief or biased atti- 
tude, and questions with double answers or 
questions in the negative form. 

The more serious criticism of such a test is 
related to the poor relationship which appar- 
ently exists between actual health or fitness 
and health knowledge of the sort called for in 
this test. What is measured seems to be mostly 
"biological information of a shot gun variety." 
Hardy and Hoefer * failed to find any appre- 
ciable relationship in their extensive longitudi- 
nal study of the relationships between health 
habits and health information or between health 
information and health as appraised by expert 
physicians. Even health practice as self-rated 
in the Johns' Health Practice Inventory cor- 
related —.15 with physical performance in 
Powell's study} of 141 freshmen college women. 

Health knowledge may be something of con- 
siderable worth in itself, but the question may 
be fairly raised why there could not be a selec- 


* Hardy, Martha C., and Hoefer, Carolyn H. Healthy 
Growth: A Study of the Influence of Health Education on 
Growth and Development of School Children. Chicago, IN.: 
University of Chicago Press, 1936. Pp. xii, 360. * 

t Powell. Virginia M. An Analysis of Relationships Existent 
Between Current Health Practices, Personal and Social Ad- 
justment and Physical Performances. Unpublished doctor's 
thesis, New York University, 1946. 


tion of questions which would correlate more 
highly with health practices and actual meas- 
urements of physical fitness. Then, such de- 
vices as knowledge tests might be useful for 
effective guidance in obtaining higher levels of 
positive health and fitness. 


HOME ECONOMICS 
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xAssisting With Care and Play of Children: State 
High School Tests for Indiana. Grades 7-8; 1 
47; Form A; 4¢ per test; 15¢ per sample test; 28(33) 
minutes; Alice Stair and Muriel G. McFarland; State 
High School Testing Service, Purdue University. 

Helen C. Dawe, Associate Professor of 
Home Economics, The University of Wiscon- 
sin, Madison, Wisconsin. This test presents 
very brief descriptions of 17 situations per- 
taining to the care of young children; each 
situation is followed by 4 to 11 true-false 
items. The situations are practical ones that 
seventh and eighth grade pupils might reason- 
ably be expected to encounter in their care of 
younger children, The test should, therefore, 
appear to the pupils as a fair method for de- 
termining their knowledge of the subject mat- 
ter. Since items do not seem closely geared to 
any particular textbook, the test could well be 
used in other states. 

A disproportionate emphasis is placed upon 
the choice of suitable toys since 29 of the 112 
items in the test pertain to this topic. Three 
of the situations concerned with evaluating 
toys require the pupil to indicate whether a 
particular toy is a good or a poor addition to 
a child’s play equipment. Having made his an- 
swer, the pupil is then to check which of a 
number of possible reasons he used as a basis 
for his answer. In one of the three instances 
all the possible reasons are positive, and thus 
the pupil could conclude that the toy was a 
good addition. 

In at least three of the test situations the 
reviewer would not agree with the correctness 
of some of the choices as indicated by the 
scoring key. Particular exception is taken to 
the principle that if a child doesn’t like spinach 
and carrots, one should “make him eat all his 
dinner before he can have dessert.” 

The fact that it should not be too difficult 
for pupils to remember most of the situations, 
and many of the possible solutions, may allow 
for communication of the test content from 
pupil to pupil. However, if all classes in a 
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school are tested simultaneously and the pupils 
are then allowed to discuss the examination 
with their teacher, this test should provoke 
good discussion of the subject matter and allow 
for profitable evaluation of the suggested solu- 
tions to the situations. 
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Assisting With etnies d un State High 
School Tests for Indiana. Grades 7-8; » 1945-46; 
Form A; 4¢ per test; 156 per sample test; 28(32) 
minutes; Elizabeth Anderson, Muriel G. McFarland, 
and Kathleen McGillicuddy ; State High School Test- 
ing Service, Purdue University. 

[ 429 ] 
XChild Development: State High School Tests for 
Indiana. High school; 1945; Form A; 4746 per test; 
I5 per sample test; po, minutes; Roberta Kelly, 
Muriel G. McFarland, and Alice Stair; State High 
School Testing Service for Indiana, Purdue Univers- 
ity. 

Helen C. Dawe, Associate Professor of 
Home Economics, The University of Wiscon- 
sin, Madison, Wisconsin. This test is based on 
an Indiana course of study in the care and 
guidance of children and upon such references 
as Care and Guidance of Children by Good- 
speed and Johnson, Home and Family Living 
by Justin and Rust, and Child Care and Train- 
ing by Faegre and Anderson. However, the 
test items so succeed in stressing knowledge of 
basic principles that use of the test need not 
be restricted to Indiana high schools nor to 
users of those particular textbooks. 

Among the areas in which the test attempts 
to measure knowledge are: food requirements 
of young children, characteristics of a healthy 
child and means of promoting normal physical 
growth, the effects of heredity and environ- 
ment on the development of the child, suitable 
clothing, selection of games and stories, and 
desirable procedures to follow in guiding young 
children. 

There seems to be good balance between the 
different areas, and the number of test items 
concerned with each area seems to correspond 
in some degree to the relative importance of 
these areas in the guidance of young children. 

In only two or three of the 1 5I items did 
there seem to be possibility of disagreement 
with the answers indicated in the Scoring key ; 
however, the reviewer would certainly not 

agree with the statement that if a small child 
starts crying in the night, “it is wise to let him 
cry if you have made sure there is no physical 
reason for his crying." A real possibility for 
error in scoring the test is to be found in Part 


II, which consists of a list of phases of develop- 
ment, each of which is numbered. To the right is 
a list of play materials, after each of which the 
pupil is to write the number corresponding to 
the phase of development to which the play 
material contributes. Some satisfy two phases 
of development, and the test's items are num- 
bered separately to indicate that fact. Although 
it should make no difference which numeral 
the pupil enters first, the scoring key presents 
the lower numeral first, the high numeral sec- 
ond. If a pupil should reverse that order in 
each of the four sections requiring two an- 
swers, there would automatically be eight in- 
correct responses on his test sheet, although 
actually his answers would be quite correct. 


[ 430 ] 
XClothing I: State High School Tests for Indiana. 
High school ; 1946-47; Forms A, B; 4¥4¢ per test; 15¢ 
per sample test; 55(60) minutes; revised by Mary I 
Healey, Jeannette O. Parvis, and Muriel G. McFarland ; 
State High School Testing Service, Purdue University. 


[ 431 ] 
XClothing II: State High School Tests for Indi- 
ana. High school; 1946-47; Form A; 4V4£ per test; 
15¢ per sample test; 55(60) minutes; Mary I. Healey, 
Ruth Davis Moutoux, Jeannette O. Parvis, Louise 
Stedman, and Muriel G. McFarland; State High School 
Testing Service, Purdue University. 


[ 432 ] 
X*Foods I, Food Selection and Preparation: State 
High School Tests for Indiana. High school; 1944- 
45; Forms A, B; 4¥%¢ per test; 15¢ per sample test ; 
55(60) minutes; Mary T. Swickard and Muriel G. 
McFarland; State High School Testing Service, Pur- 
due University. 

[ 433 ] 


XFoods II, Planning for Family Food Needs: 
State High School Tests for Indiana. High school ; 
1943-44; Forms A, B; 4126 per test; 15¢ per sample 
test; 55(60) minutes; Mary T. Swickard and Muriel 
G. McFarland; State High School Testing Service, 
Purdue University. 
[ 434 ] 

*Helping With Food in the Home: State High 
School Tests for Indiana. Grades 7-8; 1945-46; 
Form A; 44 per test; 156 per sample test; 28(32) 
minutes; Elizabeth Anderson, and Muriel G. McFar- 
land; State High School Testing Service, Purdue Uni- 


versity. 
[ 435 ] 

*Helping With the Housekeeping: State High 
School Tests for Indiana. Grades 7-8; 1045-46; 
Form A; 4é per test; rsé per sample test; 28(35) 
minutes; Evelyn Swaim, Kathleen McGillicuddy, and 
Muriel G. McFarland; State High School Testing 
Service, Purdue University. 


[ 436 ] 
*Home Care of the Sick Test: State High School 
Tests for Indiana, High school; 1943-44; Form A; 
aé per test; ISÉ per sample test; 55(60) minutes ; 
Jeannette O. Parvis, Gleela Ratcliffe, Ruth Davis, and 
Muriel G. McFarland; State High School Testing 
Service, Purdue University. 
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[ 437 ] 
*Housing the Family: State High School Tests 
for Indiana. High school; 1943-44; Form A; 4/4 per 
test; 156 per sample test; 55(60) minutes; Jeannette 
Orem Parvis and Muriel G. McFarland; State High 
School Testing Service, Purdue University. 


[ 438 ] 
Minnesota Check List for Food Preparation and 
Serving, Revised Edition. Grades 7-16 and adults; 
1938-45; 1 form; $1 per 100; ro¢ per specimen set; 
Clara M. Brown; University of Minnesota Press. 


[ 439 ] 
Minnesota Food Score Cards, Revised Edition. 
High school and college; 1940-46; cards are available 
for rating 57 products prepared in laboratory work in 
foods; 25¢ per set of 57 different cards; 254 per 50 
of any one card; Clara M. Brown; distributed iy 
Cooperative Test Service. 


INDUSTRIAL ARTS 


[ 440 ] 

X*Examination in Mechanical Drawing. High 
school; 1945; IBM for Sections 1-3; Form SMD-1- 
B-4 (usually called Form B) ; separate answer sheets 
must be used for Sections 1-3 (Section 4 is a perform- 
ance test) ; no norms available; $2 per 25; 40¢ per 25 
machine-scorable answer sheets; 15¢ per scoring key; 
25¢ per specimen set, postpaid; Sections 1-3, I 0 (160) 
minutes ; Section 4, 60(65) minutes; prepare by the 
Examinations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
cation ; distributed by Cooperative Test Service. (Also 
distributed by Science Research Associates: $2 per 25; 
65¢ per 25 machine-scorable answer sheets; Sof per 
key; so¢ per specimen set.) 


[441] 
*Mechanical Drawing Test: State High School 
Tests for Indiana. High school; 1944-45; Form A; 
5¥4¢ per test; 15¢ per sample test; 80(85) minutes; 
Justus Rising; State High School Testing Service, 
Purdue University. 


RECORD AND REPORT 
FORMS 


[ 442 ] 
*American Council on Education Cumulative 
Record Card for Grades 1, 2, and 3. 1946; 6¢ per 
copy ; 25¢ per specimen set; prepared by the Committee 
on Cumulative Records of the American Council on 
Education; American Council on Education. 


[ 443 ] 
XAmerican Council on Education Cumulative 
Record Card for Grades 4, 5, and 6. 1944; 6¢ per 
copy; 25¢ per specimen set; prepared by the Committee 
on Cumulative Records of the American Council on 
Education; American Council on Education. 


[44] 
American Council on Education Cumulative Rec- 
ord Card for Junior and Senior High Schools, 1941 
Revision. 1928-43 ; 6¢ per copy; 25¢ per specimen set; 
prepared by the Committee on the Revision of Cumula- 
tive Records: Richard D. Allen, Millard E. Gladfelter, 
W. S. Learned, John W. M. Rothney, Donald J. Shank, 


* 


Eugene R. Smith (Chairman), Arthur E. Traxler, 
E. G. Williamson, and Ben D. Wood; American Coun- 
cil on Education. 


Herbert A. Toops, Professor of Psychology, 
The Ohio State University, Columbus, Ohio. 
This two-sheet file-folder blank, bearing the 
date 1941, is in effect a considerably improved 
edition of the 1942 “revision” of the American 
Council on Education Cumulative Record Card 
for Colleges. Unlike the latter, the stub-head- 
ings are not invaded ; six columns are provided 
for the time-projection record, ample for a high 
school career by year-records but too short for 
eight or nine semester recordings. If the high 
school label of this form were replaced by the 
word "college," few would have any reason to 
suspect that this is not a greatly improved edi- 
tion of the college form. There would be little 
or nothing to give away the omission of the word 
“high school.” 

The remarks (see 445) regarding the high 
desirability of a line drawing in the college 
form apply equally to this form. The scholastic 
record is divided up by areas of specialization 
—English, language, mathematics, etc.—which 
is an improvement. The fault of the rating 
scale noted in the college form persists in this. 
The typography is greatly improved over the 
college form; more variation in type size and 
font give more emphasis where emphasis is 
needed. It is a fair guess that the reviser of 
this form went off to war before the college 
blank was "revised" a year later. 

A useful 30-page manual, with six pages of 
bibliography on research forms, is a valuable 
adjunct. It gives very concrete notions of what 
type of information is worth recording in the 
record form. 

[445] 
American Council on Education Cumulative Rec- 


ord Card for Colleges, 1942 Revision. 1028-42 ; 
6¢ per copy; 25¢ per specimen set; American Council 


on Education. 


Herbert A. Toops, Professor of Psychology, 
The Ohio State University, Columbus, Ohio. 
This is a time-projection record rather.than a 
real “cumulative” record; that is to say, it is 
provided with four columnar annual projec- 
tions to receive entries according to a common 
stub-heading in as many successive college years 
and one additional, too-narrow and space- 
invaded column evidently intended for a high 
school summary but simply labeled “High 
School.” In view of the fact that few students 
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graduate in four years, four such projections 
are generally not enough for a complete un- 
dergraduate academic career. 

The card was printed from linotype matrices 
and rulings. A carefully designed black-ink line 
drawing segregating subdivisions into well- 
defined areas, and generally breaking up the 
Space both from aesthetic and from utilitarian 
considerations might well replace the bad typog- 
raphy. The drab uniformity of type and line 
and the lack of geographic emphasis and accent 
make the form as uninteresting to the eye as 
a prairie. The kinds of records provided for by 
the headings, on the other hand, are potentially 
capable of being as delineated as are the fea- 
tures of a mountain landscape. It takes some 
little study without a manual to decide for one- 
self just where a bit of information should be 
recorded. This statement applies particularly 
to items such as: age in years and months as 
of September first, year and class placement, 
father, mother, and stepparent or guardian. 
The difficulty in interpreting the first two of 
the afore-mentioned items results from the use 
of the two-column caption “High School” to 
include a column of stub-heads (which are 


- general) and an all too narrow column for a 


Summary of the high school record; this latter 
column is invaded by a section "Faculty Judg- 
ments" where one would naturally expect a 
summary of the high school entrance units. All 
of this could easily have been obviated if a 
hand-drawn form, with a zinc engraving, had 
replaced an uncritical compositor's judgment in 
the design of the card. 

There is no justification for having a file- 
folder fold right in the middle of a record as on 
both sides of this record form, 

The items to be recorded have been well- 
thought-out or have evolved as the result of 
long experience and are indicated both by gen- 
eral and by subheadings, but unfortunately 
without that emphasis of type face, spacing, 
and the like which are needed to make them 
really function, The authors evidently hope 
that the card may serve as a brief alumni rec- 
ord and also function administratively in the 
recruiting of the entrant’s siblings. 

An inordinate amount of time would be re- 
quired to fill out the form completely for one 
student. The record card strikes a course in 

need-fulfillment between the records of the 
twenties when every personnel form contained 
the injunction “Write it in ink” and that of the 


present postwar period when much of such 
work of necessity in many institutions will 
need to be done on punched cards. The record 
is not designed for use with IBM cards. ; 

The form provides an abundance of identi- 
fying information including nickname and 
photo, necessary for large postwar enrollments, 
but neglects one of the most error-producing 
ones, that of whether or not the registrant is 
a twin. Footnotes and explanations, which 
would make the record intelligible to the novice 
recorder, are few in number. These footnotes 
are set in larger type than is necessary and 
employ space which might better be allotted to 
the recorded information. E 

A time-projection record for six traits is 
provided. Since this is evidently intended for 
administrative scrutiny, the bold-faced indi- 
vidual categories of the ratings might well have 
been printed next to the record spaces to which 
they refer without including in an intervening 
iour inches the fine-print promptings to the 
raters to define these qualitative degrees, This 
inclusion seriously jeopardizes the usefulness 
of this portion of the record. 

The blank does not suffer from a large num- 
ber of abbreviations unintelligible to the casual 
reader of such a record, a serious fault of 
many such records. As a whole the card shows 
all too little change from the record card de- 
signed by Colonel Coss after World War I. In 
particular, at this late date one would expect a 
profile of the test record rather than merely a 
tabulated record. The only profile on the form 
revealing growth, which alone justifies the ad- 
jective "cumulative," is in the case of the 
above-mentioned ratings; and the format here 
would lead one to think of it as a record rather 
than as a time-profile. The objections herein 
would no doubt become less and less with use; 
but turnover of recording personnel could be, 
or ought to be expected to be, large with this 
form. 


[ 446 ] 
*Guidance Summary Form for Use in Vocational 
and Educational Counseling. Adults; 1946; adapted 
from Aids to the Vocational Interview by P. S. 
Achilles, $2.25 per 25, Postpaid ; 35¢ per sample set of T 
postpaid; George K. Bennett and Charles E. Orbach; 


Psychological Corporation. 


Norman Frederiksen, Associate Professor o f 
Psychology, Princeton University; and Re- 
search Associate, College Entrance Examina- 
tion Board; Princeton, New Jersey. This form 


7" 


485 ] 


TESTS & REVIEWS: MISCELLANEOUS—RELIGIOUS EDUCATION 452a 


is adapted from Achilles Aids to the Voca- 
tional Interview. Many of the questions from 
the older blank have been retained, some with 
considerable revision, and together with a few 
new items they appear under the headings of 
general information, education, work experi- 
ence, health, family, interests, and self-evalua- 
tion. A “psychometric evaluation” form is pro- 
vided in which can be recorded names of tests 
together with scores, percentile ranks, stand- 
ard scores, norms used, and comments. No 
provision is made in this form for recording 
interest test scores. The “graphic record” form 
for showing percentile ranks has been aban- 
doned. The temperament check list and the 
page for self-ratings of special abilities have 
also been omitted from the new blank. Space 
is provided for counselor’s notes and for writ- 
ing a general summary. 

The Guidance Summary Form is shorter 
than its predecessor, and the questions are 
more direct and to the point. For example, 
instead of asking, “Any precautions or worries 
as to health?” the new question is, “Have you 
any health problems that affect the type of 
work you can do?" ; and “What are your inter- 
ests?” has been changed to “What do you do in 
your spare time?” The counselor may find this 
form a convenient way of collecting a good 
deal of personal data, much of which is poten- 
tially useful in vocational and educational 
guidance. 


J Consult Psychol 11:282 S-O "47. This 
guidance summary form...offers counselors a 
practical and systematic way of securing the 
client's cooperation in recording significant as- 
pects of his previous experiences and current 
aspirations to provide a basis for educational 
and vocational planning. * 


[ 447] 
XHamilton Cumulative Record Folder, Revised 
Edition. Grades 7-13; 1045; 95¢ per 25; 30 per 50 
forms (Hamilton Student Information Form, Revised 
Edition, 1945) for gathering information for the 
cumulative record folder; 7¢ per specimen set; Voca- 
tional Guidance Centre, University of Toronto. 

[ 448 ] 
Standard Profile Chart. 1947; 75€ per 50, any color 
(white, pink, or yellow) ; 25¢ per specimen set; Clif- 
ford E. Jurgensen; Educational Test Bureau. 


Charles R. Langmuir, Director, New York 
Office, Educational Testing S ervice, New York, 
New York. This chart is a graphic form for re- 
cording test scores, ratings, or other personnel 


measurements, The vettical axis is divided in 
units of standard scores (mean equated to 50, 
standard deviation to 10). Percentile equiva- 
lents, the nine categories of the stanine scale, 
and a verbal scale of five categories are indi- 
cated. Columns are provided for 18 variables, 
with adequate spaces at the top of the sheet 
for identifying the tests used, Raw scores, 
transmuted scores, and dates may be entered 
at the bottom of the columns. The chart is 
printed on heavy paper (light card stock), 
which will file easily and wear well. 

In situations where the volume of records 
does not justify the preparation of a special 
form printed for the purpose, the Standard 
Profile Chart may be useful. 

A four-page descriptive manual explains the 
need for transmuted scores and describes the 
four scales aligned for use with the chart, The 
naive user, for whom the chart appears to have 
been developed, will be seriously misled by the 
author’s statement on comparability: “when 
testing a single individual, test users frequently 
use tests accompanied by different types of 
norms. . . . Difficulties resulting from lack of 
uniformity in the matter of norms are elimi- 
nated by the use of the Standard Profile Chart, 
which automatically converts scores into any 
of the four most commonly used types of 
norms." Difficulties resulting from lack of uni- 
formity in norms will recur whether the test 
user makes profiles or any other record. 


J Consult Psychol 11:157-8 My-Je ^47. 


[449 ] 
*V.G.C. Anecdotal Record Form. 1943; 13¢ per 50; 
Vocational Guidance Centre. 


[ 450 ] 

*V.G.C. Cumnitiye Record ponen Grag ns 
-13; 1944; 2 levels; os¢ per 25 of either level; 7; 
TE specimen set of either level; M. D. Parmenter; 

Vocational Guidance Centre. 

[ 451 ] 
*V.G.C. Interview Record Form. Canadian schools ; 
1943-44; 20¢ per 50; M. D. Parmenter; Vocational 
Guidance Centre. 

[ 452 ] 


*V.G.C. Student Information Form, Revised. Ca- 
nadian elementary and secondary schools; 1043-44; 
30¢ per 50; M. D. Parmenter; Vocational Guidance 


Centre. 


RELIGIOUS EDUCATION 


[ 452a ] 
XHebrew School Attitude Test. Ages 8-13; 1047; 
Forms A, B; 5¢ per test; 15¢ per sample test; Noah 
Nardi; Jewish Education Committee of New York, Inc. 
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[ 453 ] " 
* Jewish Home Environment Test. Jewish schools; 
1946; separate booklets for pupils and parents; 5¢ per 
booklet; rof per sample test; Noah Nardi; Jewish 
Education Committee of New York, Inc. 


[ 454 ] 
XJewish Information Test. Jewish schools; 1945; 
Forms A, B; 5¢ per test; 15e per sample test; non- 
timed (30-45) minutes; Noah Nardi; Jewish Educa- 
tion Committee of New York, Inc. 


[ 455 ] 
*Religion Essentials Test. Grades 7-12 ; 1939; 
Forms A, B, C, D, E, F, G, H; $1 per 25; 25¢ per 
specimen set; 45(50) minutes; Austin G. Schmidt; 
Loyola University Press. 


REFERENCES 
+ abe Ur G. Tee the Teaching of Reli- 
sion,” J Religious Instr 13:32) 2 Ja '43. tee 
Vd Sarre, WALTER J. M Crie of the Schmidt Religion 
Essentials Test and Its Restandardization in Grades Five to 
Light. Unpublished master's. thesis, Fordham University, 1945. 


[ 456 ] 
X*Wilson Tests of Religious Aptitude. Grades 9-16 
and adults; 1942; 1 form; 6 parts; $1.15 per 25 of 
any one part, postpaid; $5 per 25 sets of the 6 parts, 
postpaid; 354 per specimen set, postpaid; nontimed 
(15-25) minutes per part; manual prepared by H. E. 
Schrammel; Matthew H. Wilson; Bureau of Educa- 
tional Measurements, Kansas State Teachers College 
of Emporia, 
4) TEST OF RELIGIOUS APTITUDE CONCERNING PARENTS. 
b) TEST OF RELIGIOUS APTITUDE CONCERNING TREES, 
C) TEST OF RELIGIOUS APTITUDE CONCERNING FLOWERS, 
d) TEST OF RELIGIOUS APTITUDE CONCERNING PRAYER. 
¢) TEST OF RELIGIOUS APTITUDE CONCERNING FRIENDS, 
Í) COMPOSITE TEST OF RELIGIOUS APTITUDE, 


REFERENCES 


1, Wirson, MarrHEWw H. “The Wilson Tests of Religious 
Aptitude,” PP, 37-45. In The Third Yearbook of the National 
Council on Measurements Used in Education, Fairmont, W. 
Va.: the Council, Fairmont State College, 1946. Pp. v, 58, vi. 
Paper, mimeographed, $0.75. * 


Goodwin Watson, Professor of Education, 
Teachers College, Columbia University, New 
York, New York. Do you have helpful, neutral 
or harmful religious experience in connection 
with flowers “at dusk,” flowers “in the morn- 
ing,” flowers’ “hardihood,” flowers “at funer- 
als"? How about the “straightness” or “queer- 
ness” of trees or “trees covered with snow”? 
Draw a line through the number 0-10, indi- 
cating the proportion (sic) of your particular 
religious experiences in connection with “Moth- 
er’s courage in trouble,” “Father’s sacrifice,” 
"Friends giving to others," "physicians," 
"snakes," "tornadoes," and "prayer" which you 
have found helpful or harmful! 

This is an extraordinary jumble of items, 
conceived as a measure of "capacity for reli- 
gion.” The author has worked a quarter cen- 
tury, getting responses from some 13,413 
people and offers reliability figures (basis not 
stated) from .88 to -96, and percentile norms 
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for high school students, college students and 
adults. The directions are certainly confusing ; 
e.g., on the “Flowers” test, the directions refer 
to both helpful and harmful reactions, but the 
rating scale covers only helpful. The scale of 
responses ranges from o (meaning either lack 
of religious experiences or presence of reli- 
gious experiences felt neither helpful nor 
harmful) through 1 (meaning "few") to 10 
(meaning “many.”). No more accurate indi- 
cation of the precise meaning of the numbers 
2, 3, 4, 5, 6, 7, 8, 9 is given. Score is the total 
value of. digits crossed out; i.e., people who 
check in connection with “friends” that “su- 
perior being recognized” (whatever that may 
mean) stimulates helpful religious reactions 
“8” have twice the religious capacity of those 
who on that item check “4.” 

The reviewer has seldom seen a more strik- 
ing example of statistical care (P.E. of r to 
three decimal places) lavished upon almost 
meaningless material. 


SAFETY EDUCATION 


[ 457 ] 


XInstructional Test on Safety. Grades 5-9; 1942; 
Forms A, B ; $1.50 per 25; 25¢ per specimen set ; 45(50) 
minutes; David F. Votaw, Sr.; Steck Co. 


[ 458 ] 
XSiebrecht Attitude Scale. Grades 9-16 and adults ; 
1041 ; $1 per 25; 10¢ per copy; nontimed (10) minutes ; 
Elmer B. Siebrecht; Center for Safety Education, 
Division of General Education, New York University. 


REFERENCES 

1, Stemrecut, Ermer B. The Construction and Validation of 
@ Scale for the Measurement of Attitudes Toward Safety in 
Automobile Driving. Unpublished doctor's thesis, New York 
University, 1941. Pp. 142. J 

2. SIEBRECHT, ELMER D. Measuring Driver Attitudes. New 
York University, Division of General Education, Center for 
Safety Education, Research Contributions to Safety Education, 
Vol. 2, No. 3. New York: the Center, 1941. Pp. 29. $0.45. 
Paper. * (PA 16:738) 


SENSORY-MOTOR 


[ 459 ] 
XA-B-C Vision Test for Ocular Dominance. Ages 
5 or 6 and over; 1927-47; individual; $2.00 per set of 
testing materials; (2-5) minutes; Walter R. Miles; 
published in 1947 by Psychological Corporation. 


REFERENCES 


1. Mites, Warrer. "Ocular Dominance—Methods and Re- 
sults." Abstract. Psychol B. 25:155-6 Mr '28, * 

2. Mites, Watter R. "Ocular Dominance Demonstrated by 
Unconscious Sighting." J Exp Psychol 12:113-26 Ap ‘29. * 
(PA 3:3481) 

& frLES, Warrer R. “Ocular Dominance in Human Adults.” 
J General Psychol 3:412-30 1130, * (PA 4:4631 : 

4. Urpranarr, Rutu. “Ocular Dominance in Young Chil. 
dren.” J Exp Psychol 15:758-66 D "32. * (PA 7:4793) " 
.5. WasnsuxN, M. R.; Faon, [on AND Scott, R. “Studies 
From the Psychological Laboratory of Vassar Vollege: LXV, 
A Comparison Between the Miles A-B-C Method and Retinal 
Rivalry as Tests of Ocular Dominance.” Am J Psychol 46:633-6 
O "34. * (PA 9:96) 
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[ 460 ] ^ 
XAO Sight-Screener. Adults; 1946; targets are avail- 
able for both readers and nonreaders of English letters; 
the instrument is available only as a part of the Indus- 
trial Safety and Visual Efficiency Program which in- 
cludes “consultative services with respect to the follow- 
ing: (1) Installation of the AO Sight-Screener and 
training in its operation, use and maintenance; (2) 
Development of an individual visual efficiency program 
aimed to meet the specific needs of the lessee; (3) De- 
velopment and execution of an educational campaign 
to secure and maintain the cooperation of employees, 
supervisory staff, safety committees, employee groups 
or others in the visual safety and efficiency program ; 
(4) Establishment of eye protection standards for vari- 
ous classes of operation and types of work, programs 
for fitting and maintaining eye protection devices; (5) 
Service on safety goggles and corrective-protective 
materials; (6) Recommendations on the improvement 
of environmental conditions affecting vision—illumina- 
tion, glare, color application, use of optical aids, and 
other means for increasing visual perception; (7) 
Recommendations on the establishment of job visual 
placement standards for various occupations or job 
classifications aimed to bring about better utilization 
of individual visual capabilities”; less than 300 em- 
ployees, $250 fee for the first year, $50 fee for the 
second year; 300 or more employees, $350, $550, and 
$700 fees for the first year for I, 2, and 3 instruments 
respectively, $100 fee for the second year; (3-5) min- 
utes; developed by the Bureau of Visual ience, 
American Optical Co.; American Optical Co. 


REFERENCES 


, 1. Freeman, Erus. “A Basis for Testing and Improving vi- 
sion o pndnsuyzs J Consult Psychol 9:35-44 Ja-F 45. * (PA 
9:202! 
2. Stausacn, Fren W. An Investigation of the Visual Prob- 
lems of Industry. Unpublished master’s thesis, Toledo Univer- 


sity, 1945. 
3. Staunact, Fren W. “Industrial Visual Screening Case 
Mr 14 '46. * (PA 20:2265) 


Report." Optom Wkly 37:339-42 
4. SULZMAN, JOHN nie ton ErrswoxrH B.; AND BARTLETT, 
Commercial 


Nutt R. Visual Acuity Measurements With Three 

Screening Devices. Progress Report No. 2 on Bureau o! Medi- 

cine and Surgery Research Project X-493. (Av-263-p),. “Com- 
parison of Various Screening Devices With Standard Medical 
rocedures.” New London, Conn.: Medical Research Depart- 

ment, U. S. Submarine Base, February 7, 1946. Pp. xii, 43- 


Paper, mimeographed. Gratis. 
$ AND BARTLETT, 
Test for Lat- 


5. Suzman, Joun H.; Coox, ELLSWwoRTH B.; 
Neit R. Part 2, “Comparison of the Maddox Rod 
eral and Vertical Phoria With Phoria Tests Incorporated in 
Three Visual Screening Devices ioe *Telebinocular,” 
American Optical ‘Sight Screener,’ and Bausch & Lomb ‘Ortho- 
Rater, ” pp. 16-36. In their Comparative Study of Measures. of 


Heterophorio. Progress Report No. 3 on Bureau of eue 
m- 


and Surgery Research Project No. X-493 (Av-263-p 
parison of Various gph Project No. Aapa, Qrradard Medical 

rocedures." New London, Conn.: Medical Research Depart- 
ment, U. S. Submarine Base, February 22, 1946. Pp. iii, 39- 
Paper, mimeographed. Gratis. * 

6. Bemis, OWARD ALBERr, “An Experience With Indus- 
trial Vision Problems." Am J Optom 24:74:9 F '47. * 

J. H.; Coox, E. B.; AND BARTLETT, N. R. "The 


7. SULZMAN, 
l; Three Commer- 


Reliability of Visual Acuity Scores Yielded by T P551 CP:38572) 


cial Devices.” J Appl Psychol 31:236-240 Je '47- 

Henry A. Imus, Head, Psychophysiology 
Branch, Medical Sciences Division, Office of 
Naval Research, Washington, D. C. An indus- 
trial visual classification device, the AO Sight- 
Screener, has been developed to test quickly 
certain visual functions in order to select those 
individuals in need of a professional eye ex- 
amination. An effort is made to specify visual 
requirements for particular jobs in industry. A 
total of seven tests is repeated for both distant 


and near vision, including acuity, eye muscle 
balance, stereopsis, and simultaneous binocular 
vision. A test of color perception is an op- 
tional accessory. 

Some evidence concerning the reliability and 
validity of this device is available. An impar- 
tial test-retest study of three vision screening 
devices, as yet unpublished, shows that the 
reliability and validity of the Sight-Screener 
are comparable to those for the Ortho-Rater 
and are better than those of the Keystone Tele- 
binocular. The coefficients of reliability of the 
Sight-Screener are .70 for binocular acuity far 
and near; .84 for far and .77 for near monocu- 
lar acuity; .61 for far and .55 for near vertical 
phoria; .80 for far and .83 for near lateral 
phoria. The agreement with clinical tests is fair 
for acuity and low for phoria. The validity co- 
efficients are .71 for far and .64 for near binocu- 
lar acuity ; .74 for far and .71 for near monocular 
acuity; .28 for far and .34 for near vertical 
phoria ; .37 for far and .54 for near lateral phoria. 
No data are available for the test of depth 
perception. 

No evidence of the validity of these tests 
is presented by the manufacturer except the 
statement that three clinicians and two psy- 
chologists investigated both the reliability and 
validity of these tests on an early model. Evi- 
dence is presented showing a greater tendency 
for defective visual functions among accident 
repeaters than among control groups matched 
for type, age, and sex distribution, although 
the number of subjects involved is not stated. 
The tests are administered by nontechnical 
personnel, but the analysis of the results is 
made by professional eye men. The methods 
of referring subjects for professional eye ex- 
amination and eye care depend upon the policy 
or facilities of the industrial plant subscribing 
to the service. 


F. Nowell Jones, Associate Professor of 
Psychology, The State College of Washington, 
Pullman, Washington, This instrument is simi- 
lar to the Bausch and Lomb Ortho-Rater. The 
AO Sighi-Screener measures various aspects 
of visual efficiency at two optical distances. In 
evaluating the comments made below, the 
reader should keep in mind the great difficulty 
of obtaining information about the Sight- 
Screener. A very rigid policy has been set up 
which requires that the Sight-Screener be used 
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only in industry and only under the direction 
of an ophthalmologist or optometrist. 

As far as this reviewer is concerned, the re- 
liability of this instrument is completely un- 
known. The reliability of the acuity scales is 
probably somewhat higher than that of the 
Ortho-Rater, since five discrimination forms 
instead of one are used at each level. Letters 
are used, however, which require the testee to 
be literate. 

There is no reason to suppose that this in- 
strument is less useful than the Ortho-Rater, 
except insofar as the policy mentioned above 
prevents widespread use by those most quali- 
fied to evaluate the results, i.e., those trained 
in testing theory and technique. On the other 
hand, there is no reason to suppose that it is 
any more useful. 


[ 461 ] 
XBinocular Reading Test. 1043 ; $1 per set of printed 
testing materials; George Spache; the Author, Gree- 
ley School, Chappaqua, New York. 
REFERENCES 
P 1, ani oy “A Bisocular ae Test.” 
1109-I "43. * 17: 
2» Stach, Gronce, A eeili Roding Test" J EJ Psy- 


J Appl 


2, SPACHE, GEORGE, “A 
chol esata S "43. * (PA 18:899 

3. SPACHE, GEORGE. "Case Studies in Binocular Reading.” 
Am J Orthopsychiatry 13:723-6 O ^43. * (PA 18:1907) 


4, SPACHE, GEORGE. ji Preference, Visual Acuity, and 
lity.” El Sch J 43:539-43 My 43. * (PA 17:3974) 

Albert J. Harris, Assistant Professor of 
Education, The City College of New York, 
New York, New York, This test is an ingen- 
ious device, as yet not completely standardized, 
for estimating the relative participation of each 
eye in the act of reading. A simple reading 
selection is printed on both sides of a stereo- 
graph, with some words omitted on the left 
side and other words omitted on the right side. 
The selection makes good sense if only the 
words exposed on the left side are read, if 
only those exposed on the right side are read, 
or if all the words presented to either eye are 
read. There are two selections, one of approxi- 
mately beginning second grade difficulty and 
one of about fifth grade difficulty. 

Copies of the test cards, mimeographed rec- 
ord sheets, and a mimeographed manual of 
directions may be obtained from the author. 
The present stereographs have a mechanical 
defect which will probably be rectified in the 
next printing. The present cards need to be 
trimmed on the right side, and can be seen 
better in a Keystone Telebinocular than in an 
ordinary hand stereoscope. Correlations with 
final diagnosis are -709 for the easier selection 


and .813 for the more difficult selection, indi- 
cating satisfactory reliability for an instrument 
of this type. The test would seem to have value 
in measuring eye dominance and as a supple- 
mentary method of detecting cases of poor 
visual fusion or poor peripheral vision. The 
author has presented his preliminary results in 
five research papers. He deserves commenda- 
tion not only for an original and useful idea, 
but also for the conservatism of his claims and 
for his patience in keeping the test in an ex- 
perimental form for several years while sub- 
jecting it to careful study. 
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*Dvorine Color Discrimination Screening Test. 
Ages 7 and over; 1947; a set of 2 pseudo-isochromatic 
charts (selected from the Dvorine Color Perception 
Charts) for sclecting color-weak individuals ; 2 forms 
(Groups A, B) ; $2.50 per copy of either form; (1-3) 
minutes; Israel Dvorine; the Author, 2328 Eutaw 
Place, Baltimore 17, Md. 
REFERENCES 


1, Dvortne, IsraeL, "Dvorine Color Discrimination Screen- 
ing Test.” Am J Optom & Arch Am Acad Optom 25:13o:3 Mr 
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*Dvorine Color Perception Testing Charts, Vol. 
1. Ages 7 and over; 1944; a diagnostic set of pseudo- 
isochromatic charts; Vol. 2 is Dvorine Color Percep- 
tion Training Charts; $25 per set of 2 volumes ; $3 per 
100 record forms; (1-10) minutes; Israel Dvorine; the 
Author, 2328 Eutaw Place, Baltimore 17, Md. 

REFERENCES 

1. Dvoniwz, Ismazt. “A New Diagnostic Method of Testing 
and Training Color Perception." Am J Optom 21:225-35 Je 
"44. * {A 18:3391) * k 

Ia. DVORINE, ÍsmAr, “Reconditioning the Color Blind: A 
Case Report.” Am J Optom 21:508-10 D ^44. * (PA 19:1149) 

2. DvontNE, IsraEL. “The Use of Volume Two of the Dvorine 
Color Perception Testing and Training Charts for Grading 
Mos Rereeption: Difficulties.” Am J Optom 22:478-9 O "4s. * 

20:673 

3. DVORINE, ISRAEL. “Improvement in Color Vision in Twenty 
Cases.” Am J Optom 23:302-21 Jl '46. * (PA 21:36 

n McCorp, FrrcHER. “Report on the Effect of 
Col or Blindness.” J Clin Psychol 3:197-8 Ap '47. 
2900 


Am J Optom 21:427-9 O "44. Carel C. Koch. 
Dvorine..is the author of “Analytical Re- 
fraction and Orthoptics,” which is one of the 
most widely publicized and read of the recent 
books on ocular refraction, and in common with 
his book on examination techniques Dvorine's 
new color testing and training charts will also 
win wide acclaim and use, * This new testing 
and training set of charts represents a step in 
the direction of improving visual skills as these 
relate to color vision. It is of considerable im- 
portance as it initiates a new technique which 
will do much toward improving the color vision 
of many patients who are now in part color 
deficient. Dvorine has again added to the value 
of optometric procedure, as these charts and 


raining in 
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the training routine which accompanies them 
will enable optometrists to further broaden 
their services to the public. The new set of 
color perception testing and training charts be- 
longs in every orthoptic treatment room in 
America today. 


Am J Psychol 58:276-9 Ap '45. Elsie Mur- 
ray. This work by an optometrist lies—to speak 
strictly—outside the province of a JOURNAL 
reviewer; but the publisher's claim that it is 
"the first diagnostic set of pseudo-isochro- 
matic charts developed by an American" chal- 
lenges attention. In 1936 Milton B. Jensen of 
the American Psychological Association issued 
a practise-proof four-chart color test of the 
Stilling-Ishihara type, designed to segregate 
four types of color deficiency. Clockhands 
were cleverly substituted for trouble-making 
digits, but perhaps because of the unhandy dis- 
tance prescribed (ten feet), Jensen's test never 
came into very general use. * The designer of 
these new charts, setting out, not from the 
standpoint of psychologist, physiologist or 
physicist, but from that of the practical color- 
printer, selects his eight basic hues accord- 
ingly. The printing-ink hues red, yellow and 
blue become his three "primaries"; their three 
derivatives, by "subtractive" mixture or super- 
imposition, orange, green and violet (illustrated 
strikingly but none too well in a recent number 
of Life), his three “secondaries,” with brown 
(“a mixture of all three primaries") and grey 
thrown in for good measure. Unfortunately 
absent are the two critical hue-ranges in the 
commoner types of color deficiency, carmine to 
red-purple (2.5RP to 2.5R in the Munsell 
notation) and bluish green 2.5BG to 5G), pos- 
sibly because of the difficulty they offer to the 
color-printer. Dvorine's green is nearer 5YG; 
his red, 5R, his violet, 10PB. Misled by the 
ability of average color-deficients to use, par- 
rotlike, with a moderate degree of accuracy, a 
number of conventional color-names (often by 
mechanical verbal association merely), Dvorine 
infers possession of the corresponding normal 
color-experiences. Hence he deduces that the 
term *color blindness' is in most cases a misno- 
mer, though he does admit that certain heredi- 
tary or neurological conditions may operate 
adversely on color perception (a suggestion 
borrowed from K. Dunlap). Colors seen singly, 
in Dvorine's opinion, offer little difficulty to 
the individual who fails in the current tests. It 


is only when they are juxtaposed that trouble 
springs up, a kind of "visual confusion." The 
logical course is therefore to try out all the 
twenty-eight permutations and combinations of 
his eight "basic colors," and discover which 
pairs present difficulties in the individual case. 
The conventional pseudo-isochromatic chart is 
adopted * For the "vanishing pattern" the 
digit introduced by Stilling is adopted. Eccen- 
tric position on the disk and doubling of digits 
is employed to baffle wouldbe memorizers, 
otherwise there is little subtlety of design. * 
A number of the plates in which the two hues 
are near-lying on the color-continuum are al- 
most illegible to the normal, but many others 
are easy for all types of eyes, giving the red- 
green weak an encouragingly high percentage 
accomplishment (probably 70 per cent). A 
blank is provided, with a space for “diagnosis.” 
The latter has no relation to the accepted cate- 
gories, but consists primarily in rating the sub- 
ject as color-capable but weak in nomenclature 
(i.e. as “color ignorant") or actually deficient 
in “color perception" (i.e. in the discrimination 
in certain pairs of hues). No hint is given in. 
the instructions as to which color-combinations 
prove more difficult in typical cases. Volume 
TI is the direct outcome of Dvorine's conviction 
of “the susceptibility to training of the entire 
visual function." * Since the individual labelled 
color-deficient has difficulty only with colors 
in combination, according to Dvorine, five 
pairs are arbitrarily selected for practise: red 
and green, red and brown, red and grey, green 
and orange, blue and violet. On most of the 
charts they are printed in a single saturation 
only, with a color-key in the corner providing 
a cue to the hues used in digit and ground, re- 
spectively. The 14 charts in each of the five 
groups grow progressively more difficult, 
through equation of the concentration of dots 
in ground and figure, through decrease in satu- 
ration, and, finally, through the use of two 
degrees of saturation in both of the hues. In 
spite of the easy start and the cues afforded, 
the initial score is lower in this set tham in 
the 60 of Dvorine's first volume. In three of 
the five groups, the brightness-values of the 
two hues used in figure and ground are not 
equated for the normal eye,—as checked by the 
Munsell neutral brightness-scale. The red and 
reddish hues are lighter, suggesting that they 
may have been chosen by someone with a 
slightly scoterythrous eye. To the average nor- 
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mal eye or the color weak they are legible by 
brightness-contrast independently of hue. Only 
for the rarer scoterythrous type (with darken- 
ing of the red) do they become difficult (by 
laboratory test). This and other matters sug- 
gest that the reputed training effected by re- 
peated reading of these charts is not in the 
field of color-discrimination at all, but rather 
in ease of discrimination of contour and con- 
figuration. Black and white and grey digits and 
ground might have served equally well. No evi- 
dence is presented of improvement in color- 
discrimination in general, ie. aside from this 
specific type of test. Nor is there proof that 
the practise holds permanently, or is transfer- 
able to other than digit patterns. The helpful- 
ness of such training in view of the growing 
tendency to discard digits for other less memo- 
rizable and dubious configurations is therefore 
questionable. There are no cautions in the In- 
structions as to distance, timing, illumination, 
angle of plate—all of recognized importance 
in psychological technique. Without any train- 
ing whatever, mild cases of defect, if allowed 
plenty of time and a dry brush to fill in tenta- 
tively the gaps where weak red or green dots 
have fallen out of the digits, might score 75 
per cent or more in the standard tests, as every 
Color psychologist knows. Even individuals 
with color-fields narrowed almost to the fovea 
might succeed under these conditions. Yet 
what is needed in the military services is the 
ability to spot instantly, in a fairly extended 
visual field, a colored object on a slightly dif- 
ferent ground, in mist, snow or fog. Luckily 
cases coached to pass a certain type of test are 
likely to fall foul of a battery of newer tests 
before they become a hazard to themselves and 
others—other than financially. Dvorine makes 
no statement upon the purity of the inks used 
to print his charts. The materials are substan- 
tially bound, with links to allow the user to 
open the page flat, Whatever its utility, how- 
ever, the place of the test is obviously not in 
the psychological laboratory and not in the 
physician's office. 


Am J Psychol 58:397-9 JI "45. Israel Dvorine. 
"Authors Comments Upon the Review of 
Color Perception Charts,” [A reply to a re- 
view by Elsie Murray.] (i) The reviewer ques- 
tions my publisher's claim that [my charts are] 
"the first diagnostic set of pseudo-isochro- 
matic charts developed by an American" * 


Jensen's contribution has little or nothing to do 
with the publisher's claim, especially in view 
of the fact that during the past twenty-five 
years the only pseudo-isochromatic charts avail- 
able in this country were the Stilling and the 
Ishihara charts. If further proof is needed that 
this set is the first to be published by an Amer- 
ican, we need only to look at the reproduced 
Stilling's and Ishihara's charts which the navy 
department was obliged to issue at the begin- 
ning of this war because there were no other 
pseudo-isochromatic charts available in this 
country. (ii) To say that the author “bor- 
rowed" from Dunlap..is casting aspersions 
unnecessarily when due credit was given to Dr. 
Dunlap (Arch. Amer. Acad. Optom., June, 
1944, page 226); a fact well known to the 
reviewer because the author sent her a reprint 
of the article prior to the publication of the 
review. (iii) The reviewer states that “For 
economy's sake, a single unvarying dot-distri- 
bution pattern is used throughout (the color 
charts).” The more obvious reason for uni- 
formity of dot-distribution was the desire on 
the part of the author to prevent malingering 
through the recognition of a dot-arrangement 
Which is possible in the Stilling and Ishihara 
charts. The fact that the testing volume alone 
contains 60 charts, not to mention the 70 charts 
of the training volume, while the maximal num- 
ber of charts in the Ishihara book is only 32, 
should be sufficient evidence that the author was 
not restricted by the question of “economy.” 
(iv) To say that “five pairs [of color-combina- 
tions] are arbitrarily selected for practise" is 
a misstatement of a known fact. These color- 
combinations were not arbitrarily selected but 
only after careful study of the various color- 
combinations which proved difficult to the group 
of color-blind individuals used as controls dur- 
ing the preparation of these charts. The follow- 
ing quotation is from page 20 of the instruction 
booklet which accompanies the charts. “The 
selection of these color-combinations has been 
made because they are the ones that usually 
confuse patients with poor color perception,” 
With this explanation clearly expressed, it is 
also puzzling to read the reviewer's protest that 
"No hint is given in the instructions as to which 
color-combinations prove more difficult in typi- 
cal cases." (v) The reviewer says that “In spite 
of the easy start and cues afforded, the initial 
score is lower in this set [the training volume] 
than in the 60 of Dvorine’s first volume.” In 
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view of the author's statement in the instruc- 
tion booklet that the color-combinations in the 
training volume were limited to those colors 
which confused the majority of the color-blind, 
it should be obvious that the initial score would 
be lower in Volume ii than in Volume i, which 
includes a number of charts readily perceived 
by the color-blind. (vi) The Waverley Press of 
Baltimore will vouch for the following: The 
author was present in the press-room during 
the entire production of the charts and per- 
sonally tested the accuracy of each sample page 
as it came off the press, using a group of color- 
blind patients for this purpose. Seventeen thou- 
sand impressions were discarded and the pages 
were reprinted because upon drying they did 
not measure up to the standard set by the 
author. The vast amount of personal labor and 
expense that went into this project and the 
fact that the author designed, executed the draw- 
ings, and supervised the engraving of the 
plates, the mixing of the inks and the color 
printing is sufficient proof to those informed 
that it is a lack of scientific objectivity on the 
part of the reviewer to go out of the way in 
a critical review and to speak in terms of ‘finan- 
cial' gain without any knowledge of the facts 
and the personality involved. 

Am J Psychol 58:399-402 J1*45. Elsie Mur- 
ray. “A Reply to Dvorine’s Comments.” [A 
rejoinder to the reply by Israel Dvorine.] Let 
me reaffirm the accuracy of the first statement 
questioned above by Dvorine. No evidence to 
the contrary has been adduced. I am, however, 
uncertain whether it is the citizenship of Dr. 
Jensen which is disputed, or the availability of 
his test, or whether the difficulty lies in the term 
‘pseudo-isochromatic,’ possibly misconstrüed by 
the objector. * his tests for color-blindness were 
listed as early as 1936 in the catalogue of the 
Psychological Corporation * They are of the 
pseudo-isochromatic type. * this term refers to 
the use of ‘confusion hues’ in figure and ground ; 
i.e. hues from colored inks so chosen as to appear 
alike in certain typical cases of color deficiency. 
Substitution of the less ambiguous term 'vanish- 
ing-pattern' charts has lately been suggested by 
the writer * Of Dvorine's testing charts, only a 
limited number are pseudo-isochromatic. Jensen 
was, I believe, the pioneer in America. His 
scheme is unique, dispensing with digits entirely, 
and his directions for administration scientific 
and explicit. * In contesting the above point, 
Dvorine introduces some new errors. The Navy 


department did not issue the assemblage of Ishi- 
hara and Stilling reproductions now in use in 
military circles. The Beck Engraving Company 
produced them for the American Optical soci- 
ety, hence they are known as the A.O. Tests. 
This was in 1940, before Pearl Harbor and our 
entrance into the war. * (ii) As to Dvorine’s 
second objection, I discover nothing either in- 
exact or derogatory in my use of the term 
‘borrowed.’ * In scientific circles, the borrowing 
of terms and concepts is a matter of course. 
Without it, scientific progress would indeed be 
slow. Before launching into test-production, 
Dvorine might profitably have ‘borrowed’ some 
of the ideas of the British color-experts R. A. 
Houstoun and Mary Collins. (iii) The reason 
turned up by the designer for the adoption of 
a uniform dot-pattern throughout his bulky set 
of charts—the prevention of malingering—is 
interesting, though experience leads me to doubt 
its soundness. It does not, however, alter the 
fact that such uniformity reduces to a minimum 
the time, labor and ingenuity to be expended on 
a set. A score or more of Dvorine's charts could 
be evolved in the time required to produce one 
or two of Ishihara’s. (iv) If the designer has 
accumulated sufficient scientific data to support 
his choice of five pairs of colors for his ‘train- 
ing’ charts, why not bring them into the open? 
The loosely worded assertion quoted merely 
raises doubts as to the whole scheme, based 
apparently on guesswork from the start. What 
certainty is there that these five pairs tap all 
types of color deficiency ? Obviously the color- 
pairs used in the testing charts of D's first 
volume are chosen without reference to the 
known facts of color deficiency ; hence few of 
the charts are pseudo-isochromatic. If the term 
‘diagnostic’ is to be applied to the set, tables of 
percent success or failure in different types of 
deficiency should be appended to this volume. 
Correlation with results from other kinds of 
tests as well as a reference to current classifica- 
tion of cases should be made or some new divi- 
sion attempted. It is inconceivable that the con- 
testant means to set up his R-G, R-Br, R-Gy. 
G-O and B-V discriminations as each the pivotal 
feature of a distinct type of deficiency. A typical 
red-green weak individual may have difficulty 
with four) if not with all five, of them with 
proper modification of color-tone, while other 
distinctive confusions for other types are 
omitted. There is no recognition of the impor- 
tant if less frequent variant with shortening in 
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the red end of the spectrum or of blue-yellow 
deficiency. Dvorine's primary error of course 
lies in his choice of hue-combinations in his 
first volume, which is mechanically, not empiri- 
cally, determined. (v) This point can be satis- 
factorily discussed only with the book of charts 
in hand. It was brought up by the reviewer 
merely as part of the argument that the charts 
train—not in color-perception but—in discrim- 
ination of form. (Note the admission here by 
the ‘designer that the test charts are not all 
pseudo-isochromatic.) (vi) The amount of time 
and labor expended by the designer on the actual 
printing of his charts is no guarantee of the 
‘purity,’ suitability, or stability of the inks em- 
ployed. It appears, indeed, to this reviewer a 
regrettable waste, quite apart from the conten- 
tion that charts printed in a series of greys 
would probably have served as well, since con- 
tour and not color-discrimination is what actu- 
ally receives practice in this set. My criticism 
at this point is based on Practical experience, 
since in 1939 and earlier in the course of pro- 
jecting a practise-proof color vision test of the 
vanishing-pattern order, the complete process 
was thoroughly analyzed and discussed in all its 
aspects with artists, photographers, engravers, 
Paper manufacturers and color printers. The 
plan included trying out a preliminary series of 
ten or more charts with a hundred cases repre- 
senting different types and degrees of deficiency. 
The hues to be used were selected on the basis 
of ten years’ clinical experience and a careful 
study of existing tests. The scheme fell through 
only because the printing-ink company ap- 
proached failed to furnish inks of the required 
critical hues, of assured permanence, and of the 
desired natrow-range spectral reflectance: as 
tested by the Hardy Spectrophotometer. The 
conclusion was reached 
Nazis were still controlling the essential pig- 


field of ‘training charts’ is entered, we also sug- 
gest follow-up work on 25 to 50 of those helped 
‘over the bars,’ for one to three years, in order 
to ascertain whether the alleged ‘cure’ is perma- 
nent. Actually, an emphasis on backwardness 
in color-nomenclature as a cause of failure in 
pseudo-isochromatic tests is about a hundred 
years out of date. Finally, as to “financial gain, 
I carefully refrained from imputing any mer- 
cenary motive to the designer, in the absence 
of specific knowledge. The term ‘financial’ was 
used only in reference to the loss to the govern- 
ment or the ‘color graduate’ of months expended 
in military training for a type of work for which 
nature had not equipped the individual, 


J Appl Psychol 29:175 Ap "45. Miles. A. 
Tinker. These charts are of the pseudo-isochro- 
matic type. There are 60 perception and 70 
training plates. Tests for naming colors and 
for training in color naming are included. The 
colors employed are based on the subtractive 
color theory. The author erroneously lists the 
three primary colors as red, yellow and blue 
instead of red, green and blue. The perception 
test is not limited to investigation of red-green 
and blue-yellow deficiencies but yields informa- 
tion on perceiving both primary and secondary 
colors. The author considers that some individ- 
uals who are listed as color blind are only 
confused by certain color combinations. These, 
although classified as color blind, readily im- 
prove when given color perception training, It 
is claimed that these charts will avoid uncer- 
tainty and error in measurement of color per- 
ception. The only cassifications attempted 
from responses are: normal in naming and 
perceiving colors, correct naming with defec- 
tive perception, incorrect naming and normal 
perception, incorrect naming and defective per- 
ception. There is no attempt to designate a 
person as red-green color blind, etc. The train- 
ing charts "educate" or coach people to name 
colors and to pass pseudo-isochromatic color- 
blind tests such as for the Navy or the Air 
Corps. It would seem that the widespread use 
of these training tests would tend to nullify 
the usefulness of all pseudo-isochromatic tests 
of color vision, 
Q R Biol 20:300-1 S 45. Knight Dunlap. 

wo features of Dvorine’s volumes merit 
attention, The first feature is inclusion of mate- 
rials for testing the accuracy of naming of 
colors; and color naming is undoubtedly im- 
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portant. In highway traffic, for example, the 
light which is seen by drivers with "normal" 
color-vision as green, may be seen by a color- 
blind driver as a slightly greenish blue; if the 
driver, however, has learned to call the color 
"green" he knows that it means "go." Simi- 
larly, if the light which “normals” see as red 
is called “red” by the “color-blind” driver, al- 
though he may see it actually as orange, he 
knows the light means "stop," and is no dan- 
ger. A color-blind driver who names the lights 
as do persons of "normal" color-vision may 
even be a safer driver than are many “nor- 
mals.” There are two difficulties involved in 
Dvorine's charts for training color perception. 
One is the omission of the names by which the 
colors should be called. It is well known that 
persons of "normal" color-vision differ from 
one another in the names they apply to colors, 
especially to colors of low sauration. Hence 
the validity of the training depends upon the 
names applied by the particular trainer. The 
second difficulty is that a really color-blind per- 
son would be trained to call the colors by 
names of colors he does not actually see; this 
would confuse him and cause him trouble in 
traffic. The second important feature of 
Dvorine's presentation is the assumption that 
color-vision can be improved by training. We 
admit that color-naming can be improved by 
training; but that the fundamental features of 
so-called *color blindness" can be modified by 
training is merely an hypothesis. We know 
that a “color-blind” person can learn all the 
charts in a set; calling off the numbers cor- 
rectly, although remaining unable actually to 
see the numbers on many of the charts. One 
young man memorized all the charts in the A. 
O. C. set, and then admitted that there were 
many numbers which he could not see. After 
a course of treatment with Vitamin-A and 
cobra venom, his color perception improved 
greatly. That training may produce a slight 
improvement initially, due to adjustment to the 
test situation, assisted by better attention, is 
true; but beyond that point the usefulness of 
training in color perception is conjectural. 
However, Dvorine is to be commended for 
publishing the first set of charts by the use of 
which his theory can be experimentally tested. 
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Eames Eye Test. Grades 1 and over; 1938-40; in- 
dividual ; $4.40 per examiner's kit containing all needed 


materials except a stereoscope ; 75¢ per 25 record cards; 
Thomas H. Eames; World Book Co. 
REFERENCES 
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Farnsworth Dichotomous Test for Color Blind- 
ness: Panel D-15. Ages 6 and over ; 1947; indiyidual ; 
1 form; $22 per set of materials, manual, and 100 
analysis sheets; $2.50 per roo analysis sheets; (2-5) 
minutes; Dean Farnsworth; Psychological Corpora- 


tion. 
des REFERENCES 

1. FarnswortH, Dean. "The Farnsworth-Munsell roo-Hue 
and Dichotomous Tests for Color Vision." J Opt Soc Am 33: 
568-78 O AE (PA 183999) p 

2. Jupp, Deane B. "Standard ag Functions for Pro- 
tanopic and Deuteranopic Vision.” J Opt Soc Am 35:199-221 
Mr "45. * (PA 19:3284) v 

J Consult Psychol 11:339-40 N-D '47. For 
industrial, military, and other uses, a color blind- 
ness test is needed to detect the functionally 
color blind, without the excessive number of 
false positives obtained by the Ishihara and 
similar tests. In the Farnsworth test, the sub- 
ject arranges 15 small color samples "in order," 
beginning with a reference sample (blue). The 
score is either passing or failing. In a sample 
of 2146 males, 119 or 5.5 percent were detected 
as color blind. By means of a chart, some differ- 
entiation between protanopes (red blind), deu- 
teranopes (green blind), and tritanopes (blue 
blind) is possible. The test does not intend to 
select workers for excellence in color vision, 
as required for some specialized jobs, but only 


to detect the grossly deficient. 


[ 465 ] 
*Farnsworth-Munsell 100-Hue Test for Anoma- 
lous Color Vision. Mental ages 8 and over; 1943; 
individual; $55 per testing outfit including manual and 
25 score sheets; $3.50 per 100 score sheets; (10) min- 
utes; Dean Farnsworth; Munsell Color Co., Inc., 10 
East Franklin St, Baltimore 2, Md. 

REFERENCES 
1. FanNswonTH, Dean, "The Farnsworth-Munsell 100-Hue 
and Dichotomous Tests for Color Vision.” J Opt Soc Am 33: 
568-78 O "43. * (PA 18:399) 
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XHarris Tests of Lateral Dominance. Age 6 and 
over ; 1947 ; individual ; $2.90 per manual and 25 forms, 
postpaid ; $1.75 per 25 forms; the actual test materials, 
including the 4-B-C Vision Test for Ocular Domin- 
ance (see 459), must be purchased or assembled sepa- 
rately; (10-15) minutes; Albert J. Harris; Psycholog- 
ical Corporation. 


J Consult Psychol 11:224 Jl-Ag '47. This is 
a uniform series of tests of hand dominance 


1623-6 Je "39. d 
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—"and-of eye dominance that can be given rou- 
tihely in the examination of reading disability” 
cases, and of others in whom lateral dominance 
may be a significant factor. The battery con- « 
sists of seven tests of manual dominance 
(knowledge of right and left, hand preferences, 

simultaneous writing, handwriting, tapping, 
dealing cards, and optionally, a dynamometer 
test), three tests of ocular dominance (monoc- 
sular: sighting, binocular tests, and optional 
stereoscopic tests), and one test of foot domi- 
à ce (kicking). The tests are clinical rather 
than quantitative, since the results are expressed 
as a rating on a five-point scale from strongly 
right to strongly left, and no overall numerical 
score is obtained. No data are given concerning 
senapility or standardization. Required ma- 

„terials»include a record blank, the Miles ABC 

Vision Test, and a number of easily obtainable 

toy objects. 


[ 467 ] 
Keystone Telebinocular. Ages § or 6 or 7 and over; 
1930-47 ; three 1947 models of the instrument are avail- 
able; according to the Keystone View Company, these 
< -three models are entirely new instruments and are not 
merely revisions of the 1938 models; (34-5) minutes; 
Keystone View Co. 


placement; $495 for the Service (sold outright, not a 


rental), 
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F. Nowell Jones, Associate Professor of 
Psychology, The State College of Washington, 
Pullman, Washington. [Review of the 1938 
industrial model.] This instrument is the old- 
est of the three leading industrial vision stereo- 
Scopes (the others are the Ortho-Rater and the 
AO Sight-Screener). As compared to these 
others, it is more simple in design and provides 
somewhat less well controlled conditions for 
testing, since the test figures are not surrounded 
by a screen to eliminate extraneous stimulation. 

The Keystone Telebinocular provides tests 
of acuity at both near and far points, of lateral 
muscular balance, of vertical muscular balance, 
of stereopsis, and of color vision. The test 
figures for color vision are particularly exten- 
sive and include blue and yellow as well as red 
and green. 

The reliabilities of the different tests are not 
included in the instruction booklet and are not 
known to this reviewer. Since the test cards 
are open to the general illumination of the 
room, special care would be advisable in regu- 
lating external illumination and insuring its 
constancy. As far as validity is concerned, if a 
clinical test is to be considered as the criterion, 
the Telebinocular fared less well than the 
Ortho-Rater in a study of lateral phoria. 

Some industrial studies have shown the 
Keystone Telebinocular to be useful as a 
screening device, although completely adequate 
procedure from the scientific point of view is 
not the rule, Other studies have shown some 


: y available 
.than either of its competitors bitdbrob bly less 
precise. In common with all tes designed for 
use in industry, only careful study\ef ead ol 


; ihr 
under consideration can determine its useful- . 
ness. 2 


J Am Med Assn 123:558 O 30743. The Joint- 
Committee on Industrial Ophthalmology has 
participated in careful analysis of visual records 
made in industry employing the Keystone Tele- 
binocular. These records have been correlated 
statistically with subsequent job performance, 
incidence of accidents and with conventional 
professional aptitude tests. Evidence at present 
permits the following appraisal of the industrial 
use of the Telebinocular: (1) The Telebinocu- 


lar is a sturdy, well-made instrument, originally - 3 & 


designed as a hand stereoscope. In 1933 it was 
modified by attaching a special shaft and slide 
holder, mounted on a stand. The slides are 
modifications of the original Ready to Read 
Tests designed by E. A. Betts to measure the 
visual performance of school children. (2) The 
record forms designed to classify employees 
for jobs in accordance with visual ability are - 
based on arbitrary standards not justified by 
actual experiment on thousands of industrial 
employees. A simplified form has been devel- 
oped by a member of the Joint Committee which 
has proved to be more practical. The Keystone 
View Company is also working on a simplified 
record form. (3) The Telebinocular visual 
acuity tests at distance have, through the work 
of the Joint Committee, been correlated with 
Snellen equivalents. (4) The depth perception 
test, while not accurately graduated, can be 
used to advantage in screening out examinees 
in the lowest quarter of performance who are ~ 
under consideration for special or dangerous 
assignments. (5) The phoria tests will identify 
most examinees with muscle imbalance lying 
outside the general range of normality. These 
measurements cannot be expressed in prism 
diopter equivalents. (6) The telebinocular has 
no adequate near point test of acuity in its 
series. Because of the importance of near point 
acuity in many jobs, special provisions will 
need to be made. (7) The color vision test, 
while inadequate as a whole, is useful in identi- 
fying certain types of color deficiency. (8) Sev- ` 
eral of the tests show little or no relationship 
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to success on the job or with standard clinical 
practice in ophthalmology and are likely to cre- 
ate an impression of significant visual defi- 
ciency. Actually, the tests detect what might 
better be called substandard visual function. The 
Joint Committee believes that the Telebinocular, 
used wisely by a properly trained tester, will 
uncover subnormal visual performance not de- 
tected by the common industrial practice of 
testing for central visual acuity alone. However, 
the tests employed for determining muscle bal- 
ance, stereopsis and color sense have definite 
limitations, The existing standards for accept- 
ance or rejection of examinees in industry or 
for the visual classification of employees are 
not accurate enough for dependable use. In- 
stead, individual standards should be created in 
each industry based on available data and the 
practical experience of associated medical and 
ophthalmologic staffs. 


For reviews by I. H. Anderson, David 
Kopel, Marion Monroe, and Guy Wagner of 
the Betts Ready to Read Tests, see 38:1097. 
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* Massachusetts Vision Test. Grades kgn-12; 1942; 
individual; $98.50 per vision test kit; (1-5) minutes; 
developed by the Massachusetts Department of Public 
Health; Welch Allyn, Inc. 


d REFERENCES 
1. SLOANE, ALBERT E. "Massachusetts Vision Test: An Im- 
proved Method of Testing Eyes of School Children.” Areh 
TIR 24:924-39 RR #531235) | 

2. OAK, "The Massachusetts Vision Test: - 
proved Method for School Vision "Testing." Am J Pub Health 


: ild ‘Fails’ 
School Vision Test.” El Sch J 437478 8a Ap A SC 


J Am Med Assn 122:37 My 1 "43. Council 
on Physical Therapy. * The Massachusetts 
Vision Test is a method of school vision test- 
ing which provides for the Screening of chil- 
dren to determine the need of detailed ocular 
examination and possible treatment. A notable 
feature of the test lies in the fact that it can 
be applied to young children. Five and 6 year 
old children needing ocular attention can be 
detected when the test series is applied by those 
skilled in the handling of young children. 
Errors which occur in the testing by nonpro- 
fessional persons are said to result in more 
children being referred than actually require 
correction by the specialist. The test series is 


composed of three parts: Part One screens out 
cases of low visual acuity. The charts are con- 
structed to minimize the possibilty of memoriz- 
ing. Illumination of the charts is standardized. 
The Snellen illiterate E is employed through- 
out because of the variation in difficulty be- 
tween different letters and different numbers. 
Part Two detects those who may pass the vis- 
ual acuity tests but who have a hyperopic error 
which is latent and may require correction. 
Part Three indicates cases in which there is 
difficulty in using the two eyes together. * The 
arrangement of the distance vision chart ex- 
pedites rapid and reasonably accurate elimina- 
tion of eyes with approximately normal vision. 
* The Council reached the following conclu- 
sion concerning the apparatus: (1) It is a rela- 
tively simple and complete test, especially 
adapted for school children of the lower grades. 
Distance vision with elimination of lower hy- 
permetropias and myopias as well as muscle 
balance are roughly measured. (2) It is pur- 
chasable as a single unit, which increases its 
desirability for schools and factories. (3) In 
this test the same distances are customarily 
used for measuring vision and muscular anom- 
alies which are employed in office and hospital 
practice. Thus it differs from stereoscopic 
tests. (4) The instrument is apparently a pio- 
neer in grouping several fundamentals into a 
single unit, adapted for mass testing of visual 
function by nonmedical examiners. The Coun- 
cil on Physical Therapy voted to accept the 
Massachusetts Vision Test for inclusion on its 
list of accepted devices. 


469 
*New Test for the esM of Colorblindness. 
1941; for a short form of this test, see Test for Color 
Blindness (see 474) ; individual; $4; P. B. Wiltberger; 
College Book Co. 

J Am Med Assn 118:1168 Mr 28 *42. The au- 
thor believes that the Ishihara test, now in com- 
mon use for color blindness, is mainly a test of 
color "intensity" and cannot be entirely relied 
on. He presents an Objective test consisting of 
a set of plates made up of nonfading color 

chips" of high chroma, high value and accu- 
tate hue. The subject need not know the name 
of a single color. The test is based on the fact 
that when one gazes at a given color for an 
appreciable length of time and then shifts the 
gaze to a blank sheet of white or neutral gray 
paper the complementary color will appear on 
the paper as an “after-image” and will be of 
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the same size and shape. The tests can be easily 
made but will fail in a certain percentage of 
cases because of lack of concentration on the 
part of the subject tested. The price of the 
book seems entirely out of line with the con- 
tents. 

Q R Biol 17:94 Mr ’42. Many people who 
are classified as color blind by the usual tests 
are not truly color blind but have varying de- 
grees of weakness in color perception. The 
writer of this book objects to the use of such 
tests for this and other reasons. He recom- 
mends the use of color "chips," or rectangles, 
on which the subject fixes for a short period 
of time, Upon shifting the fixation point the 
normal person (or a person who is only color 
weak) sees the complementary color after- 
image, whereas the totally color blind person 
sees a white rectangle. The book contains a 
brief description of the physiology of color 
perception, directions for administration of the 
test, and several pages of color "chips." For 
some purposes this method may be useful, but 
in general people who administer tests of color 
perception are interested in detecting color 
weak individuals as well as those who are 
totally color blind. 


[470 ] 

Ophthalmograph. Grades 1-16 and adults; 1936-40; 
a portable camera, using 35 mm. standard motion pic- 
ture film, for photographing eye movements while the 
examinee reads? developed films may be obtained 
within 8 minutes; 110 A.C., 60 cycles ; $325 per instru- 
ment (No. 1522) ; 15¢ per 25 record cards; developed 
by the Bureau of Visual Science of the American Op- 
tical Company and the Research Division of the East- 
man Kodak Company ; American Optical Co. 
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*Ortho-Rater. Ages 8 and over; 1943; the instrument 
is available only as a part of the Bausch & Lomb 
Industrial Vision Service; a limited number of instru- 
ments may be purchased by psychological laboratories 
in colleges and universities for $365 ($90 additional for 
tuition, board, and room at a 5-day seminar covering 
instruction in the use of the instrument, methods of 
setting visual standards for various jobs, and relation- 
ships between the functions of psychologists and pro- 
fessional eye men) ; (6-8) minutes; Bausch & Lomb 

Optical Co. 
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Otol 48:194-6 Ja-F '44. 19:2732 

8. EC) J. HA rhe Visual Skills of Precision Instru- 
ment Assemblers.” J Consult Psychol 9:165-70 Jl-Ag '45. * (PA 


N, Franx, “Industrial Safety and Visual Func- 
O '4s. * (PA 20:549) 
. “A Statistical Study of Visual Func- 
err and rai Safety.” J Appl Psychol 29:467-70 D '45. * 
20:12. 
s 10. SERIEM Joun H.; FamwswonrH, Dean; Coox, ELLS- 
wortH B.; Barrett, NEIL R.; AND KrNpnzp, Mary I. Compari- 
son of Various Screening Devices With Standard Medical Vis- 
ual Procedure. Progress Report No. 1 on Bureau of Medicine 
and Surgery Research Project No. X-493 (Avar p) New 
London, Conn.: Medical Research Department, U. S. Subma- 
nae Base, November 22, 1945. Pp. iv, 50. Paper, mimeographed. 
ratis. 
10a. TrzrIN, Joser, “Match the Eye to the Job: Industrial 
Vision Institute Tests Eyesight Values and Skills.” Safety Eng 


Visual 


E. "Determinin 


170-4 Mr-Ap '46. * 
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13, Davis, C. Janz. “Correlation Between Scores on Ortho- 
Rater ‘Tests and Clinical Tests." J Appl Psychol 30:596-603 D 

3°46. * (PA 21:1400) REM 

14. EMMEs, ArTHUR B. “Predictive Significance of Ortho- 
Rater Tests." 4m J Optom 23:3-15 Ja '46. * (PA 20:2636) 

15. KzPHART, NEWELL C. "An Analysis of Professional Eye 
Care and Industrial Efficiency.” Trans Am Acad Ophthal & 
Otol 50:166-70 Mr-Ap '46. * 

16. Luecx, Irving B. “Vision in Industry." Am J Ophthal 
29:63-72 Ja '46. * (PA 2021649) JS 

17. Stump, N. Frank. “How Inefficient Vision Causes In- 
SUMA eee Optom Wkly 37:915-9+ Jl 4 '46. * (PA 
20:430. 

18: Stuur, N, Franx, “How Ophthalmic Bye-Care Can Pre- 
Tae rident" Trans Am Acad Ophthal & Otol 50:219- 
25 Ji. A 

P TOME, N. Frank, “How to Increase Productivity 
Through Visual Tests." Ind Relations 4:28-31 O d^ d y 

18b. Stump, N. Franx. “A New Concept of Ophthalmic 
Service." Bausch & Lomb Mag 22:4-6+ [My '46]. * 

19. Stump, N. Franx. “Vision Tests Predict Worker Capa- 
bility.” Factory Mgmt 104:121-4 F '46. * 

20. SULZMAN, Joun H.; Coox, ELLswortH B.; anp BART- 
LETT, Nei R. Part 2, “Comparison of the Maddox Rod Test for 
Lateral and Vertical Phoria With Phoria Tests Incorporated in 
Three Visual Screening Devices (Keystone ‘Telebinocular,’ 
American Optical ‘Sight Screener,’ and Bausch & Lomb ‘Ortho- 
Rater,’ " pp. 16-36. In their Comparative Study of Measures of 
Heterophoria, Progress Report . som Bureau of Medicine 
and Surgery Research Project No. 43 (Av-263-p), “Com- 
parison of Various Screening Devices With Standard Medical 

rocedures." New London, Conn,: Medical Research Depart- 
ment, U. S. Submarine Base, February 22, 1946. Pp. iii, 39. 
Paper, mimeographed. Gratis. * 

21. SULZMAN, Joun H.; Coox, ELLswortH B.; Anp BART- 
LETT, New R. Visual aay Measurements With Three Com- 
mercial Screening Devices. Progress Report No. 2 on Bureau of 
Medicine and Surgery Research Project X-493 (Av-263-p) 
“Comparison of Various. Screening Devices With Standar 
Medical Procedures." New London, Conn.: Medical Research 
Department, U. S. Submarine Base, February 7, 1946. Pp. xii, 
43. Paper, mimeographed. Gratis. * 

22, TIFFIN, JosePu, anp Lona, W. F. “Visual Standards and 
Joh Requirements.” 4m J Optom 23:463-76 N '46. * (PA 21: 
1620 

23. Wirt, S, Encar. “Eye Care Increases Earnings of Elec- 
tric Poe Optom Whiy 37:1091-3-- Ag 8 '46. T (PA 20: 
4323, title o; 

PAIS: ©’ Encar, “Statistical Laboratory for Vision Tests 
4t vea miversity." J Appl Psychol 30:354-8 Ag '46. * (PA 

24a. Woopaury, Davi O. “Better Eyes for I. v 2 
m 48:159-62 a? 6. * j^ ndustrq Peck 

24b. “Modified Stereoscope Measures Visual Skills." Mech 
IUS Me . 

, 24€. Brown: » P. “Vision Fact Finding Discloses Rela- 
PUMP f Visual Capacity and Job Dern e Safety Eng 

H 47. 

^ 5 [TORIA RicHARD, AND Wirt, S, Encar. “Visual Acuity 

in Relation to Illumination in the Ortho-Rater.” J Appl Psychol 
DRE Ag 470 GERE AD) 

26, Kepnart, N, C. he Importance of Phoria re- 
ments in Industrial Vision." Optom Wkly 38:45:50 Tae 

s1614, 

26a, KrrHART, NEwELL C. “Visual Skil T; 
UNE dt me Age CAAA rora LAE Us 

260, STUMP, N. Franx. “Values of Professionally Pr 
Eyewear Shown Through Improvement of Visual Hn 
Scores.” Optical J & R Optom 84:40-14 Jl 15°47. * 

26c. Stump, N. Frank. “Visual Functions and Industrial 
Management With References to Production and Safety." Mod 
M 62 ADAE * (PA 21:3256) A 

» » N. FRANK, AND Carter, DonoTHEA. “Vi: - 

formances by Ophthalmic Services.” Optom Wily yernya ae, 


VAL Soudan J Es Coor, E. B. awo B 

8, S AN, J. H.; Coox, E. B.; ann Bartterr, N. R. “ 

Reliability of Visual Acuity Scores Yielded by Three Commit. 

cial Devices.” J Appl Psychol 31 3236-240 Je'47. * (PA 21:3872) 

, 29. „TIFFIN, Joserm. Chap. 7, “ suak Sins and Vision 

A pp. 18; E In his Industrial Psychology, Second Edi- 

tion. New York: Prentice-Hall, Inc., 1947. Pp. xxi, 553. Trade 

edition, $5.35; edition, $4.00. * (B4 21:2442 

ie fat Sos focoupatiónal Vision.” Optom Wkly 
H " -8, 1224-7, 1265-7, 1299-1302, 2 

31, Ag 7, 14, 21, 28, S 4 ^47. * (PA 22:2374 TET 


Henry A. Imus, Head, Psychophysiology 
Branch, Medical Sciences Division, Office of 
Naval Research, Washington, D. C: An indus- 
trial visual classification device, the Ortho- 
Rater, has been developed by means of a co- 
operative research program conducted by the 


Division of Education and Applied Psychology 
at Purdue University and the Bausch & Lomb 
Optical Company. Although defective vision is 
revealed by this device, its primary purpose is to 
aid in selecting and placing personnel in indus- 
trial jobs. Twelve tests of near and far vision 
are administered in approximately six minutes 
of testing time per subject. These tests include 
acuity, eye muscle balance, stereopsis, and color 
perception. 

A number of test-retest studies with indus- 
trial, college, and military populations have 
shown that the various Ortho-Rater tests 
sometimes more reliable than the correspond- 
ing clinical tests. For visual acuity at distance 
and lateral phoria at near, however, the clini- 
cal tests are more reliable. The coefficients of 
reliability range from .62 to .9o for the various 
Ortho-Rater tests. The agreement between the 
Ortho-Rater and clinical methods of measuring 
visual acuity is satisfactory, the coefficients 
ranging from .83 to .go. In the measurement 
of lateral phoria the agreement is not as clo 
the coefficients of correlation being .70 for di 
tant and .77 for near vision. For vert 
phoria, the agreement is .50 for both distant 
and near vision. The reliability coefficients of 
both the Ortho-Rater and clinical tests of ver- 
tical phoria are significantly lower than those 
for lateral phoria, which attenuates the rela- 
tionship between the two methods. 

The test of stereopsis in the Ortho-Rater is 
fairly reliable (r —.83) and agrees with the 
less reliable (r= .62) Howard-Dolman Test 
of Depth Perception to the extent of r = .60. 
The test of color perception contains only six 
items, and most subjects are able to discrimi- 
nate four of these. 

Various industries which have used this sys- 
tem of visual screening for job placement re- 
port increases in production, efficiency, and 
employee earnings. The corollary use of the 
instrument in detecting visual defects has re- 
sulted in increased efficiency in production and 
in a reduction of accidents when proper glasses 
or corrective procedure are furnished. An im- 
partial comparison of the Ortho-Rater, AO 
Sight-Screener and the Keystone Telebinocu- 
lar shows that the first two are approximately 
equal in reliability and validity, with a slight 
preference for the Ortho-Rater, and that both 
tests are better in both reliability and validity 
than the Telebinocular. 


are 


5€ 
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F. Nowell Jones, Associate Professor of 
Psychology, The State College of Washington, 
Pullman, Washington. This is a self-contained 
instrument designed to measure certain aspects 
of visual efficiency. These aspects are acuity at 
both far (26 feet) and near (13 inches) posi- 
tions; the postural balance of the eyes, both 
horizontally and vertically, at the two distances ; 
stereopsis at the far position; and color per- 
ception for red and green. As the manufacturer 
emphasizes, these tests are not intended as 
clinical diagnostic instruments but are designed 
to measure visual performance without regard 
to the underlying optical conditions. 

Users of the Ortho-Rater must purchase the 
Bausch and Lomb Industrial Vision Service. 
This includes the training of the test techni- 
cian and the consulting facilities of a central- 
ized statistical bureau located in the Depart- 
ment of Psychology at Purdue University. 
This latter point is most useful for the smaller 
plant which does not hire an industrial psy- 
chologist. 

The reliabilities of the various measures, 
with one exception, do not seem to have been 
published. The exception is that for lateral 
phoria, where test-retest reliability (using two 
slightly different forms of the Ortho-Rater) 
ranges from .84 to .87. There is no reason to 
believe that reliabilities would be unduly low, 
although the AO Sight-Screener might. be 
more reliable on the acuity scales because of 
more test forms at each level of difficulty. 

The validity of the instrument, using clinical 
tests as criteria, has been tested for lateral 
phoria. The r’s were .76 to .84 when the Steven- 
son test was the criterion. The Ortho-Rater 
fared better in this experiment than did the 
Keystone Telebinocular. This is probably due 
to the better screening of the Ortho-Rater from 
extraneous visual stimulation. 

A fairly large number of studies have been 
published which show the usefulness of the 
Ortho-Rater in selecting employees for various 
occupations and in reducing accidents. Prob- 
ably the greatest contribution of the Ortho- 
Rater has been to point up the rather low inter- 
correlations among various visual measures, 
with the resulting realization that the usual 
Snellen Chart screening procedure at 20 feet 
may not only be useless but actually mislead- 
ing. Unfortunately, however, most of the pub- 
lished studies report only a posteriori results ; 
that is, they show what would have happened 


if the group under consideration had been:se- 

lected by means of the Ortho-Rater rather than . 
what actually happens to a group whose per- 

formance or accident records are predicted 

before they start to work. Also, the signifi- 

cances of group differences are very seldom 

reported. 

There is no doubt that the Ortho-Rater adds 
to the test batteries for many occupations a 
more or less unique variable and is therefore 
useful for purposes of selection and place- 
ment. As compared to its competitors, the 
Ortho-Rater is more widely available than the 
AO Sight-Screener and probably not less effec- 
tive; it is more expensive and less readily 
available than the Telebinocular, but undoubt- 
edly a more accurate instrument. 

[472 ] 
*Oseretsky Tests of Motor Proficiency: A Trans- 
lation from the Portuguese Adaptation. Ages 4-16; 
1946; originally published in Russian in 1923; Portu- 
guese adaptation by Maria Irene Leite da Costa was 
published in 1943; individual; $1 per manual; $1.25 per 
25 record sheets; nontimed (20-30) minutes; English 
translation by Elizabeth Joan Fosa; sponsored and 
edited by Edgar A. Doll ; original test by N. Oseretsky ; 
Educational Test Bureau. 
REFERENCES 

1. DA Costa, M. I. Lerre. "The Ozeretzky Tests: Method, 
Value and Results (Portuguese T EGG UM "Translated by 
eats Joan Fosa. Training Sch B 43:1-13 Mr '46, * (PA 
20:2920, 

. LU s Tests, Part IL" 
Transition by Elisabet Joan Fosa. Training Sch D. agia7-38 
Ap "46. * (P. 20:3332) 

» DA Costa, MARIA Irene Lire. “The Ozeretzky Tests, Part 
111.” Translated by Elizabeth Joan Fosa. Training Sch B 43: 
50-9 My '46. * (PA 20:3333) 

4. DA Costa, MARIA IRENE Leite, "The Ozeretzky Tests, Part 
IV.” Translated by Elizabeth Joan Fosa. Training Sch B 43:62- 


"46. * (PA 20: 
Don, AY x "dye Ozerteky Scale.” Letter, Am J 
t 1485- a 
é Korr, fads Tbsyehosomatie Study of Fifty Stuttering 
Children: Round Table: II, Ozeretsky Tests,” Am J Orthopsy- 
chiatry 16:114-9 Ja '46. * (PA 20:2523) 


J Consult Psychol 11:157 My-Je '47. * The 
tests are organized by age levels from 4 to 16, 
with 11-12, 13-14, and 15-16 combined as two- 
year levels. At each stage there are 6 tests, 
which measure, respectively, general static’ 
coordination, dynamic coordination of hands, 
general dynamic coordination, motor speed, 
simultaneous voluntary movements, and syn- 
kinesia (associated involuntary movements). * 
Ingenious small drawings clarify the nature of 
the tasks, but some of the directions are de- 
fective and will have to be revised by further 
studies, * In their present form, no claims are 
made that the Oseretsky test norms are appro- 
priate for American children, or that the test is 
a finished clinical instrument. It is, however, a 
challenging basis for research in an important 
and hitherto neglected area. 
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[ 473 ] 
XPseudo-Isochromatic Plates for Testing Color 
Vision. Ages 7 and over; 1940; a 46-plate test de- 
signed to duplicate the most useful of the Ishihara and 
Stilling plates ; individual n to per testing outfit; (2-3) 


minutes; American Opti 
REFERENCES 

1, Murray, Erste. “ ‘Color Blindness’: Current Tests and 

Ms Charting of Cases," Psychol B 39:165-72 Mr '42. * 
16:2591 

a Gactaoiter, E ROSWELL; GALLAGHER, CONSTANCE D.; AND 
Stoane, ALazxr E. “A Brief Method of Testing Color Vision 
With Pseudo-Isochromatic Plates." Am J Ophthal 26:178-81 F 


"43. * (PA 17:18; 
EAE 5 BERT E. "The Pseudoisochromatic Plate 


3. SHORMA! 
Test of Color Vision: Practical Application." Arch Ophthal 


2909-18 Je '43. * (PA 17: 
eRe fons fL “Conparioon of the Ishihara and the 
í udoisochromati 


American Optical Company Series of Pse: ic 
Plates.” Arch DIR 1:163-4 F '44. * (PA 18:2379) 
5. Harpy, LeGranp H.; Rano, GERTRUDE; AND. RITTLER, 


M. 
Caturrine.’ "Tests for the Detection and Analysis of Color- 
Blindness: II, The Ishihara Test: Comparison of Editions.” 
J Opt Soc Am 35:351-6 My '45. * (PA 20:60) 
6. StoAN, Louise L. “An Improved Screening Test for Red- 
Green Color Deficienc; Composed of Available Pseudo-Isochro- 
matic Plates." J Opt Soc Am 35:761-6 D '45. * (PA 20:1818) 


7. Fostex, Harriet., “A Comparative vé of Three Tests 
for ind Vision." J Appl Psychol 30:135:43 Ap '46. * (PA 20: 
302 


8. Harpy, LeGnaND H.; RAND, GrarRUDE; AND RrrrLER, M. 
Catusnine, “A Screening Test for Defective Red-Green Vi- 
sion." J Opt Soc Am 36:610-4 O '46. * (PA 21:1020) 
LrGzAND; RAND, TRUDE; AND RirTLER, M. 
Test for esie Red-Green Vi- 


Based on Ei Pseud, romatic Plates From 
» American Opal Company's Compilation.” Arch Ophthal 

J Am Med Assn 116:86 Ja 4°41. This con- 
sists of a paneled folder containing forty-six 
charts of the general type devised by Ishihara 
but presenting many improvements, evident 
only after a comparative test on the same sub- 
jects. The panels are not numbered and are so 
arranged that any desired pair may be exhib- 
ited alone and in any order. These provisions 
will facilitate the detection of malingerers. In- 
serted in a pocket in the folder, which is of 
substantial construction, is a pamphlet giving 
a brief discussion of color blindness, descrip- 
tion of types, and instructions for making the 
tests and interpreting the results. There is also 
a brief bibliography and a tabular key for in- 
terpreting each plate. Most of the plates are 
applicable only to red-green defects, but there 
are several especially designed for the detec- 
tion of blue-yellow defects. On the whole the 
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equipment makes possible a somewhat more 
comprehensive examination but the technic is 
greatly simplified as compared to some of the 
other devices that have been used. 


[ 474 ] 
XTest of Color Blindness. 1944; a short form of 
New Test for the Detection of Color Blindness (see 
409) ; individual; $1; P. B. Wiltberger; College Book 
jo. 


[45] 


*Western Electric Audiometer. Grades 1 and over; 
1924-47; models 1A and 2A (pure tone instruments) 
and 3A and sA (warble tone instruments) are no 
longer available; manufactured by Western Electric 
Co.; distributed in the United States by the Graybar 
Electric Co. through local dealers; distributed in 
Canada by the Northern Electric Co.; distributed 
throughout the rest of the world by the Westrex Cor- 
poration. 

à) AUDIOMETER 4CA. Grades 1 and over ; 1926-47 ; super- 
sedes models 4A, 4B, and 4C ; individual in grades 1 
40 persons may be tested at one time in grades € 
over; $585 (suggested list price) per audiometer with 
4 receiver trays and a receiver tray case. 

b) AUDIOMETER 6BP. Grades 1 and over; 1936-47 ; su- 
persedes model 6A ; individual ; $325.80 (suggested list 
price) per audiometer equip ed with an air conducticn 
receiver; $375 (suggested list price) per audiometer 
arranged for both air and bone testing. 


REFERENCES 
MA vue. Haroto M. “An Audiometric Test for Young 
Children." Child Develop 2:237-41 D '31. * (PA 6:1987) 
a. Ciocco, Antonio. "Audiometric B 
dren: I, The Consistency and Significance of Tests Made With 
udiometer.”” 


tudies on School Chil- 


a yA Audi Pub Health Rep $1:1402-16 O 9 '36. * (PA 

1153090, title only; A 
3 co, ANTONIO; “Audiometric Studies on School Chil- 

d of Audiometric Curves." Pub Health Rep si: 


ren: IL, Types 
1609-21 N 20°36. * (PA 11:5494, title o 
ency 


1936. Pp. 162. Paper. $1.50. (PA 11:3609) 

$. Ciocco, Antonio. “Audiometric Studies of School, Chil- 
"ren: IIl, Variations in the Auditory Acuity of 543, School 
Children Re-Examined After an Average Interval of Three 
Mar Ann Otol Rhinol Laryn 46:55-68 Mr '37. * (PA 12: 


6. SrtvexMAN, S. Rrewarv. “Inadequacies of Current Hear. 
ing Tests for Schoot Children.” J Estep Child 6:225 0 '39.° 
7. Bunce, C. C. "Usab m i 
THE y poche le Hearing." Ann Otol 49:359-67 Je 
8. NIO, AND Parmer, Carrot E. The Hearing 
hildren: A Statistical Study of Audiometric and 
Clinical Records. Monographs of the Society for Research in 
acd Development Vol: 6 No. 3, Serial No. 29. Washington, 
Toe (ESO DER nal Research Council, 1941. Pp. v, 77. 
in CNA ead Guanes H. TM Auditory ,Behavior Rather 
86:1-6 Janar e (PA AAS ae on of Age." Am Ann Dea 
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READING 


Reviews mv Irving H. Anderson, Robert Murray Bear, Paul Blommers, Ivan A. Booker, 
Marion Monroe Cox, Frederick B. Davis, Robert A. Davis, Ethel M. Feagley, William S. Gray, 


W. E. Hall, Albert J. Harris, James R. Hobson, Dorothy E. Holberg, 
Lowes, James M. McCallister, Robert L. McCaul, Constance M. 


lice N. Jameson, Ruth 
cCullough, Margaret G. 


McKim, Eason Monroe, C. Robert Pace, Holland Roberts, David H. Russell, Douglas E. 
Scates, Virginia Da OEE Spache, Herbert F. Spitzer, J. B. Stroud, Edith I. M. Thomson, 
LU 


Robert H. Thouless, 


les A. Tinker, T. L. Torgerson, Francis Oralind Triggs, William W. 
Turnbull, Carolyn M. Welch, Paul 


. Witty, J. Wayne Wrightstone. 


[ 476 ] 
*Achievement Tests in Silent Reading: Domin- 
ion Tests. Grades 1, 2, 2-3, 4-6; 1941-46; 65¢ per 25 
of any one test; specimen set must be purchased to 
obtain the manual for a given test; prepared by the 
Department of Educational Research, Ontario College 
of Education, University of Toronto; Vocational 
Guidance Centre. 
a) PRIMARY: TYPE I, WORD RECOGNITION. 1941; Forms 
A, B ; 30€ per specimen set; 10(20) minutes. 
b) PRIMARY: TYPE 2, PHRASE AND SENTENCE READING. 
1941; Forms A, B; 30 per specimen set; 20(30) min- 
utes. 
C) PRIMARY: TYPE 3, PARAGRAPH READING. 1043-45; I 
form; 30f per specimen set; nontimed (35) minutes. 
d) PRIMARY : TYPE 4, DIAGNOSTIC TEST IN WORD RECOGNI- 
TION. 1943-45; 1 form; 30¢ per specimen set; non- 
timed (35) minutes. 
€) ADVANCED PRIMARY: TYPE.I, VOCABULARY. Grades 
2-3; 1043-46; Forms A, B; 30 per specimen set; 20 
(30) minutes. 
f) ADVANCED PRIMARY: TYPE 2, DIAGNOSTIC TEST IN 
PARAGRAPH MEANING. Grade 2; 1943-46; Forms A, B ; 
30¢ per specimen set; nontimed (40) minutes. 
g) JUNIOR: TYPE I, VOCABULARY. Grades 4-6; 1943-46 ; 
Forms A, B; 30¢ per specimen set; 20(30) minutes. 


Margaret G. McKim, Assistant Professor 
of Education, University of Cincinnati, Cin- 
cinnati, Ohio, These tests are the first in a 
series designed for use in Canadian schools. 
Norms are based on the performance of rural 
and urban children in selected Canadian schools. 

The first-grade tests are similar to the three 
subtests in the Gates Primary Reading Test. 
The phrase and sentence reading test consists 
of fifty items, each containing four phrases or 
sentences, of which the child selects the one 
matching an accompanying picture. The para- 
graph reading test provides 28 paragraphs, 
each accompanied by four pictures, one of 
which correctly illustrates it. This test seems 
well adapted to the measurement of paragraph 
comprehension in the primary grades. The for- 
mat of the first-grade tests is excellent. The 
pictures are unusually clear. The directions and 
the means of indicating the correct answer are 
simple. 


Reliability coefficients for comparable forms 
of Types I and II of the primary tests are 
.848 and .909 respectively. Internal estimates 
of reliability, based on analyses of variance for 
Types III and IV, for which there are no 
comparable forms, are .957 and .950. Correla- 
tions between Types I and II and the equiva- 
lent types of the Gates Primary Reading Test 
were .895 and .884. As the manual of the 
Dominion Tests indicates, these two types 
could be considered equivalent forms of the 
corresponding Gates tests or vice versa. 

The Diagnostic Test in Word Recognition 
is a significant venture, Similar in form to the 
Word Recognition test, it is so constructed that 
errors can be classified as related to the be- 
ginning, middle, or ending of the word, as 
reversals, or as choices based on general con- 
figuration. A table shows the typical error pat- 
tern for a given number of mistakes and the 
points below which errors might be considered 
either significant or abnormal. No information 
is provided as to how these error levels were 
determined. Furthermore, the proportion of 
typical errors in each category shows practi- 
cally no change as the number of total errors 
decreases. Inasmuch as teaching emphasis in 
the first stages of word analysis tends to be 
upon left-to-right orientation and ability to re- 
spond to the beginnings of words, one would 
expect the proportion of errors in these cate- 
gories and those related to the general con- 
figuration of the word to decrease as reading 
ability improves. If the children who took this 
test did not give evidence of this growth, it is 
perhaps because it is geared to a stage before 
that in which there is much emphasis on word 
analysis and is therefore not correctly graded 
to be of greatest service to teachers. Further 
research seems indicated before teachers are 
encouraged to rely upon the error scores. 
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The vocabulary tests at the two upper levels 


use a key word and four responses. Reliabili-- 


ties, based on comparable forms, are .927 for 
grades 2 and 3 combined and .925 for grades 
4, 5, and 6 combined. 

The Diagnostic Test in Paragraph Reading 
contains eight paragraphs, each followed by 
three questions, one related to the general sig- 
nificance of the paragraph, a second calling for 
an inference, and a third asking for a detail. 
Error scores are indicated as they were in the 
Diagnostic Test of Word Recognition, and the 
same general questions can be raised regard- 
ing their validity. The problem of accurate 
diagnosis is further complicated by the fact 
that the total test allows for only eight errors 
of each type. The value of a group test which 
provides diagnostic information cannot be 
overemphasized. It is to be hoped that the re- 
search will be continued by the authors of the 
Dominion Tests to the point where the analyses 
of errors are backed by careful study. The 
reliability of this test, based on comparable 
forms, is .830. 

Those concerned with the technical aspects 
of test construction will appreciate the care 
with which these tests have been built. In 
addition to the reliabilities, an analysis of 
variance provides evidence as to the internal 
consistency of each test, the discrimination be- 
tween persons, the equivalence of forms, and 
the effects of the difficulty of the separate 
items. This careful attention to the statistical 
bases of test construction seems to be resulting 
in tests designed to cover only one or two 
grades instead of the range which is more fre- 
quently attempted in one form. This allows for 
norms which go well below and above the 
grades for which the test is recommended. 
There are still gaps in the series, but the tests 
which have been published are well done and 
give promise of a strong complete battery. 


[477 ] 
Buffalo Reading Test for Speed and Comprehen- 
sion, Revised. Grades 7-16; 1933-41; Forms A, B; 
5 Per 100; 10¢ per specimen set; 30(35) minutes; 
Me E. Wagner; Foster & Stewart Publishing Corpora- 
ion. 


Holland Roberts, Educational Director, Cali- 
fornia Labor School, Inc. San Francisco, 
California. This 1941 revision follows the tra- 
ditional t3xtbook pattern set by the 1936 edi- 
tion. It is a general survey test of certain 
abilities needed in understanding the dull mate- 


rial of unattractive texts, under the pressure 
of timing. The major revision is the shortening 
of the test from 40 to 30 minutes. 

Each of the two test forms has 16 subdivi- 
sions, grouped into two similar 15-minute 
parts. Each of these two parallel parts consists 
of 8 selections from text material nearly all 
published before 1930. The 32 individual selec- 
tions in the two parallel forms are all essay- 
type material, distributed as follows among 
these subject matter fields: geology, paleon- 
tology, and biology 9; astronomy 3; ph 5 
2; history 5; sociology 1; economics 2; poli 
cal science 1; philosophy 3; literary criticism 
2; biography 2; and the informal essay 2. As 
the biographies are both of men of science, the 
subject matter is divided among the fields in 
these proportions: (a) biological and physical 
sciences, 16 out of the total of 32 selections 
in both forms of the tests, or 50 per cent; 
(b) social sciences, 12 or 37% per cent; (c) 
English, 4 or 1274 per cent. Fiction, poetry, 
drama, dialog, and mathematics are not repre- 
sented, and popular nonfiction is included only 
through token selections. As the material cho- 
sen does not represent these five important 
divisions of the school curriculum, the ability 
to read in these fields is not directly tested. 
The absence of fiction which represents such 
a large part of the reading in the secondary 
school is particularly noticeable, especially as 
the abilities needed for reading stories differ 
in important particulars from those used in 
understanding heavy textbook content. 

Perhaps the key to the many difficulties 
which this test presents is the author’s state- 
ment that “Every endeavor was made to select 
unusual topics so that those tested would be 
unlikely to have read them.” To achieve this 
standard, the following key features of an effec- 
tive test have been sacrificed : 

(a) The miscellany of textbook scraps which 
constitute the framework of the test has no 
unity, and the content consequently has no 
over-all meaning or purpose and so lacks so- 
cial significance. 

(b) In striving to find material with which 
young people are unfamiliar, the possibility of 
giving them a vital learning experience and 
direction in their lives has been lost. Much of 
the content chosen could be useful, but it is 
not assembled and focused on the solution of 
any particular problem and therefore cannot 
be assimilated and put to work. 
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(c) The selections are generally uninterest- 
ing, lacking in variety, and difficult to read. 
They deal with subjects that are nearly all out- 
side the experiences of young people, especially 
students in the high school. The assumption 
that the reading of informative material, irre- 
spective of its appeal to the reader and of his 
ability to connect it up with previous experi- 
ences in reading or daily living, is an adequate 
test of reading ability has no established foun- 
dation in fact. Motivation has been proved to 
be a powerful factor in learning, and testmakers 
who wish to measure reading power cannot 
disregard it. 

(d) The selections are without exception 
humorless and unimaginative. 

This test reflects the structure of the educa- 
tional system and the philosophy of the school 
systems for which it was designed. In its 
method and selection of content it is assumed 
that fact gathering is the business of the school, 
much as a squirrel garners nuts, and that the 
student who stores away the most facts is the 
most highly educated. That society today faces 
the postwar problems of an atomic age and 
that it is the key responsibility of the school to 
help the student learn to solve the basic prob- 
lems of peace and survival are ignored. In- 
stead, young people who handled machine guns 
at Bataan and flew bombers over Berlin yes- 
terday and who may, if we fail to build the 
peace, meet death rays and germ warfare to- 
morrow are asked to grapple with the social 
problems of Chinese mandarins in the days 
before Alexander the Great. 

Each of the 16 selections in each form of the 
test varies from one to three paragraphs of 
from 200 to 300 words printed on a single 
right-hand page. The four to six multiple- 
choice test questions accompanying each selec- 
tion appear on the following page and are 
clearly and simply stated. They are primarily 
fact questions depending upon direct recall of 
items just read; i.e., “Fossil birds have (1) no 
teeth (2) reptilian teeth (3) front legs (4) 
scales rather than feathers.” In answering 
these questions students are not permitted to 
turn back to the selection, but must recall the 
facts from a single reading. The ability to re- 
member facts from one reading is not of cru- 
cial importance in normal study, and in this 
particular, as in the choice of subject matter, 
the test situation does not correspond to actual 
life conditions. The condensed nature of the 


material selected and the bad writing which 
characterizes much of it set up additional hur- 
dles that complicate an already specialized, un- 
natural reading situation. 

The test manual for the first edition was 
carefully prepared and presents directions for 
giving the test clearly. 


William W. Turnbull, Secretary of the Board 
and Head of the Test Construction Depart- 
ment, College Entrance Examination Board, 
Princeton, New Jersey. DESCRIPTION. The test 
consists of sixteen passages (300 to 400 words), 
each followed by four to seven multiple-choice 
questions based on its content. The first eight 
passages (Part I of the test) deal with history, 
English, economics, sociology, and psychology ; 
the last eight (Part II) with astronomy, 
physics, geology, biology, and paleontology. 
The selections, and the questions on each selec- 
tion, appear in ascending order of difficulty. 
The student is not allowed to refer to the pas- 
sage while answering questions about it. The 
difficulty of the task seems appropriate for stu- 
dents at the senior high school or college fresh- 
man levels. 

scorING. With the exception of one question 
in each form, the test is scored objectively 
throughout. The student writes the number of 
his answer choice beside the question; the 
scorer checks the number against a list of right 
answers. This is a decidedly tedious process 
which could easily have been eliminated by ar- 
ranging the test for stencil scoring, eg., by 
use of a separate answer sheet. Three scores 
are obtained: total right (“raw score”), num- 
ber of questions up to and including the last 
question answered (“speed score”), and “raw 
score” divided by “speed score” (“comprehen- 
sion score”). Extra credit is given for finishing 
either part of the test in less than the time 
allowed (15 minutes). 

varprty. No validity data are reported. 

RELIABILITY. No reliability data are reported. 

norms. Percentile norms for the three scores 
are provided for “college freshmen” on Form 
A only. Since there is no indication of the 
number or characteristics of the group on 
which these norms are based, they are of little 
or no value. No standardization data are given 
for Form B, nor is there any indication of the 
relation between scores on the two forms. 

MANUAL. The manual gives directions for 
administering and scoring the test, and pro- 
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vides the norms mentioned above. There is no 
statement of the test's purpose, method of con- 
struction or standardization, reliability or valid- 
ity, or of the manner in which the scores 
should be interpreted. 

EVALUATION. This test suffers by compari- 
son with others that are available (e.g., the 
lowa Silent Reading Test, the Nelson-Denny 
Reading Test, the Traxler High School Read- 
ing Test). It has already been noted that avail- 
able data about the test are wholly inadequate. 
The material itself does not suggest that the 
data, if available, would be reassuring. Many 
of the questions deal with unimportant details 
mentioned in the passages, and a considerable 
number of items violate common principles of 
test construction (eg. clarity, grammatical 
accuracy, etc.). The proportion of questionable 
items is unusually large, indicating a need for 
a thorough re-editing and, most of all, an item 
analysis. The test user's task is made unneces- 
sarily difficult by use of an awkward scoring 
System. 


[ 478 ] 

Chicago Reading Tests. Grades 2-4, 4-6, 6-8; 1939- 
40; 4 levels; 60 per specimen set; Max D. Engelhart 
and Thelma Gwinn Thurstone; E. M. Hale and Co. 

4) TEST A. Grades 1-2; Forms 1, 2; 31(40) minutes ; 
$2.50 per 50. 

b) TEST B. Grades 2-4; Forms 1, 2, 3; 42(60) minutes ; 
$2.50 per 5o. 

C) TEST C. Grades 4-6; Forms 1, 2, 3; 45(55) minutes; 


b per 50. 
d) TEST D. Grades 6-8; Forms 1, 2, 3; 45(55) minutes ; 
$3 per so. 


REFERENCES - 


1. Encerwart, Max D., AND THURSTONE, THELMA G, “Chi- 
cago Reading Tests," Chicago Sch J 20:748: N^ 3 € oO 


Ed Res B 20:264 D 1741. William J. Jones. 
The Chicago Reading Tests, purporting to 
measure comprehension of words, sentences, 
stories, maps, and paragraphs, and a pupil's rate 
of reading, follow the ustial testing procedures 
in such measuring. The samples of materials 
provided in each subtest are inadequate to meas- 
ure either validly or reliably the status of a 
pupil in any of the aspects of reading. The sam- 
ples of materials for measuring comprehension 
of paragraphs are limited almost exclusively to 
getting the main idea and noting details in the 
paragraphs. Other aspects of reading compre- 
hension obviously cannot be measured without 
an extension of the test. The phases of reading 
comprehension measured in the test are not 
adequately defined or explained. No data are 
provided upon the validity and reliability of 


these tests. Such tests, however, might well be 
used in a survey of undifferentiated aspects of 
reading. It may be presumed that the relia- 
bility of scores on subtests are reliable for 
group interpretation purposes only. For indi- 
vidual diagnosis more carefully constructed 
reading tests are recommended. 


For reviews by Robert Lawrence McCaul 
and W. J. Osburn, see 40:1531. 


[ 479 ] 
DeVault Primary Reading Test. Grades 1-2; 1931; 
Forms 1, 2; 6o¢ per 25 ; 25¢ per specimen set, postpaid; 
(15-20) minutes; Nellie M. DeVault; California Test 
Bureau. 


Alice N. Jameson, Assistant, Child Place- 
ment Department, Public Schools, Brookline, 
Massachusetts. This test purports to measure 
reading ability in the first and second grades 
It contains three tests. The first test measures 
word recognition. Responses for this first test 
are registered in four different ways: a cross 
over a word, a line under a word, a box or a 
circle around a word or phrase. In the first and 
third sentences of Test 1 all four marks are 
employed. By using so many different marks, 
the child is obliged to concentrate upon both 
his reading and the type of mark he must use 
to indicate his response. The use of just one 
of these marks throughout the test would allow 
him to concentrate on reading exclusively. 

The second test includes 15 short questions 
which are answered by underlining either Yes 
or No. 

The third test is made up of 15 items. The 
first seven are sentences giving specific direc- 
tions to be followed. For example, “Make a 
ball in the box.” The other items are stories 
of increasing difficulty, each of which is fol- 
lowed by two or three questions to test com- 
prehension. 

The DeVault Primary Reading Test is to be 
commended for the following features: (a) The 
content of this test “is based upon a detailed 
analysis of various courses of study, research 
concerning word lists, test criteria and reports 
on children’s reading.” (b) It is well graded. 
(cy Tables in the manual indicate that the 
DeVault test is "strictly comparable with the 
Ingraham-Clark Tests, and Gates I, and II in 
the second grade." (d) The test booklet is com- 
pact and easy to use. (e) The print is clear and 
the spacing excellent. (f) The score sheet con- 
tains the answers to all three tests on one 
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page, thereby facilitating correcting and saving 
the teacher’s time. 

The DeVault test was “designed to meet the 
needs of the first and second grade.” It accom- 
plishes this purpose. 


[ 480 ] 
Diagnostic Examination of Silent Reading Abili- 
ties. Grades 4-6, 7-9, 10-16; 1939-40; IBM in part; 
3 levels ; 2 parts; 50¢ per specimen set of any one level, 
postpaid; August Dvorak and M. J. Van Wagenen; 
Educational Test Bureau. 
Q) PART I, RATE OF COMPREHENSION TEST AND INDIVID- 
UAL DIAGNOSTIC READING PROFILE. $1 per 25; 5(15) 
minutes, 
b)PARTS II AND 11r. IBM; separate answer sheets must- 
be used; $2 per 25; 2¢ per machine-scorable answer 
sheet; nontimed (105-135) minutes. 


REFERENCES 
. HAvTER, WALTER HENRY. wrvey of Silent ng 

1, H. W.: H A Survey of Silent Readi; 
Abilities in the State of Oregon. UNIES master's thesis, 
University of Washington, 1941. (Abstracts of Theses..., 
1941, pP- 46-7.) 

2. TRAXLER, ARTHUR E, “A Study of the Van Wagenen- 
Dvorak Diagnostic Examination of Silent Reading Abilities,” 
pp. 33-41. In 1940 Fall Testing Program in Independent Schools 
and Supplementary Studies. Educational Records Bulletin, No. 
31. New York: Educational Records Bureau, January 1941. Pp. 
Xi, 41. Paper, lithotyped. $1.00. * (PA 15:3580) 


Frederick B. Davis, Professor of Psychology, 
George Peabody College for Teachers; and 
Director, Test Research Service; Nashville, 
Tennessee. This test constitutes an ambitious 
effort to measure ten aspects of silent reading 
ability for pupils in each of grades 4 through 12 
and among college freshmen. Test 1 is a well 
designed and highly speeded test of ability to 
comprehend easy material, but the directions 
do not make it sufficiently clear that the time 
limit is very short and that individual scores 
depend mainly on speed. If they did, this test 
would be one of the most useful of its kind. 
Test 2 is a verbal analogies test. Specific clues 
to the answers for some items are provided by 
differences in the parts of speech of the choices. 
Test 3 and 4 measure word knowledge. Test 5 
includes items of general information, such as 
“Sugar comes from 26. Louisiana, 27. Hawaii, 
28. Florida, 29. Brazil, 30. China.” The keyed 
answer to this item is “Louisiana.” According 
to the 1946 Statesman’s Yearbook, in 1944 all 
these areas produced sugar, and all of them ex- 
cept Florida produced more than Louisiana. The 
item, as worded, has no one correct answer ; the 
better informed the examinee, the worse the 
item seems to him. These details are cited merely 
to illustrate the careless construction and keying 
of some items in the examination. In Part III, 
for example, several items (e.g, Item 54, 
Senior Division) begin: “The paragraph is 
mainly about.” The examinee cannot tell 


whether the main point of the paragraph or the’ 
topic discussed at greatest length is wanted, 

Tests 6 to 10, inclusive, are composed of 
items based on reading passages and are ad- 
ministered without time limit. The reading level 
score derived from Tests 6 to 10 is a weighted 
combination in which Test 7 (weighted the 
heaviest) has about twice the weight of Test 
6 (weighted the lightest). Whether this rela- 
tive weighting of the tests, which is determined 
by the magnitudes of the standard deviations 
and intercorrelations of the parts, was planned 
and has any rationale is not stated in the man- 
ual. In any case, the writer judges the reading 
level score to be a reasonably valid measure of 
level of comprehension. It is unfortunate that 
the manual makes some inappropriate state- 
ments about the means of establishing the valid- 
ity of any test. What reservations the writer 
has about the validity of the reading level score 
are due to the fact that he suspects Tests 6 to 
10 (especially Tests 6 and 8) of measuring to 
an unacceptably great extent the pupils' ability 
to guess the answers the item writer had in 
mind in addition to the five skills specified in 
the manual. 1 

Intercorrelations and reliability coefficients 
are not presented in the manual, but they have 
been reported by Traxler (2) on the basis of 
small, selected samples. In a sample of 116 
tenth-grade, independent-school pupils, the in- 
tercorrelations of Tests 6 to 10 range from .43 
to .70. Corrected for attenuation, the lowest of 
these becomes .75, and half of them exceed .90. 
Taken by themselves, these high intercorrela- 
tions do not necessarily reflect unfavorably on 
the validity of the part scores or of the total 
score. But they do indicate the presence: of 
only small amounts of unique variance in each 
test. 

The reliability coefficients for Tests 2 to 10 
in the Senior Division booklet range from .43 
(for Test 6) to .79 (for Test 4). Experience 
with items of these types warrants the conclu- 
sion that individual item -efficiency in the case 
of tests like Test 6 is not as high as can be 
obtained from well-written and well-selected 
items. However, the standard error of meas- 
urement of an obtained reading level score is 
only about two C-score points at the public 
school, twelfth-grade level. 

All things considered, the examination pro- 
vides useful reading level and speed of com- 
prehension scores. Its diagnostic value is lim- 
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ited by the unreliability of differences among 
the separate test scores and the fact that each 
of them measures little that is unique. 


W. E. Hall, Associate Professor of Educa- 
tional Psychology, The University of Nebraska, 
Lincoln, Nebraska. It is a quite widely ac- 
cepted fact among reading experts that the ade- 
quacy of a diagnostic reading test rests on three 
requirements: (a) careful analysis of reading 
skills involved in a reading situation; (b) 
reading material as nearly like a usual reading 
situation as possible (this requirement is im- 
portant because we are not sure of our knowl- 
edge of what are the reading skills) ; (c) care 
in selection of criteria of validation and in the 
subsequent validation. A careful study of the 
test seems to reveal that it was cleverly con- 
Structed, is easy to score, and purports to 
measure some very important skills in a read- 
ing situation. 

The reading skills tested are: rate of com- 
prehension, perception of relations, vocabulary 
in context, vocabulary—isolated words, gen- 
eral information, ability to grasp the central 
thought, ability to note clearly stated details, 
interpretation, integration of dispersed ideas, 
and ability to draw inferences. 

All the tests but r, 2, and 5 are probably 
measuring skills used in ordinary reading situ- 
ations. Test r, rate of comprehension, is not a 
typical reading situation because it is broken 
up into small reading units and uses the method 
of checking extraneous words to determine the 
rate of comprehension. It would seem that this 
checking device is very similar to a proofread- 
ing type of test rather than to rate of compre- 
hension. Test 2, perception of analogous rela- 
tionships, contains a type of item common in 
intelligence tests; and one wonders immedi- 
ately how frequently this skill occurs in read- 
ing and how it is related to reading. Test 5, 
range of information, appraises the achieve- 
ment in' gathering information, but just how 
this skill is related to reading would seem to 
baffle any reading analysis. The only way to 
justify Tests 1, 2, and 5, or any of the tests, 
would be validation. The only validation given 
in the manual is a table of distribution of 
medians that fall within certain grade levels. It 
is difficult to see just how such a table can 
validate these tests as a diagnostic instrument 
related to the reading tasks that are required 
of the pupil in school or in our society. 


There is no statement of reliability in the 
manual, which leads one to question reliability 
in terms of the length of each test. It is now 
generally conceded that the validity of a meas- 
uring instrument is best indicated by its dis- 
criminative capacity. It is stated in the manual 
that this examination has a high degree of dis- 
criminative capacity as evidenced by a study 
of 9,000 cases, The problem still remains—does 
this examination discriminate between individ- 
ual skills important to reading and other skills ? 


J. B. Stroud, Professor of Education and 
Psychology, The State University of Iowa, 
Iowa City, Iowa. Part I (Test 1, rate of com- 
prehension) consists of three- and four-line 
paragraphs, in which some word in the last 
half disagrees with the meaning of the first 
half. This word is to be crossed out. The 
method is similar to that used in the Chapman- 
Cook Speed of Reading Test. The method dif 
fers from that used in the reading comprehen- 
sion part of the Cooperative English Test 
wherein the rate score is the number of items 
correctly responded to in a given period of 
time minus a correction; and from the method 
of computing rate by the number of words 
read in a length of time, as in Pressey’s Read- 
ing Speed and Comprehension, the Traxler 
High School Reading Test, and the Iowa Sil: 
Reading Test. Each of these three methods 
corrects some fault in the others but in so do- 
ing creates some of its own. The last-named 
method is the most natural, yet gives no evi- 
dence of the care with which the piece is read. 
However, the other two methods mentioned do 
not separate the time spent in not comprehend- 
ing from that spent in comprehending. Inci- 
dentally, a procedure developed by Blommers * 
avoids this objection by computing rate scores 
only from exercises done correctly. On the test 
here reviewed the student may successfully 
mark most of the paragraphs by reading a por- 
tion of the first and last lines only. It would 
be helpful to know how students actually react 
to these different testing procedures, 

Part II includes Tests 2 through 5. Test 2, 
perception of relations, is identical with the 
analogies test commonly found in intelligence 
tests. Other tests of Part IT (vocabulary in 
context, vocabulary, and general information) 
are also similar to tests found in intelligence 

* Blommers, P. J. “Rate of Comprehension of Reading: Its 


Measurement and Its Relation to Comprehension.” J Ed 
Psychol 35:449-72 N "44. 
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test batteries. These are excellent tests and 
may be used as an index of aptitude for com- 
prehension. This is one of the chief diagnostic 
features of the battery. 

Part III (Tests 6-10) in form consists of 
100 items based upon reading selections, ad- 
ministered without time limit. By differential 
scoring, five part scores based upon 20 items 
each are obtained. The reading exercises are 
ample in number and variety, and the test items 
are constructed with care and competence. The 
scores on Part III should reflect reading power, 
uncomplicated by rate in so far as this can be 
controlled by testing procedure. Part III yields 
the following part scores: grasping central 
thought, retention of clearly stated detail, in- 
tegration of dispersed ideas, drawing inferences, 
and interpreting contents. There is little doubt 
that the processes suggested by these labels are 
implicated in effective reading. Utilization of 
items appropriate to each of these processes 
certainly should give a more comprehensive 
account of a student's reading ability than would 
the use of a smaller number. As a general test 
of reading, Part III deserves a ranking with 
the best. 

The merit of the diagnostic feature of a 
reading test depends, among other things, 
upon the extent to which abilities measured 
vary independently and upon the extent to 
which these abilities can and should be de- 
veloped separately by subsequent instruction. 
No statistical evidence of the disparateness of 
the part scores is given. The instructor will 
wish to decide for himself whether any special 
use is to be made of the separate scores of 
Part III in teaching. No information is given 
about reliability. In using profiles for diagnos- 
tic purposes, it is of the greatest importance to 
keep before teachers the question of the extent 
to which deviations may be expected by chance. 
Scores on the three parts of the test—rate, in- 
telligence, and power of comprehension should 
have considerable diagnostic significance. 


For an excerpt from a review by Worth J. 
Osburn, see 40:1532. 


[ 481 ] 

*Diagnostic Reading Tests. Grades 7-13; 1947-48; 
IBM except for Section IV, Part I; prepared by the 
Committee on Diagnostic Reading Tests, Inc.: Frances 
Oralind Triggs (Chairman), Robert M. Bear. Ivan A. 
Booker, Daniel D. Feder, Constance M. McCullough, 
A. Eason Monroe, George D. Spache. and Arthur E. 
Traxler; distributed by Educational Records Bureau. 
@) SURVEY SECTION. 


b) SECTION I, VOCABULARY. 
c) SECTION II, COMPREHENSION. 
1) Part 1, Silent. 
2) Part 2, Auditory. 
d) SECTION III, RATES OF READING. 
1) Part 1, General. 
2) Part 2, Social Studies. 
3) Part 3, Science. 
€) SECTION IV, WORD ATTACK. 
1) Part 1, Oral. 
2) Part 2, Silent. 
REFERENCES 
1. Triccs, Frances Oratinp. “Diagnostic Reading Tests as 
Aids to Remedial Instruction.” Sch E Soc 66:42-5 Jl 19 '47. * 
[482 ] 
Elementary Reading: Every Pupil Test. Grades 
4-6; 1946-47; new form usually issued each April and 
December; forms December 1946 and April 1947; 2¢ 
per test; 1¢ per scoring key; Ina Sigworth, Ethel Rud- 
ler, and D. S. Gruey; Ohio Scholarship Tests, State 
Department of Education. 
d) GENERAL ABILITY. 
b) SPEED AND COMPREHENSION. 
[ 483 ] 
Garrison First Year Reading Test. Grades 1B, 1A; 
1938; 1 form; 2 levels; $1 per 25; 20¢ per specimen 
set; (30-35) minutes; Marie Garrison; C. A. Gregory 
o. 
a) TEST I, MID-YEAR. 
b) TEST II, END OF YEAR. 


REFERENCES 
1. Evans, Marte Garrison, The Revision and Standardiza- 
tion of a First Grade Reading Test. Unpublished master’s thesis, 
University of Michigan, 


nV TONE, CLARENCE, R. “Validity of Tests in Beginning 
Reading.” El Sch J 43:361-5 F '43. * (PA 18:2605) 

Ruth Lowes, Associate Professor of Educa- 
tion, The West Texas State Teachers College, 
Canyon, Texas. This is a series of two tests 
for use in the first grade. One is to be given 
at midyear and the other at the end of the 
year. The tests are designed to measure visual 
discrimination, auditory-visual discrimination, 
phrase and sentence reading, and comprehen- 
sion reading. The last-named skill refers to 
sentence and paragraph comprehension. The 
present revision is the fourth and was worked 
out by the author in a course taught by Dr. 
Arthur I. Gates at Teachers College, Columbia 
University. 

Words used in the first test are those found 
to be common to twelve primers which were 
considered leading books at the time the test 
was published. Words in the second test are 
those common to ten first readers. Both tests 
were checked with the vocabulary lists of Thorn- 
dike, Gates, and Wheeler and Howell. For pur- 
poses of standardization, scores from almost 
four thousand pupils in schools in from fifteen 
to seventeen cities were used. For the two tests, 
reliability coefficients of .84 and validity coeffi- 
cients of .83 and .80 respectively are reported. 
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No explanation is made in the manual as to 
how these coefficients were obtained. 

The directions for administering the test are 
clear-cut and to the point. A strong feature is 
noted in the directions for giving the part for 
testing auditory-visual discrimination. Here a 
row of words is given, and the pupil circles the 
one pronounced by the examiner. In the trial 
exercise, precautions are taken to see that the 
pupil looks at each word before marking one. 
The test is given in two sittings. 

The pictures in both tests are disappointing, 
and one wonders if the size of the type is not 
small in comparison with that used in primers 
and first readers, 

The age table showing frequency of individ- 
ual scores for different chronological ages may 
be of some value. Grade equivalents and per- 
centile norms would have been helpful. The 
two tables showing the distribution of scores 
in the cities in which the tests were standard- 
ized contain valuable data but are of little help 
to the test user. 


[ 484 ] 
Gates Advanced Primary Reading Tests. Grades 
2 5-3; 1926-43 ; revision of Gates Primary Silent Read- 
ing Tests; Forms 1, 2, 3; 2 parts; $2.70 per 100 of 
either part; 30¢ per specimen set; 40(50) minutes; 
Arthur I. Gates; Bureau of Publications, Teachers 
College, Columbia University. 
d) TYPE I, WORD RECOGNITION. 
b) TYPE 2, PARAGRAPH READING. 


REFERENCES 

gam of Diagnosis ond Remy Menea PRG Eng Re 
York: Macmillan Co., 1947. Pp. xxi, 657. $4.25. * (PA 22:3195) 

Virginia Seavey, Assistant, Child Placement 
Department, Public Schools, Brookline, Mas- 
sachusetts. [Review of Form 2.] The Gates 
Advanced Primary Reading Tests, which are 
recommended for use in the latter part of the 
second grade and at any time in grade 3, are 
a carefully constructed and well standardized 
team of two tests, “designed to measure gen- 
eral competence or power in each of two as- 
pects of reading.” Skill in word recognition is 
tested by the frequently used method of select- 
ing one of four words that accompany a pic- 
ture. The 48 items increase in difficulty gradu- 
ally from primer words such as "mother" and 
"door" to words on an adult level of com- 
prehension like “palatable” and “vaporous.” 
Ability to read with understanding is measured 
by the paragraph reading test of 24 items di- 
recting the child to mark an accompanying set 
of pictures in certain designated ways, such as 


D 


“Draw a line under," or “Put an X on," etc. 
Both the level and range of vocabulary and 
comprehension can be determined as there is a 
very wide range in the difficulty of the items 
in both tests, 

There is much to recommend these tests. 
They are valuable primarily for survey pur- 
poses and as a supplement to the Gates Pri- 
mary Reading Tests. As they have such a high 
ceiling, they are especially well adapted for use 
in classes containing a number of superior 
readers. For second-grade pupils who still have 
limited reading ability the easier Gates Primary 
Reading Tests are more suitable. 

The tests are easily administered and scored. 
Directions are clear and simple. The time lim- 
its are ample enough so that the pupils do not 
have to work under pressure. The test mate- 
rial is presented in an attractive form with ex- 
cellent type and spacing. Except for one or 
two instances, the pictures are free from the 
ambiguity for which the Gates primary tests 
have been criticized. Many teachers have re- 
marked that children enjoy these tests, and this 
has been proved true in the case of a few chil- 
dren with reading difficulty and a high degree 
of resistance to taking any test. A more tech- 
nical advantage is the fact that the reading 
grade and age norms are based on over 5,000 
cases. Reliability figures are moderately high. 
It is also commendable that there are three 
equivalent forms for each type test so that re- 
testing can be done when necessary. 

It must be remembered that the paragraph 
reading test measures only one narrow reading 
skill—the ability to follow directions in mark- 
ing a picture. While total comprehension of 
reading matter involves more than bearing in 
mind such details, this kind of exercise is a 
satisfactory measure of comprehension on the 
Second and third grade levels. We cannot tell, 
however, whether failure in any specific in- 
stance is attributable to lack of power in the 
mechanics of reading, to carelessness in mark- 
ing and poor work habits, or to limited mental 
ability. 


George Spache, Rohrer, Hibler and Replogle, 
New York, New York; formerly Psychologist, 
Public Schools, Chappaqua, New York. Meas- 
ures of word recognition and paragraph reading 
are offered for use in the latter half of the sec- 
ond grade and through the third grade. The 
word recognition involves associating words and 


ll 


$09] TESTS & REVIEWS: READING 


pictures; the paragraph meaning, marking pic- 
tures according to instructions inserted in the 
paragraph. 

It is doubtful whether the grade norms given 
for these tests, which extend from 1.7 to 7.0 
or 8.2, represent actual pupil performance 
above the third grade. Judged by their diffi- 
culty, the tests are most effective and discrimi- 
natory if used in the range suggested by their 
author. 

The reliabilities for word recognition are 
given by the author as .89 and .90; those for 
paragraph meaning as .86 and .88. These are 
high enough to yield "reasonably reliable re- 
sults in the individual cases and highly reliable 
scores for a class," if it is remembered that 
differences between individuals of less than 
seven-tenths of a grade are only fairly reliable 
differences. The norms are expressed in terms 
of the "average" child, and no cognizance 
taken of the influence of mental ability upon 
scores or of the practice effects of utilizing two 
forms consecutively, as advised by the author 
when more precise individual appraisal is de- 
sired. 

Teachers are prone to interpret the grade 
scores derived from such reading tests as indi- 
cating the level of the reading materials that 
their pupils are competent to handle. When 
artificially derived norms for grades far above 
the level where the testing is usually done are 
given, as here, these grade scores become even 
more misleading. In a group of 8o third grade 
children, with median IQ of 103, thirty-one 
per cent made scores more than a year above 
their grade placement, implying that they were 
capable of reading materials more than a year 
above their actual reading levels. Taken at face 
value, the inflated scores achieved by these 
children would have been misinterpreted by 
the average teacher using the tests preparatory 
to organizing reading groups. Thus, the norms 
for these tests tend to give exaggerated esti- 
mates of the reading ability of bright children 
or good readers. 

The tests will prove serviceable for screen- 
ing and diagnostic purposes if used with chil- 
dren of low mentality or poor reading ability. 
With these pupils they may be interpreted in 
the common manner as indicating the general 
level of appropriate reading materials. 

[ 485 ] 


Gates Basic Reading Tests. Grades 3.5-8; 1926-43; 
revision of Gates Silent Reading Tests; Forms 1, 2, 3, 
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4; 4 parts; $2.70 per 100 of any one part; 3o¢ per 
specimen set; (15) minutes per part; Arthur I Gates; 
Bureau of Publications, Teachers College, Columbia 
University. 

G) TYPE A, READING TO APPRECIATE GENERAL SIGNIFI- 
CANCE. 

b) TYPE B, READING TO PREDICT OUTCOME OF GIVEN EVENTS. 
C) TYPE C, READING TO UNDERSTAND PRECISE DIRECTIONS. 
d) TYPE D, READING TO NOTE DETAILS. 
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George Spache, Rohrer, Hibler and Replogle, 
New York, New York; formerly Psychologist, 
Public Schools, Chappaqua, New York. The 
four tests are intended to measure different 
types of reading such as that for securing a gen- 
eral impression, to predict outcomes, to follow 
detailed directions, and to recall details. Each 
consists of 18 to 24 paragraphs and their ques- 
tions arranged in ascending order of difficulty. 
As Traxler * has shown, in his survey of reading 
tests, there is a distinct tendency among testmak- 
ers to devise and label reading tests with no evi- 
dence whatsoever of their validity. This tendency 
is present here in that the author gives no proof 
that his tests actually measure four different 
types of reading, or that these types even exist. 
The 57 intercorrelations given by the author 
range from .66 to .92 with a median of .831. 
There is certainly little support here for con- 
sidering the tests as disparate measure of four 
distinct reading abilities. The self-correlations 
or reliability coefficients range from .76 to .96 
with a median of .889, implying almost as 
much identity between any two of the four 
tests as there is between two forms of any one 
test. 

We would seriously question the effective- 
ness and discriminatory power of these tests in 
the grade range from 3 to 8 as recommended 
by their author. There is evidence * from pri- 
vate school testing that above the fourth grade 
the norms give exaggerated estimates of read- 
ing ability. In our own public school experi- 
ence, 31 per cent of a group of 71 third graders, 
with median verbal IQ of 110, achieved scores 
more than a year above their actual grade 
placement in Type A, Reading for General 
Significance. Forty-seven per cent were simi- 
larly overestimated in Type C, Reading to Un- 


* Traxler, Arthur E. The Nature and Use of Reading Tests. 
Educational Records Bulletin, No. 34. New York: Educational 
Records Bureau, 1941. Pp. 64. Paper. 
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derstand Precise Directions. None of these chil- 
dren was actually accelerated more than six 
months in his classroom reading level. These, 
in our opinion, are typical of the results ob- 
tained with these tests in ordinary testing situ- 
ations. 

Norms are also offered for speed in terms of 
the number of paragraphs attempted and for 
accuracy in per cent of paragraphs correct. 
These yield ratings in such terms as very high, 
medium, very low, etc. Apparently they serve 
to give an estimate of these aspects of reading. 

In our opinion, the tests cannot be recom- 
mended for general use for such purposes as 
Screening, organizing reading groups, measur- 
ing growth, etc. unless the average score from 
the use of all four tests is the only result em- 
ployed in comparing groups and individuals. 
The tests will also function in individual diag- 
nosis among children of low mentality or poor 
reading skills, if we accept the author’s premise 
that they are indeed measuring different types 
of reading. 


Herbert F. Spitzer, Associate Professor of 
Education, The State University of Iowa, Iowa 
City, Iowa. These tests consist of four separate 
forms and four types or parts for each form: 
Type A, Reading to Appreciate General Sig- 
nificance; Type B, Reading to Predict the Out- 
come of Given Events; Type C, Reading to 
Understand Precise Directions; and Type D, 
Reading to Note Details. Each type of the test 
is printed on a four-page 84 by 11-inch book- 
let—one page for title, directions, and pupil’s 
name and three pages of test material. This use 
of a separate title page for each type seems a 
waste of paper and test time, for the child in 
taking the complete tests has to write his name, 
age, birthday, school, grade, and date of tak- 
ing the test four times. Since the total working 
time for the four types is only 38 minutes for 
grades 3 and 4 and 28 minutes for grade 5 and 
up, the waste of time resulting from this writ- 
ing of name and other information appears 
obvious. 

Type A, Reading to Appreciate General 
Significance, consists of 24 brief paragraphs. 
The subject is to underline the one of five 
words which tells something significant about 
the paragraph. Although this is supposed to be 
a measure of easy reading, the subject has to 
move along at a pretty rapid rate, for approxi- 
mately 1,800 words of reading are presented in 


only 6 minutes for upper grades and 8 min- 
utes for lower grades. In fact, 3 pages of test 
material is a lot of reading for only 24 test 
items. However, the paragraphs are for the 
most part interesting and challenging, and the 
test items bear directly on the paragraph. A 
few of the paragraphs plunge the reader rather 
abruptly into a situation in which it is difficult 
to get the setting, e.g., in Item 17, Form 4, the 
first sentence is “It was fourteen years before 
the New World was named.” “Fourteen years 
before what” is the question that normally 
arises. 

In Type B, Reading to Predict the Outcome 
of Given Events, 24 paragraphs are presented. 
Four statements follow each paragraph. The 
subject is to select the statement that repre- 
sents what is most likely to happen next. Ap- 
proximately 2,400 words are presented for the 
8 or 10 minutes of testing time. The paragraphs 
are interesting, and the good reader should 
certainly be able to select more of the best 
“what happened next” statements than the poor 
reader would be able to select. This type of 
test has the very attractive feature of getting 
away from word matching and other clues. The 
subject may consult the paragraph but will not 
find any clues in the form of similar words 
that will help in selecting the best answer. 

In Type C, Reading to Understand Precise 
Directions, a drawing and word description or 
explanation is provided. The subject is given 
directions to draw a line, to make a circle, 2 
the like. There is nothing particularly interest- 
ing or challenging about the various test situ- 
ations. A few attempts to play for interest ap- 
pear rather weak. For example, after the child 
has been instructed to draw a circle for the 
wheel on a water tower, this statement ap- 
pears, "Now it can begin to pump." The use 
of such a term as "water tower" rather than 
"windmill tower" would probably be disturb- 
ing to children in the land of windmills, “Water 
tower" usually refers to the towers and ele- 
vated water storage tanks in villages and 
towns. A similar disturbing misuse of terms is 
used in connection with kinds of.lights. An oil 
lamp is called a lamp, whereas an electric bulb 
is called an electric light. Many children refer 
to their father's desk lamp as a lamp even 
though it is electric. 

In Type D, Reading to Note Details, 24 
paragraphs each followed by three incomplete 
statements or questions about the paragraph 
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are used. Each incomplete statement or ques- 
tion can be answered by underlining one of 
four words. A minor inconsistency occurs in 
this test in that the directions state that ques- 
tions are asked about the paragraph, and then 
(on Form 4) the first paragraph is followed 
by three incomplete statements. About one- 
third of the items of the test consist of incom- 
plete statements rather than questions. The 
paragraphs used in this test are for the most 
part interesting, and therefore the best efforts 
of children should not be as difficult to secure 
as would be the case for Type C. 

A scoring key for the Gates Basic Reading 
Tests is provided in the manual, and the sug- 
gestion is made that “a copy of each of the 
tests should be marked to accord with the key 
and used to facilitate scoring.” Such a key 
would, of course, require comparison of the 
key with the paper being scored, a slow and 
tiring process. Since the tests could easily be 
arranged so that marks could be counted 
through use of a stencil-type answer key, the 
system of scoring recommended in the manual 
must be considered very poor. Much of the 
time and energy teachers use in scoring the 
Gates Basic Reading Tests could easily be 
saved and might be used in doing something in 
the way of remedial or improved instruction 
based on the test results. 

The manual accompanying the Gates Basic 
Reading Tests provides an excellent discussion 
of the place and purpose of these tests in an 
instructional program. There is also a brief 
description of the different tests and more ex- 
tensive sections on diagnosis based on raw 
scores, diagnosis by use of age and grade 
norms or scales of average achievement, sam- 
ples of recommended type of diagnosis, apprais- 
ing a class as a whole, and other uses to be 
made of the tests. 

The notes on reliability in the manual are 
excellent. The self-correlation on the tests are 
nearly all within the range .85 to .94. Some 
of the intercorrelations between two types are 
as high as .92, an extremely high figure for 
tests of supposedly different phases of a subject. 

Three types of norms are provided in the 
manual: (a) reading age and reading grade, 
(b) accuracy scores, and (c) speed or rate 
norms. The second are provided in the form of 
five levels for each half grade. The third type 
of norm is based on the number of paragraphs 
attempted, This is, of course, not a true speed 


or rate score since comprehension played a part 
in determining how fast the pupil read. In this 
the Gates rate score is no worse than rate 
scores on other widely used reading tests. 

The manual contains an excellent reference 
list. 

The outstanding features of the Gates Basic 
Reading Tests have been presented in the pre- 
ceding paragraphs, but a few other notes and 
comments should be made. The wide range of 
grades (middle third through eighth) for 
which these tests are recommended is rather 
surprising when it is a well-known fact that 
other tests designed for as short a range as 
grades 3, 4, and 5 have been found to be rather 
unsuited for some third and some fifth grades. 
It is difficult to see how a basic reading test 
can omit such important areas of reading as 
organization of material read and the skills 
involved in locating information—the use of 
the index and sources of information. In the 
Gates Basic Reading Tests use is made of many 
short unrelated paragraphs, while in normal 
reading situations several paragraphs on the 
same topic are the rule rather than the excep- 
tion. Why not several paragraphs on the same 
topic is a question that test users should direct 
to test authors. 

The Gates Basic Reading Tests, as has been 
indicated, have relatively simple directions and 
therefore can be administered without much 
difficulty. While they are not as easily scored 
as they might be, the directions for scoring are 
easily followed. The tests require less than 40 
minutes of actual working time, measure im- 
portant areas of reading, and are provided 
with three kinds of norms. The wise use of 
these tests should aid in promoting worth- 
while study resulting in improvement of in- 
struction in reading. 


T. L. Torgerson, Chairman, Psycho-Educa- 
tional Clinic, and Professor of Education, The 
University of Wisconsin, Madison, Wisconsin. 
This battery is a series of reading tests of high 
reliability which is easily administered. The 
series possesses considerable flexibility because 
of its four types, four forms, and short work- 
ing time. The manual, on the whole, is very 
complete and should prove very useful to the 
user. Scoring would be facilitated if the keys 
were printed separately and keyed to the test 


booklets. 
One criterion of a valid reading test is the 


486 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 512 


inability of the pupils to answer the questions 
without reading the paragraph. Undoubtedly, 
many of the questions in Tests C and D can 
be answered by upper-grade pupils without 
reading the paragraphs because the questions 
deal with knowledge or information familiar to 
most pupils. 

Do these tests which are designed to measure 
rate of comprehension utilize a valid technique 
for measuring rate? The best-answer type of 
response used in Tests B and C is a useful 
technique in measuring power of comprehen- 
sion. Does*it yield a valid measure of the rate 
of reading? The technique of crossing out the 
absurd or irrelevant word or phrase as em- 
ployed in the Minnesota and the Michigan 
speed of reading tests is unquestionably supe- 
rior. The teacher in using these tests should 
be aware of the fact that the tests have been 
designed to determine how much reading mate- 
rial on a third grade level of difficulty pupils 
in grades 4 to 8 can read and comprehend ac- 
curately in a given period of time. In order to 
determine a measure of a pupil's level of com- 
prehension, a test which consists of reading 
material which increases in difficulty should be 
used. The Gates Reading Survey for Grades 
3 to ro provides such a measure. 


For reviews by Joseph C. Dewey and James 
R. Hobson and an excerpt from an appraisal 
by Edward William Dolch of the Gates Silent 
Reading Tests, see 40:1539. 
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Gates Primary Reading Tests. Grades 1-2.5; 1926- 
43; revision of Gates Primary Silent Reading Tests: 
Forms 1, 2, 3; 3 parts; $2.70 per 100; 30€ per specimen 
set; 50(80) minutes; Arthur I. Gates; Bureau of Pub- 
lications, Teachers College, Columbia University. 

@) TYPE I, WORD RECOGNITION. 15(25) minutes. 

b) TYPE 2, SENTENCE READING. 15(25) minutes. 

C) TYPE 3, PARAGRAPH READING. 20(30) minutes, 
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William S. Gray, Professor of Education, 
The University of Chicago, Chicago, Illinois. 
These tests are an integral part of a broader 
program for testing reading throughout the 
grades which Dr. Gates has developed. They 
include separate tests designed to measure the 
level and range of ability of three highly im- 
portant types of reading in the primary grades. 
Type 1, Word Recognition, measures the de- 
gree to which pupils can identify with reason- 
able accuracy representative primary words, 
The test consists of 48 items, each of which 
includes a picture and four words. The task is 
to identify and encircle the word that tells most 
about the picture. All the words used fall 
within the first 1,000 of the Gates Primary 
Word List. The first exercises are composed of 
the easy:and most commonly used words, but 
gradually become more difficult. 

Type 2, Sentence Reading, measures ability 
to read and understand sentences composed of 
words most commonly used in the primary 
grades. The test consists of 45 sentences, and 
comprehension is measured by the pupils’ 
ability to identify and mark one of several pic- 
tures which accurately illustrates the meaning 
of a given sentence. All the words used fall 
within the first 1,500 of the Gates Primary 
Word List. The sentences are so arranged that 
they gradually increase in difficulty, as deter- 
mined by the decreasing frequency and impor- 
tance of the words used and by the length of 
the sentences. 

Type 3, Paragraph Reading, measures ability 
to read thought units with full and exact un- 
derstanding. There are 26 exercises including 
one or more sentences accompanied by pictures 
or drawings. The test of comprehension meas- 
ures ability to execute accurately the directions 
in each exercise. Effort has been made to in- 
crease the difficulty of the verbal exercises 
through control of their vocabulary, sentence 
structure, and length. 

VALIDITY. No discussion of validity accom- 
panies the tests. A critical analysis of the test 
passages led the reviewer to the conclusion that 
the test exercises were reasonably valid meas- 
ures of the abilities they purported to measure. 
In a few exceptional cases, for example, Item 
41 in Sentence Reading, the picture assumed 
to be the right answer represents the meaning 


of only one word rather than the sentence as à 
whole. 


————— qe—— 
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RELIABILITY. No statistical data on the reli- 
ability of the tests are provided. The author 
rightly points out, however, that the reliability 
of the results of the tests is determined pri- 
marily by the skill of the examiner. The factors 
of greatest importance in this connection are 
the explanation of the tasks to the pupils and 
the management of the group during the test 
period. Careful directions are provided as an 
aid to the teacher in giving the tests properly. 
The reliability of the test for sentence reading 
may be reduced by the marking device em- 
ployed. Experience shows that some children 
are unable to indicate clearly whether they are 
making one, two, or three parallel lines in har- 
mony with the directions. 

norms. Tables of grade and age norms are 

given in the manual. Age and grade norms for 
each test score are provided, based on records 
from approximately 250,000 pupils in schools 
in all parts of the country. Unfortunately, no 
measure of variability is provided, excepting as 
suggested by the 75th and 25th percentile 
scores for various grade levels. 
. VALUE. It requires 50 minutes to give the 
tests, not including time for directions. They 
are easy to administer and can be used either 
for general survey or diagnostic purposes. The 
manual which accompanies the tests includes 
specific suggestions for use in making diag- 
noses, discusses at length the causes of diffi- 
culty in the types of reading measured, and 
offers suggestions for improving sentence and 
paragraph reading. Since three forms have 
been developed, they can be used in méasuring 
the progress of pupils at intervals of a semester 
or less or for checking individual progress. 


George Spache, Rohrer, Hibler and Replogle. 
New York, New York; formerly Psychologist, 
Public Schools, Chappaqua, New York. This 
group of tests includes measures of word recog- 
nition, sentence reading, and paragraph reading. 
The first two involve the association of words 
and pictures illustrating the test word or sen- 
tence. In the third, the child’s comprehension is 
measured by his ability to mark the related pic- 
tures in accordance with instructions contained 
in the paragraph. 

The tests are offered for use in the first 
grade and the first half of the second grade. 
They are rather sharply scaled in difficulty and 
discriminate well in their effective range. This 
we estimate to be from the middle of the first 


grade through the second grade. The fact that 
pupils of the first half of the first grade would 
be able to do less than 15 per cent, or only 2 
to 3 items, implies that the tests probably func- 
tion most discriminatively above the middle of 
the first grade. There also appears to be ade- 
quate ceiling for use of the tests through the 
second grade. 

The author gives no data on validity or reli- 
ability other than the comment that “the tests 
are long enough to yield reasonably reliable 
results in the individual case and highly reli- 
able scores for a class.” If this statement is 
based upon statistical investigation of the re- 
liability, there seems to be no good reason for 
withholding the data. The author suggests the 
use of two forms for “more precise individual 
appraisals” but gives no information regarding 
the known effects of practice on two forms. 
Similarly, although he recognizes the influence 
of the intellectual ability of the class upon its 
scores, all norms are expressed in terms of the 
“average” child. Use of the tests has demon- 
strated, as the author claims, that they are not 
unduly influenced by speed. The public school 
child of normal reading ability will commonly 
complete the tests in the time allowed. 

We believe it is misleading, to say the least, 
to give norms expressed in units as small as a 
hundredth of a school year, as is done here. It 
is highly doubtful whether any test now avail- 
able has the degree of reliability and discrimi- 
nation claimed by implication in these norms. 
Norms in such minute units are not justified 
for a test in which differences less than eight- 
tenths of a grade are only “fairly reliable.” 
Another objection to the norms arises from the 
common practice of teachers of believing that 
the grade scores obtained from reading tests 
indicate the actual reading levels of the pupils. 
It would be much more meaningful in this in- 
stance to give norms in terms of the expected 
performance at preprimer, primer, and first- 
and second-reader reading levels. 

Despite these minor criticisms of standardi- 
zation, the tests will serve, if interpreted cau- 
tiously, in screening and diagnostic testing to 
indicate pupils who are markedly deficient in 
the skills of sight word vocabulary, and sen- 
tence and paragraph reading. They will serve 
to rank children in the order of their effec- 
tiveness in these skills, an order roughly cor- 
responding to their classroom performance. 
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Gates Reading Survey for Grades 3 to 10: Vocab- 
ulary, Level of Comprehension, Speed, and Accu- 
racy. Grades 3-10; 1939; Forms I, 11; $6.85 per 100; 
254 per specimen set; (60-90) minutes; Arthur I 
Gates; Bureau of Publications, Teachers College, Co- 
lumbia University. 
REFERENCES 
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Dorothy E. Holberg, Elementary Supervisor 
and Reading Clinician, Alexis I. duPont Spe- 
cial School District, Wilmington, Delaware. 
After three years of using this test in grades 
3 through 6, the reviewer has reached the fol- 
lowing conclusions and suggestions: 

(a) The printed underlining of the correct 
answers in the Vocabulary Test, Level of Com- 
prehension Test, and Speed Test should be 
omitted in order that the student may practice 
the directions of drawing a line under the cor- 
rect word before beginning the test. This 
would enable the examiner to check on the 
child’s ability to follow directions accurately 
before beginning the test. 

'(b) Include in the oral directions for the 
Vocabulary Test the statement that the stu- 
dent must not work beyond page 3. Before be- 
ginning the Level of Comprehension Test, the 
examiner should orally stress the fact that the 
pupil must not work beyond page 7. Printed 
directions to stop work are included at the end 
of each test, but this does not necessarily im- 
press all students. 

(c) When used with elementary school stu- 
dents, an additional explanation might be psy- 
chologically sound. The students should be 
informed that this test is used with tenth grade 
pupils as well as third grade pupils. There may 
be many parts that they will be unable to read ; 
however, they should work as many as pos- 
sible. This will tend to eliminate guessing as 
well as signs of frustration. 

(d) In order to insure a feeling of success, 
it might be advisable to administer the Level of 
Comprehension Test first. Children are gen- 
erally more successful in reading the Level of 
Comprehension Test because they are aided 
by context clues, whereas in the Vocabulary 
Test the words are in isolation. 

(e) The directions for both the vocabulary 
and the level of comprehension tests fail to in- 
form the student that these tests are not speed 
tests. Test-wise students tend to conclude that 
all standardized tests are speed tests, and one of 


the most valuable factors of these power tests 
is lost unless this misconception is corrected. 

(f) In power tests such as Tests I and II, 
students who finish quickly may distract those 
still working. For the benefit of classroom 
teachers who might administer these tests it 
would be advantageous to include in the man- 
ual of directions several teaching techniques to 
avoid the above situation. 

(g) The size of the print utilized is unde- 
sirable for use with elementary school children. 

(h) This test may be used for analysis on 
the college level if a reading difficulty has been 
indicated by other tests such as college en- 
trance examinations. 

(i) When compared with informal reading 
testing techniques and teachers' opinions of 
reading ability, this test overestimates a pupil’s 
ability to read and tends to place him above his 
true instructional level in reading. 

(j) It is the writer’s opinion that this is the 
most valuable survey-type reading test at the 
present time. Where further diagnosis is de- 
sired, it may be supplemented by using the 
Gates Basic Reading Tests. Failure by a re- 
tarded reader on the Gates Survey Test may 
be handled by administering the Gates Ad- 
vanced Primary Reading Tests, the Gales 
Primary Reading Tests, or the Gates Reading 
Readiness Test as the case may be. It is sug- 
gested that a test similar to the Durrell—Sulli- 
van Reading Capacity Test be prepared by Dr. 
Gates. This would enable a reading clinician 
to determine a student’s retardation in reading 
by comparing his reading capacity score, i.e., 
hearing comprehension level, with that of his 
actual reading achievement level as provided 
by this test. This would give a complete pic- 
ture of a student’s reading achievement, status 
within a group, reading strengths and weak- 
nesses, as well as data as to whether or not he 
is working to full capacity in reading. 


Herbert F. Spitzer, Associate Professor of 
Education, The State University of Iowa, Iowa 
City, Iowa. According to the manual, this test 
was developed to complement the Gates Basic 
Reading Tests. This survey test purports to 
measure vocabulary, level of comprehension, 
speed, and accuracy. Two scores are obtained 
from the speed test, one for speed and one for 
accuracy. 

The vocabulary and level of comprehension 
tests are power tests and therefore do not have 
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fixed limits. However, the directions for ad- 
ministering suggest 20 to 30 minutes for the 
vocabulary and approximately 30 minutes for 
the comprehension test. The speed test re- 
quires 10 minutes for grades 3, 4, and 5, and 
7 minutes for the grades 6 and up. 

In the vocabulary test the subject is to se- 
lect and underline a word similar in meaning 
to a key word. In the comprehension test, one, 
two, or three key words are omitted from a 
short paragraph. The subject is to indicate by 
underlining which of five words presented 
would make the best sense when used in the 
appropriate blank of the paragraph. In the 
speed and accuracy test the subject reads a 
short paragraph and is then asked a question 
which can be answered by underlining a single 
word, four of which are presented. 

The directions for this survey test are sim- 
ple, although containing a few such indefinite 
statements as “allow 30 minutes more or less.” 
Directions of that type are frequently a cause 
of confusion with some teachers, especially 
when no directions are given regarding what 
children who finish are to do. " p 

The subjects should have little difficulty in 
marking the test. A minor inconsistency is 
found in directions for the speed test. The sub- 
ject is directed to draw a line under the word 
that best answers the question. In some in- 
stances, however, no question is asked, but an 
incomplete statement such as “These dogs are 
very ——” is used. Adequate sample exercises 
are provided for each section of the test. 

A key for scoring consisting of the correct 
words is provided. Unfortunately this scoring 
key requires comparison of the subject’s under- 
lined word and the key. Since the test requires 
only the underlining of words, a faster and 
more accurate method of scoring could be pro- 
vided by use of a separate answer sheet and a 
stencil-type key. Even if no answer sheet were 
used, a stencil-type key for the booklet, which 
would permit the scorer to count marks instead 
of having to compare, would be a marked im- 
provement. Time and effort saved in scoring 
a test is very important, for that time and 
effort may be used in instruction based on the 
test results. Too often teachers are so ex- 
hausted from scoring a test that they have 
little desire to do anything about the results 
they have obtained. 

The manual of directions for the Gates 
Reading Survey contains a brief but compre- 


hensive discussion of each of the parts of the 
test, a discussion of the function of the test, the 
interpretation of results, data on reliability, and 
the like. A section on instruction and remedial 
work in reading indirectly presents rather sig- 
nificant commentaries on the importance of the 
areas of reading measured. The following is an 
illustration: *A limited vocabulary may be due 
to low intelligence, to meager general experi- 
ences, and to little reading of varied materials 
of real substance.” In other words, the author 
wisely warns the test user to refrain from con- 
cluding that a low score is due to a poor read- 
ing program. 

The suggestions for improvement of reading 
in the manual are not extensive and not too 
encouraging to teachers or administrators who 
are seeking a definite remedial program. The 
following statement regarding comprehension 
is illustrative: “The only lasting cure for a low 
level of comprehension consists in getting the 
pupil to find and read materials of real interest 
and suitable difficulty.” 

The correlations between different forms of 
the test, which according to the manual are not 
so high as reliability coefficients reported on 
many tests, are high enough to be entirely satis- 
factory and might even be considered high. The 
range of correlations is from .82 to .92. These 
high correlations are not surprising when the 
nature and length of the tests involved are con- 
sidered. For instance, the vocabulary test con- 
sists of 85 items of the same type. 

Most of the strong features and minor limita- 
tions of the Gates Reading Survey have been 
presented in the preceding discussion. How- 
ever, a few other points should be noted. An 
excellent general bibliography is included in 
the manual. The testing technique, the actual 
procedures used, do not resemble very closely 
or approximate normal reading. Perhaps this 
lack of agreement with the reading process is 
not a limitation, but the presentation of para- 
graphs with key words omitted is seldom en- 
countered in everyday reading situations. The 
speed-of-reading test in the Gates Reading 
Survey, like most speed tests, is a measure of 
both rate ánd comprehension, and cannot, 
therefore, be considered solely as a rate score. 

Many test users have found tests designed 
for only grades 3, 4, and 5 rather difficult for 
grade 3, and occasionally such tests have in- 
sufficient range to measure adequately at even 
as low a level as fifth grade. In view of the 
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preceding facts it is difficult to see how the 
Gates Reading Survey can be used. effectively 
over as wide range (from middle third through 
eighth grade) as is assigned to it. 

While the manual for this test clearly states 
that this test was developed as a complement 
to the Gates Basic Reading Tests, its use is 
not limited to such a combination. It does pro- 
vide, as the title indicates, a survey test, and 
if the user keeps in mind its limitations, its 
use should be valuable in instruction. 


488 
High School end rece: ew Achievement 
Tests. Grades 7-12; 1939-45; Forms A, B; $2.50 per 
25; I5 € per specimen set; nontimed (40) minutes; 
Robert K. Speer and Samuel Smith; Acorn Publish- 
ing Co. 

Robert L. McCaul, Instructor and Adviser in 
the College, The University of Chicago, Chicago, 
Illinois. Presumably the High School Reading 
Test is a publication in separate form of the 
reading section of the National Achievement 
Tests for the secondary school level. What was 
said of the whole elementary battery in The 
Nineteen Forty Mental Measurements Year- 
book (see 40:1191) can probably be applied, 
mutatis mutandis, to the test being considered 
here. If the preceding sentences seem vague 
and noncommittal, the reason is that no in- 
formation about the test’s origin, construction, 
and standardization is supplied to the user, He 
gets a single sheet containing directions for 
administration and scoring, columns for re- 
cording class scores, and a set of norms. Con- 
sequently, no thorough evaluation of the test 
can be made. 

Although the test is divided into five parts 
—vocabulary, word discrimination, sentence 
meaning, noting details, and interpreting para- 
graphs—there are norms only for the total 
score. It is impossible, therefore, to determine 
whether variations in the student’s scores on 
the subtests are produced by differences in the 
difficulty of the subtests or by differences in 
the quality of his reading skills. The authors 
are aware of this flaw and suggest that the 
student’s subtest performance be measured in 
terms of the accomplishment of his particular 
class. Such a solution does no more than endow 
the test with a spurious diagnostic character, 
for the meaning of the group’s achievement upon 
each of the subtests is just as uncertain as the 
meaning of the individual’s achievement. 

The test reveals care in construction and 


expression. There are, of course, minor flaws 
which a captious critic could pounce upon and 
worry to an extent out of all proportion to 
their significance. In the sentence meaning 
part of the test, for example, the longest of the 
possible responses is usually right. This is true 
of 10 of the 15 sentence-meaning items in Form 
A and of 9 of the 15 in Form B. Again, some- 
times two of the answers to word-discrimination 
items could be correct; Item 11, Form A, for 
instance, is “I can devise no arguments 
with which to influence your decision,” followed 
by subtle, salacious, important, and revolting. 
The right answer according to the key is subile, 
but is not important as good? 

The defects pointed out in the previous para- 
graph are outweighed by the conveniences that 
the test possesses. It is a power test with no 
arbitrary time limits. Norms cover a range ex- 
tending from grade 6 through the sophomore 
year in college. No scoring key is required—an 
item is correct if it is designated by a letter 
in a code word like master. Because the test 
does have these advantages, and the three men- 
tioned are only samples, it is to be hoped that 
the authors will in the near future provide 
norms for each of the subtests. They should 
also furnish the user with more information 
about the test. 
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Iowa Silent Reading Tests: New Edition, Re- 
vised. Grades 4-9, 9-13; 1927-43; IBM for grades 
9-13; Forms AM (Rev.), BM (Rev.), CM, DM; 2 
levels; H. A. Greene, A. N. Jorgensen, and V. H. Kel- 
ley; World Book Co. 

a) ELEMENTARY TEST. Grades 4-8; 1933-43; $1.60 per 
25; 35¢ per specimen set, postpaid; 49(60) minutes. 

) ADVANCED TEST. Grades 9-13; 1927-43; IBM; sepa- 
rate answer sheets need not be used; booklets may be 
used once without answer sheets and thereafter with 
answer sheets ; $2.20 per 25 ; $1.35 per 25 machine-scor- 
able answer sheets; 6o¢ per set of machine-scoring 
keys; 35¢ per specimen sct, postpaid; 45(60) minutes. 


REFERENCES 
1-6. See 40:1: 


7. 
7. Hotcown, G. W., anp Lastert, H. R. “A Prognostic Study 
of Engineering Aptitude.” J Appl Psychol 16:107-15 Ap '32 * 
eA 4722) n 
,B. Sturt, Dewey B. “Differential teristics of Supe- 
rior and Interior Students?” Sch & Soe 46:7336 Da 37." (od 
9. Stuit, Dewey B., AND Donnetry, Mary CarrorL, ""Per- 
omncs in the Iowa Qualifying Examination of Majors in 
Various Academic Departments With Implications for Counsel- 
ing.” J Exp Ed 8:293-9 Mr "40. * (PA 14:4754) 
SATIRE ene END aS Water F. "Read- 
y elated to it iz yi H 
387-96, Ap ae (a IRIS chievement.” J Psychol 11 
;, LANGSAM, ROSALIND STREEP. “A i lysis of 
Reading Ability.” J Erp Ed 10:57.63 S'ar Pt emu) À 
12. PANKASKIE, MancanET. Factors in Reading Achievement 
at the College Level. Unpublished doctor's thesis, University of 
EIS (Doctoral Dissertations . . . 1940 and 1941, 1944, 
13. Sarre, JossPHiNE M. “The Prognostic Value of En- 
trance Tests in a Junior College." y 1584-92 | 
iene DOES lege.” J Ed Psychol 32:584-92 N 


$17] TESTS & REVIEWS: READING 


489 


14. Rarnter, RuTH NrrzLv; REHFELD, FLORENCE WASHBURN; 
AND Manican, Martan E. “The Use of Tests in Guiding Student 
Nurses," Am J Nursing 42:679-82 Je '42. * (PA 16:5029) 

E WirkiwG, S. Vincent. “Do Our Reading Tests Test the 
Right Words?" J Ed Res 36:35-9 S '42. * (PA 17:709) 

16. MonGAN, C. L., AND STEINMAN, C. C. “An Evaluation of 
a Testing Program in Educational Psychology." J Ed Psychol 
34:495-502 N 585 * (PA 18:1891) : 

17. ADAMS, ILLIAM MICHAEL. “Prediction of Scholastic 
Success in Colleges of Law: II, An Investigation of Pre-Law 
Grades and Other Indices of Law School Aptitude." Ed & Psy- 
chol Meas 4:13-9 spaa * (PA 18:3271) 

18, BLOMMERS, PAUL, AND LiNDQUIsT, E. F. “Rate of Com- 

rehension of Reading: Its Measurement and Its Relation to 
omprehension." J Ed Psychol 35:449-73 N '44. * (PA 19: 


1035 

n Preiscue, WarrER A. The Relationship of Certain Meas- 
urable Factors to Success in Secondary-School Physics. Unpub- 
lished doctor's thesis, New York University, 1944. (Abstracts of 
Theses . . . [School of Education] 1944, pp. 217-21.) 

20, SMITH, Henry Lester, AND EATON, Merritt T. Analysis 
of the Proficiency in. Silent Reading of 15206 Sisth Grade 

'upils in 648 Schools in Indiana. Bulletin of the School of Ed 
cation, Indiana University, Vol. 21, No. 6. Bloomington, In 
Indiana University Bookstore, November 1945. Pp. 47. Paper. 
$0.50. * (PA 20:2912) 

21. Tinker, Mies A. "Rate of Work in Reading Perform- 
ance as Measured in Standardized Tests.” J Ed Psychol 36:217- 
28 Ap '45. * (PA 19:2380) 

, 22. TOWNSEND, AGATHA, “A Study of the Revised New Edi- 
tion of the Iowa Silent Reading Tests," pp. 31-9. In 1944 Fall 
Testing Program in Independent Schools and Supplementary 
Studies. Educational Records Bulletin, No. 42. New York: Edu- 
cational Records Bureau, January 1945. Pp. x, 49. Paper, litho- 
typed. $1.00. * (PA 19:1811) 

23. Kitsy, Ricuarp W, “Relation of Iowa Silent Reading 
Test Scores to Measures of Scholastic Aptitude and Achieve- 
ment.” J Appl Psychol 30:399-405 Ag '46. * (PA A) 

,24. Puc, Grapvs S. "Summaries From ‘Appraisal of the 
Silent Reading Abilities of Acoustically Handicapped Chil- 
dren.’ " Am Ann Deaf 91:331-49 S '46. * (PA 21:290) 

25. SuirH, Henry LESTER, AND EATON, MERRILL T Analysis 
of the Proficiency in Silent Readin; d 11,424 Sophomore Hd 
in 243 High Schools $n Indiana. Bul etin of the School of Edu- 
cation, Indiana University, Vol. 22, No. 1. Bloomington, Ind.: 
ans University Bookstore, January 1946. Pp. 46. Paper. 

0.50. 

26. "A Note on the Correlation Between the Iowa and Co- 
operative Reading Tests," p. 66. In Dn Achievement Testing 

rogram in Independent Schools and Supplementary Studies. 
Educational Records Bulletin, No. 48. New York: Educational 
ere Bureau, June 1947. Pp. xii, 66. Paper, lithotyped. 

2.00. 

, 27. KELLEY, Victor H., AND Greenz, Harry A. Better Read- 
ing and Study Habits. Yonkers, N. Y.: World Hook Co., 1947. 
Pp. iv, 73. Paper. $0.25. * 


Frederick B. Davis, Professor of Psychology, 
George Peabody College for Teachers; and 
Director, Test Research Service; Nashville, 
Tennessee. The validity of the total score de- 
rived from the Jowa Silent Reading Tests as a 
measure of reading ability depends on the extent 
to which the subtests adequately sample the im- 
portant skills involved in reading ability of the 
work-study type (which is the announced objec- 
tive, according to the manual of directions). A 
conscientious effort has been made to measure 
the skills in reading considered most important, 
and the reviewer is inclined to believe that the 
total score is as valid a measure of the work- 
study type of reading ability as most reading 
tests are. The tests do not, and do not pretend 
to, measure the more subtle aspects of compre- 
hension that are measured in some degree by 
the Cooperative reading comprehension tests. 

The writer would be better satisfied with the 
total comprehension score if the subtests for 
rate of reading, the use of the index, and the 


selection of key words were omitted from its 
computation. Rate of reading is an almost 
meaningless concept because an individual's 
rate of reading depends on his immediate pur- 
pose and the difficulty of what he is reading. In 
the Iowa Silent Reading Tests, the directions 
for the rate-comprehension test say, “This is a 
test to see how well and how rapidly you can 
read silently. Read the story below very care- 
fully so that you can answer questions about 
it.” The testee has no idea what kind of ques- 
tions. Penetrating? Superficial? Some testees 
will rush through and take a chance with the 
questions. Others will play it safe and digest 
what comes along. The point is that they will 
not all set out to do the same thing because 
the directions are not sufficiently detailed and 
specific to evoke a closely similar mind-set in 
each testee. Much more definite instructions 
would have to be provided in order to stand- 
ardize the pupils’ purposes. 

The reviewer tried the 35 comprehension 
items in Test 1 of Form CM without first 
reading the two passages on which they are 
based. He got 23 of the 35 items correct on 
the basis of his general information about cork 
and city governments. This is better than 
chance would permit since, if he had marked 
the answer sheet without reading either the 
passages or the items, it would have been most 
reasonable to have expected him to get 12 cor- 
rect, or a standard score of 133. This is evi- 
dence that the so-called comprehension score 
involves quite a bit of general information. In 
all fairness, one should point out that this is 
a failing of most tests of comprehension in 
reading. 

The reviewer’s main criticism of the content 
of the test is that it never forces a testee to 
grapple with a difficult passage, to weave an 
author’s ideas together, to understand his over- 
all thought. Every part gave the reviewer the 
feeling of superficiality, of pecking at the mean- 
ing without ever being required to think hard, 
The reviewer is prepared to defend the state- 
ment that the Iowa Silent Reading Tests never 
get at the inner recesses of comprehension and 
that their use must tend to reward unduly the 
rapid, superficial reader who is in the habit of 
skimming along and parroting a phrase or two 
of the writer if any one presses him regarding 
the content of what he has read. 

In the manual of directions various types of 
data about the tests are provided. Reliability 
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coefficients obtained by means of the split-half 
Spearman-Brown procedure and Kuder-Rich- 
ardson formula No. 21 are presented. The 
fact is that meaningful reliability coefficients 
cannot be obtained for speeded tests, like the 
lowa Silent Reading Tests, by either of these 
two means. The best way to determine the re- 
liability coefficient (or better, the standard 
error of measurement at each of several levels 
on a standard-score scale) of any speeded test 
is to administer not less than three forms of the 
test to each one of a sample of testees in a 
single grade level. The intercorrelations of all 
three (or more) forms can then be used to get 
the reliability coefficient of each form sepa- 
rately and not the reliability coefficient of the 
average of a pair of forms (as is commonly 
done). This is getting reliability coefficients 
the hard way, but the reviewer is inclined to 
believe that if they are not worth the labor in- 
volved in this procedure they are not worth 
computing at all, 

In summary, the Jowa Silent Reading Tests 
are conscientiously planned tests of the work- 
study type of reading. The total reading score 
gives some weight to tests of three mechanical 
skills: speed of reading, use of an index, and 
selection of key words. The reviewer does not 
know how much, if any, weight should be given 
to these three skills in a total reading score; 
but he prefers a comprehension score that does 
not include them. His major criticism of the 
Towa Silent Reading Tests is that they appear 
to reward the pupil who reads superficially and 
who parrots glibly the reading material pre- 
sented, 

A genuine effort has been made to provide 
helpful, accurate, and complete information in 
the manual, It is unfortunate that reliability 
coefficients obtained by means of the split- 
half Spearman-Brown procedure and Kuder- 
Richardson formula No. 21 are presented with 
no warning that meaningful reliability coeffi- 
cients cannot be obtained for highly speeded 
tests, such as the Iowa Silent Reading Tests, 
by either of these two means. 


William W. Turnbull, Secretary of the Board 
and Head of the Test Construction Depart- 
ment, College Entrance Examination Board, 
Princeton, New Jersey.-The purpose of the 
test is to measure rate of reading; comprehen- 
sion of words, sentences, and passages read; 
and ability to use skills required in locating 


information. The Advanced Test has seven 
subtests and yields nine scores in 45 minutes 
of testing time (not including time to adminis- 
ter), the Elementary Test has six subtests, 
giving eight scores in 49 minutes. The tests at 
both levels include measures of the following : 
(a) rate of reading, (b) vocabulary, (c) un- 
derstanding of sentences, (d) paragraph com- 
prehension, (e) “directed reading,” and (f) 
ability to use a simple index. The Elementary 
Test includes also a test of ability to alpha- 
betize; while the Advanced Test measures 
ability to select words under which information 
about a given question might be found in an 
index. A further addition to the Advanced Test 
is a measure of comprehension of poetry. ln 
this section the technique is similar to that in 
the “directed reading” subtest: the student is 
to identify the parts of a passage which answer 
certain specific questions. 

scorING. Both the Elementary and the Ad- 
vanced Tests are scored objectively. The Ele- 
mentary Tests have the answers indicated (for 
stencil scoring) in the test booklet, while the 
Advanced Tests can be given in this manner or 
with separate answer sheets for scoring by 
hand or by machine. Each subscore is converted 
to a standard-score scale, and the median of 
the subscores provides the total test score. 

VALIDITY. From all appearances the test 
should provide a valid measure of reading 
ability. Unfortunately the authors rest their 
case after drawing attention to this fact. No 
validity coefficients are reported, which to this 
reviewer seems inexcusable. 

RELIABILITY. The total score for each form 
has a reliability coefficient (computed separately 
for each grade level by Kuder-Richardson 
formula No. 21, using over 1,000 cases in all 
grades but one) of .94 or .95, with a probable 
error of measurement of 2 points on a scale 
which has a standard deviation of 13 to 14 for 
each grade. Thus, the reliability of the total test 
is high. The coefficients for the subtests range, 
in general, from .70 to .85 for the Advanced 
Test (probable error of measurement from 5 
to 8 points) and from .70 to .9o (probable 
error of measurement from 4 to 7 points) for 
the elementary level forms. Thus, limited reli- 
ance can be placed on the subtest scores in in- 
dividual cases, although their reliability is ade- 
quate for group comparisons, 

NORMS. The test has been standardized on 
a national sample of between 1,500 and 2,000 
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students at each grade level. Percentile norms 
are given for standard scores on all subtests 
and on the total test. The standard-score scale 
for the Advanced Test is continuous with that 
for the elementary forms, so that direct com- 
parison from one score level to the other is 
possible. Tables are provided for the purpose 
of equating scores on this test with those on 
the 1939 edition. 

MANUAL. The directions for giving and 
scoring the tests are clear and complete. The 
manual contains relatively full information on 
reliability, standardization, and interpretation 
of scores, very little on construction (e.g., the 
level of discrimination below which items were 
rejected or revised) ; and, as already noted, a 
clearly inadequate treatment of validity. 

EVALUATION. Examination of this test sug- 
gests that its position of leadership in the field 
is justified. The material, while perhaps some- 
what artificial and academic, is in general ap- 
propriate and seems to have been carefully 
prepared and edited. Standardization appears 
to have been carried out thoroughly on large 
samples. The availability of four parallel forms 
at the two levels is an important advantage 
where measurement is to be repeated. No data 
are given on the proportion of students finish- 
ing the various subtests, but it is probable that 
the factor of reading rate enters into all sec- 
tions of the test, which affects the interpreta- 
tion of the scores. The comparatively low re- 
liability of individual sections (inevitable in a 
test of its length) limits its usefulness in diag- 
nosing areas of weakness in individual per- 
formance. The test should, however, be found 
entirely satisfactory for identifying students in 
need. of remedial work, for securing class 
norms, or as an aid in sectioning classes. 


Clearing House 19:322 Ja '45. E(arl) R. 
G(abler). * an excellent instrument if properly 
used, for the improvement of instruction. 

Teach Col J 12:69 Ja '41. Margaret Pan- 
kaskie. * A new edition....is welcomed * The test 
of rate of reading is doubtless an improvement 
over the rate test in the earlier edition. * A 
unique feature of the Iowa tests (both old and 
new editions) has been the tests of skills required 
in locating information. This factor is usually 
omitted in reading tests. The test as a whole has 
satisfactorily high reliability. What should be 
desirable is a measure of the reliability of each 
of the subtests. It is yet to be proven that read- 


ing may actually be divided into the number 
and kinds of skills which are indicated by the 
authors. If it is true that the seven tests repre- 
sent separate skills, then each should be tested 
separately, with the reliability of each test ascer- 
tained independently. If there is much over- 
lapping in the skills tested, it would seem that 
a test with fewer or no sub-tests would be quite 
as good a test and would in addition be simpler 
to score. One also wonders, if reading is to be 
separated into various independent skills, why 
the reading of technical or scientific material 
was omitted. 


For reviews by Ivan A. Booker and Holland 
D. Roberts, see 40:1547. 
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Lee-Clark Reading Test—Primer, 1943 Edition. 
Grade 1; 1931-43; Forms A, B; $1.20 per 25; 35¢ per 
specimen set, postpaid; (15) minutes; J. Murray Lee 
and Willis W. Clark; California Test Bureau. 


Ruth Lowes, Associate Professor of Educa- 
tion, The West Texas State Teachers College, 
Canyon, Texas. While most primary reading 
tests are designed for use in from one to three 
grades, this one is designated as a primer test to 
be given in the'middle of the first school year. The 
Gates vocabulary list was used as a guide in the 
selection of words, 80 per cent of the different 
words being found in the first five hundred of the 
Gates list. Because of the way in which the words. 
were. selected, the authors claim that their tests 
“provide as valid a measure of reading ability as 
is available for primary children." No other 
proof of validity is offered. The coefficients of 
reliability for the total test are .909 (Form A 
with Form B) and .952 (total reliability if both 
forms are used); scores from 232 first grade 
pupils were used in calculating these coefficients. 

Part I of the test measures the pupil's response 
to auditory stimuli and consists of fifteen rows 
of words, four words to the row. The pupil un- 
derlines the word pronounced by the examiner. 
Part 2 tests the pupil's ability to recognize eleven 
words. He draws a line from the word to an 
appropriate picture. Three minutes is the time 
allowed for this part. In Part 3 the ability to 
follow directions is measured by having the 
pupil mark and color certain objects pictured. 
Time is allowed for approximately 9o per cent 
of the class to finish. 

The test can be given at one sitting. Scoring is 
easy. Both grade and age norms are given, but 
the number of pupils used in establishing the 
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norms is not indicated. A table of percentile 
norms, based on the results obtained in the first 
grades in two city school districts, is given. 
Again the number of pupils tested is not indi- 
cated. A brief paragraph of directions for con- 
verting raw scores into percentile ranks will 
prove helpful to teachers who have had no spe- 
cial training in interpreting test results. 

There are several mechanical features of this 
test which merit comment. The blanks on the 
front page call for a minimum amount of data, 
and the spaces are wide enough so that most 
of the pupils can write in the information. Parts 
1 and 2 each occupy only one page of space. The 
left-hand page in each case is blank. This feature 
will be appreciated by first grade teachers who 
usually have to help pupils fold the booklet and 
place it in position so that the correct page is 
ready for use. Page numbers, large enough for 
the pupils to read, might prove helpful. 


[ 490a ] 

Metropolitan Achievement Tests [Reading]. 
Grades 3-4, 5-6, 7-9.5; 1933-47; Forms R, S; 35¢ per 
specimen set of any one level, postpaid; 35 (45) min- 
utes; Richard D, Allen, Harold H. Bixler, William L. 
Connor, and Frederick B. Graham; World Book Co. 
@) ELEMENTARY READING TEST. Grades 3-4; $1.60 per 
25. 

b) INTERMEDIATE READING TEST. Grades 5-6; $1.40 per 
25. 

C) ADVANCED READING TEST. Grades 7-9.5; $1.40 per 25. 


For reviews by Ivan A. Booker, Joseph C. 
Dewey, and D. A. Worcester of an earlier edi- 
tion, see 38:1105 and 40:1551. For reviews by 
Jack W. Dunlap, Richard Ledgerwood, Charles 
W. Odell, E. V. Pullias, and Hugh B. Wood of 
an earlier edition of the total battery, see 38:874 
and 40:1189. 


[ 491 ] 
Minnesota Reading Examination for College Stu- 
dents. Grades 9-16; 1930-35; Forms A, B ; $6 per 100; 
35f per specimen set, postpaid; 46(55) minutes; Mel- 
vin E. Haggerty and Alvin C. Eurich; University of 
Minnesota Press. 
REFERENCES 

ae ae 40'2584. COE 

A, Eurtcu, ALVIN C. An Experimental Stud. i 
Abilities of College Students. Unpublished doctor's Queste ane 
versity of TOADS Conk Method for M x 

5. Euricn, Arvin C, “ t] i ion ij 
Reading.” J Ed Res 24:2028 3r. T (PA Gane] eee i 

. LANGSAM, ROSALIND STREEP, “ torial if 
Reading Ability.” J Exp Ed 10:57:63 S 4u,* (PA tone) Of 


James M. McCallister, Director of Personnel 
Service, Herzl Branch, Chicago City Junior 
College, Chicago, Illinois, This test is designed 
to measure vocabulary and power of compre- 
hension. The section devoted to vocabulary con- 
sists of 100 words, each of which is followed by 


four alternatives. The student selects the alter- 
native that is the best definition of the word. 
Since few students will be able to finish the 
vocabulary section in the time allowed, it is an 
indirect test of speed as well as a measure of 
vocabulary. 

The section that measures power of compre- 
hension consists of ten paragraphs followed by 
directions for interpretation. These directions 
are varied so as to include sentence completion, 
selection of true statements, and selection of 
false statements. A careful comparison of the 
paragraph content with the questions reveals 
the measurement of several aspects of reading 
ability. The changing instructions require ability 
to follow directions. The various directions re- 
quire, in addition to fact selection, some organ- 
ization of content, occasional formulation of 
judgments and conclusions, and considerable 
comparison of ideas. The authors of the test 
have found correlations of .48 and .54 between 
the two parts of the test. They interpret these 
correlations to indicate that the two parts of the 
test measure the same psychological functions. 
It appears evident from the analysis mentioned 
above and these correlations that the test meas- 
ures a number of the mental processes involved 
in reading. As the test yields only a single score 
for power of comprehension, it is in no sense 
diagnostic but is a composite measure of these 
various aspects of reading. The time limit in the 
second section of the test is such that nearly all 
students will complete it before time is called. 
Therefore, it is primarily a test of power of 
comprehension independent of speed of reading. 

The validity of the test as a reading examina- 
tion is implied by the analysis in the preceding 
paragraphs. The validity of Form A has been 
studied further by correlating the reading scores 
with scores on intelligence tests and with meas- 
ures of scholastic achievement. The correlations 
obtained with the Minnesota College Ability 
Test, the Miller Mental Ability Test, and the 
Miller Analogies Test vary from .52 to .83. In 
each case the correlation between vocabulary 
and intelligence was higher than that between 
reading comprehension and intelligence. The 
relationship of the test to scholastic achievement 
was examined by correlating it with ratings of 
students in educational psychology and with 
average achievements in all courses taken dur- 
ing one quarter. These correlations were .52 
and .46, respectively. The authors express the 
conclusion that the reading test is as predictive 
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of college achievement as most tests of general 
intelligence. 

Reliability measures are reported in the test 
manual for Form A only. First, the odd and the 
even numbered items in the test were compared 
for a group of 283 high school seniors. The cor- 
relation coefficients were .928 for Part I, and 
.693 for Part II. Second, the test was repeated 
with a group of 216 college juniors. The reliabil- 
ity coefficient obtained from this group for Part I 
is 912; for Part II, .780; and for the test as a 
whole, .865. These correlation coefficients indi- 
cate that the test is adequately reliable for group 
measurements but that it should be used with 
caution when applied to individuals. 

Form A and Form B were not administered 
to the same population groups in the process of 
standardization. Form A was administered to 
889 college sophomores, juniors, and seniors in 
1929 and 1930. Form B was administered to 
slightly more than 1,000 sophomores, juniors, 
and seniors in 1933 and 1934. Percentile ranks 
and scale scores for these two groups reveal 
only slight differences at various points in the 
distributions. In so far as these two population 
groups were comparable, approximately similar 
scores may be expected from the use of the two 
forms. 

Percentile ranks are published in the manual 
for high school seniors and for university fresh- 
men, sophomores, juniors, and seniors. Conse- 
quently, the test is best adapted for groups at 
these levels. The arrangement of the test items 
and the scoring key makes it relatively easy to 
score. Since it does not provide a direct measure 
of rate of reading, it must be accompanied by 
another test if a speed score is desired. 


For reviews by W. C. McCall and Ruth 
Strang, see 38:1106 and 40:1554. 


[ 492 ] 
Nelson Silent Reading Test: NEUES and 
Paragraph: Clapp-Young Selí-Marking Tests. 
Grades 3-9; 1931-39; IBM for Forms A, B; Forms 
A, B, C; separate answer booklets must be used ; $1.98 
per 25 sets of test and answer booklet ; oo¢ per 25 hand- 
scored answer booklets; 12¢ per specimen set, post- 
paid; $1.35 per 25 machine-scored tests; 75¢ per 25 
machine-scorable answer sheets; 40¢ per set of ma- 
chine-scoring stencils; 30(40) minutes; M. J. Nelson; 
Houghton Mifflin Co. 
REFERENCES 


1. Grant, Azert. "Results of Nelson Silent Reading Test 
in Grade IX." Sch R 48:349 Ja '40. * (PA 14:2604) 


Constance M. McCullough, Assistant Pro- 
fessor of Education, San Francisco State Col- 


492 


lege, San Francisco, California. This test con- 
sists of a 10-minute vocabulary test of 100 words 
from the Thorndike and Horn lists, and a 20- 
minute, 25-paragraph test of three types of com- 
prehension: general significance, details, and 
prediction of outcomes. The first 22 paragraphs, 
mostly original, contain only words appearing 
on one of the lists mentioned above. A Clapp- 
Young marking booklet provides simplified, 
rapid scoring. Correlations with other tests have 
run around .8. Reliability coefficients on scores 
for different forms of the test have been around 
.9. Norms based upon 41,000 cases from various 
parts of the country present median scores for 
various grades, and grade and age equivalents 
for raw scores. A study by Grant (1) of over 
3,000 ninth grade pupils yielded median scores 
agreeing with the ninth grade norms. 

The teachers' manual describes a method of 
obtaining part scores for the types of compre- 
hension in the paragraph test, and suggests 
remedial measures. It gives in addition a list 
of helpful references on reading aids. 

The vocabulary test contains useful words of 
increasing difficulty. Five multiple-choice an- 
swers are sometimes synonymous with the 
words tested; at other times, descriptive of 
function or attribute. This variation in tech- 
nique requires continual mental adjustment by 
the pupil. Some of the words tested are easier 
than the answers, so that the test becomes a 
matter of recognizing the form and meaning of 
the answer rather than of the original word: 
“Workers receive launches, ships, hospitals, 
wages, levels.” Occasionally two answers are 
so related as to be eliminated from considera- 
tion as possible right answers: “A banker 
works with stages, boards, checks, saws, lead.” 
Like many vocabulary tests on the market, 
this test contains too few easy words to make 
it very useful or pleasant for its youngest read- 
ers. It reveals little about third graders of less 
than average ability. The ceiling, on the other 
hand, is challenging for ninth graders. 

The paragraph test contains narrative mate- 
rials with a personal tone. It deals largely with 
feelings, situations, and motives. The para- 
graphs are well written for the purpose of test- 
ing the three types of comprehension. Like the 
vocabulary test, however, this test is utterly 
discouraging to an average third grader. Ac- 
cording to the Lorge formula, the first para- 
graph in Form A is of 3.95 grade difficulty ; 
in Form B, 4.4 grade difficulty. The samples 
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are equally discouraging. In Form A the sen- 
tence length averages from about 7 to 30 words 
per sentence per paragraph. But the first para- 
graph in Form B averages 11 words per sen- 
tence. Thus in certain respects the forms are 
scarcely comparable. 

For some curious reason the authors have 
mixed the order of types of questions so that 
they appear in different order for each selec- 
tion. Considering the importance of setting 
purposes for reading and considering the fact 
that the sample already prepares the pupil for 
the types of question, it seems odd to offer a 
‘baffling lack of consistency to the pupil and 
unnecessary difficulty in scoring to the teacher. 

The test, excellent in concept and carefully 
developed, is unattractive in format, even un- 
pleasant to read, with small type, crowded lines, 
and poor paper. In its present form the younger 
readers should not be required to read it. A 
test as admirably constructed as this test de- 
serves a better appearance. 

It is to be hoped that the authors will extend 
the test downward in difficulty to accommodate 
the younger readers, rather than continue to 
disappoint the third grade teacher who accepts 
the publisher's claim that the test is suitable 
for grades 3 to 9. 

This is not a test of the pupil's ability to 
read science and social-study textbooks. It is 
a test of general vocabulary and of major 
types of comprehension in the reading of story 
material. It is one of the best available. 


For an excerpt from a review by Albert 
Grant, see 40:1558. 
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Primary Reading: Every Pupil Test. Grades 2-3; 
1946; new form usually issued each April and De- 
cember; forms April, December 1946; 2¢ per test; 1¢ 
per answer key; 13(20) minutes; constructed by pri- 
mary teachers of Maple Heights (Ohio) schools under 
the supervision of A. E. Hadfield; Ohio Scholarship 
Tests, Ohio State Department of Éducation, 


William S. Gray, Professor of Education, The 
University of Chicago, Chicago, Illinois. [Re- 
view of forms for April and December, 1946.] 
These tests are a part of a battery of reading 
tests for use in grades 2 to 12, inclusive, in 
studying the achievement of pupils, primarily in 
Ohio. They are not standardized tests in the 
usual sense of the term, different forms being 
developed twice a year for tse throughout the 
state. As soon as a specific form has been given 
in local school systems, the scores are sent 


to the State Department of Education, which 
prepares special Class Percentile Comparison 
Forms which make it possible for each class and 
school to compare the ranking of its pupils with 
state-wide norms. 

The test consists of six parts designed to 
measure achievement in six types of reading: 
Type I, Sentence Recognition, consists of ten 
exercises, each of which includes four sentences 
and a picture. The pupil is directed to draw a 
line from the picture to the sentence which 
belongs with it. Type II, Word Recognition, 
consists of eight exercises, each of which in- 
cludes four words and a picture. The pupil is 
directed to draw a line from a picture to the 
word that belongs to it. Type III, Following 
Directions, consists of five exercises, each of 
which includes a short paragraph followed by 
a picture and three sentences. Each sentence 
directs the pupil to do something. Type IV, 
Details, consists of six exercises, each of which 
includes a paragraph and two incomplete sen- 
tences followed by three words or phrases in 
each case. The pupil is directed to underline 
the word or words that make the sentence cor- 
rect. Type V, Larger and Total Meanings, 
consists of six exercises similar in form to the 
exercises in Type IV. The pupil underlines the 
word or phrase in each case that gives the cor- 
rect answer. Type VI, Auditory Test, consists 
of fifteen exercises in which the pupils must 
discriminate one word heard from three words 
seen. 

The test requires only 22 minutes of actual 
working time. The fact that it is so brief points 
to one of its chief weaknesses. Only one min- 
ute, for example, is allowed for the test on 
word recognition. The number of words for 
which meanings are indicated is eight. This 
section of the test is too limited, therefore, to 
insure a reliable measure of a pupil's meaning 
vocabulary. One of two alternative plans could 
be adopted: first, to extend the time for each 
part of the test, thus making more exercises 
possible; or, second, to reduce the number of 
types of reading tested. 

The exercises to test ability to follow direc- 
tions are open to the criticism that the right 
answers can be indicated in many cases with- 
out reading the introductory paragraph, for 
example, “Draw two lines under the mailman.” 
Pupils who merely execute the directions will 
obviously make a much higher score than the 
pupil who reads the paragraph first and then 
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does what the directions tell him to do. Similar 
criticisms may be made of other sections of the 
test. 

The validity of sections of the test is also 
open to question. For example, in the section 
on larger and total meanings, the pupils are 
asked.to read a paragraph and then to check 
the right answer: "What were the shoes made 
of (wood, cloth, animal skins)?" This meas- 
ures the recognition of a detail in one sentence 
only rather than the grasp of a larger or total 
meaning. Similarly, the auditory test is not a 
valid measure of auditory discrimination be- 
cause it is impossible in given cases to deter- 
mine whether poor auditory or poor visual 
discrimination is the cause of failure or just 
lack of word recognition. 


Virginia Seavey, Assistant, Child Placement 
Department, Public Schools, Brookline, Mas- 
sachusetts. [Review of forms for April and 
December 1946.] As these two tests were con- 
structed to provide for the needs of the schools 
in a particular state, they are probably of great- 
est value when used as originally intended to 
measure pupil achievement throughout the 
state of Ohio in December and again in April. 
The first test is also intended to be used for 
analyzing reading difficulties and pointing the 
way toward a remedial program. Teachers are 
furnished with percentile norms based on 
12,000 second grade scores, and 13,000 third 
grade scores. Any other school system admin- 
istering these tests should compute its own per- 
centile norms. This is a time-consuming task 
and a disadvantage to be considered in adopt- 
ing this testing program in preference to tests 
that provide national norms. It is also true that 
teachers in general find it more helpful to know 
reading age and grade norms than only the 
percentile rank and that it would be more prac- 
ticable to compare the December and April 
scores on that basis. It appears to this reviewer 
that teachers also need to know how their 
classes rate before the first of December, par- 
ticularly if a remedial program is to function 
effectively. Too little time elapses between the 
December and April tests for an appreciable 
amount of progress to take place in the case of 
average or inferior readers. If the first test 
were given earlier in the year, it would be nec- 
essary to include easier items at the beginning 
of each section. 

Reliability is not reported for these tests, 


nor is the method of selection of items de- 
scribed. The vocabulary has been drawn from 
standard word lists; and while the test has not 
been scaled, the material in each section in- 
creases in difficulty. An item analysis has been 
made on the difficulty of each item. The table 
of norms showing the per cent of. correct re- 
sponses to each item indicates that the test has 
been carefully constructed. The vocabulary is 
well chosen, particularly from the standpoint 
of measuring skill in phonetic analysis. The 
booklet is well planned except that the type is 
too small for second graders. In a few places 
the printing is poor with a letter blurred. The 
pictures, however, are clear and attractive. 

A measure of the five types of reading in- 
cluded in this test should give a teacher an 
excellent picture of her pupils' achievement. As 
the test now stands with very short time limits, 
it is definitely a speed test. It is more advisable 
in the primary grades to allow enough time to 
obtain a true measure of a child’s range and 


power of reading ability. 


[ 494 ] 
Primary Reading Test. Grade 1; 1939-40; Forms 
A, B; $1.14 per 25; I5¢ per specimen set, postpaid ; 
(40-55) minutes; Albert G. Reilley; Houghton Mif- 
flin Co. 
REFERENCES 
1. Poston, Freva L. An Evaluation of Techniques of Match- 


ing Words and Pictures as Measures of Word Recognition and 
Word Meaning in First and Second Grades. Unpublished mas- 
ter’s thesis, ‘Ohio ‘University, 1941. Pp. 76. (Abstracts of Mas- 
ters’ Theses... , 1941, DP. 67-8.) 

Ruth Lowes, Associate Professor of Educa- 
tion, The West Texas State Teachers College, 
Canyon, Texas. Achievement in word recog- 
nition, word meaning, sentence meaning, and 
paragraph meaning is measured. The test may 
be given at the end of the first grade or at the 
beginning of the second. Two sittings are re- 
quired. Time is allowed for approximately 90 
per cent of the class to finish. 

Claims that the test is a valid one are based 
on the fact that the following criteria were 
used in its construction: (a) Vocabulary was 
selected almost entirely from the first thou- 
sand words in the Thorndike list; of the words 
used nearly 90 per cent are in’ the Horn- 
Packer list. (b) Concepts used are all familiar 
to first grade pupils. (c) Mechanically the test 
approximates acceptable primary grade stand- 
ards. (d) The reading content and illustrations 
are interesting and stimulating. (e) All re- 
sponses are nonverbal. No statement is made 
concerning the reliability of the test. 
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For purposes of standardization the test was 
given to 700 pupils in seventeen carefully se- 
lected rural and urban schools. Several states 
were represented in the selection. A table of 
percentiles is given. The manual calls atten- 
tion to the advantage of the graph in compar- 
ing class averages or individual scores with the 
norm established for each part of the test. 

An effort has been made to suggest remedial 
treatment for pupils found weak in sentence and 
paragraph meaning. The suggestions are not 
as specific as they might be. 

The test folder is large (10 by 12 inches) 
and difficult to file. The pages devoted to word 
recognition and word meaning are well filled. 
One wonders if the pupils, especially the slow 
readers, would not be discouraged by the layout. 


[ 495 ] 
XPrimary Reading Test: Acorn Achievement 
Tests. Grades 2-3; 1943; Forms A, B; $1.50 per 25; 
15¢ per specimen set; 31(40) minutes; Winifred E. 
Stayton, Frances C. Ranson, and Roland L. Beck; 
Acorn Publishing Co. 


Alice N. Jameson, Assistant, Child Place- 
ment Department; Public Schools, Brookline, 
Massachusetts. "This test purports to measure 
word recognition, word meaning, and reading 
comprehension. Word recognition and reading 
comprehension are tested as distinct parts. 
Part A, Reading Vocabulary, comprises the 
first three tests: (1) Word Recognition, (2) 
Words—Similar Meaning, and (3) Word 
Meaning—Opposites. Part B, Reading Com- 
prehension, includes five graded stories each 
of which measures story, paragraph, and sen- 
tence meaning. ; 

This test could be used only for a rough 
survey and not for analysis or diagnosis. It has 
been standardized on a very small population 
about whom no information is given. Further- 
more, no data are presented on the reliability 
of the five part scores. Reliability is based on a 
wide range of ability—grades 2, 3, 4—which 
results in a high reliability index. 

The data on validity are based on too few 
cases and upon criteria some of which are of 
dubious value. It is correlated over a three- 
grade range, which gives a spuriously high 
coefficient. 

Norms are given for both Forms A and B 
which range from grades 2 to 7, but no data 

accompany them to explain their basis. 

This examiner would like to comment on 
the time limit given in Test 1, Word Recogni- 


tion. At the beginning of the test it states, 


“Time limit: 5 minutes.” This is superfluous 
if the later direction is followed—" Watch 
group and allow 1o seconds for each word.” 

In Part B, the examiner would question the 
suitability of the material for second and third 
grade children. 

The test is printed in large, boldface type, 
and it is well spaced and easy to read. 


[ 496 ] 
Purdue Reading Test. Grades 7-16; 1928; Forms 
A, B; $2 per 25; 256 per manual; 45¢ per specimen 
set, postpaid; 40(45) minutes; H. H. Remmers and 
John M. Stalnaker; Lafayette Printing Co, 


Albert J. Harris, Assistant Professor of 
Education, The City College of New York, 
New York, New York. The content of the 
silent reading test includes a 40-item sentence 
reading subtest and nine fairly long selections, 
taken from books: suitable for college use in a 
variety of fields. Each selection is followed by 
10 to 18 questions, for the most part in true- 
false form. A single score is obtained, which 
depends partly upon comprehension and partly 
on rate of reading. Scoring, by means of a key, 
is completely objective. 

The reliability is only fair by today’s stand- 
ards, with an r between the two forms of 855 
for a range of eight grades, or about .75 for a 
single grade. The norms provided include the 
arithmetic means for each grade, based on a 
maximum of 237 cases for any one grade, 
chiefly from consolidated schools in Indiana. 
A table of percentile scores is also given, in 
which the scores from all grades are combined. 

The test may be considered a commendable 
pioneering effort, but it does not equal in value 
such competing tests as the Nelson—Denny 
Reading Test or the Cooperative reading tests. 


[ 497 ] 
XReading Comprehension: Cooperative English 
Test: Lower and Higher Levels, Tests C1 and C2. 
Grades 7-12, 13-16; 1941-43; commonly called Cooper- 
ative Reading Comprehension Test; IBM; Forms R, 
S, T; 2 levels; $2 per 25 ; 25¢ per specimen set of either 
level, postpaid; 4o¢ per 25 machine-scorable answer 
sheets; 15¢ per stencil for Scoring answer sheets; 
40(45) minutes; Frederick B. Davis, Harold V. King 
(Form S), and Mary Willis (Form T) ; Cooperative 
Test Service. 
4) LOWER LEVEL. Grades 7-12. 
b) HIGHER LEVEL. Grades 13-16. 


REFERENCES 

1-2. See 40:1564. 

3. Ryans, Davro G. The First Step im Guidance: Self-Ap- 
praisal: A Report of the 1940 Sophomore Testing Program. 
Cooperative "Test Service Publications in Measurement and 
Guidance, Series 3, Vol. 1, No. t. New York: Cooperative Test 
Service, January 1941. Pp. 35. Paper. $0.10. * (PA 15:2382) 
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4. Davis, FREDERICK B. “Fundamental Factors of Compre- 
hension in Reading." Abstract. Psychol B 39:499-500 Jl '42. * 
(PA 16:5007, title only) 

s. Davis, FmepERICK B. "Two New Measures of Reading 
Ae J Ed IDOL My '42. * (PA VERD ) 
_ 6. SuwPsow, R. G. “The Vocabulary Sections of the Coopers 
tive English Tests at the Higher Levels of Difficulty.” J Ed 
Psychol 34:142-51 Mr '43. * (PA 18:311) 

7. Davis, FREDERICK B. “Fundamental Factors of Compre- 
hension in Reading.” Psychometrika 9:185-97 S '44. * (PA 19: 
242 

8. Davis, FmEpERICK B. “What Do Reading Tests Really 
Measure?" Engl J 33:180-7 Ap '44. * (PA 18:2250 

9. Henny, Lorne J; “The Diagnostic Value of à Standardized 
Reading Test.” Sch, Sec Ed 32:884-7 Je '44. * 

10. Davis, Frepertcx B. “A Brief Comment on Thurstone’s 
Note on the Reanalysis of Davis’ Reading Test.” Psychometrika 
11:249-55 D '46. * (PA 21:978) ‘4 

ir, Davis, FmEDERICK B. "Fundamental Factors of Reading 
Comprehension.” Psychometrika 11:185-8 S 746. * 

12. Davis, Frepertck B. “The Factorial Composition of Two 
Tests of Comprehension in Reading.” J Ed Psychol 37:481-6 
N "46. * (PA 21:1632) 

13. Perxotto, HELEN E. “The Relationship of College Board 
Examination Scores and Reading Scores for College Freshmen." 
J Appl Psychol 30 Moin Ag '46. * (PA 21:289) 

" um TuunzsroNz, L, L, “Note on a Reanalysis of Davis’ Read- 
ing Tests." Psychometrika 11:185-8 S '46. * (PA 21:295) 

1g. TRAXLER, ArTHUR E. “Reading and Secon "School 
Achievement,” pp. $9.65. In 1946. Achievement Testing Pro- 
gram in Independent Schools and Serene Studies. Edu- 
cational Records Bulletin, No. 45. New York: Educational 
Records Bureau, June 1946. Pp. x, 63. Paper, lithotyped. $1.50. 
* (PA 20:4354) $ 

16. Voraw, Davip F. “A Comparison of Test Scores of En- 
tering College Freshmen as Instruments for Predicting Subse- 
quent Scholarship." J Ed Res 40:215-8 N '46. * (PA 21:1304) 

17. "A. Note on the Correlation Between the Iowa and Co- 
operative Reading Tests,” 2 66. In 194g Achievement Testing 
Program in Independent Schools and Supplementary Studies. 
Educational Records Bulletin, No. 48. New York: Educational 
pap Bureau, June 1947. Pp. xii, 66. Paper, lithotyped. 

2.00. 


Robert Murray Bear, Professor of Psychol- 
ogy and Director of the Reading Clinic, Dart- 
mouth College, Hanover, New Hampshire. 
Each test has two parts, recognition vocabulary 
and paragraph reading. The range of difficulty 
of the 60 vocabulary words is adequate for the 
grades covered; one subsequent study of a 
form of the higher level, C2, has confirmed the 
generally accurate placement of words in order 
of difficulty. The undesirable influence of speed 
of recognition on the score is minimized through 
the use of an ample time limit. The reading 
paragraphs of Part 2 sample different types of 
subject matter and are followed by five-choice 
items testing four main types of skills. As with 
the vocabulary part, item validity and difficulty 
were determined by preliminary experimenta- 
tion. The tests received the benefit of rather 
careful standardization before publication. 

Perhaps the most unusual feature is that the 
90 comprehension items are arranged in three 
repeating scales of equivalent difficulty, each 
containing 30 questions based upon from 4 to 
8 paragraphs per scale. A few paragraphs are 
as short as three dozen words. Though Part 2 
yields scores that differentiate well enough be- 
tween students with wide differences in reading 
skill, differentiation is not as satisfactory for 
others, because of the fact that the scoring 


range is very contracted, particularly in the 
case of students with only one scale completed, 
of whom there may be as many as 10 per cent 
in a college freshman class. 

While a speed-of-comprehension score is ob- 
tained from a count of the total number of 
items accurately answered, level of comprehen- 
sion is scored by counting only items of com- 
pleted scales. This eliminates the influence of 
speed upon the comprehension score and makes 
it a better measure of power of comprehension 
than is obtained in other timed tests. The cor- 
relation. between level of comprehension and 
speed of comprehension for the lower level tests 
is .89 and for the higher level, .87, suggesting 
a considerable influence of comprehension upon 
the speed score. The speed score thus is not 
comparable with rate of reading as the stu- 
dent usually thinks of it or with his perform- 
ance upon the rate sections of several other 
tests used in these grades. For example, the 
correlation between words read per minute on 
the essay of the Booker Test of Achievement 
in Silent Reading and the speed of comprehen- 
sion on Form Q of Test C2 was only .317 for 
176 college freshmen. 

A useful feature in scoring is that by refer- 
ence to tables all raw scores can be converted 
into standard scores which are directly com- 
parable and permit comparison of relative pro- 
ficiency in vocabulary, speed, and level of com- 
prehension. 

Reliability coefficients have been computed 
which show considerable stability at the 50 
point of the scaled scores. These range from 
.75 on Cr and .82 on C2 for level of compre- 
hension with only one scale completed to better 
than .9 for vocabulary and total scores. For 
secondary and college groups, correlations of 
hetween .7 and .8 with intelligence tests have 
been reported, and of between .39 and .73 with 
school marks. Thus, we may conclude that 
these reading tests have about the same rela- 
tion to school achievement as do group intel- 
ligence tests. Incorporation by the publisher of 
the findings of these various studies into his 
descriptive folder would be helpful to users. 
Directions for administering each test are not 
supplied separately, but are given in a general- 
ized form to fit all Cooperative tests. One un- 
acquainted with Cooperative test procedure 
may at first be somewhat puzzled to fit to- 
gether the directions to be used. 

For the grades intended these survey tests 
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are among the best for measuring reading 
comprehension of the usual types of subj 
matter but should be supplemented by some 
other test if measures of the pupil's usual rates 
are desired. 


J. B. Stroud, Professor oj Education. and 
Psychology, The State University of lowa, 
Iowa City, Iowa. The test consists of Part I, 
Vocabulary (15 minutes), and Part II, Read- 


Scaled scores, the size of the unit being one- 
tenth of a standard deviation of the distribution 


error, percentile norms based upon 
scaled scores are provided for 11- and 12-year 
schools types of colleges, which 


1 


In the vocabulary test the examinee is not 
generally required to make any close discrimi- 
mation between the correct responses and the 
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foils. The foils contain a good many ruses, * 
words with the same prefixes or suffixes, and 
other kinds of formal similarity. Use is also 
made of a kind of second order similarity ; e.g., 
for the word rudimentary, the foil impolite 
suggests rude; for the word averse, the foil 
poetry suggests verse and the foil upside down 
suggests reverse. 

The reading test items are quite good and 
the reading test is well chosen, although it 
runs a bit heavily toward literature. The items 
will probably appeal to English teachers more 
than to social studies and science teachers. 
Tests of understanding of mood and purpose 
and of word meaning in context are featured. 

It would be difficult to find a test better con- 
ceived and better executed within the limits of 
its objectives. 

For reviews by J, Paul Leonard, Edward S. 
Noyes, and Robert C. Pooley of the entire bat- 
tery, see 120, 
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Re Gre CHE Test: National Achicve- 
ws rades 3-8; 1938; "o ^ Bs $1.50 per 
D r specimen set; nontimed (30 inutes; 
Robert We Eger and Sámuel Seith; Acorn Publish- 
ing 


James R. Hobson, Director of Child Place- 
ment, Publie Schools, Brookline, Massachusetts. 
This test is divided into three parts: I, Follow- 
ing Directions; II, Sentence Meaning; iil, 
Paragraph Meaning. There is nothing to indi- 
cate the more specific reading skills which this 
test is designed to measure. There is no man- 
ual worthy of the name and the teacher's 
directions, class record, answer form and norms 
are all contained on the two sides of an 874 
by 11 inch sheet of paper. This test is an- 
nounced as being part of a very ambitious 
teach-study-test program, whch was to have 
included achievement tests, diagnostic tests, 
bulletins for teachers, and bulletins for pupils 
in every subject practically from the cradle to 
the grave. If these materials have been devel- 
oped, this reviewer has not seen them. How- 
ever, they have little to do with the intrinsic 
worth of this test. 

The defects of this test are so many and so 
gross that it is difficult to assemble them in any 
systematic pattern for criticism. To begin with, 
the material in the test items strikes this re- 
viewer as utterly inane and as obviously fabri- 
cated for the purpose. It is in any event far 
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* removed from a normal functional reading situ- 
ation for any child. " 

Secondly, the grade range of the test is so 
extensive as to greatly diminish its actual use- 
fulness if it were otherwise meritorious. Any. 
test which contains enough items of sufficient 
difficulty to measure the reading abilities of 
cighth grade children is far too formidable and 
discouraging for the average third grade child, 
Conversely, the eighth grade pupils have to 
wade through too many easy test items which 
do not discriminate among them before they 
reach the ones which do. In effect a 30-minute 
test becomes only about a 15-minute test for 
the extremes of such a range. Such a test is 
actually fully effective for only about the mid- 
dle half of its announced grade range. It follows 
also that any reliability coefficients based. on 
the entire grade range are spuriously high. 
The reliability of this test for a single grade is 
quoted as .92. There is no information as to 
what grade this is for. This reliability seems 
very high for what amounts to about a fifteen- 
minute test at most grade levels. 

Another criticism is that in the authors' an- 
nounced attempt to include "important types 
of questions that had been ignored or under- 
emphasized in other tests," they appear to have 
gone too far in the other direction by making 
10 of 36 items in the test consist of following 
directions. This type of item is often used and 
quite properly belongs in a verbal intelligence 
test. Its extensive use in a reading test is not 
justified in the opinion of this reviewer. It is 
not surprising that the authors have felt con- 
strained to develop norms for high, low, and 
average IQ's, 

The fact that a difference of one item often 
means a difference of about three months in 
grade level, combined with the fact that three- 
choice restricted answer questions, which com- 
prise 26 of the 36 test items, are scored with- 
out any correction for chance guessing, not only 
shows that this test is not very finely gradu- 
ated but casts doubt upon its basic validity and 
reliability as well. 

The statement made that this test has no 
time limit is followed by the statement that 
“the time generally taken is 30 minutes” but 
that this “will vary according to age and 
grade." Such directions are much too ambigu- 
ous to give to classroom teachers. You cannot 
make a test a power test merely by eliminating 
the time limit. 


_——————_— 


Finally the claims made for this test in al- 
most every aspect are so extravagant and are 
backed up by so little real data that they im- 
mediately alienate anyone who wishes to know 
the detailed background of instruments he 
recommends for use, 
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*Schonell Reading Tests. 1942; 4 tests; 2s. 6d. 
12 of any test; c per o Schonell Bacadh 
ness in the Basic Subjects, Kanion, woh serves 
as the test manual for Tests Ra, R3, R4; Fred J. 
Schonell; Oliver & Boyd Ltd. . 
d) TEST X2, SIMPLE PROSE READING TEST, 6-9; also 
called My Dog Test; individual; (3-8) P 
b) TEST R3, SILENT READING TEST A. Ages 7-11; 9(15) 
minutes. 
€) mst 34, SILENT READING TEST 3. Ages 9-13; 15(20) — 
t 

d) GRADE WORD READING TEST, Ages £5] also called 
Graded eve vy 20 een Tessas a 

n n chol a caching 0| 
Reodin A Second Edition, wh serves as the test 
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dividual timing. 

The Graded Word Reading Test is similar 
to Burt's and Vernon's Graded Word Vocabu- 
lary Tests, and may be regarded as an alterna- 
tive version for them. It provides a rapid 
measure of in word attack from § to 
15 years and a useful preliminary to the more 
detailed tests. 

The Simple Prose Test is suitable for pupils 
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measurements may be made by the use of the 
first and easiest paragraphs. 

The Silent Reading Tests consist of para- 
graphs each followed by a question or a mul- 
tiple-choice problem, and they may be repeatedly 
used, since the answers are written on a sepa- 
rate paper. It will be noted that this implies 
writing a word, not merely underlining, and it 
might be objected that slow writers are 
penalised. 

One would be glad of a separate manual for 
the tests. Instructions and norms are to be 
found only in a large book Backwardness “in 
the Basic Subjects; and while it is appreciated 
that the clinic worker will wish to use this 
book for the study of cases, convenience in test- 
ing would be served by having a less cumber- 
some testing guide. 

Again, one would like to know more about 
the background group on whom the tests were 
standardised. The Simple Prose Test was 
based on 650 cases; numbers at each age are 
not given, and one misses data on reliability. 
For the Graded Word Reading Test no par- 
ticulars are given—only norms. The Silent 
Reading Tests were standardised on 1:860 
pupils, and again no particulars as to age are 
given. Correlations between these tests and 
the Simplex Junior Group Intelligence Scale 
are given in the case of 210 pupils from 8 to 
11; the coefficients for number correct with IQ 
range from .67 to .86, but the number of cases 
at any age is never above 63. The reliability 
coefficient of Test B is given as .92. No doubt 
Dr. Schonell has by this time accumulated 
more data, and it would be an advantage to 
have it published. 
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Schrammel-Gray High School and College Read- 
ing ae 7-13; 1940-42; Forms A, B; $1.80 
per 25; 36¢ per specimen set, postpaid; 2: in- 
utes; H. E. Schrammel and W. H. Gay: Fos Nps un 
Publishing Co. 


James M. McCallister, Director of Person- 
nel Service, Heral Branch, Chicago City Junior 
College, Chicago, Illinois. This test consists of 
25 paragraphs, each of which is followed by 
three or five objective items to be checked 
true or false. The total number of items is 100. 
The scoring arrangement provides three meas- 
ures of reading ability: (a) a gross-compre- 
hension score in terms of the number of ques- 
tions answered correctly, (b) rate of reading 
in terms of words per minute, and (c) compre- 


hension-efficiency as shown by the ratio of the ^ 
number of items checked correctly to the num- 
ber attempted. 

A test that covers the range of reading abili- 
ties from the seventh grade through college 
must of necessity contain a variety of materials 
as to difficulty and interest. The manual states 
that the paragraphs were selected from a wide 
variety of sources that would prove interesting 
to students at each level. Because of the wide 
range represented, it is important that the 
validity-of the test be established at the vari- 
ous levels. Validity coefficients are reported 
for the college freshman level only. These co- 
efficients are based on correlations between the 
test scores and composite decile ranks computed 
from scores on tests in intelligence, English, 
vocabulary, spelling, mathematics, reading, and 
current history. The coefficients, which range 
from .67 to .97, suggest that the test is vali 
for the upper levels of the range. Additional 
data are needed for the lower levels. 

Internal analysis of the test paragraphs and 
the true-false items indicates that the test meas- 
ures largely apprehension of specific f: It 
provides little opportunity for measuring such 
things as comprehension of main ideas, recog- 
nition of relationships and sequences among 
ideas, or judging relative importance of ideas 
from the standpoint of the intent of the writers 
of the passages. The limitations of a test to the 
measurement of restricted phases of reading 
ability may be readily justified by the authors, 
but it must be kept in mind in interpreting the 
Scores as measures of general reading ability. 

The test was studied for reliability by com- 
paring alternate forms and by comparing the 
three measures obtained on single forms. 
Coefficients of correlation reported for a ninth 
grade group vary from .63 to .86. The authors 
also report that “a student's true score, in 50 
per cent of the cases, does not diverge from 
his obtained score by 9.3 points or less in the 
case of rate, by 4.1 points or less in the case 
of gross comprehension, and by 4.7 points or 
less in the case of comprehension efficiency.” 
These data indicate that the test is sufficiently 
reliable for measurement of group accomplish- 
ments, but that it should be used with caution 
in making individual diagnoses. 

The test has the decided advantages of being 
easily administered and readily interpreted. 
The meanings attached to the three measure- 
ments make the results readily understandable 
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to both students and teachers. The manual 
contains excellent tables for translating raw 
scores into percentile ranks and grade norms. 
It should be particularly valuable in making 
initial surveys of the reading abilities of large 
groups for the purpose of screening out stu- 
dents who need further testing and study. 


Robert L. McCaul, Instructor and Adviser 
in the College, The University of Chicago, Chi- 
cago, Illinois. The authors declare that this test 
is an instrument “of considerable diagnostic 
value.” With all due allowance for the pride with 
which testmakers ordinarily speak of their 
products, this statement must still be consid- 
ered an exaggeration. The test is actually of 
little diagnostic value. It provides only three 
scores: (a) a gross comprehension score con- 
sisting of the number of correct answers; (b) 
a rate score of the number of words read per 
minute; and (c) a comprehension-efficiency 
score which is the ratio between gross com- 
prehension and rate. 

When these scores are analyzed, the test’s 
flaws as a diagnostic instrument become mani- 
fest. Each of the 25 test paragraphs is followed 
by three to five true-false items. Since these 
items are of a factual nature, the gross compre- 
hension score indicates merely that the student 
is strong, weak, or mediocre in one element of 
reading—apprehension of facts. We do not know 
why, nor do we have evidence of the quality of 
his ability to organize what he reads, select cen- 
tral ideas, follow and evaluate causal relation- 
ships, or perform any of the other tasks involved 
in securing meaning from the printed page. 

Similar deficiencies characteristize the speed 
score derived from the test. The material read 
includes a poem and paragraphs culled from 
psychology, music, biology, social science, and 
general publications. What is obtained from the 
test, therefore, is a single speed score which rep- 
resents heterogeneous speeds upon heterogene- 
ous materials. Not only is it impossible to deter- 
mine how adequate a measure of speed the score 
is, but it is likewise impossible to estimate how 
much the speed on each type of material con- 
tributes to the total raw score. Yet for diagnosis 
we should know whether the student read the 
poem more slowly than he did the paragraph 
about a frog jumping contest, the paragraph 
describing Chinese music more slowly than the 
one about coloring dogs to match women’s 
frocks, etc. It is fair to conclude that there exist 


more complete and less ambiguous diagnostic 
reading tests than the Schrammel-Gray. 

Elsewhere in the manual the authors assert 
that the "test possesses marked predictive value 
as regards scholastic success.” They base this 
statement upon correlations of .73, .67, and .87 
between college-freshman scores on the three 
parts of their test and composite decile ranks 
computed from tests of intelligence, English, 
vocabulary, spelling, mathematics, reading, and 
current history. Such reasoning seems hazard- 
ous. After all, the test correlates an average of 
.76 with a composite validity criterion which it- 
self probably correlates no more than .70 with 
teachers’ grades. Numerous investigations have 
proved, moreover, that a single test of a specific 
trait, aptitude, or achievement does not furnish 
a trustworthy predictive measure. Thus, the au- 
thors’ belief in the “marked” prognostic value 
of their test as well as their faith in its “con- 
siderable diagnostic value” must be accepted 
with reservations. 

The Schrammel-Gray High School and Col- 
lege Reading Test suffers more from the ex- 
travagant claims of the authors than from any 
serious weaknesses in construction and content. 
It does not compare unfavorably with many tests 
on the market, and it could be a rather useful 
survey instrument. 


Ed Res B 21:47 F 1842. William J. Jones. * 
includes unusually descriptive and informative 
reading material designed to keep the interest 
of the students while taking the test. * The test 
appears to have validity as a predictor of 
scholastic success. The test has reported reliabil- 
ities varying from .63 to .86 on part scores, not 
high enough to place a lot of confidence in the 
stability of individual student scores. * a good 


test * should be examined by those interested in 
reading. 

[ 501 ] 
Silent Reading Comprehension: Iowa Every- 
Pupil Tests of Basic Skills, Test A. Grades 3-5, 
5-9; 1940-45; IBM, grades 5-9; Forms L, M, N, O; 
2 levels; 44¢ per specimen set of either level; H. F. 
Spitzer in collaboration with Ernest Horn, Maude Mc- 
Broom, H. A. Greene, and E. F. Lindquist; Houghton 
Mifflin Co. 
a) ELEMENTARY BATTERY. Grades 3-5; $1.50 per 25; 
46(50) minutes. 
b) ADVANCED BATTERY. Grades 5-9; IBM; separate an- 
swer sheets need not be used; $1.60 per 25; 50¢ per 
25 machine-scorable answer sheets; 15¢ per machine- 
scoring stencil; 68(85) minutes. 


James R. Hobson, Director of Child Place- 
ment, Public Schools, Brookline, Massachusetts. 
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- These tests arë the reading tests of a compre- 


hensive achievement test battery measuring the 


- fundamental skills developed in the elementary 


school. This battery is the outgrowth of the 


Iowa Every-Pupil Testing Program initiated 


in 1929 and extended down as far as grade 3 
beginning in 1940. 

The nation-wide distribution and use of these 
tests is entirely, incidental to their annual use in 
all Iowa schools and has come about by reason 
of their intrinsic merit rather than through ex- 
ploitation by the Iowa State educational author- 
ities. 
` The four silent reading skills other than 
vocabulary measured by the elementary reading- 
comprehension test are paragraph comprehen- 
sion, noting details, organization of ideas, and 
grasp of total meaning. The test material ranges 
from short paragraphs followed by from three 
to five questions to a 450-word story followed by 
15 questions. The restricted answer technique 
is used with four choices in each instance. 

The Advanced Battery is parallel to the Ele- 
mentary Battery in abilities measured and 
method of indicating answers. The reading selec- 
tions used, however, are of much greater length 
and consist of description, exposition, and his- 
torical narrative as contrasted with the simple 
exposition and fictional narrative of the elemen- 
tary test. The selections in the advanced test 
average nearly 700 words in length, and as 
many as I9 questions are asked about each 
selection. 

The reading selections in both tests appear 
to be interesting and well chosen. They are obvi- 
ously the type of reading a child encounters in 
normal elementary school assignments in vari- 
ous subject matter fields. In both tests the skills 
to be measured govern the types of questions 
asked. 

The vocabulary test consists of 40 items in 
the elementary test and 50 items in the advanced 
test. In some instances the word to be defined is 
presented in a complete sentence and in others 
in only a two- or three-word phrase, which in 
the opinion of this reviewer is less desirable. The 
four-choice restricted-answer technique is used 
in both tests. 

All of the answers for the advanced test are 
marked on a single answer sheet on the inside 
of the front cover, which is perforated for easy 

detachment. There is no reason why standard 
IBM answer sheets could not be adapted to this 
test and machine scoring used. 


The manuals for these tests are extremely 
practical and well arranged for easy under- 
standing by the classroom teacher. The direc- 
tions for administering and scoring are clear 
and concise. Three kinds of norms are given: 
grade equivalents, age equivalents, and grade 
percentiles, all based, of course, on the perform- 
ance of Iowa public school children. 

The aids to interpretation and practical use of 
the results of these tests are many and varied. 
These include the class record, the individual 
profile and cumulative record, an item analysis 
of the skills involved, and a particularly com- 
plete set of suggestions for developing the read- 
ing skills in which pupils may show deficiencies. 

No objective data are presented regarding the 
validity and reliability of these tests. In fact, no 
particular section of the manual is devoted to 
these subjects, although the method of devclop- 
ing the tests and selecting the items is given. 
In spite of the lack of statistical evidence of 
validity, the use of such criteria in the selection 
of test items as analysis of course of study, text- 
books, and instructional procedures as well as 
the pooled judgments of the group of authors, 
most of whom are outstanding figures in the 
evaluation field, does much to guarantee high 
validity for this test battery when applied to 
Iowa children. 

Theoretically, this validity will be reduced 
somewhat when the tests are used outside the 
Towa situation. Actually, the nature of reading 
skills is such as to give this type of test practi- 
cally nation-wide applicability. Since such a 
strong point is made of the equivalence of the 
annual forms of this test and the comparability 
from year to year which makes a cumulative 
record and profile really meaningful, consist- 
ency would seem to demand some statistical 
evidence of the reliability of these instruments. 
None is given. 

It goes without saying that none of the three 
types of norms given will prove very useful out- 
side Iowa except for purposes of comparison 
and that any school system making use of these 
tests will have to develop its own norms, which 
of course is always a good idea. It does appear, 
however, that a publishing house undertaking 
the national distribution of tests of this appar- 
ent high quality should seek a wider base for 
their standardization. 


Constance M. McCullough, Assistant Profes- 
sor of Education, San Francisco State College, 
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San Francisco, California. The test authors have 
developed the instructions for administering, 
scoring, interpretation, and remediation, with 
painstaking care. Their sound insistence, within 
the test itself, upon comprehension questions 
that require genuine reading ability and genuine 
thought is admirable and somewhat unusual in 
the commercial field. 

The paragraph content is science and social 
studies, sometimes disguised in narrative and 
conversation form. The elementary battery 
starts with one paragraph and builds up to a 
six-paragraph selection; the advanced battery 
starts with three and builds to six. The number 
and types of question vary with the content of 
the selection. The authors profess to be testing 
four types of comprehension : paragraph, details, 
organization, and total meaning. But the over- 
lapping of their suggestions for remedial work 
establishes their failure to select distinct types. 
For this reason the value of the part scores 
within the comprehension test is considerably 
diminished. 

Because of their effort to suit the questions 
to the passages tested, the number of items avail- 
able to suggest mastery of each type of compre- 
hension is often uneven and inadequate within 
a given form and extremely irregular between 
forms. The forms are not at all comparable in 
this respect. The test is overburdened with de- 
tail questions. The authors would have done bet- 
ter, both for scorers and for balance, to construct 
paragraphs with uniformity of questioning in 
mind. Aside from this weakness, the excellence 
of the questions themselves cannot be over- 
emphasized. Someone with a keen appreciation 
of reading-as-thinking had a hand in this test. 
A teacher capable of reclassifying the questions 
as to the comprehension ability tested could 
draw much of value from her findings. 

The difficulty of the paragraphs according to 
the Lorge formula is not uniform for the differ- 
ent forms or extensive enough for the grades the 
test claims to cover. In the Elementary Battery, 
the first paragraph in Form L is of 4.8 grade 
difficulty; in Form M, 4.0 difficulty. Hardest 
paragraphs in Form L are about 4.9; in Form 
M, about 4.6. The range scarcely justifies the 
claim that the test is suitable for grades 3 to 5. 
Easier and harder paragraphs are needed. 

The elementary vocabulary test contains 40 
items; the advanced 50, of general vocabulary. 
In both cases the words are much too difficult 
for the younger pupils tested and also in com- 
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parison with the vocabulary of the paragraphs in’ 
Part 1. Inthe Elementary Battery, Form L, only 

2 of the vocabulary words appear in Dale's list 
of 769 easy words, and 35 per cent of the right - 
answers are harder words than the ones tested, 
The norms reflect the fact that the vocabulary ^ 
section is far out of the reach of many of the 
children tested. Logically, a good many words. 
in the vocabulary section should have been 
chosen from the harder words in the paragraphs 
in Part 1. If the paragraphs were representative 
of material the children should be able to read, 
the words they contain would be also. The au- 
thors missed the opportunity to make the vocab- 
ulary section a tool in the interpretation of the 
comprehension results. 

The elaborate directions in the manual for 
scoring the test, the perforated answer keys, 
and the conversion tables on the cover page of 
the test booklet combine to give the prospective 
examiner a sense of fear or revulsion. The 
teacher of 40 children does not want to spend 
valuable time figuring out the authors' thoughts 
on how to obtain 40 simple scores on an hour 
test. It could be wished that there were simple 
and brief means of obtaining ultimate efficiency 
and that so much manipulation might be re- 
warded by a more meaningful, clear-cut diagno- 
sis of separate skills. As it is, the aids to the 
teacher have far exceeded the point of diminish- 
ing returns and are approaching the point of 
exasperation. 


For reviews by Frederic L. Ayer, Gustav J. 
Froehlich, and Ralph C. Preston of the com- 
plete battery, see 10. For reviews by William 
A. Brownell, J. Murray Lee, and Charles W. 
Odell of an earlier form of the complete bat- 
tery, see 38:872. 
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XSRA Reading Record. Grades 7-12; 1947; 1 form; 
separate answer pads must be used; 43¢ per test book- 
let and answer pad ; $1.65 per 25 answer pads; 45¢ per 
25 profile sheets; 75¢ per specimen set; 28(40) min- 
utes; Guy T. Buswell; Science Research Associates. 


Frances Oralind Triggs, Educational Records 
Bureau, New York, New York. The measure- 
ment of reading skills is somewhat difficult be- 
cause there is no real agreement among authori- 
ties on the definition of reading. Buswell defines 
reading in somewhat narrower terms than do 
most other reading specialists and his Reading 
Record reflects his thinking. The manual states 
that the test measures the ten skills “found most 
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essential to satisfactory reading adjustment"— 
these skills being : rate of reading, reading com- 
prehension, paragraph meaning, directory read- 
ing, map-table-graph reading, advertisement 
reading, index reading, sentence meaning, tech- 

~ nical vocabulary, and general vocabulary. The 
test furnishes a score on rate of reading and 
total comprehension, as well as scores on nine 
other skills measured, “a breakdown which is 
essential to complete reading diagnosis,” accord- 
ing tothe manual. 

The pinprick type of answer sheet accom- 
panies this test ; it is scored by counting the holes 
which appear in the circles printed on the inside 
of the answer sheet. The answer sheets may be 
removed from the test booklets and scored by 
the students, if rescored for checking later. The 
present edition of the Record is not adapted for 
machine scoring, but it would not be difficult to 
adapt it. Unless this is done, schools using this 
test as one of a battery to be scored by a test- 
scoring service will be handicapped. 

The format makes the test physically attrac- 
tive. It is spirally bound into a seemingly dur- 
able, heavy paper cover. The answer sheet is 
punched to fit easily on to the spiral binding 
from which it can also be very easily removed. 
The size of each page of the test booklet varies 
in accordance with that column on the answer 
sheet on which the answers are to be punched, 
making for ease and accuracy of recording. Each 
test takes up one page. The instructions for each 
test are printed at the bottom of the preceding 
test, thus making for ease and accuracy in ad- 
ministering and timing the test. 

A number of comments may be made concern- 
ing this test. However, it should be noted first 
that this review is being written before the test 
is generally available for sale. Thus, the only 
experimental data available to the reviewer are 
those furnished by the author in the manual. 
Comments are based on these data and the expe- 
rience of the reviewer with other tests in the 
field. 

* The test is very closely timed. The manual 
warns users that "Exact timing of the tests is 
exceedingly important. Deviation of even five 
seconds can increase a score ten points. A score 
on an incorrectly timed test is valueless.” Such 
warning is very pertinent. Excluding the rate 
of reading test, the number of items varies ftom 
10 to 25 in each subtest; the timing varies from 
two to three minutes. 

The manual gives the median reliabilities for 


two groups of 1,000 ninth and eleventh grade 
students. At first glance one wonders, with these 
short tests, how the reliabilities can be as high 
as they are: Test 1, Rate of Reading .79 (2 min- 
utes) ; Test 2, Reading Comprehension 75 (2 
minutes) (16 items) ; Test 3, Paragraph Mean- 
ing .86 (2 minutes) (10 items) ; Test 4, Reading 
an Alphabetical Directory .96 (3 minutes) (25 
items) ; Test 5, Interpretation of Map-Table- 
Graph Material .86 (3 minutes) (12 items) ; 
Test 6, Advertising Reading Test .83 (3 min- 
utes) (20 items) ; Test 7, Index Usage .96 (3 
minutes) (14 items) ; Test 8, Technical Vocab- 
ulary .79 (3 minutes) (23 items); Test 9, 
Sentence Meaning .78 (3 minutes) (19 items) ; 
Test 10, General Vocabulary .75 (3 minutes) 
(25 items) ; total score .93 (164 items). While 
the manual does not so state, the reviewer un- 
derstands that these reliabilities are Spearman- 
Brown except for rate of reading, which must 
be either test-retest (which is not probable as 
there is only one form of the test) or for which 
the administration may have been varied to fur- 
nish a score for each minute read. It is known 
that Spearman-Brown reliabilities tend to be 
overestimates when a test is highly speeded. The 
reviewer would caution users, therefore, not to 
put too much emphasis on any one score on 
this test until more data concerning reliability 
are available, especially for individual diagno- 
sis. It is probable that the reliabilities on tests 
which measure specific homogeneous skills— 
Test 3, Paragraph Meaning; Test 4, Reading 
an Alphabetical Directory; Test 5, Interpreta- 
tion of Map-Table-Graph Material ; Test 6, Ad- 
vertisement Reading; and Test 7, Index Usage 
—are reliable enough for estimating an indi- 
vidual's skills for the specific skills measured 
by these tests. Reliability of the total score on 
the test is high enough so that, even though the 
value is undoubtedly affected by speed, it can 
be used with confidence that it will not vary 
markedly on retest. 

What of the validity of this test? To what 
extent does the total score discriminate between 
efficient and inefficient readers, and to what ex- 
tent do scores on subtests measure the skills 
they purport to measure? Also with regard to 
reliability, it is pertinent to ask to what extent 
can scores on this test be used to guide a teacher 
or clinician in remedial work with a student. 

It must be stated that validity of a reading 
test may be judged to some extent on the defini- 
tion of reading subscribed to. In order to avoid 
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theoretical discussion, it seems wise first to 
examine the subtests in accordance with the 
definition of them in the manual. 

Test 1 is defined as a measure of rate of read- 
ing. Two minutes are allowed for reading a 
passage of approximately 1,200 words. The 
topic is of interest, and the vocabulary level 
would seem to be somewhat simple. 

Test 2 is a reading comprehension test. A 
well-informed reader can answer more than half 
of the questions without reading the material; 
most of the questions are questions of detail. 
The first nine questions are answered in the first 
six hundred words. This test obviously meas- 
ures reading comprehension in a very narrow 
sense—only one question requires any general- 
ization at all, the rest being exactly answered 
by the content. 

Test 3 is called a test of paragraph meaning. 
Each paragraph is made up of two sentences. 
One word in the second sentence of each para- 
graph "spoils the meaning of the paragraph." 
This test resembles closely a vocabulary test 
which requires the examinee to recognize the 
meaning of the word as given by the context. 

Test 4 is a measure of ability to read an alpha- 
betical directory. Twenty-five names and ad- 
dresses are given. The examinee responds by 
matching correct telephone numbers. This test 
measures a very specialized, homogeneous skill, 
which many test specialists would consider cleri- 
cal, and the reliability reflects this fact. It would 
be interesting to know the effect which a few 
short practice periods would have on scores on 
this test. 

Test 5 is a measure of map-table-graph read- 
ing. A map, a weather report in tabular form, 
and a bar graph of average temperature are 
given. Examinees must find answers to factual 
questions, based on these pictorial representa- 
tions. Some questions require reference to more 
than one of the sources of data. Again, a very 
specific skill is measured, one that can be learned 
with very little practice. Though it is true that 
stadents do not always have this skill, it can be 
easily acquired by those who have mastered basic 
reading skills but probably not by poor readers. 

"Test 6 is an advertisement reading test. Fac- 
tual questions are answered by reference to four 
advertisements of the usual newspaper type. 
"This test probably measures a narrow and spe- 
Cific form of skimming skill. As recognition 
only, not thought processes, is measured, the 
skill probably could be attained with a small 


amount of practice by persons who have mas- 
tered basic reading skills. 

Test 7 is an index usage test. This test, while 
of a very specific nature, requires the applica- 
tion of more thought processes than most of the 
others. The reliability of this test is .96, a sur- 
prisingly high value for only 14 items, even 
though it is probably affected by the highly 
speeded nature of the test. 

Test 8 is called a test of technical vocabulary. 
The synonym type of item is used. This test is 
similar to the usual vocabulary test. Test 9 is 
called a test of sentence meaning but is essen- 
tially a measure of vocabulary knowledge. Test 
10 is called a test of general vocabulary and 
utilizes again the usual synonym type of item. 
It is improbable that the vocabulary tests are 
difficult enough to discriminate between good 
and poor readers at the higher level, except as 
they are very closely timed. 

Of these ten subtests, five measure specific 
skills which may be obtained with practice by 
good readers but which poor readers would have 
difficulty attaining until they had mastered the 
basic reading skills, four measure vocabulary 
essentially though they are not all entitled vocab- 
ulary, and one test measures comprehension and 
memory of details. This leaves the comprehen- 
sion of the main thought of a paragraph, ability 
to draw conclusions and support those conclu- 
sions by generalizations and details drawn from 
reading, ability to read to support generaliza- 
tions and ability to organize material read almost 
untouched by this test. These skills are all study- 
type reading skills needed by students as are the 
more superficial skills measured by this test. 
Also, the test does not diagnose the deficiencies 
of the reader with poor basic reading skills. Few 
clues for approaching remedial work could be 
obtained from scores in these cases. 

The manual states that “relationships between 
the tests of the Record are moderately high.” If 
these relationships are known, they should have 
been furnished to users. The point is well taken 
in the manual that parts which furnish clues 
for remedial work need not be eliminated just 
because scores show relationship. There is prob- 
ably also a close relationship between scores on 
the types of reading skills which the Record does 
not measure and those measured by the Record. 
However, for a complete diagnosis and for 
remedial treatment of the usual range of reading 
deficiencies from the seventh grade through the 
college freshman year, this test is not broad 
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enough ; in other words, it does not measure the 
types of reading skills most essential to students' 
everyday reading problems. Four of the tests 
measure vocabulary skill but do not segregate 
the skill by specialized fields of knowledge. Five 
subtests measure skills easily attained by good 
readers through practice. Therefore, very little 
of the testing time is left for measuring the type 
of reading skills most needed by students. 

A word should be said about the profile chart. 
The format is good, and it makes an excellent 
visual picture of the scores obtained on the test. 
A study of this profile certainly clearly indicates 
the extent to which one score point raises a per- 
centile rank. For instance, on Test 5, a score of 
six is at the 20th percentile and a score of ten 
is at the 75th percentile, while the ceiling on the 
test is I2 raw score points. This fact places a 
great deal of importance on one score point and 
emphasizes again the effect the speed factor may 
have on scores in this test. 

This reviewer would recommend that this test 
be used with students known to have the reading 
skills essential for efficient reading but who 
may not be using these skills efficiently when 
applied to the specific skills measured by this 
test, but not to screen retarded readers or diag- 
nose their deficiencies for the purpose of reme- 
dial work. 

It would seem that the extent of the use of 
this test might be affected by its cost. Progress 
in reading should be measured at least once a 
year, preferably oftener. There is only one form 
of this test, a drawback to its frequent use for 
measuring progress in reading ; the cost of fre- 
quent testing, however, would be a greater de- 
terrent, since the cost of a single booklet is 43 
cents and answer sheets are $1.65 per 25. Profile 
Sheets, which are not absolutely necessary, are 
45 cents per 25. 

Experience shows that scoring pinprick tests 
with the clerical help used in a test scoring unit 
is difficult. It is a slow process, and scorers dis- 
like this type of recording and do not work as 
accurately as they do with other types of record- 
ings. This is another argument for adapting the 
test for regular machine-scoring answer sheets. 


J Consult Psychol 11:340-1 N-D "47. * The 
test’s greatest value is for use in surveys to 
discover deficiencies in reading skill, to be fol- 
lowed by a remedial program. 


[ 503 ] 
Stanford Achievement Test [Reading]. Grades 
2-3, 4-6, 7-9; 1922-40; same as reading tests in the 
Sianford Achievement Test; 3 levels; $1.20 per 25 of 
any one level; 35¢ per specimen set of any one level, 
postpaid ; Truman L. Kelley, Giles M. Ruch, and Lewis 
M. Terman; World Book Co. 
G) PRIMARY READING TEST. Grades 2-3; Forms D, E, F; 
25(30) minutes. 
b) INTERMEDIATE READING TEST. Grades 4-6; Forms D, 
E, F, G, H; 30(35) minutes. 
C) ADVANCED READING TEST. Grades 7-9; Forms D, E, 
F, G, H; 35(40) minutes. 


Margaret G. McKim, Assistant Professor of 
Education, University of Cincinnati, Cincinnati, 
Ohio. Forms D to H of the Stanford Achieve- 
ment Test comprise the 1940 revision of this 
battery. The reading tests contain the sections 
on paragraph meaning and word meaning from 
the complete battery. The tests are similar in 
pattern at all three levels. 

The value of these tests lies in their general 
estimate of paragraph comprehension and vocab- 
ulary. The series of short paragraphs compris- 
ing the paragraph meaning sections covers a 
variety of informational materials. A system 
requiring the reader to write in the word which 
best fills a blank in the paragraph allows for con- 
siderable variation in the type of understanding 
required. The word meaning sections are equally 
general in content. A key word in a partial sen- 
tence followed by five responses is the form used. 
Precise time limits make speed a factor in all 
tests. No information is given as to the bases 
upon which the test items were chosen. 

"The primary test, particularly the paragraph 
meaning section, is the most open to question. 
Here requiring a written response seems unsuit- 
able. The difficulty is increased by blanks too 
small for the normal handwriting of the primary 
child. Few of the paragraphs and few of the 
items in the word meaning section are simple 
enough to provide any sure measure of the 
ability of the poor reader. Extrapolated scores 
below a grade level of approximately 2.75 are 
further evidence that this test will not give a 
satisfactory picture of an entire class much be- 
fore the end of the third grade. 

Standardization was done on a random sam- 
ple of 50,955 cases drawn from a total popu- 
lation of 300,000 pupils distributed over 173 
communities, Spearman-Brown reliability co- 
efficients calculated from odd and even numbered 
items range from .883 to .969 according to un- 
published information supplied by the pub- 
lishers. Equated scores make it possible to com- 
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pare the two sections of a test or a test at one 
level with that at another. Age and grade norms 
have been calculated using only those children 
who were at grade for their age. While this does 
not solve the problem of interpreting norms for 
a school system in which children are not pro- 
moted automatically, it attempts to reduce the 
influence of varied promotion policies in the 
norms themselves. Age and grade norms calcu- 
lated in customary fashion are also provided. 

The accompanying manual explains the pro- 
cedures used to establish the norms and equated 
scores, and presents evidence regarding reliabil- 
ity. There is no discussion of reading as a skill 
nor any suggestion of ways in which the test 
results might be used to improve the reading 
program other than a general discussion of the 
use of achievement tests in the manual accom- 
panying the complete battery. The tests will 
probably be of greatest value as general achieve- 
ment tests in the intermediate and junior high 
school grades. They will supplement but not 
replace the reading tests which provide more 
specific diagnostic evidence. 


For reviews by Walter W. Cook and Ralph 
C. Preston of the entire battery, see 18. 


[ 504 ] 
Stone-Webster Test in Beginning Reading. Grade 
1; 1936-37; Forms 1, 2; 7¢ per test; manual free on 
request ; 40 (45) minutes ; Clarence R Stone; Webster 
Publishing Co. 
REFERENCES 

1. Strong, CLARENCE R. “Validity of Tests in Beginning Read- 
ing” EI Sch J 43:3615 F "43. * (PA 18:2605) 

Ruth Lowes, Associate. Professor of Educa- 
tion, West Texas State Teachers College, Can- 
yon, Texas. As a guide in selecting words for 
this test, the author used his own graded vocabu- 
lary list based on an analysis of the early read- 
ing material of pupils. 

The test was standardized on scores made by 
almost two thousand pupils in town and city 
schools of eight widely scattered states. A table 
of median scores with age and grade equivalents 
is given. Also given are tables of percentile 
norms for the end of the low first grade and the 
high first grade. Helpful suggestions for inter- 
preting percentile scores are given. No reliabil- 
ity or validity coefficients are reported. 

The test is divided into two parts. Part I 
measures word recognition. Sixty excellent pic- 
tures, simple in detail, are given. Each is accom- 
panied by four words, one of which the pupil 
circles. Part II consists of twenty-four sen- 


tences. With each sentence are four carefully 
executed drawings. The pupil marks the picture 
which represents the sentence. Each part re- 
quires a sitting of 20 minutes. The test is easily 
scored. 

The manual calls attention to certain diagnos- 
tic values of the test. One of the most important 
is that of determining discriminatory weaknesses 
when words similar in appearance are pre- 
sented. 


[ 505 ] 
Techniques in Reading Comprehension for Jun- 
ior-Senior High School: Every Pupil Test. Grades 
7-12; 1943-47; new form usually issued each Decem- 
ber and April; forms December 1946, April 1947; 2€ 
per test; 17(25) minutes; Ohio Scholarship Tests. 


Ivan A. Booker, Assistant Director, Research 
Division, National Education Association, 
Washington, D. C. [Review of April and De- 
cember 1946 forms.] This test is intended as a 
general measure of reading achievement for 
junior and senior high school pupils. It is one of 
the two high school reading tests in the total 
battery of more than three dozen tests which 
now constitute the series used in a state-wide 
testing program. The companion test, Applied 
Reading for Junior-Senior High School, is de- 
scribed elsewhere in this volume. 

In Techniques in Reading Comprehension 
the emphasis falls on various aspects of com- 
prehension. However, the element of speed is 
by no means eliminated. The time limits are 
short enough that pupils work under pressure, 
and also short enough that the great majority 
of scores are affected by rate of work. Like many 
other reading tests, therefore, this one yields a 
score in which rate and comprehension are in- 
separably tangled. Two students may make iden- 
tical scores: one a rapid worker but rather 
inaccurate ; the other, a slower but quite accurate 
reader. From the test scores alone, this differ- 
ence would not appear. 

The test has four parts designed to measure 
the pupil's ability: (a) to identify the central 
theme of a paragraph; (b) to find pertinent de- 
tails; (c) to recognize correct word meanings; 
(d) to interpret common idioms. Each part of 
the test uses a commonly employed type of test 
item and response. 

The first part, on identifying the central 
theme, contains ten short paragraphs, several of 
them usually written about the same general 
topic, for example, the Holland tunnel in New 
York City. The pupil selects the main idea of 
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each paragraph from four choices provided. The 
time limit of six minutes is much more generous 
for the April test than for the December form. 
The ten paragraphs of the latter contain thirty- 
eight more lines of printed material than the 
former and the multiple-choice items an addi- 
tional fourteen lines. Apparently the test para- 
graphs are chosen with little attempt to equate 
them for length, and the time limit is set arbi- 
trarily irrespective of the reading time involved. 

The second part of the test which involves 

reading (or skimming) for details consists of 
one or more selections of double-spaced material 
with words and phrases underlined and num- 
bered. Alongside this material are thirty ques- 
tions which must be answered by finding the 
numbered words or phrases which answer them. 
This seems the least useful of the parts of this 
test. The response technique, widely criticized 
as used in the Jowa Silent Reading Tests, gains 
no luster as used in the Ohio test series. Many 
of the questions can be answered by more than 
one underscored phrase (and alternate answers 
are shown in the scoring key). Other items re- 
quire two numbered items as a correct answer 
although one could scarcely infer as much from 
the directions and sample exercises with which 
the exercise begins. Still another objectionable 
feature in this part of the test is that several 
test items appear on a page other than that on 
which the answers are found, requiring pupils 
to shift back and forth from one page to another 
and sometimes to turn a page to find an answer. 
Again the time limit is six minutes, for reading 
about eighty-five lines and answering thirty 
questions. The directions are: “Read the follow- 
ing paragraphs. In the parentheses, write the 
numbers of the underlined words which cor- 
rectly answer the question.” However, if stu- 
dents proceeded in that order, the slower readers 
would do relatively few questions. Method of 
attack undoubtedly has much to do with the 
pupil’s score on this part of the test. 

The thirty-word vocabulary test is of stand- 
ard type. The test words are presented in isola- 
tion rather than in a context setting, and for each 
test word a four-part multiple-choice item is 
given. The three minutes allowed for this part 
of the test is a relatively more generous time 
allowance than that for either of the preceding 
sections. 

The final part of the test, Reading for Phrase 
Meaning, requires the pupil to interpret such 
idioms as “in the same boat,” “the lion’s share,” 


"and “put in his oar.” According to the manual, 


an “increment in difficulty is obviously estab- 
lished” in this part of the test. However, accord- 
ing to the test results of the April program, the 
fifth item was the easiest in the series and the 
first was about as difficult as any in the list ex- 
cept the last. 

The norms provided for the test take the form 
of percentiles based on the total scores for the 
four parts of the test. These become available 
about one month from the time the tests are 
given in Ohio schools. For this test the norms 
are given for each grade, instead of merely for 
junior and senior high school grades, respec- 
tively, as is true for some of the Ohio Scholar- 
ship Tests. Also, for such help as they may be 
in diagnostic work, per cents are given in the 
published reports to show what portion of the 
pupils answered each test item correctly. 

As a general survey of reading achievement, 
with the emphasis on certain phases of compre- 
hension, this test is a useful measuring instru- 
ment. Except for. Part IT, Reading for Details, 
the specific skills involved are clearly identified 
and nicely isolated for the purposes of measure- 
ment. For careful evaluations, however, other 
tests should be used at least in a supplementary 
way because of the lack of available data with 
respect to the validity, reliability, and compara- 
bility of the various forms of this test. 

Teachers who use the Universal Class Error 
Check Sheets and the Universal Class Percentile 
Record Blanks which accompany the test will 
find these charts helpful in planning class activi- 
ties and in individual pupil guidance even though 
the test has distinct limitations as a diagnostic 
instrument. 

TThe directions for administering the test are 
clear and brief. The test is inexpensive, objec- 
tive, and easy to score. The availability of similar 
(if not strictly comparable) forms prepared each 
year is a real advantage, and the percentile grade 
norms established on a state-wide basis provide 
splendid points of reference., The test can be 
administered within a single class period and, 
unless other more comprehensive reading tests 
are already being used in a school, will give a 
helpful overview of the types of reading ability 
which this test involves. It will be most useful 
when used as the initial step in a program of 
measurement and instruction—least useful when 
regarded as an adequate and final measure of 
reading achievement or as a means of accurate, 
individual diagnosis. 


$ 
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James M. McCallister, Director of Personnel 
Service, Hersl Branch, Chicago City Junior 
College, Chicago, Illinois. [Review of April and 
December 1946 forms.] This test is prepared for 
use in the state-wide testing program in Ohio. 
Parallel forms are provided for use in December 
and April. After the scores are reported to the 
Ohio State Department of Education, the 
teacher is furnished with a report of percentile 
norms and item norms that enables her to ascer- 
tain the standing of her class as a whole and the 
relative standing of each pupil. 

According to the printed announcement, this 
test serves two purposes: “(a) to determine the 
general level of reading comprehension of sec- 
ondary school pupils, and (5) to provide a basis 
for the improvement of pupils’ study through 
a detailed understanding of their reading 
strengths and weaknesses.” The validity of the 
test is conditioned by how well it serves these 
purposes. 

An internal analysis of the four parts of the 
test leaves the impression that it is a good meas- 
ure of the pupil’s ability to apprehend the exact 
meaning of a passage. The percentile norms 
indicate the general reading level of each pupil 
with respect to this ability. In these respects, it 
accomplishes the first purpose. 

It does not appear to serve the second pur- 
pose so well, however, because it measures only 
limited phases of the application of reading to 
study. It is conceivable that a pupil might suc- 
ceed well on this test but at the same time be 
poorly prepared to evaluate the correctness of 
ideas or detect inconsistencies, to make implica- 
tions or generalizations based on the material 
read, to apply ideas gained in the solution of 
problems, and to perform other study activities 
that accompany reading. When the test is used 
as a basis for the improvement of study activi- 
ties, this limitation must be recognized. 

Since tests for state-wide testing programs 
must of necessity be prepared for immediate use, 
they cannot be subjected readily to correlational 
studies as to validity and reliability. Therefore, 
no studies of these types are reported for the 
test. 

Excellent devices for the diagnostic analysis 
of the performances of individuals are provided 
with the test. Among these devices are a class 
error check sheet and a percentile record blank 
for diagnosing weaknesses and estimating the 
progress of individuals. These devices, when 
used in connection with the report of percentile 


norms, give a thorough analysis of pupil per- 
formances that is especially valuable in ascer- 
taining individual needs. 


[ 506 ] 
X*Thorndike-Lorge Reading Test for Grades 7 
to 9. Grades 7-9; 1945; Forms 1, 2; $5.40 per 100; 35¢ 
per specimen set; 40(45) minutes; E. L. Thorndike 
and Irving Lorge; Bureau of Publications, Teachers 
College, Columbia University. 


REFERENCES 

1. Lorce, Irvine. "The Thorndike-Lorge Reading Test for 
Grades 7-9." Teach Col Rec 46:453-9 Ap '45. * (PA 19:2754) 

Robert L. McCaul, Instructor and Adviser 
in the College, The University of Chicago, 
Chicago, Illinois. This test follows conventional 
patterns in validation and standardization. It 
differs from other reading tests by virtue of the 
fact that approximately half of the items de- 
mand of the pupil thoughtful, creative reading. 

The content of the test reflects the authors’ 
belief that “any adequate measure of ability to 
read will be more or less a measure of intelli- 
gence as it operates with language." Form r, for 
example, consists of four riddles, twenty-one 
brief aphorisms from Longfellow, Shakespeare, 
the Bible, and similar sources, and four rela- 
tively long prose selections, one of which is 
general narrative and the other three expository 
material from social science and natural history 
books. While the majority of the questions about 
the selections solicit a type of rote reproduction 
of details common to most reading tests, there 
are included some questions which require com- 
prehension of the whole passage and reasoning 
with the data presented. To the aphorisms the 
pupil responds by picking out from a series of 
sentences the one which “means most nearly the 
same." Obviously, these items correspond very 
closely to the proverb tests at the Average Adult 
and Superior Adult II levels of the Revised 
Stanford-Binet Scale. 

Granted that riddles and aphorisms serve a 
legitimate purpose in a reading test, there still 
remain several questions to be raised. If, as the 
authors claim, their test is "planned to include 
all the important factors in silent reading and 
give reasonable weight to each of them," are 
twenty-five riddle and aphorism items out of a 
total of fifty-five items a “reasonable weight"? 
Do riddles and aphorisms occur so frequently 
in children’s reading or occupy so important a 
place that they should comprise nearly half of 
a “general” reading test? If, on the other hand, 
these items are intended to provide a measure 
of how well the pupil’s intelligence “operates 
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with language," is the kind of reading exacted 
by riddles and aphorisms representative of all 
the complex, diverse elements in thoughtful, cre- 
ative reading? From a pupil's performance on 
such items can we generalize safely about how 
adequately his intelligence "operates with lan- 
guage"? The authors supply no information 
which will enable the prospective test user to 
answer these questions. 

Because parts of the test sample functions for 
which no precise nomenclature exists, the diag- 
nostic breakdown of the items is, as the authors 
imply, unsatisfactory. An item like “He goes 
where he pleases, low or high, and can walk just 
as well with his feet to the sky" may measure 
"knowledge of the meanings of words and 
phrases in context"; but to answer that fly is 
the word which best fits this sentence requires, 
in addition, a stroke of constructive imagination, 
an inference, or an insight much higher in the 
hierarchy of mental processes than mere knowl- 
edge of the meaning of words and phrases. 
Again, more than a "knowledge of construc- 
tions, including idioms" operates when a pupil 
chooses “To the boiling pot the flies come not" 
as the appropriate sentence to match with “The 
busy man has few idle visitors." For this reason 
the single score—the number of right answers— 
derived from the test is difficult to interpret. 

It should be remarked here that the portion 
of the test devoted to measuring comprehension 
of literal meaning and other concrete skills is 
well and carefully constructed. The reviewer is 
convinced, nevertheless, that a person's judg- 
ment of the whole test will be determined by 
his attitude toward the sections designed to 
measure thoughtful, creative reading. If the 
prospective test user feels that reading is no 
more than simple apprehension of meaning and 
recognition of symbols, he may condemn the 
test as an impractical and inadequate cross of 
an intelligence test and a reading test. If, con- 
versely, he believes that reading includes all the 
intellectual and emotional reactions which can 
possibly be made to the printed page, he may 
hail the test as almost the only one in its field 
that attempts to appraise the efficiency. with 
which the higher mental processes function in 
reading. Actually, the Thorndike-Lorge Read- 
ing Test undertakes to measure certain factors 
in reading that are neglected by more pedestrian 
reading tests. It supplements rather than dis- 

places these tests and may be used most profit- 
ably in conjunction with them. 


J Bus Ed 20:37 Ap "45. * a general reading 
such as this....will help to discover shortcomings 
in word knowledge, understanding and appre- 
ciation of the printed page, in ability to concen- 
trate to get thought, or in ability to read fast 
enough to make reading an asset in other high 
school subjects. * 
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Whipple's High-School and College Reading Test. 
Grades 9-16; 1925; Forms A, B ; $3.60 per 100; 12 per 
specimen set; 10(15) minutes; Guy M. Whipple; Pub- 
lic School Publishing Co. 
REFERENCES 


1. Book, WiLLiIAM F. “How Well College Students Can 
Read.” ‘Sch & Soc VI 


3. ANDERSON, Irvine H., AND DEARBOR: Xx - 
ing Ability as Related to College Achievement." J Psychol 11: 


Frederick B. Davis, Professor of Psychology, 
George Peabody College for Teachers; and 
Director, Test Research Service; Nashville, 
Tennessee. This test, copyrighted in 1925, has 
been on the market for over twenty years. At 
first thought, one would be likely to conclude, 
with all the research and attention devoted in 
that time to the problems of reading and to the 
measurement of achievement, that any reading 
test published in the last few years would be 
markedly superior to a test over twenty years 
old. The facts, however, do not justify such a 
sweeping conclusion, although there are better 
reading tests on the market. 

Each of the two forms of the Whipple’s High- 
School and College Reading Test consists of 
twenty items based on a single long passage 
characteristic of a textbook in modern history. 
These passages are not exactly boring, but they 
are of the type no normal high school or college 
student would reread except by necessity. They 
are, to put it briefly, the type of passages used 
characteristically for tests of reading compre- 
hension. Except for the Cooperative reading 
comprehension tests, the writer has yet to find 
a reading test that contains both humorous pas- 
Sages and excerpts of genuine literary merit. 

Since data have been obtained showing that 
ability to understand passages in the social 
studies is by no means identical with ability to 
understand passages taken from other academic 
fields (such as Science), the validity of the 
Whipple test is at once brought into question. 

The twenty items consist of directions to per- 
form some operation and are interpolated at 
intervals into the content of the passage. Re- 
Sponses are indicated by writing on the test 
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. booklet, a procedure which makes it consumable 


and necessary to score by hand in a rather in- 
convenient way. Scoring is not entirely objec- 
tive. The reliability of the scores derived from 
the test is not stated in the manual. The writer 
guesses that it might be about .75 in a typical 
high school grade. If so, the standard error of 
measurement of an individual raw score would 
probably work out to be about 1.5. Taking this 
estimate as better than no data at all (which is 
what the manual provides), we find that a 
twelfth-grade student's score could readily vary 
by sheer chance as much as twenty percentile 
points if it were close to the mean for the grade. 
In this connection it is interesting to note the 
range of raw scores between selected percentile 
points, say the tenth and ninetieth. For the ninth 
grade, this range is 7.5; at no grade level is it 
as great as 10. These data serve to illustrate why 
a test with so few items cannot be highly dis- 
criminative in placing a large group of students 
in rank order of ability. 

summary. The Whipple’s High-School and 
College Reading Test is a reasonably satisfactory 
survey test of speed of comprehension of mate- 
rial of the kind found in social studies textbooks. 
It does not appear to measure this ability reliably 
enough to permit accurate comparisons of indi- 
viduals at a given grade level. Too limited a 
range of reading skills and content is presented 
to warrant confidence in the validity of the test 
as a measure of general reading ability. The 
score resulting from the test is a mixture of 
speed of reading and level of comprehension. 
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Williams Primary Reading Test. Grades 1-3; 1926; 
Forms A, B; $2.40 per 100; 12¢ per specimen set; 
Allan J. Williams; Public School Publishing Co. 

Alice N. Jameson, Assistant, Child Place- 
ment Department, Public Schools, Brookline, 
Massachusetts. This test “was designed to test 
the ability of the pupils in grades 1-3 to get the 
thought of the printed page.” In 1926 this test 
may have accomplished its purpose, but today 
it is inadequate as a measure of reading ability. 
It is stated that “the material used in the test 
is common to most of the widely used texts." 
The books most used today are entirely differ- 
ent in content and construction. It is stated 
that "the first part of the test is easy enough 
so that most of the pupils at the end of the first 
year are able to answer two or more questions.” 
Here again the test is poorly graded, for cer- 
tainly two or more questions at the end of a 


school year are too few to test a whole year’s 
work in grade one. 


MISCELLANEOUS 
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Examiner’s Reading Dia, sonst Record os High 
School and College Students. 1938-39; Forms A, 
B, C, D; 35€ per set (includes manual, record blank, 
and oral. reading passages); 3o¢ per set, 10 or more; 
Ruth Strang with the assistance of Margaret E. Martin, 
Margaret C. McKim, and Mary Alice Mitchell; Bu- 
reau of Publications, Teachers College, Columbia Uni- 
versity. 


Robert Murray Bear, Professor of Psychol- 
ogy and Director of the Reading Clinic, Dart- 
mouth College, Hanover, New Hampshire. The 
chief value of this instrument lies in its making 
generally available a record form for a thorough 
case study of an individual with rather severe 
reading difficulty. Its various parts, eleven in 
number, range from personal data, records of 
scholastic achievement and tests, developmental 
history, and interests through summary, inter- 
pretation, recommendations, and follow-up. An 
accompanying five and one-half page manual 
supplies further suggestions for obtaining and 
recording the data with comments upon their 
significance. 

The Examiner's Reading Diagnostic Record 
performs a special service in calling attention to 
the values derived from appraisal of oral read- 
ing and in providing a rather brief oral test. 
There are four 100-word paragraphs to be read 
orally by the student. These are graduated in 
difficulty, though unfortunately no information 
is supplied the user regarding their levels of dif- 
ficulty, either in vocabulary, sentence structure, 
or ideational complexity. Norms are not pro- 
vided, though the user is told a number of things 
which should be observed during the reading. 
Errors are to be marked in a standard way, but 
their particular significance as clues to faulty 
perceptual or mechanical reading habits and the 
possible use to be made of them in remedial work 
are not explained. 

Experienced workers derive greater value 
from case histories than do the inexperienced, 
but the seasoned clinician will probably already 
have devised a record form containing the items 
he has found useful in individual diagnosis. It is, 
therefore, probable that the Examiner's Read- 
ing Diagnostic Record will hold greater interest 
for those beginning this work. Quite correctly 
the manual is written from this standpoint and 


$10 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 540 


contains many valuable suggestions for the 
guidance of the beginner. One wishes it had gone 
even further and into more detail, as, for exam- 
ple, in suggesting types or even names of reading 
tests which might be used for various diagnostic 
purposes, in greater analysis of interests other 
than reading, in raising questions regarding en- 
vironmental and emotional factors in poor read- 
ing, and in explaining the significance mentioned 
above of oral reading as an index of silent habits 
of word recognition. 

In summary then, we have here a convenient 
record form for those without one of their own, 
a brief, nonstandardized oral reading test suit- 
able for upper class pupils, and a guide for the 
beginner which should help prevent him from 
overlooking factors of possible relevance to a 
diagnosis. 


Carolyn M. Welch, Formerly Acting Super- 
visor, The Reading Clinic, Baltimore, Mary- 
land. This is a form for obtaining information 
concerning a reading problem at the high school 
and college levels. The author emphasizes, in 
the accompanying manual, that the procedure 
is clinical in nature, an individual approach. 
The record embraces eleven sections including 
items to be recorded, suggestions for stand- 
ardized and informal tests, informal questions, 
reading material, and directions for the exami- 
ner. The manual offers an explanation of the 
diagnostic procedure and specific directions on 
how and where to locate information. 

The format of the record allots ample space 
for notation and encourages completeness of 
analysis. The first section, Identifying Data, 
reveals home background, reasons for referral, 
and a statement of the reading problem as rec- 
ognized by the student and his parents or 
teachers. The second and third sections, Sum- 
maries of Results of Standardized Tests (In- 
telligence, Achievement, and Reading) and 
‘Scholastic Achievement, respectively, give in- 
dexes to capacity and achievement relation- 
ships; hence, they indicate amount of retarda- 
tion. The physiological factors are examined in 
the fourth section, Medical Examination and 
Reports on Physical Condition. Questions 
listed in the fifth section, Developmental and 
Educational History, are particularly helpful as 
a guide to spotting the initiation of the problem 
and further defining it. Of significant aid for 
instructional purposes is the information on 
pupil interests obtained through the interview 


in the sixth section, Present Reading Interests, 
and in the seventh section, Other Interests and 
Activities. : 

In the eighth section, Present Reading 
Status, the author presents a series oi four 
paragraphs, arranged in order of increasing 
difficulty, for the analysis of oral reading habits. 
This appraisal is informal in nature, since the 
scoring has not been standardized and norms 
are not presented. However, the manual sug- 
gests that in most cases it will not be necessary 
to administer more than two paragraphs, se- 
lected to fit the estimated ability of the ex- 
aminee. Herein lies a hazy concept of what is 
meant by “ability.” This examiner has inferred 
that the comprehension of the particular test 
selections, word recognition skills and abilities, 
and certain study habits are to be determined 
through the use of the oral reading and the 
dictionary tests. No opportunities for determin- 
ing the instructional level in terms of graded 
materials are offered. No opportunities for ob- 
taining an informal appraisal of the child’s 
capacity for understanding the material in spite 
of the reading handicap are suggested. 

The ninth, tenth, and eleventh sections pro- 
vide spaces fér summary of results, recom- 
mendations, and follow-up. A paragraph in the 
manual suggests that records be continued dur- 
ing the instructional program. 

This diagnostic record presents an analytic 
procedure for screening and for diagnosing, to 
a certain degree, a retarded reader at high 
school and college levels. It leaves for more 
specific analysis those severe reading disability 
cases with neurological involvements and with 
severe emotional aberrations. 

This examiner continues to search for a way 
of recording the relationships of findings in 
order to determine the specific nature of the 
reading problem, to define the related and in- 
fluencing factors, and to determine which pro- 
cedures to recommend for instruction. 


For a review by Henry D. Rinslind and ex- 
cerpts from reviews by Seth A. Fessenden, 
Austin G. Schmidt, and one other, see 40:1535. 
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Gates Reading Diagnostic Tests, Revised Edition. 
Grades 1-8; 1926-45; individual; Forms I, II; 20¢ per 
record booklet ; 45¢ per manual ; $1.10 per set of oral re- 
sponse test material; 1o¢ per set of tachistoscopic 
cards; (60-90) minutes; Arthur I. Gates; Bureau of 
Publications, Teachers College, Columbia University. 
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REFERENCES 
1. GATES, AnrHUR I. The Improvement of Reading: A Pro- 


ram of Diagnostic and Remedial Methods, Revised Edition. 
D or latest edition seg (3) below Pp. xvii, 668. * (PA 9:5252) 
Ew RAPCU v pe Td 
of Determining Reading Readiness, New York: Bureau of Pub- 
lications, Teachers College, Columbia University, 1939. Pp. iv, 
in ARS UR T DE TE i 

gram of Diagnostic and Remediat Methods, "hid Eikon New 
York: Macmillan Co., 1947. Pp. xxi, 657. $4.25. * (PA 22:3195) 

T. L. Torgerson, Chairman, Psycho-Educa- 
tional Clinic, and Professor of Education, The 
University of Wisconsin, Madison, Wisconsin. 
These tests are available in two forms, con- 
sisting of a pupil’s record booklet, two sets of 
cards containing test material for the pupil, 
and a manual of directions. The 16-page record 
booklet contains a cover page for recording 
test scores, 12 pages of test material, a one- 
page check list of errors and difficulties, and 
a two-page analytic summary of other.tests and 
observations of vision, hearing, speech, per- 
sonality, interests, home, and school life. A 35- 
page manual contains directions, norms, and 
references to remedial procedures found in the 
author's Improvement of Reading (1). 

The series consists of the following eighteen 
tests: (a) oral reading, (b) oral vocabulary, 
(c) reversals, (d) phrase perception, (e) 
timed word perception, (f) untimed word 
pronunciation, (g) spelling, (4) syllabication, 
(i) recognition of syllables, (7) recognition of 
phonograms, (k) blending letter sounds-visual, 
(I) giving letter sounds, (m) reading capital 
letters, (n) reading small letters, (0) blending 
letter sounds-auditory, (p) giving letters from 
sounds, (q) giving words with initial sounds, 
(r) giving words with stated final sound. 

Both age and grade norms are contained in 
the manual for each of the tests. Grade norms 
for the oral reading test, which consists of 
seven paragraphs, extend from 1.6 to 8.5. The 
grade norms for the oral meaning vocabulary, 
consisting of 30 words, extend from 2.0 to 13.0. 
The grade norms for word perception—timed, 
phrase perception, pronunciation and syllabi- 
cation extend to 7.0, 6.5, 6.0, and 4.6 respec- 
tively. With the exception of the spelling tests, 
the norms on all of the remaining tests of basic 
skills in reading are based on perfect scores 
at a grade status of 3.0 for giving words with 
initial sounds and final sounds; 3.7 for rever- 
sals, recognition of syllables, and phonograms; 
and 4.0 for speed of recognizing capital letters 
and lower-case letters, letter blends, and letter 


sounds. 


Scores on the oral reading test are based 
upon six types of errors: hesitation of five or 
more seconds on a word, mispronunciation, 
omissions, substitutions, insertions, and repeti- 
tions. The length of time used by the pupil in 
reading a paragraph is not taken into consider- 
ation in scoring. The seven paragraphs increase 
in difficulty “at approximately uniform steps 
of a grade each." 

In an unpublished study by the author, the 
following reliability coefficients are reported: - 
oral reading .86, oral vocabulary .80; these are 
based upon correlations obtained between the 
two forms using 9o cases in the third grade. 
Correlation between odd and even items stepped 
up by the Spearman-Brown formula yielded 
coefficients ranging from .88 to .96 for eleven 
of the tests. Blending letter sounds orally, giv- 
ing letters for sounds, and giving words with 
stated initial sounds yielded coefficients of 84, 
.77, and .63 respectively. 

This series of diagnostic tests is the most 
comprehensive and usable individual test for 
diagnosis in reading at present available. The 
classroom teacher can learn to administer the 
series after a reasonable study of the materials. 
The administration of both forms to each case 
studied will insure adequate reliability for most 
of the tests. The observational check lists of 
more than 60 specific difficulties in reading, 
together with the student’s developmental his- 
tory, intelligence, hearing, vision, and speech, 
will provide an excellent basis for making a 
diagnosis of a reading disability case. 

The oral reading paragraphs provide an in- 
teresting story for the pupils. Recall questions 
which would measure comprehension of these 
paragraphs would add to the completeness of 
the diagnosis. Teachers who differ as to the 
“brand” of phonics emphasized in their teach- 
ing may omit the tests that have no curricular 
validity for them without interfering with the 
results of the remaining tests. Many teachers 
would. welcome a profile chart that would en- 
able them to record the test scores graphically. 
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Inventory of Reading Experiences. Grades 9-16; 
1940; 1 form; $1.75 per 25; 15¢ per specimen set; non- 
timed (15) minutes; Frederick L. Pond; Stanford 
University Press. 
REFERENCES 
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Albert J. Harris, Assistant Professor of 
Education, The City College of New York, 
New York, New York. This inventory is a 
carefully developed questionnaire intended for 
use in appraising the quality and quantity of 
the reading experiences of high school and col- 
lege students. Part I, Qualitative Inventory, 
contains 100 items, grouped in ten sections, to 
be marked on a three-point scale: never, some- 
times, often. Part II, Quantitative Inventory, 
contains 50 items. Most of the items seem rele- 
vant and appropriate. Twenty of the qualita- 
tive items, however, including 9 with very high 
weights, deal with radio and moving picture 
interests; the inclusion of these seems unfor- 
tunate. There is one form, administered with- 
out time limit. The format is satisfactory. A 
weighted scoring scheme similar to that used 
for the Bernreuter Personality Inventory is 
employed. Cardboard stencils are provided, and 
the use of Veeder counters is recommended for 
rapidity in scoring. 

The inventory seems to have been developed 
through the use of elaborate research tech- 
niques. Reliability is satisfactory: .92 for Part 
I and .91 for Part II, for a single grade, using 
the split-half, Spearman-Brown technique. Re- 
liabilities for the 10 sections of Part I are not 
given but are probably low. Validity data of 
several kinds are presented, which seem to 
show that the inventory has low positive corre- 
lations with IQ, reading ability, literary knowl- 
edge, school marks, and socio-economic status ; 
differentiates between remedial readers and 

. mormal readers less satisfactorily than a good 
‘vocabulary or reading test does; and correlates 
highly (above .80) with reading diaries kept 

. by high school students, both for single items 

and for total scores. It is, of course, subject 
to any tendencies toward falsification that 
might also influence reading diaries. 

Percentile norms are given by grades for the 
two main scores, and quartile norms for each 
of the ten sections. The norms for grades 9, 
10, 11, and 12 are based on over 1,000 students 
in each grade. College norms are based on 187 
freshmen, r28 sophomores, and 84 graduate 
students. No information is provided which 
allows a judgment about the representativeness 
of the sampling. The norms must, therefore, be 

regarded as tentative. 

This inventory is the first to provide quanti- 
tative scores and norms for the reading inter- 
ests of high school and college students. It will 
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probably prove useful to instructors in litera- 
ture and remedial reading courses and in re- 
search studies on reading at the high school 
and college levels. 


[512 ] 
Record for Reading Diagnosis. Grades 1-8; 1936; 
withdrawn from publication; C. R. Stone; Webster 
Publishing Co. 


Carolyn M. Welch, Formerly Acting Super- 
visor, The Reading Clinic, Baltimore, Mary- 
land. This four-page leaflet was designed to 
record data and information pertinent to the 
causes of a reading problem. The record con- 
sists of sixteen sections suggesting the gather- 
ing of data on intelligence; reading achieve- 
ment; oral reading habits; word recognition 
skills and abilities; emotional characteristics; 
physical status; handedness; eyedness; speech, 
sight, and hearing deficiencies; handwriting 
and language habits; school progress; and 
reading attitudes. Most of the data must be 
obtained in an individual testing situation. 

In general, the record suggests that a num- 
ber of factors be considered in diagnosing a 
reading problem. In many sections, responses 
of correct and incorrect nature are given. For 
example, under the heading Speech in Section 
IX, these responses are found: “Normal; baby 
talk; lisps; stutters; stammers.” Those who 
wish to record more specifically the nature of 
the speech deficiency will have difficulty en- 
closing notes in the space provided. The record 
allots space for hardly more than summary- 
type recordings of observation. 

This user of the record finds the following 
needs unsatisfied: A column for the chrono- 
logical age in the first section on intelligence. 
A definition of the word "easy" as used in the 
second section describing the type of story to 
be used for silent reading. (It has been the 
writer’s experience that many children can be 
given material easy enough so that a satisfac- 
tory rate of comprehension exists with satis- 
factory reading habits, thus resolving the read- 
ing problem.) A statement regarding the type 
and the difficulty of the material to be used for 
oral reading in the third section. Methods for 
determining handedness and eyedness. And a 
section for reporting language handicaps of 
a structural type, not involving speech defi- 
ciencies or foreign language handicaps. 

The criteria for analyzing oral reading habits 
(Section IIT), word recognition skills and 
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, abilities (Section IV), and emotional charac- 
teristics (Section V) are very helpful. The 
fact that the record can in this respect be used 
at any grade level is a salient feature. The 
quantity of factors to be considered for diag- 
nosis should be noted. However, the organiza- 
tion of the record presupposes the competence 
of the examiner in relating the findings for the 
summary (Section XV) and the recommenda- 
tions (Section XVI). These latter sections 
offer very little assistance in recording sugges- 
tions for the correction or remediation of read- 
ing retardation or disability. 


READINESS 
[ 513 ] 
X*American School Reading Readiness Test. 


Grade 1 entrants; 1941; Form A; $4.80 per 100; 30¢ per 
specimen sct, postpaid; nontimed (45) minutes; Rob- 
ert V. Young, Willis E. Pratt, and Carroll A. Whit- 
mer; Public School Publishing Co. 


REFERENCES 
1. Pratt, WirLIs E. The Construction of a Group Test of 


Reading Readiness. Unpublished doctor's thesis, University of 
Bu 1940. (Abstracts of Theses . . . 1940, 1941, pp. 265- 

David H. Russell, Professor of Education, 
University of California, Berkeley, California. 
This test consists of an above-average number 
of subtests with a below-average number of 
items in each subtest It comprises eight sub- 
tests, but the largest number of items in any 
test is ten and three tests contain only six items. 
The student of test construction, accordingly, 
will immediately question the reliability of the 
subtests. The authors report a split-halves re- 
liability coefficient of .91 for the test as a whole 
but give no indication of the reliabjlity of the 
short subtests. 

The authors state that the eight subtests were 
selected from items mentioned in the literature 
as being related to reading readiness which can 
be tested by group methods. They include vo- 
cabulary (marking pictures), discrimination of 
letter forms (selecting one letter unlike three 
others), discrimination of letter combinations 
(matching one of four combinations with a 
criterion combination), recognition of words 
(selecting a letter that does not belong), recog- 
nition of words (matching one of four with a 
criterion word), discrimination of geometric 
forms (matching as above), following direc- 
tions (marking pictures), and memory for geo- 
metric forms (drawing from memory after a 
short exposure). The authors report intercor- 


relations of these subtests ranging from .17 to 
.61 with a group of children who had been in 
kindergarten and from .29 to .69 in a nonkin- 
dergarten group. 

The authors of this test are to be commended 
for the statement they make in the accompany- 
ing manual of their experimental work in de- 
veloping the test. The description of successive 
stages of development is succinct and clear. 
The final validation of the test was accom- 
plished on too small a group to please all test 
users, but the statement is available to all. The 
authors found a multiple correlation of .53 be- 
tween their test and Gates Primary Reading 
Tests given six months later. 

Since the predictive value of the test is low, 
the authors have added a feature not usually 
found on similar tests. By means of weighting 
the scores on the subtests and regression equa- 
tions, they predict “an estimated reading grade 
on a reading test at the end of the first school 
year rather than the arbitrary determination 
of some critical point above which the pupil will 
succeed and below which he will fail.” These 
predicted grades range from 1.2 to 3.2. In view 
of the wide variety of factors affecting success 
in beginning reading, the authors have made 
a commendable effort to avoid the use of a 
critical score which has little or no value in 
dividing the sheep from the goats. If they could 
improve their total test to give it higher pre- 
dictive value, their approach would be a very 
useful one. 


Paul A. Witty, Professor of Education, 
Northwestern University, Evanston, Illinois. 
This survey test and manual constitute a con- 
scientious effort to meet the need for “a meas- 
ure of reading aptitude which may be used 
easily by the regular teacher in the classroom.” 
Tests and studies were examined and the fol- 
lowing types of tests were selected for the 
American School Readiness Test: (a) visual 
discrimination, (b) vocabulary, (c) copying 
geometric forms from memory, and (d) ability 
to follow directions. From 143 items which 
were judged to be discriminatory, 60 items 
apiece were selected for Form A and for Form 
B. Experimental work on reliability was based 
on the study of 226 pupils “in twelve rooms in 
three typical school districts.” 

The American School Reading Readiness 
Test is designed as a survey test to be used in 
examining groups of children. In constructing 
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the test, the predictive value of individual items 
was considered; the multiple correlation tech- 
nique was utilized to weight each part of the 
test. Separate norms were developed for kin- 
dergarten and nonkindergarten pupils. Several 
measures of validity appear to be substantially 
high. So too are the measures of reliability. 
Directions in general are concise and clear. 
However, lack of time limits proves a handicap 


to some teachers in giving this test. Directions 


for some tests should be made more specific. 
For example, in Test I, the pupils are in- 
structed as follows: “ ‘Put your piece of paper 
on the page so that only the first row of pic- 
tures is showing. Now put a mark on the boy." 
(Show on the board how to make an x)" 
This direction is illustrative of several that 
could be greatly improved. For example, the 
examiner might be instructed to demonstrate 
with his own paper how to use the marker. 
The marker should be designated as a marker 
rather than as. “a piece of paper.” 

In testing young children, it is best for the 
examiner to work with them, studying one row 
of materials at a time (Test II) instead of in- 
Structing pupils to follow complex directions 
such as: "Put aside your piece of colored 
paper. Look at the rest of the page and you 
will see more blocks. In each one put a mark 
on the letter, etc." Test IV is crowded, and 
instructions for its administration are incom- 
plete. The samples should be made clearer in 
order that children will not place the marker 
incorrectly when the teacher says, "Look at the 
two blocks. In the block on the xipht; 5 2o" 
Here the child may think the block on the right 
is the large square block at the extreme right 
which has nothing to do with this test. 

The giving of the test would be less confus- 
ing if the pictures in Test T were improved; 
€g., the rock in row 3 looks like a potato. 
Children who succeed in such tests do so by a 
process of elimination rather than discrimina- 
tion. 

Directions for Scoring are, on the whole, 
direct and clear, It is paradoxical that for some 
of the tests the weighted scores decrease as the 
Taw scores increase. “This phenomenon,” men- 
tioned in a footnote on page 11 of the manual, 
and the fact that for kindergarten children the 
decrease occurs in Tests I, III, and IV, and 

for nonkindergarten children it occurs in Tests 
II and IV, merit further study. 


The test can be given by a classroom teacher 


to approximately 15 children at one time. More- 
over, the first test in the series is so easy that 
good rapport is established and children enjoy 
taking the test. However, there are some de- 
tails in giving and scoring which should be 
remedied. Also, some of the pictures should 
be improved, and the print for the entire test 
should be clearer and bolder. One page each 
should be provided for Tests IV and V. 

The American School Reading Readiness 
Test can be used effectively by the classroom 
teacher. Results secured from its tryout showed 
that this test is a superior one of its type and 
is practical to give and use in the typical class- 
room. 

[ 514 ] 
*Binion-Beck Reading Readiness Test for Kin- 
dergarten and First Grade. 1945; 1 form; $1.75 per 


25; 50¢ per manual; 50 per specimen set; (40) min- 
ae Harriet Seay Binion and Roland L. Beck; Acorn 


Publishing Co. 
REFERENCES 

1. Brinton, Harrier Seay. “Do We Know When a Child is 
Ready to Read.” Abstract. Proc Okla Acad Sci 21:139 '41. 
(PA 16:1153, title only) 

Irving H. Anderson, Professor of Educa- 
tion, University of Michigan, Ann Arbor, 
Michigan. This test is divided into four parts. 
Part I measures picture vocabulary as well as 
the ability to detect likenesses and differences 
in pictures. Part II measures the ability to 
follow directions, Part III measures picture vo- 
cabulary and memory for a story. Part IV is a 
test of motor control. The reliability of the test 
as a whole is .95. No reliability coefficients for 
the individual parts of the test are reported. 
Validity coefficients were calculated between 
Scores on the reading readiness test given in 
September and scores on the Gates Primary 
Reading Tests given in May of the same school 
year. The highest correlation obtained was .8o. 
Correlations between scores on the Piníner- 
Cunningham Primary Test and scores on the 
Gates Primary Reading Tests are also re- 
ported. These correlations run lower than the 
correlations involving the reading readiness 
test. These results suggest that the reading 
readiness test is a better predictor of achieve- 
ment in reading than the Pintner-Cunningham 
intelligence test. These correlations are based, 
however, on only 20 cases. 


Paul A. Witty, Professor of Education, 
Northwestern University, Evanston, Illinois. 
The purposes of the test are: *(1) to aid the 
teacher in determining whether or not pupils 
entering first grade are ready to learn to read. 
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and (2) to classify children in particular types 
of instruction, thus securing the development 
of right attitudes and habits and the prevention 
of wrong ones." Furthermore, the makers of 
the test assert: “The exercises which compose 
the Reading Readiness Test require the child 
to bring into use the developed factors in his 
physical, mental, emotional, and social matu- 
rity.” Validity is presumably established by 
measures of agreement with standard reading 
and mental tests. Coefficients for reliability ap- 
pear satisfactory. 

These tests are somewhat complicated and 
difficult to administer. The loose sheets found 
in the testing booklet, the poor drawings, the 
need for the examiner to assemble materials, 
and the mimeographed directions suggest the 
incompleteness of the test and the need for 
thorough revision. 

Some of the pictures are especially crude; 
for example, the family pictures for Test II, 
Row 6, and the animal pictures for Test II, 
Row 7. There is, also, a crowded effect with 
too much material on some pages. It is burden- 
some to assemble the materials necessary for 
administering this test. Test manufacturers 
should supply the necessary articles. In fact, it 
might be well to attempt to standardize the test 
using pictures of objects. 

Some of the instructions for giving and scor- 
ing the test are inadequate. For example, in 
Test IV, samples should have been included 
to show what are considered "best, average, 
and poor" drawings. The authors state that 
“This testing technique" (having each picture 
on the page marked, rather than just one pic- 
ture in each row) "provides picture-test items 
at less than one-third the usual cost." But the 
cost in energy and strain on the part of the 
teacher and the pupil in giving and taking this 
test is much too great. Many pictures are so 
poorly drawn that the child must proceed by 
the process of elimination to obtain his answer. 

The Binion-Beck Reading Readiness Test 
should be considered as an interesting effort to 
develop a useful test. Thoroughgoing develop- 
ment and revision are needed in the test and 
the manual before this readiness test can be 
recommended. 


[ 515 ] 
Classification Test for Beginners in Reading. 
First grade entrants; 1933; 1 form; of per test; man- 
wal free on request; 14(20) minutes; Clarence R. 
Stone and Clifford C. Grover; Webster Publishing Co. 


REFERENCES 
1. Wirsow, Franx T., AND Burke, AcNzs, “Reading Readi- 
ness in a Progressive School.” Teach Col Rec 38:565-80 Ap ‘37. 


Wendel Keng Reese ey, Kork, uso on 
Fea 13:6466) dE psa 

Marion Monroe Cox, Supervisor of the 
Reading Division, Psycho-Educational Clinic, 
The University of Southern California, Los 
Angeles, California. This test consists of two 
parts. Part I requires the child to look at a 
printed word and find it again among four 
words, three of which are similar in some re- 
spect to the test word. Part II requires the 
child to compare two words and indicate 
whether they are alike or different. Each part 
has several preparatory exercises for practice 
and a time limit of seven minutes. 

The test does not require the child to read 
the words but merely to observe and note their 
details carefully. The assumption is made that 
children who can compare and match printed 
words accurately will become better readers 
and learn more rapidly than those who cannot 
do so. To measure the validity of this assump- 
tion, the test scores made by children on the 
Classification Test for Beginners in Reading 
at the beginning of the semester were corre- 
lated with scores by them on the Lee-Clark 
Reading Test—Primer at the end of the semes- 
ter. The correlation coefficient reported by the 
authors is .62. While the test predicts success 
in reading in a general way, the prediction for 
any individual pupil is not very accurate, Many 
children possess this skill and yet do not learn 
to read easily because of other reasons, such as 
physical defects, emotional blockings, person- 
ality disturbarices, and other factors which may 
interfere with normal learning. 

The Classification Test for Beginners in 
Reading has a reliability coefficient of .97 and 
is easily administered; items for the test were 
selected after preliminary study of experimen- 
tal editions of the test. The norms consist of 
the scores at the roth, 25th, soth, 75th, and 
goth percentiles of beginning first grade chil- 
dren. The authors suggest that first grade 
teachers divide their pupils into three groups 
on the basis of the test scores: superior, aver- 
age, and low groups, each to be given a type 
of instruction suitable to the ability of the 
group. As a further aid in grouping the chil- 
dren, the authors suggest giving an intelligence 
test and comparing the mental ages of the chil- 
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dren with ratings on the Classification Test for 
Beginners. 

The authors recognize that the test has cer- 
tain limitations of which the teacher should be 
aware. They caution her to consider the clas- 
sification of pupils by their test scores as tenta- 
tive; whenever the teacher is convinced that a 
child would be better off in another group, she 
should use her judgment in making the change. 
Test results, therefore, are not to be adhered 
to rigidly but are to be supplemented with a 
generous amount of the teacher's common 
sense. It takes some time, however, before a 
teacher can become well enough acquainted 
with her pupils to estimate their abilities accu- 
rately. The authors offer the test as a quick 
screening device to classify pupils tentatively 
into fast, slow, and average groups with a fair 
chance that many pupils will be correctly placed 
and that the teacher will not need to make a 
great many changes later. 


David H. Russell, Professor of Education, 
University of California, Berkeley, California. 
The Stone and Grover Classification Test for 
Beginners in Reading must be given credit for 
being an early attempt to measure readiness for 
reading. For example, it appeared three years 
before Harrison's Reading Readiness (Hough- 
ton Mifflin Co., 1936), a book which stimu- 
lated greater interest in the field. Two other 
tests were published about the same time, but 
the Stone and Grover test must be considered 
one of the pioneers in this area. 

The relatively early publication of the test 
may contribute to its chief weakness as a readi- 
hess test, the narrow range of abilities it meas- 
ures. The two parts of the test both appraise 
visual perception abilities, similarities and dif- 
ferences in words. The narrow range of abili- 
ties tested simplifies procedures and reduces 
the time for administration of the total test (20 
minutes) but, of course, reduces the diagnostic 
possibilities inherent in the test. It gives a 
measure of children's word-recognition abili- 
ties but gives no information on other factors 
related to success in beginning reading such as 
size of understanding vocabulary, ability to fol- 
low directions, and ability to complete a story. 

The authors of the Classification Test for 
Beginners in Reading give a split-halves relia- 
bility of approximately .97 and found a corre- 

lation of .62 between scores on the test given at 
the beginning of a semester and scores on the 


Lee-Clark Reading Test—Primer given at the 
end of the semester. This does not indicate as 
high a predictive value as is usually found in 
a test more composite in nature. However, the 
same coefficient of correlation, .62, was found 
for this test by Gates, Bond, and Russell (2) 
to be the highest of the correlations obtained 
between scores on reading ability at the end 
of one, two, and three terms and 39 other tests 
sometimes used as possible predictors of read- 
ing success. Accordingly, the test would seem 
to have little diagnostic value but some ad- 
ministrative value for a tentative grouping of 
first-graders for purposes of instruction in 
reading. 
[ 516 ] 

Gates Reading Readiness Tests. Grade 1; 1939; 
1 form; $4.85 per 100; 30¢ per specimen set; nontimed 


(50) minutes; Arthur I. Gates; Bureau of Publica- 
tions, Teachers College, Columbia University. 


REFERENCES 

1-5. See 40:1537. 

6. Gates, ARTHUR I. “A Further Evaluation of Reading 
Readiness Tests.” El Sch J 0:577-91 Ap "40. * (PA 15:3581) 

7. WiLsoN, Frank T. "Early Achievement in Reading.” El 
Sch J 42:609-15 Ap '42. * (PA 16:3816) 

8. GATES, ARTHUR I. The Improvement of Reading: A Pro- 
Sork: Menendus Co 1997. Pp o, 6S7. 84.26." (BA 2253199) 

Marion Monroe Cox, Supervisor of ihe 
Reading Division, Psycho-Educational Clinic, 
The University of Southern California, Los 
Angeles, California. This test has three main 
objectives: (a) to measure readiness for be- 
ginning reading, (b) to predict rate of devel- 
opment of reading ability, and (c) to diagnose 
the pupils’ needs in several important abilities 
required in learning to read. 

The tests consist of five subtests: Picture Di- 
rections, Word Matching, Word-Card Match- 
ing, Rhyming, and Reading Letters and Num- 
bers. The first subtest involves hearing, under- 
standing, remembering, interpreting, and follow- 
ing verbal directions. The second and third sub- 
tests require observation of printed words. The 
fourth subtest requires recognition of similar 
word sounds. The fifth subtest is an achieve- 
ment test to determine whether the child has 
already formed associations between printed 
letters and numbers and their names. 

While it is recommended that all five sub- 
tests be used, norms are provided separately 
for each, so that a teacher may omit one or 
more of the tests and still have a rough meas- 
ure of readiness. On the other hand, space is 
provided on the face sheet for recording men- 
tal age, vision, hearing, and additional items of 
information that may add to the diagnostic 
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value of the tests. The reliability coefficients of 
the separate tests range from .78 to .96, and 
the coefficient is .97 for the whole test. Validity 
was measured by correlating the tests given to 
pupils on entering first grade with the Gates 
Primary Reading Tests at the end of the se- 
mester. The correlation coefficients range from 
.57 to .89 for various groups. The Gates Read- 
ing Readiness T'ests correlate slightly higher 
with later reading achievement when combined 
with scores from an intelligence test. 

Percentiles are given for each subtest, and 
a comparison of abilities on the subtests may 
indicate specific weaknesses of individual pupils. 
One section of the manual gives helpful sug- 
gestions for improving the abilities required by 
the first four subtests in a prereading program 
aimed to develop reading readiness. Although 
the test with the highest predictive value for 
later reading is the fifth subtest, i.e., ability to 
read letters and numbers, the author does not 
suggest that letters and numbers be taught as 
a prerequisite to later reading. Those children 
who are interested enough before entering the 
first grade to learn isolated letters have a drive 
toward reading which will facilitate rapid prog- 
ress, but an intensive drill on letters is not 
recommended as a substitute for a broad read- 
ing program out of which the knowledge of let- 
ters will gradually, and in due time, emerge. 

Dr. Gates warns against using the reading 
readiness tests as a sole basis for classification 
of beginners into ability groups. Reading 
achievement alone, is too narrow a criterion for 
promotion from one grade level to the next in 
the middle grades. Similarly, promise of ability 
to read is too narrow a basis for grouping 
within the first grade. Rather, he suggests, 
group pupils whose interests and needs are 
similar, but study the abilities of each child 
within the groups in order to individualize in- 
struction for effective learning. 


Paul A. Witty, Professor of Education, 
Northwestern University, Evanston, Illinois. 
On the whole, the mechanical features of the 
tests are satisfactory. However, pages are not 
numbered, and tests are not differentiated by 
titles, These factors produce confusion in some 
instances, although the clear outlines of the 
pictures tend to offset this confusion. Time 
limits are not given. It would be reassuring 
and helpful to many teachers if time limits 
were made available. Rows of words and pic- 


tures in Tests 2, 3, and 4 are not numbered. 
Picture labels or numbers would help some 
children identify the rows. è 

Twenty cards have to be printed by the ex- 
aminer to accompany Test 3. Making these 
cards is time-consuming. Commercial cards 
might well be made available. In taking this 
test, the children have to look at examiner’s 
printed card and then match it with a word 
chosen in their test. This is a tedious process 
for many children. It might be desirable to 
have the two words printed on the same page 
in the child’s test booklet and let him identify 
the required word from his own page. The 
alphabet and number test is administered in- 
dividually. This is extremely time-consuming. 
It might be desirable to attempt to obtain simi- 
lar results by use of group techniques. 

In Tests 2, 3, and 4 suggestions are made 
as to ways of scoring tests and of making sten- 
cils and keys. Many examiners desire to have 
commercial keys and stencils. In scoring some 
tests, explicit instructions are offered as to 
whether credit is given when the crosses and 
circles are interchanged; in other tests no in- 
formation is given as to the exactness of the 
circles or crosses. Hence, some confusion is 
found. 

The Gates Reading Readiness Tests conform 
to acceptable standards for test construction. 
However, these tests did not consistently hold 
the attention and interest of some public and 
private school children to whom they were ad- 
ministered recently. Examiners report that the 
use of both group and individual methods of 
administering the test was a most important 
factor in causing interest to lag. Nevertheless, 
these tests have proved useful in securing one 
type of information which, when accompanied 
by adequate supporting data, seems to indicate 
a child's readiness for reading. 


For excerpts from reviews by Austin G. 
Schmidt and one other, see 40:1537. 


[2517-1 
Lee-Clark Reading Readiness Test, 1943 Revi- 
sion. Grades kgn-1; 1931-43; 1 form; $1.20 per 25; 35€ 
per specimen set, postpaid; J. Murray Lee and Willis 
W. Clark; California Test Bureau. 


REFERENCES 
x. Lee, J. Murray; Crank, WirLrs W.; AND Lee, Dorris 
May. “Measuring Reading Readiness.” El Sch J 34:656-66 My 
*34. * (PA B7) Y "ir 
2. WiLMoxE, Warpno W. Relative Validity of Three Group 
Readiness Tests in Predicting Reading Achievement, Unpub- 
lished master’s thesis, University of Kansas, 1939. 
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Marion Monroe Cox, Supervisor of the 
Reading Division, Psycho-Educational Clinic, 
The University of Southern California, Los 
Angeles, California. This test was designed to 
assist teachers in identifying children who are 
ready to learn to read. It consists of three 
parts, each of which measures a skill consid- 
ered essential to learning to read: (a) ability 
to discriminate printed letter forms; (b) abil- 
ity to select pictures according to verbal de- 
scriptions; and (c) ability to discriminate 
printed word forms. Since both the first and 
third parts of the test involve detecting simi- 
larities and differences between printed forms, 
the test is more heavily weighted with items 
requiring visual discrimination than with ver- 
bal items. 

The test is brief enough that first-grade chil- 
dren can usually take the entire test in one sit- 

ing without fatigue. Directions are simple and 
the test is easily given and scored. Norms are 

interpreted for the teacher in terms that are 
understandable without special training in 
Statistics or measurements. Because of its ease, 
simplicity, and brevity the test would appeal to 
a busy teacher who wants a quick method of 
grouping children and does not have the time 
or training for administering a more extensive 
or diagnostic battery of tests. 

The test has a reliability coefficient of 92 
and predicts later reading achievement as well 
as many group intelligence tests do at this 
level. The correlation coefficients between the 
Lee-Clarb Reading Readiness Test and the 
Lee-Clark Reading Tests—Primer are re- 
Ported to be .67 for unselected first grades and 
-43 for first grades whose pupils were above 
average in ability. These correlation coefficients 
indicate that the skills involved in the tests are 
related to reading, but that the predictive value 
for any individual child is not highly accurate. 
Hence, teachers should interpret the results 
not too rigidly but supplement the ratings by 
their own judgment and evaluation of pupils’ 
abilities. 

The authors are aware of the limitations of 
the test as a sole criterion of readiness to read 
and suggest that an intelligence test also be 
given. When both his mental age and reading 
readiness test score indicate sufficient matu- 
rity, the child’s success in learning to read can 
be predicted more reliably than by either test 
alone. The authors also suggest that the teacher 
investigate many other factors which may im- 


pede reading, such as foreign language ; visual, 
hearing, or speech defects; social and person- 
ality maladjustments; etc. Children who have 
any of these difficulties should be studied fur- 
ther in order to give them a type of develop- 
mental reading program which is best suited to 
their needs. 

The Lee-Clark Reading Readiness Test is 
useful to teachers who wish to discover quickly 
and at an early date children who cannot dis- 
criminate printed forms or respond accurately 
to verbal directions. Such children undoubt- 
edly lack important and desirable skills for 
reading. As a screening device to locate cer- 
tain types of difficulties, teachers will find this 
test appropriate and useful, but should not 
consider it a thorough analysis of readiness to 
read. 


David H. Russell, Professor of Education, 
University of California, Berkeley, California. 
This is a revísion of a test, first published in 
1931, which was a widely used pioneer instru- 
ment in the testing of reading readiness. The 
present form consists of four tests: (a) match- 
ing letter symbols in two columns, (5) crossing 
out one letter of four which is unlike the other 
three, (c) testing vocabulary and ability to fol- 
low directions by marking pictures according 
to instructions, (d) identifying letters and 
words by marking one of four symbols that is 
the same as a criterion symbol. The test ac- 
cordingly gives more place to letter symbols 
than most other readiness tests. Since Gates 
reports that “the predictive value of a particu- 
lar test varies with the teaching method,” * this 


-may lower the value of the test. Most first- 


grade teachers today do not give much place to 
teaching letters at beginning levels of reading. 

Unlike a number of readiness tests evaluated 
elsewhere in this volume, the authors of the 
Lee-Clark Reading Readiness Test give the 
split-halves reliabilities of their subtests as well 
as the whole test. These range from .83 to .94. 
In checking the validity of their test, the au- 
thors report a correlation with the Lee-Clark 
Reading Tests—Primer of 67 in two first 
grades and a lower correlation of .432 with a 
group of superior children. The correlation of 
the readiness test with the California Test of 
Mental Maturity is .65, an average figure for 
such calculations. 


* Gates, Arthur I. “An E; i it dii 
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The manual accompanying the test lists 
grade equivalents for various total scores, a 
unique feature. A number of test users will 
criticize Table II of the manual stating “Prob- 
able Per Cent of Failure" since it assumes that 
failure is necessary or inevitable. A derivation 
of this table stating the amount of delay before 
beginning reading corresponding to various 
scores gives the first grade teacher a more posi- 
tive approach to her problems of grouping for 
instruction. However, there are so many fac- 
tors affecting the child's early progress in 
school that setting a definite period for “delay” 
may be dangerous. 

Although it is unfortunate that the authors 
suggest per cent of failures and length of “de- 
lay,” they must be given credit for stressing 
the value of supplementary information about 
the child such as mental age, vision, speech, 
and teacher’s rating. A short bibliography in 
the manual should be useful to primary teach- 
ers, Finally, the Lee-Clark test achieves some 
sort of balance between oversimplification and 
too great complexity of tests, noted elsewhere 
about certain other readiness tests reviewed in 
this volume. 


[ 518 ] 
Metropolitan Readiness Tests. Kindergartners and 
grade I entrants; 1933-39; 1 form; $1.55 per 25; 35€ 
per specimen set, postpaid; (70) minutes; Gertrude 
H. Hildreth and Nellie L. Griffiths; World Book Co. 
REFERENCES 


1-10, See 40:1552. 

11, Nees, MARIE MARGUERITE. The Prognosis of Success in 
14 Reading. Unpublished master's thesis, George Washington 
University, 1937; Pp. t 

12. WILSON, FRANK T., AND BURKE, AcNzs. “Reading Readi- 
ness in a Progressive School.” Teach Col Rec 38:565-80 Ap 


"37. * (PA 11:3918) 
13. Dean, CHamLes D. “Predicting First-Grade Reading 


Achievement.” El Sch J 39:609-16 Ap '39. * (PA 13:6458) 

14. FitzGerap, Frances. Predicting Reading Success of 
First-Grade Entrants. Unpublished Tamers ess, Duke Uni- 
versity, 1939. Pp. 96. 

is. TmAxLER, Artuur E. “Reliability and Predictive Value 
of the Metropolitan Readiness Tests,” pp. 1935. In 1946 Fall 
Testing Program in Independent ‘Schools and Supplementary 
Studies, Educational Records Bulletin, No. 47. New York: Edu- 
cational Records Bureau, February 1947. Pp. x, 58. Paper, 
lithotyped. $1.50. * (PA 21:2498) 

Irving H. Anderson, Professor of Educa- 
tion, University of Michigan, Ann Arbor, 
Michigan. The Metropolitan Readiness Tests 
continue to be among the most widely used of 
the readiness tests. The tests are intended to 
measure readiness for reading, arithmetic, and 
even writing and are thus somewhat broader 
in scope than most other readiness tests, which 
are rather generally designed to measure readi- 
ness for reading alone. The Metropolitan tests 
are easily administered to first-grade children 


—more easily than the Monroe Reading Apti- 
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tude Tests, for example. However, the Metro- 
politan tests are probably not as analytical of 
readiness for reading as the Monroe tests. 

Metropolitan tests require approximately 

minutes to administer. Percentile norms are 
available in the manual of directions. These 
norms show that there is a significant relation- 
ship between scores on the test and achievement 
in the first grade. High correlations exist be- 
tween scores on the Metropolitan Readiness 
Tests and scores on intelligence tests. These 
correlations are of approximately the same 
magnitude as the correlations between scores 
on different intelligence tests. It would probably 
be unnecessary to give the pupils both an intel- 
ligence test and the Metropolitan Readiness 
Tests. A good intelligence test would serve 
essentially the same purpose. Readiness for the 
work of the first grade is not difficult for th 


observant teacher to determine. * 
For review by W. J. Osburn, see 40:1552. 
[ 519 ] 


Reading Aptitude Tests. First grade entrants ; 1935; 
individual in part; I form; $1.35 per 25; 72€ per set 
of testing materials needed by examiner ; 10¢ per speci- 
men set; (40-55) minutes; Marion Monroe; Houghton 
Mifflin Co. 
REFERENCES 

. Wvri, Atice E. A Stud: the Monroe Reading Apti- 
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3. Kirk, SAMUEL A. ing Aptitudes of Mental ly Re- 
tarded Children." J Psycho-Asthenics 44:156-62 "39. * (PA 14: 


a Sypney. heading Readiness and Readin, 
xp 


., RosLow, 
Adchicsement in First Grade.” J Exp Ed 9:154-9 D '4o. * (P 
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Irving H. Anderson, Professor of Educa- 
tion, University of Michigan, Ann Arbor, 
Michigan. An effort has been made in this test 
to include items which the literature shows 
have differentiated between good and poor 
readers. For example, the literature suggests 
that children who are successful in beginning 
reading have better motor control than children 
who ate unsuccessful. A test of motor control, 
therefore, is included in the examination. The 
other tests of the battery have been selected in 
the same way. These other tests cover such 
things as memory of orientation of forms, ocu- 
lar-motor control and attention, visual memory, 
auditory word-discrimination, sound blending, 
and picture vocabulary. The foregoing are 
tested on a group basis. A few individual tests 
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are also included. Auditory memory, speed and 
accuracy of articulation, and speed of associ- 
ation are examples of the individual tests. 
Teachers like the Monroe test because they 
consider it to be diagnostic of specific difficulty. 
The group tests can be administered in 30 to 
40 minutes. The individual tests require an 
additional ro to 15 minutes per child. Percen- 
tile norms are provided in the manual of direc- 
tions. These norms show that high and low 
Scores on the total test are good predictors of 
success and failure in beginning reading. The 
test represents a good effort to apply the re- 
Sults of scientific investigations in the field of 
reading. 


[ 520 | 
Reading Readiness Test. First grade entrants ; 1932- 
38; individual; Forms A, B in one booklet ; $1.50 per 
25 tests; 50f per manual and accessories needed by 
examiner; 5o¢ per specimen set, postpaid ; (30) min- 
utes; M, J. Van Wagenen; Educational Test Bureau. 


1. WILSON, FRANK T., AND BURKE, AGNES. "Reading Readi- 
ness in a ei iyo School.” Teach Col Rec 38:565-80 Ap '37. 


David H. Russell, Professor of Education, 
University of California, Berkeley, California. 
Since Van Wagenen's Reading Readiness Test 
is made up of six subtests administered indi- 
vidually it has more diagnostic possibilities 
than a simpler test such as Stone and Grover's 
Classification Test for Beginners in Reading. 
This advantage is at least partly offset by the 
longer time required for administration and a 
rather elaborate System of scoring. 

The six subtests are as follows: (a) Range 
of Information, composed of 30 items of com- 
mon knowledge such as the number of cents 
in a dime; (b) Perception of Relations, com- 
posed of thirty analogies to be completed; (c) 
Vocabulary-Opposites, which requires the 
child to give the opposites of such words as 
empty and throw; (d) Memory Span for Ideas, 
which requires the child to Tepeat sentences of 
increasing length; (e) Word Discrimination, 

in which the child picks out from a line of five 
words (viewed in a slit) one word which is 
unlike the four others; and (f) Word Learning, 
an associative learning test in which the child 


associates an English word with a foreign 
word (such as bird with mesto) exposed five 
different times on a card. 

These six tests obviously attempt to measure 
quite different abilities and agree with the 
common view that readiness for reading is a 
complex of factors. The authors, however, 
give little evidence to suggest that each of these 
subtests has considerable value in predicting 
later reading success. The manual quotes only 
one result, a correlation of .73 between the 
average of total scores on the two forms of 
the test (omitting the fifth subtest) given at 
the beginning of the first grade and an average 
of four reading tests given at the end of the 
first grade, Intercorrelations between the sub- 
tests and results, including the fifth test, Word 
Discrimination, are not given. In a study by 
Gates, Bond and Russell (3) the average cor- 
relation between the total score and reading 
Scores after one, two, and three terms was . 52 
in four New York City classes, and the cor- 
relations for the subtests ranged from .23 for 
the Memory Span for Ideas subtest to .17 for 
the Perception of Relations subtest. The valid- 
ity of the parts of the test would accordingly 
seem to be still in doubt. 

The rather elaborate system of scoring the 
test may discourage some prospective users 
among teachers but probably will not dissuade 
testing bureaus or trained test users from em- 
ploying the test. Conrad (see. 38:878) has 
stated clearly some of the disadvantages in 
using C-scores. However, the use of such 
Scores here has the advantage of giving the 
child's ability in relation to the difficulty of the 
tasks in the various scales. In addition, the use 
of the Aptitude Index (based only on the first 
four subtests) gives a measure of his probable 
Success, taking into account his chronological 
age. Van Wagenen recommends a “much 
slower modified program" for children who 
have Aptitude Indexes below 90, particularly 
if they start before six or six and one-half 
years of age. If the author can establish its 
validity more clearly, the results of the test can 
be interpreted, by means of the scores men- 
tioned and the profile chart provided, to pro- 
vide definite guidance in the child's early read- 
Ing experiences. The two forms of the test and 
the provision for incorporating its results into 
cumulative records are Strengths. Because of 
administration and Scoring difficulties, it is 
probably best used as a follow-up test for diag- 
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nostic purposes if a simple test indicates pos- 
sible difficulties. 
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Stevens Reading Readiness Test. First grade en- 
trants; 1938-44; formerly published by American Edu- 
cation Press, Inc.; 1 form; $2.40 per 25; 35€ per speci- 
men set, postpaid; ro daily sittings over a 2-week 
period ; Avis Coultas Stevens; World Book Co. 


Irving H. Anderson, Professor of Educa- 
tion, University of Michigan, Ann Arbor, 
Michigan. This test consists of three parts. 
Part I is designed to measure visual discrimina- 
tion. Part II is designed to measure the ability 
to listen to, comprehend, recall, and retell a 
short story. Part III, which is optional, is de- 
signed to measure visual-auditory recall, Other 
factors related to reading readiness are assessed 
by means of a detailed individual check list or 
child study record. The check list is included 
as a part of the test blank proper. Reliability 
coefficients are reported for the test as a whole 
as well as for the individual parts and various 
combinations of parts. All reliability coeffi- 
cients are satisfactory for individual diagnosis 
and prediction. The intercorrelations between 
the various parts of the test are sufficiently low 
to suggest that different functions are measured 
by the separate parts. Satisfactory validation 
data are reported. The statistical data as a 
whole are impressive. Norms are presented 
both in percentiles and standard units. Some 
teachers will object to the time it takes to ad- 
minister the test. Several sittings over a period 
of two weeks are recommended. Part I alone 
consists of 16 pages involving 120 items. How- 
ever, the information which the test provides 
on each individual child is worth having. 
Teachers will do well to give this reading 
readiness test careful study. 


Marion Monroe Cox, Supervisor of the 
Reading Division, Psycho-Educational Clinic, 
The University of Southern California, Los 
Angeles, California. This test consists of a six- 
teen-page test booklet encased in a four-page 
folder which contains a check list of observa- 
tions about the child; judgments as to his per- 
sonality, social and emotional behavior, back- 
ground, and family relationships; and space for 
recording data from intelligence tests, physical 
examinations, and kindergarten attendance. 
The battery of tests includes three parts: (a) 
group tests of ability to discriminate printed 
letters, words, and phrases; (b) an individual 
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test of ability to reproduce a story heard; and 
(c) an individual test of ability to recall words 
which have been associated with pictures in a 
group game played daily for three days pre- 
vious to the test. 

The testing program requires two weeks 
early in the first grade during which the pupils 
are given certain sections of the test according 
to a daily schedule. As the testing proceeds, 
the teacher becomes acquainted with each child 
personally and fills out the check list concern- 
ing his personality and history. The Stevens 
Reading Readiness Test, therefore, stimulates 
the teacher to make a brief "case history" of 
each pupil, to gather together pertinent data, 
and to observe each pupil's reactions in both 
group and individual testing. 

A teacher who has followed the suggested 
program carefully undoubtedly discovers many 
things about each child which will enable her 
to individualize instruction and adapt to the 
child's needs in teaching him. 

The validity of the test was determined by 
correlating the total test scores at the beginning 
of the semester with teachers’ reports of 
achievement in reading at the end of the se- 
mester, The correlation coefficient obtained is 
.8o, an unusually high one in the light of the 
many factors that influence children's ability to 
learn. The test predicts very well how each 
child's reading achievement will be rated by 
his teacher at the end of the semester. How- 
ever, pupils who scored very low on the tests 
were not exposed to reading, those who scored 
low were given only simple charts for reading, 
and at each level the work was adapted to the 
children's abilities as shown by the test. A high 
correlation between test scores and final achieve- 
ment is, therefore, to be expected. 

The manual which accompanies the test con- 
tains not only directions for administering the 
tests, and the norms, but also a number of help- 
ful suggestions and recommendations for im- 
proving the child’s ability in cases where spe- 
cific weaknesses have been revealed by the test. 

The Stevens Reading Readiness Test may 
require more time in testing and observing chil- 
dren individually than many teachers will be 
willing to give, and not all the traits listed in 
the Check List are always closely related to 
reading readiness. An investigation of each 
child while compiling such a list is illuminat- 
ing, however, and the teacher may be well re- 
paid for her trouble by a better understanding 
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of each child's problems: In obtaining a his- 
tory, the cooperation between parents and 
teachers is essential, and a close parent-teacher 
relationship may clarify goals and pave the way 
toward better learning. If teachers using the 
tests were well trained in mental hygiene or 
had access to a counselor who could help inter- 
pret the data wisely, the individual analysis 
stimulated by the Stevens Reading Readiness 
Test would be as helpful as or even more help- 
ful than zhe actual test scores in improving 
instruction in reading. 


SPEED 


[ 522] 
Chapman-Cook Speed of Reading Test. Grades 4-8; 
1923-24; Forms A, B; 75¢ per 25; as UI specimen 
set, postpaid ; 2 (10) minutes; J. C. Chapman and 
Sidney Cook; Educational Test Bureau. 


REFERENCES 
„1. CHAPMAN, J. Crossy, AND Coox, S. “The Principle of the 
Single Variable in a Speed of Reading Cross-Out Test.” J Ed 
Res 8:389-96 D '23. * 

Eason Monroe, Chairman, Language Arts 
Division, San Francisco State College, San 
Francisco, California. The Chapman-Cook 
Speed of Reading Test, one of the early tests 
of its kind, is designed to measure the rate of 
reading of pupils in grades 4 through 8. This 
test offers the advantage of a short testing time 
(two and one half minutes), simplicity and 
ease of administration, and objective scoring. 
Norms are furnished for each grade for which 
the test may be used. Two forms of the test 
are available. 

Very much like the Michigam Speed of 
Reading Test, for which it served as model, 
the Chapman-Cook Speed of Reading Test is 
composed of thirty-word items. In the latter 
part of each item one word is inserted which 
is out of meaning relationship with the rest of 
the item—‘one word which spoils the mean- 
ing." The reader is required to find this word 
in each item and cross it out. The test includes 
thirty such items. The content of the separate 
items is unrelated. 

The entire test is printed on the two sides of 
a standard 874 by 1r inch sheet, the directions 
and practice exercises on one side and the 
thirty items arranged in separate paragraphs 
and in two columns on the reverse side. Type 
Size and format appear to be satisfactory. 

The nature of the reading task involved in 

this test is highly specialized and unusual. It 
seems questionable, therefore, that the test re- 


flects validly those rates of reading which func- 
tion in the more representative types of read- 
ing. (See 523 for the reviewer's comments 
on the Michigan Speed of Reading Test.) 

The short testing time and the limited num- 
ber of items raise also the question of whether 
this test provides a reliable measure of reading 
rate for all the grade groups for which it is 
intended. 

This test appears to be most useful for teach- 
ers who are especially interested in studying 
the reading rates of intermediate grade pupils. 
As a survey device, the test may single out 
those pupils who should receive further testing 
or other forms of special attention. 


[ 523 ] 
Michigan Speed of Reading Test. Grades 6-16; 
1932-37; Forms I, II; $1.75 per 25, postpaid; 35¢ per 
specimen set, postpaid; 7(15) minutes; Edward B. 
Greene; Psychological Corporation. 


REFERENCES 
1-2, See 40:1553. “ 
3. Lancsam, RosaLinp Srreer. “A Factorial Analysis of 
Reading Ability.” J Exp Ed 10:57-63 S '41. * (PA 16:1187) 


Eason Monroe, Chairman, Language Arts 
Division, San Francisco State College, San 
Francisco, California. This test is designed to 
measure "speed of reading of easy material." 
Simplicity of organization and procedure and 
a short testing time (seven minutes) make this 
test easy and convenient to administer. Two 
forms of the test are available, with norms for 
subjects from third grade through the college 
senior year based upon 3,302 cases. Scoring is 
objective. 

The test is composed of 75 items. Each item 
consists of 30 words, arranged in two or three 
Short related sentences. There is no continuity 
of meaning from item to item. With few excep- 
tions, the vocabulary of the items is limited to 
the first 5,000 words in the Thorndike Teach- 
er's Word Book. The content of the items is 
said to be representative of the school and 
home situations common to third and fourth 
grade children (1). 

The test items are arranged in five-item par- 
agraphs in double columns on both sides of a 
standard 834 by 11 inch sheet. To provide this 
convenience of format, it has been necessary to 
use a size of type which is four or six points 
smaller than intermediate grade pupils are ac- 
customed to read. Otherwise, the test appears 
to be carefully prepared and presented. 

For the purpose of comprehension control, 
the Michigan Speed of Reading Test employs 


— 
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the technique used in the earlier Chapman- 
Cook Speed of Reading Test. In the latter por- 
tion of each item a single word has been in- 
serted which is out of meaning with the rest 
of the item. ds he reads, the subject is re- 
quired to "cross out, in the second part of each 
numbered passage, one word which spoils the 
meaning." 

Reliabilities of this test are reported to vary 
between .653 and .94, "with the mean relia- 
bility of .87 for single age groups" (1). De- 
spite the fact that the vocabulary and concepts 
of the test are adapted for third and fourth 
grade pupils, the relatively low reliability of 
the measure at these grade levels casts doubt 
upon its appropriateness for these young read- 
ers. At the junior high school level and above, 
however, the test maintains consistently satis- 
factory reliability. 

It is a difficult task to prepare a discrete 
measure of reading rate which is highly valid 
and reliable and yet convenient to administer 
and to score. Frequently, in an effort to pro- 
vide strongly for one or two of these important 
elements, test producers introduce shortcom- 
ings in other respects. Tests of reading rate 
which, for the sake of convenient administra- 
tion or high reliability, involve unusual or non- 
representative reading tasks may fall seriously 
short of valid measurement. 

Serious question can be raised concerning 
the extent to which this test measures the types 
of reading rate which are most representative 
and significant in general school and life read- 
ing. The reading problem involved throughout 
the entire test—looking for and crossing out 
in each item the single word which is out of 
meaning relationship with the remainder of 
the passage—focuses the reader's attention 
upon an unusual comprehension task. Instead 
of reading for the integrated meaning of each 
item, the reader may find himself scanning the 
first part of the item for a general clue which 
he can use merely to identify the misfit word. 
This specialized process may not be uniformly 
related to the reader's other reading rate per- 
formances and is certainly not a representative 
reading process. It is likely also that different 
readers may react variously to the annoyance 
of interrupting the flow of reading to cross out 
the inappropriate words. 

One of the most serious shortcomings of this 
test, in the reviewer's opinion, is the complete 
lack of continuity of meaning throughout the 


sequence of items. The reader, accustomed in 
his regular reading to deriving meaning through 
the continuous process of interpreting relation- 
ships, during this test finds that he must make 
a new reading adjustment for each item. He 
must demonstrate his rate of reading in a situ- 
ation which, lacking the dynamic qualities that 
support efficient efforts in normal reading, ac- 
tually prevents or inhibits rapid reading. In 
this respect the test calls for a type of perform- 
ance which is not commonly involved in gen- 
eral reading and which is perhaps relatively 
unimportant. 

Test users who are interested solely in the 
measurement of reading rate for junior and 
senior high school pupils may find this test 
useful for simple and convenient group screen- 
ing. Because of the important inadequacies of 
the test, however, pupils who make unusually 
high or low scores or who demonstrate in 
other ways a need for special attention should 
be checked by means of additional standardized 
and informal measures of reading. 


For reviews by Richard Ledgerwood and 
M. R. Trabue of an earlier edition, see 38:1171. 
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Reading Speed and Comprehension: Ohio Senior 
Survey Tests. Grade 12; 1935-41; Forms A, B; 3¢ 

er test; (25) minutes; S. L. Pressey and J, W. Sher- 
Eos (Form B) ; Ohio Scholarship Tests, Ohio State 
Department of Education. 


J. B. Stroud, Professor of Education and 
Psychology, State University of Iowa, Iowa 
City, Iowa. Each of two forms of this test, a 
part of The Ohio Senior Survey Tests battery, 
consists of a single reading selection of about 
2,400 words. The student is given an instruc- 
tional set to read as rapidly as is consistent 
with his ability to answer questions over the 
material after he has read it. At the end of five 
minutes he marks the line he is then reading. 
He is then allowed to finish the piece. Presum- 
ably the rate score is determined from the num- 
ber of words read in the five-minute period. 
The questions are answered without recourse 
to the reading text. 

This sort of test fulfills an important need 
in testing reading. In some respects it is a more 
exacting test of ability to read than are the 
tests in which the student has access to the text 
when answering the questions. This test re- 
quires self-direction. The reader must supply 
his own set as he must in any natural reading 
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situation. His own perceptual and thought 
processes rather than the nature of the ques- 
tions asked determine what elements in the 
piece shall be reacted to and how he shall react 
to them. Perhaps the most exacting gauge of 
the quality of reading is what responses the 
reader can make about the content after he 
has read a piece. The criticism that such tests 
are memory tests is not crucial if the test ques- 
tions are responded to immediately and if the 
items are of the kind and quality they should 
be. After all, items of detail—the kind for 
which the memory factor would be vitiating— 
are not good tests of reading ability, except of 
a very elementary kind, whether they are re- 
sponded to from memory or from inspection of 
the text. The items of the two forms of the 
test in question appear to be adequate. 


Miles A. Tinker, Professor of Psychology, 
The University of Minnesota, Minneapolis, 
Minnesota. This is one of the tests devised for 
use with Ohio high school seniors. A case study 
of a college student constitutes the reading ma- 
terial. Form A is to be administered before 
remedial training in the senior survey course 
is begun; Form B, after the program is com- 
pleted. The student reads as rapidly as possible 
for 5 minutes, marks the amount done, and 
then reads the rest of the material in the test, 
When finished, the booklet is collected, and a 
set of 30 multiple-choice comprehension ques- 
tions is given without time limit, Thus, the test 
yields a measure of speed in terms of words 
read per minute and a measure of comprehen- 
sion. Directions are clear and concise. 

There are no data in the manual or in the 
materials accompanying the test on number of 
cases employed for standardization or on equiv- 
alence of the two test forms. Percentile norms 
are given for speed and for comprehension 
Scores. Item norms (ie., per cent answering 
each item correctly) are also listed for the 
comprehension questions, Although Form A is 
printed in uniform line lengths, Form B has 
exactly five words on each line. This results in 
a marked variation in length of successive lines, 
ie. an undesirable typographical arrangement. 
Perhaps this explains the fact that Form B was 
read much slower than Form A although re- 
medial training had intervened. The obvious 
lack of equivalence in reading rate between the 
two forms makes interpretation of progress in 

reading hazardous, One can only note changes 


in percentile rank from one test to the other. 
The significance of such shiftings in rank nec- 
essarily must remain obscure in the absence of 
equivalent test forms. For instance, the student 
who reads 229 words per minute on Form A 
gets a percentile rank of 40. If, after training, 
he reads 199 words per minute on Form B, his 
percentile is 50. Interpretation of the compre- 
hension scores involves similar difficulties, 

The materials accompanying the test furnish 
no information on either the reliability or the 
validity of the speed of reading and the com- 
prehension scores. The trained worker objects 
to using tests either for clinical or for survey 
purposes unless he knows their reliability and 
validity, since this information is needed for 
interpreting the significance of the scores. 

The typography of the test is very illegible, 
The print is small, and the letters are blurred 
as if badly worn type had been used. 

This test is constructed in a fundamentally 
sound way. Rate of reading and comprehension 
scores are derived from the same material, The 
reader knows there will be a comprehension 
check when the test is started, and the timed 
reading period is of adequate duration, Choice 
of reading material appears adequate and or- 
ganization of the comprehension questions well 
done. Nevertheless, lack of information on 
standardization and the illegible print are 
serious obstacles to use of the test. The more 
adequately standardized C. hapman-Cook Speed 
of Reading Test and the Minnesota S. peed of 
Reading Test for College Students are more 
satisfactory for measuring speed of reading 
and gains in speed of reading than the test re- 
viewed here, 


[525 ] 

Reading Speed Test: National Achievement Test. 

Grades 3-8; 1938-39; Forms A, B: withdrawn from 

ublication in 1947; 3 or 4(10) minutes; Robert K. 
peer and Samuel Smith; Acorn Publishing Co. 


Eason Monroe, Chairman, Language Arts 
Division, San Francisco State College, San 
Francisco, California. This test “has been con- 
structed to test the pupil’s speed of reading.” 
The directions for administration are clearly 
Phrased and the test procedure is relatively 
simple. 

Each form of the test consists of a fairly 
long, somewhat continuous account of chil- 
dren’s activities which are likely to be mean- 
ingful for elementary school pupils. Normal 
paragraphing, favorable type size, and double- 
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columning of the text increase the readability 
of the material. 

Comprehension is presumably controlled dur- 
ing the reading process by requiring the reader 
to underline certain sentences throughout the 
text. In Form A, for example, the pupil is 
asked to “draw a line under every sentence that 
tells about John's dog, Fido." In each form, 
twenty-five sentences should be underlined. 
These sentences are distributed throughout 
twenty-two paragraphs in Form A and twenty 
paragraphs in Form B. 

This test offers an important advantage over 
other rate of reading tests which involve under- 
lining or crossing-out techniques for control- 
ling comprehension. This advantage lies chiefly 
in the relative continuity of the text, which pro- 
vides a more representative reading experience 
and increased pupil interest. Much of this 
value, however, is seriously reduced by the re- 
quirement of underlining the entire sentence in 
which the key symbols occur. For this reason, 
the test comes to be a measure not only of 
speed of reading but also of speed and skill of 
underlining. Conscientious pupils are likely to 
sacrifice reading time in their effort to under- 
line in exact accord with the test directions. No 
score may be given for sentences "incorrectly 
underlined." 

Alert pupils may soon discover that the sen- 
tences to be underlined in Form A invariably 
contain one of three clues—Fido, John's dog, 
or his dog. Scoring lines beside each paragraph 
indicate clearly for such pupils the number of 
sentences to be underlined in each paragraph. 
Similar clues are evident in Form B. Having 
made these discoveries, a pupil may shift his 
method of performance and make satisfactory 
test responses without continuing the normal 
process of sequential reading. The scores of in- 
dividual pupils are likely to be distorted by this 
possibility. 

This test may be used for group survey pur- 
poses, but it is important to use additional 
measures in the case of pupils who make un- 
usually high or low scores or who demonstrate 
in other ways the need for special study. 


SPECIAL FIELDS 
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x Interpretation of Reading Materials in the Natu- 
ral Sciences: Tests of General Educational Devel- 
opment: College Level, Test 3. College; 1944-45; 
IBM; Form B; separate answer sheets must be 


used; $3 per 25; 4o¢ per 25 machine-scorable answer 
sheets ; 15¢ per scoring key ; 25£ per specimen set, post- 
paid; worklimit (120) minutes; prepared by the Exam- 
inations Staff of the United States Armed Forces Insti- 
tute; published by the American Council on Education; 
distributed by the Cooperative Test Service. (Also dis- 
tributed by Science Research Associates: $3 per 25; 
65¢ per 25 machine-scorable answer sheets; 50¢ per 
key; 50¢ per specimen set.) 


For reviews by Herbert S. Conrad and War- 
ren G. Findley of the entire battery, see 20. 
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Interpretation of Reading Materials in the Natu- 
ral Sciences: Tests of General Educational De- 
velopment: High School Level, Test 3. High 
school; 1944-45; IBM; Form B; separate answer 
sheets must be used; $2 per 25; 40¢ per 25 machine- 
scorable answer sheets; 15¢ per scoring key; 25¢ per 
specimen set, postpaid; worklimit (120) minutes; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education; distributed by the Cooperative 
Test Service. (Also distributed by Science Research 
Associates: $2 per 25; 65¢ per 25 machine-scorable 
answer sheets; 50¢ per key; 5o¢ per specimen set.) 


For reviews by Herbert S. Conrad and War- 
ren G. Findley of the entire battery, see 20. 


[ 528 ] 
XInterpretation of Reading Materials in the So- 
cial Studies: Tests of General Educational Devel- 
opment: College Level, Test 2. College; 1944-45; 
IBM; Form B; separate answer sheets must be 
used; $3 per 25; 40 per 25 machine-scorable answer 
sheets; 15 per scoring key; 25¢ per specimen set, post- 
paid; worklimit (120) minutes; prepared by the Ex- 
aminations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
cation; distributed by the Cooperative Test Service. 
(Also distributed by Science Research Associates: $3 
per 25; 65¢ per 25 machine-scorable answer sheets; 
so¢ per key; 50¢ per specimen set.) 
REFERENCES 


1. Braptey, Many Evrru. “A Study of the Validity of the 
Armed Forces Institute Tests of General Educational Develop- 
nent in the Field of Social Studies.” Ed & Psychol Meas 6:265- 


8 su '46. * (PA 21:595) 

W. E. Hall, Associate Professor of Educa- 
tional Psychology, The University of Nebraska, 
Lincoln, Nebraska. This test is constructed on 
a sound basis in terms of the type of reading 
material presented in the text. This material 
resembles closely the type of material a college 
student will be apt to read in the social science 
area. The questions on the social science read- 
ing selections closely resemble the type of ques- 
tions that will be asked of many students in 
college courses. 

The questions are of a nature to require the 
student “to dig out” the meaning rather than 
answer the obvious. There is a novel type of 
question included in each group of questions 
following the selection read. The questions can 
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be answered directly. out of the general knowl- 
edge of the testee rather than from the passage 
read, 

The standardization has been established on 
the success of high school seniors who have 
taken the test and who have passed social sci- 
ence courses, Since the three types of percentile 
tables have been based on the scores made by 
students who have been classified by the Amer- 
ican Council of Education Psychological Ex- 
amination and the tests are constructed on 
some activity which is practically identical to 
college work, it would seem that this test goes a 
long way toward realizing its objective. 

Tt will be hoped that further validation will 
be published in terms of college success in so- 
cial science and scores on the test. 


C. Robert Pace, Associate Director, Evalua- 
tion Service Center, Syracuse University, Syra- 
cuse, New York. This test is, for the most part, 
competently constructed and attempts to meas- 
ure important and worth-while objectives; its 
intended use represents a distinct advance from 
the practice of granting "blanket credits" fol- 
lowed after World War I. Some elements of 
the test, however, warrant criticism; and the 
norms given for college populations not only 
are of doubtful validity but may be misleading. 

The test consists of ten paragraphs, one 
graph, and one table, each followed by a series 
of questions presumably requiring an interpre- 
tation or evaluation of the material. There are 
91 items in all, 20 of which are starred as being 
answerable without reference to the reading 
materials. In my judgment, however, at least 
20 additional items could appropriately be 
starred, thus making it possible to answer about 
half of the test items without reference to the 
reading materials. Further, one may properly 
object to the "correct" answer which is keyed 
for some of the questions. Especially in inter- 
pretation and evaluation items, testmakers must 
be on guard to avoid penalizing the individual 
who is able to bring a great deal of knowledge 
to bear on a given topic. There may be several 
good answers rather than one best answer ; and 
che results of item analysis do not prove to the 
scholars which answer is best. For example, the 
answer to a question regarding the concentra- 
tion of economic power is “directors of corpo- 
rations”; but the student should know that 
“bankers,” even though they are not referred to 
in the paragraph, hold economic power over the 


corporations. Again, social security is con- - 
sidered the correct response to a question about 
ways in which the United States has lagged in 
the extension of democracy; the right to vote, 
however, is also a reasonable choice for we did 
not obtain women’s suffrage until 1921, and we 
still do not have suffrage for large groups in 
parts of our society. And again, the correct 
response to a question about which is the best 
reply to a prejudiced argument in one of the 
paragraphs is an argument by analogy (such 
arguments are always weak from the logician’s 
point of view), whereas a direct and ethical 
reply is considered "incorrect." Other examples 
could be cited. 

With regard to the college percentile norms, 
the test authors acknowledge that their mean- 
ingfulness is limited and that their uncritical 
acceptance may be misleading. Percentile norms 
for Type III institutions offering courses in 
world history are based on the test perform- 
ance of 53 students from one institution! For 
Type I institutions offering general survey 
courses in social studies, percentile norms are 
based on 85 students from 3 institutions! These 
are the two worst examples. In addition to the 
total inadequacy of these figures (it might have 
been better not to have published them), a more 
basic fault lies in the selection of institutions 
to comprise the norm group. This group is lim- 
ited to institutions which offer “survey” courses 
and which also used the ACE psychological 
examination in 1941. The decision to limit the 
selection of institutions to those which happen 
to offer “survey” courses seems to me quite 
unjustified, especially when the use of the test 
is practically forced on a nation-wide scale. I 
would argue that there should have been na- 
tional college norms, based on a broad sampling 
of students who had taken any social science 
course during their first two college years. The 
composite thus obtained would be representative 
not only of the kinds of students who take social 
science courses but also of the variety of social 
science courses offered in American colleges and 
universities. In contrast, the high school norms 
for this and the other tests in the series are 
based on an excellent national sample. 

It should be recognized that the GED tests 
were constructed to meet a postwar emergency. 
Within a relatively short time this emergency 
will have passed. Meanwhile, the GED tests 
should be compared experimentally with others 
which might be used to predict ability to carry 
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. on advanced college work—such as those of 


the Cooperative Test Service, the College En- 
trance Examination Board, the Graduate Rec- 
ord Examination, etc. 

The attempt to measure fundamental objec- 
tives of understanding and critical judgment 
deserves every encouragement. It is hoped that 
the increased familiarity with this type of test 
which has occurred by virtue of its nation- 
wide use will encourage local institutions to 
prepare tests along similar lines for local pur- 
poses and will encourage central testing agen- 
cies to seek improvements and refinements in 
future tests designed to measure such basic 
objectives. 


For reviews by Herbert S. Conrad and War- 
ren G. Findley of the entire battery, see 20. 
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XInterpretation of Reading Materials in the So- 
cial Studies: Tests of General Edücational De- 
velopment: High School Level, Test 2. High 
school; 1944-45; IBM; Form B; separate answer 
sheets must be used; $2 per 25; 40¢ per 25 machine- 
scorable answer sheets; 15¢ per scoring key; 25¢ per 
specimen set, postpaid; worklimit (120) minutes; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education; distributed by the Cooperative 
Test Service. (Also distributed by Science Research 
Associates: $2 per 25; 65¢ per 25 machine-scorable 
answer sheets; Sof per key; So¢ per specimen set.) 


For reviews by Herbert S. Conrad and War- 
ren G. Findley of the entire battery, see 20. 


1530 ] 
XReading Scales in History. Grades 7-12; 1938-39; 
Forms A, B; $1.25 per 25; 25€ per specimen set, post- 
paid; nontimed 45(60) minutes; M. J. Van Wagenen; 
Educational Test Bureau. 


Paul Blommers, Acting Dean of the College 
of Education and. Assistant Professor of Edu- 
cation, The State University of Iowa, Iowa 
City, Iowa. This test purports to measure power 
to comprehend the subject matter which is en- 
countered at the junior and senior high school 
levels in history. The test consists of a series of 
I5 paragraphs, each followed by from 4 to 6 
statements. The students are directed to mark 
those statements which contain an idea that is 
either specifically mentioned in or implied by the 
paragraph. The statements following each para- 
graph are scored as a unit, the maximum score 
for no errors being two points. One point is 
given if one error is made and no points are 
given if more than one error is made. The 
paragraphs are graded with respect to difficulty 


—difficulty being defined in terms of the pro- 
portion of successes on the accompanying state- 
ments. A table is provided for converting the 
total test score into a C-score, which, according 
to the manual, indicates the difficulty level of 
the content which the student can understand 
and mark correctly in 50 per cent of his at- 
tempts. Age and grade norms are provided. 

The test has been carefully constructed and 
should be fairly useful as a measure of general 
reading comprehension. It appears legitimate, 
however, to question its validity as a measure of 
the specific ability to comprehend historical 
material. Consideration of the test statements 
does not indicate an attempt to measure skills 
differing from those employed in reading other 
types of material. It is, of course, true that the 
content of all paragraphs is historical in nature, 
but is this in itself sufficient to accomplish the 
purpose? 

In answering this question it must be ac- 
knowledged that two individuals equal in read- 
ing skill per se may differ in ability to compre- 
hend history. This difference, however, is in 
a large part due to differences in background 
knowledge. To the extent that background 
knowledge enters into the score, the test will 
differentiate between these individuals. In this 
test, however, every attempt is made to mini- 
mize the effect of background knowledge. The 
student is told: “there may be statements that 
are true in themselves, but do not mark them 

. unless the idea is contained in the para- 
graph or can be inferred from the facts or 
ideas in the paragraph.” Evidently the student 
is to come equipped with only his reading cap 
—his background knowledge being left in the 
cloakroom. It is difficult to understand how a 
test which purports to measure an individual’s 
specific ability to comprehend history can be 
valid for that purpose if it employs such direc- 
tions. Moreover, such directions are apt to con- 
fuse students who take them seriously as the 
better students are likely to do. At least, there 
is room for debate on the keying of certain re- 
sponses when the issue reduces to whether the 
statements can be inferred from the ideas in the 
paragraph (e.g. see Statement 2, Paragraph 
11, Form B.) 

As already indicated, the manual states that 
the C-score is indicative of the difficulty level 
of the material the student can understand. Ac- 
tually the C-score bears little relationship to 
the difficulty of the content of the paragraph 
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per se. This is evidenced by the different posi- 
tioning within the two forms of the ten para- 
graphs common to them. The C-score is more 
indicative of the type of statement the student 
can correctly mark, that is, of the depth of his 
comprehension of the given material rather 
than of the difficulty of the material itself. This 
does not constitute an objection to the test and 
is mentioned in the interest of preventing a 
misunderstanding of the relationship between 
the C-score and “difficulty level.” 

Finally, it should be observed that the over- 
lapping of the content of the two forms is so 
great as to impair somewhat the value of hav- 
ing the second form. 


Albert J. Harris, Assistant Professor of Edu- 
cation, The City College of New York, New 
York, New York. This test is intended to 
measure the ability to comprehend historical 
reading material at the high school level. Simi- 
lar tests of literary reading and scientific read- 
ing have been constructed by the same author. 

The test is given as a power test without a 
specific time limit. Each of the two forms in- 
cludes fifteen selections followed by four, five, 
or six trüe-false items. The selections are sup- 
posed to be arranged in ascending order of dif- 
ficulty. Of the fifteen selections in Form A, ten 
also appear in Form B, with slightly different 
questions. Selection 1 in Form A is Selection 5 
in Form B; Selection 12 in Form A is Selec- 
tion 2 in Form B; etc. To make matters even 
worse, Selection 8 in Form A reappears as Se- 
lection 11 in the same form, with a different set 
of questions based on it! A well-constructed 
power test should certainly have a more stable 
order of difficulty than this, 

The raw scores are translated into C-scores, 

a type of scaled score based on the quartile de- 
viation, which is used in several other tests by 
the same author. Age and grade equivalents are 
provided for the C-score scale. The reliability 
is stated as a probable error of 2.6 C-score 
points. This is the equivalent of between 4 and 
-5 times the quartile deviation for a single 
grade and is below what is generally consid- 
ered good reliability for the measurement of 
the status of individuals. Norms include age 
and grade equivalents and the 25th, soth, and 
75th percentiles for grades 9, I0, II, and r2, 
No information is given in the brief four-page 
manual that would allow an evaluation of the 
adequacy of standardization. 


There is no evidence presented that this test , 
would be more helpful than a good general 
reading comprehension test in assaying a high 
school student's ability to read historical ma- 
terial. The idea of testing comprehension in a 
specific field is a good one; the execution, in 
this case, leaves much to be desired. 


[ 531 ] 
XReading Scales in Literature. Grades 7-12; 1939; 
Forms A, B, C; $1.25 per 25; 25¢ per specimen set, 
postpaid; nontimed (45-60) minutes; M. J. Van 
Wagenen; Educational Test Bureau. 

[ 532 ] 
*Reading Scales in Science. Grades 7-12; 1938; 
Form A ; $1.25 per 25; 25¢ per specimen set, postpaid; 
nontimed (45-60) minutes; M. J. Van Wagenen; Edu- 
cational Test Bureau. 


Ivan A. Booker, Assistant Director, Research 
Division, National Education Assoc 
Washington, D. C. The Reading Scales in Sci- 
ence are designed to measure a pupil’s ability 
to comprehend the materials usually encoun- 
tered in high school science textbooks. They 
have been developed in such a way and stand- 
ardized in such manner that the C-scores from 
this test are said to be comparable to those 
from the reading scales for junior and senior 
high schools in the Unit Scales of Attainment. 

Each form of the Reading Scales in Science 
consists of fifteen paragraphs, each followed by 
four to six objective test items. The paragraphs 
typically contain about 150 words. The test 
items take the form of statements which are to 
be checked only when “the idea is either con- 
tained in the paragraph or can be inferred 
from the facts or ideas in the paragraph.” All 
other statements are to be left unchecked. 

The paragraphs are scaled according to diffi- 
culty, However, it is the difference in test items 
instead of a difference in the difficulty of the 
test paragraphs which accounts for the para- 
graph placement. This must be the case in view 
of the amount of repetition of paragraphs in 
the tests, with quite different placement of 
identical paragraphs. The extent to which test 
paragraphs appear more than once in these 
scales is indicated in part by the facts that (a) 
three paragraphs are used twice in Form A, 
and three other paragraphs appear twice in 
Form B; (b) eight paragraphs appear one or 
more times in one form of the test and also 
once in the other form; and (c) only three 
paragraphs in Form A and one in Form B are 
not repeated. When a paragraph is repeated, 
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. however, every test item used with it is differ- 


ent from the items used elsewhere. Repetition, 
therefore, merely has the effect of reducing the 
reading load or holding it within selected limits 
while increasing the number and difficulty of 
the test items. 

Each paragraph is scored as a unit, irrespec- 
tive of whether it is accompanied by four, five, 
or six test items, Each disagreement between 
the pupil’s response and the scoring key (failing 
to check an item or checking one that should 
not be checked) is counted as an error. A 
paragraph is given a score of two if no error 
occurs, a score of one if there is only one error, 
and a score of zero if there are two or more 
errors. Thus, the actual scores must range be- 
tween zero and thirty. The scoring key used in 
checking errors is exceptionally well arranged 
for rapid use. Having to score each paragraph 
as a unit, however, is time-consuming and 
greatly increases the hazard of scoring errors. 
It is difficult to understand how this compli- 
cated procedure improves on the total score 
which would be obtained by simply counting 
the total number of correct responses for the 
70-odd items in the total series. The C-scores, 
however, are based on the 30-point scale of 
total scores. 

The manual of directions gives no informa- 
tion as to the validity and reliability of the test, 
neither for the forms when considered as wholes 
nor for the individual paragraphs and test items 
which constitute them. Users of this test—as is 
true for so many other tests also—are given no 
description of the standardization procedures 
and no indication of the adequacy of the norms 
suggested. 

The scoring key gives C-scores ranging be- 
tween 48 and 122 to accompany each actual 
score of o to 30. The manual gives a median 
and a 25th and 75th percentile C-score for each 
grade from 8 to 12, inclusive. Norms are not 
included for grade 7, although the test purports 
to cover grades 7 to 12. 

Lists of C-scores are also given for each 
form of the test to indicate the relative levels 
of difficulty of the various paragraphs and 
items. A part of this scale for Form A follows: 


ggg. v rae 03 


PARAGRAPH : EN EAE i 
81 83 .... 89 91 92 .... 99 102 


C-SCORE VALUE: 


These are interpreted as follows: a pupil with 
a C-score of 91 can be expected to do correctly 
about "half of such paragraphs and sets of 
statements as paragraph 8; three fourths of 


such paragraphs and sets of statements as 
paragraph 3 [ten C-score points lower]; but 
only about one fourth of such paragraphs and 
sets of statements as paragraph 13 [ten C-score 
points higher]." Such interpretation, however, 
is of doubtful practical value. The teacher has 
no way to determine whether the content of a 
given science selection and the pupil's tasks re- 
lating to it are more nearly equivalent to Para- 
graph 3 or Paragraph 13. As has been pointed 
out, the scaled difficulty is achieved, not through 
the paragraphs but through the test items. To 
know, for example, that a pupil can do about 
three-fourths of the tasks comparable to Para- 
graph 5 is of little consequence, therefore, 
when Paragraph 5 is the same as Paragraph 15 
except for the test items. 

A criticism often voiced against the Unit 
Scales of Attainment, to which these. tests are 
related, seems fully justified here. The C-scores 
provided with the tests probably are somewhat 
misleading, especially to teachers not thoroughly 
grounded in testing procedures. They tend to 
imply an exactness of pupil placement or an 
accuracy of measurement which these tests can- 
not supply. 

The unique value in the Reading Scales in 
Science is their use of science content material 
in a reading comprehension test. Often there is 
a need for a reading test which will differenti- 
ate among the pupils as to their ability to read 
and understand the materials in a given subject- 
matter area, And where such a survey of abil- 
ity to read high school science materials is 
needed, the Reading Scales in Science can be 
appropriately used. Even the grade norms and 
C-scores should prove helpful if their limita- 
tions are recognized, particularly with respect 
to the scores of individual pupils. Also, for the 
teacher who can use tests for something more 
than merely to obtain a total score, analysis of 
the kinds of test items on which a pupil made his 
errors will provide helpful insight as to the 
kind of guidance needed. 


[ 533 ] 


xTest of Reading Ability for . College; 
1940; 4¢ per reading booklet; 2¢ per test booklet; rog 
per manual; 4o¢ per specimen set; (15) minutes; 
Francis P. Robinson and Prudence Hall; Ohio State 
University Press. 

d) A TEST OF READING ABILITY FOR ART. I form. 

b) A TEST OF READING ABILITY FOR GEOLOGY. I form 

C) A TEST OF READING ABILITY FOR HISTORY. Forms 
Canada, Russia. 

d) A TEST OF READING ABILITY FOR FICTION. I form. 
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*Applied Reading for  Junior-Senior High 
School: Every Pupil Test. Grades 7-12; 1946; new 
form usually issued each April and December; forms 
April, December 1946; 2¢ per test; 1¢ per answer 
key; 7(15) minutes; Marion Steele and Margaret 
Egeland; Ohio Scholarship Tests, Ohio State Depart- 
ment of Education. 


Ivan A. Booker, Assistant Director, Research 
Division, National Education Association, 
Washington, D. C. [Review of the 1946 forms. ] 
This test is one in the series of Every Pupil Tests 
designed for use in all the schools of Ohio. 
More than three dozen tests including two on 

high school reading are now used in this state- 
wide testing program which was begun in 1929. 
New editions are developed each year by com- 
mittees of teachers working in collaboration 
with the Testing Division of the State Depart- 
ment of Education for state-wide use in Decem- 
ber and April. The two high school reading tests 
are Applied Reading:for Jwnior-Senior High 
School and Techniques in Reading Comprehen- 
sion, the latter described elsewhere (505) in this 
volume. 

The test of applied reading is designed to 
measure three selected types of reading skills: 
(a) skill in using an index; (b) skill in recog- 
nizing, from the running heads used in a dic- 
tionary, whether or not a given word is on the 
page with the heading shown; (c) skill in dis- 
criminating among word meanings for homo- 
nyms and for words with multiple meanings. 
This is not a test of general level of reading 
ability, but a test of proficiency in these three 
specific skills, with emphasis on speed of per- 
formance. The over-all time for administering 
the three-part test in a class is not more than 
fifteen minutes—the total actual working time, 
seven minutes. 

Part I of the test, on the use of an index, 
makes use of the usual procedure in testing this 
skill. Typical index items from.A to W are 
given as a point of reference; the pupils are 
asked to find the page number or to select the 
index heading where they would find informa- 


tion about various subjects. The three-minute 
time limit is not generous, but adequate. 

The next part of the test carries seven boxed 
page headings such as one finds in most dic- 
tionaries. Each such heading is followed by 
four test items consisting of words closely re- 
lated alphabetically to one of the key words in 
the heading. The pupils indicate whether or not 
each test word should be on the page by writ- 
ing "yes" or "no," The time limit on this part 
of the test is extremely brief—only two min 
utes. Those who write slowly are particularly 
handicapped. Merely to write “yes” and “no” 
alternately for twenty-eight items, without the 
slightest pause between items, usually requires 
from forty-five to sixty seconds for high-school 
pupils, which means that three-eighths to half 
of their total working time is used up in 
handwriting. A system of check ( ,/) and zero 
(0) would be just as useful; but of course, if 
any such plan were followed, it would destroy 
the comparability of results with pupils who 
had been forced to write “yes” and “no” to 
each item. The directions contain no warning 
against a system of abbreviations, only a posi- 
tive direction to write “yes” or “no.” Do some 
pupils on their own initiative use a simplified 
code to save time? If so, are their test papers 
eliminated? In a state-wide testing program, 
might some teachers encourage a short cut in 
marking, believing that the ability to discrimi- 
nate was the only important part of the test? 
Where time limits are as brief as in this exer- 
cise, any variation in the mechanics of response 
becomes highly significant. 

The final section of the test is focused on the 
ability to distinguish between word meanings. 
The words used are quite simple, but in each 
case two to five different meanings are involved. 
One series uses the word “contest,” giving two 
definitions of the word when used as a noun 
and three when used as a verb. Test items then 
ask the student to indicate which definition 
would hold in each of several sentences. No 
writing is required other than to enter the letter 
of the correct definition in the space provided. 
Again the time limit is rather brief, which 
means that speed of performance is rewarded. 
The slow reader, irrespective of his accuracy in 
interpreting word meanings, is at a disadvan- 
tage on this exercise. A slight difference in 
content appears between the April and Decem- 
ber 1946 editions, which may mean that the two 
forms are not strictly comparable. The April 
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edition uses homonyms, such as "to," “too,” 
and "two"; and "cite," “sight,” and "site." The 
December edition uses for each unit in the ex- 
ercise a^single word with several meanings, 
such as "contest" already mentioned. Some 
students work much faster with the homonyms, 
knowing the differences in meaning so well 
that they can do the test items without having 
to read the definitions. This advantage becomes 
increasingly great for those in senior high 
schools. 

Taken as a whole the test of Applied Read- 
ing for Junior-Senior High School undoubtedly 
is a useful measure of three related reading 
skills. Especially in classes where practice in 
the use of the index and practice in dictionary 
usage are being given—or being considered— 
this brief test should be quite helpful. When 
administered according to directions as a timed 
test, it shows the speed of performance of 
members of the class in comparison with that 
of other high school students. It might also be 
used to advantage in some classes as an un- 
timed test or with individual timing to see how 
accurately the pupils work when not subjected 
to time pressures. These results could not be 
compared with available norms but might give 
useful diagnostic leads to the teacher. 

The manual contains no information as to 
the procedure followed by the test development 
committees either to equate the forms in ad- 
vance of their use or to assure the reliability 
and item validity of the tests. Perhaps, in view 
of the purpose of the tests, no elaborate pre- 
liminary tryouts and evaluations are feasible. 
Even the bulletin which reports the results of 
the 1946 testing program contains no data on 
reliability or validity, only the percentile scores 
and a table showing the percentage of pupils 
who answered each test item correctly. The fact 
that percentiles are used almost exclusively 
suggests that exact equality of forms is not 
expected. 

The percentile norms issued as soon as pos- 
sible after the state-wide testing program are 
based on the total scores—not on the scores for 
the three parts of the test—which limits its 
diagnostic value. Likewise the percentile norms 
are published separately only for junior high 
schools and for senior high schools rather than 
for individual grades. As stated earlier, there 
is a table showing the percentage of pupils who 
answered each test item correctly. This would 
be of some use in the diagnostic study of indi- 


vidual test papers. However, because of the 
emphasis on speed of performance which the 
test involves, the number of semichance errors 
is perhaps too large for individual responses to 
have real significance. 

An excellent class record sheet is included 
with the test, the Universal Class Error Check 
Sheet. On it the errors and omissions for an 
entire class can be recorded to show the parts 
of the test which were and were not correctly 
answered by all the pupils. This record sheet 
would indicate for an entire class and for in- 
dividual pupils the particular skill or skills on 
which further work is most desirable. 

Those who use the test should be fully aware 
that it is sharply focused. It is not a general 
measure of the student’s ability to apply read- 
ing successfully to all study situations—in spite 
of the title, Applied Reading. It measures only 
the reader’s skill in using an index and in per- 
forming two of the processes involved in using 
a dictionary. Wherever these are the skills for 
which a brief test is needed, the Ohio tests 
have many things to commend them, not the 
least of which are the annual revisions and the 
state-wide norms. For a comprehensive study 
of applied reading skills, however, teachers 
must rely on other tests as well as on these, 
according to the needs and demands of their 
respective classes. 


[ 535 ] 
Interpretation of Data. Grades 9-12; 1930-47; re- 
vision of Interpretation of Data, Test 25; IBM; 
Forms 2.51, 2.52; separate answer sheets must be used ; 
$2 per 25; 4o¢ per 25 machine-scorable answer sheets ; 
$1.05 per set of stencils for scoring answer sheets; 75¢ 
per specimen set, postpaid; worklimit (90) minutes ; 
prepared by the Evaluation Staff of the Eight Year 
Study, Progressive Education Association under the 
direction of Maurice L. Hartung; Cooperative Test 
Service. 
REFERENCES 
1-9. See 40:1544. 


1o. Teter, James D. "Improving Ability to Interpret Edu- 
cational Data." Ed Res B 19:363-71+ S 25 '40. * 

ir. Hartunc, Maurice L.; WEISMAN, Lean; McMuttzn, 
Haxorp G.; AND TniMBLE, HanoLp C. “Interpretation of Data," 
pp. 38-76, 509-13. In Appraising and Recording Student Prog- 
ress. By Eugene R. Smith, Ralph W. Tyler, and the Evaluation 
Staff. E Education Association Publications, Commis- 
sion on the Relation of School and College, Adventure in 
American Education, Vol. 3, New York: Harper & Brothers, 
1942. iy xxiii, 5$ $3.00. * (PA 16:5033) z 

12. NEUHOF, . "Integrated Interpretation of Data 
Tests.” Sct Ed 26:21-6 Ja "42. * 

13. MANDELL, MILTON M., anp Apxtns, Domornv C. “The 
Validity of Written Tests for the Selection of Administrative 
Personnel.” Ed & Psychol Meas 6:293-312 au '46. * (PA 21; 
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J. Wayne Wrightstone, Assistant Director, 
Bureau of Reference, Research and Statistics, 
Board of Education, New York, New York. 
Ingenious tests constructed by the staff of 
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Evaluation in the Eight Year Study of the 
Progressive Education Association are avail- 
able through the Cooperative Test Service. The 
test on interpretation of data is a novel contri- 
bution to the measurement of a newer objective 
of modern education. It is encouraging that the 
results of experimentation with new test exer- 
cises are to be shared more widely. 

Commendable features of this test are the 
breakdown of interpretation of data into vari- 
ous analytical or diagnostic features. These in- 
clude general accuracy of interpretation, accu- 
racy with probable true and probable false, 
accuracy with insufficient data, accuracy with 
true and false data, caution in judgment, gen- 
eralizing beyond the data, and crude errors in 

"interpretation. 

The reliability of the general accuracy score, 
the beyond data score, the caution in interpre- 
tation score, and insufficient data score are suf- 
ficiently high so that the results may be used 
Íor individual diagnosis. Other subtests, or 


- analytical features, must be used with extreme 


` caution for individual diagnosis because of 
their lower reliability, The validation of the 
test against the criteria of pupil behavior de- 
fining the objective and against an essay form 
of the test is adequately described in the man- 
ual. The pupil behaviors—described in some 
detail in the manual—clearly indicate the na- 
ture of abilities and skills measured by the test. 
One disadvantage of the test is the lack of 
definitive norms. Interpretation of scores is 
presented in terms of the percentage of items 
correctly answered by each pupil for the total 
test and for the various analytical features of 
the test. For general use, it is desirable to pro- 
vide norms achieved by a representative sample 
of pupils at the various grade levels of the 
high school. This is in conformity with the 
usual practice of providing a table for convert- 
ing raw scores into equivalent scores, such as 
percentile scores or standard scores, 
Considerable time is required for scoring the 
analytical features of the test unless the In- 
ternational Business Machines electrical test- 
scoring machine is available. Compared with 
the usual test, the manual scoring and the arith- 
metic computations required to convert each 
Score to a percentage of correct responses will 
present mental hazards for many teachers. Yet 
it is evident that the authors of the test have 
attempted to simplify the scoring by the use 
of a separate answer sheet to be scored by su- 


perimposing specially punched scoring keys. 

For teachers who are accustomed to admin- 
ister a test in a 40- to 45-minute period, this 
go-minute test may arouse hostility because of 
rearrangement of class schedules, regardless of 
the inherent values involved and the need for 
emphasizing thought rather than speed. It is 
hoped that attitudes of supervisors and teachers ^ 
will change on the stereotype of the 40-minute 
testing period. 

For teachers who are not conversant with 
instructional methods for teaching aspects of 
critical thinking—especially the changes in pupil 
behavior implicit in this test—detailed sugges- 
tions for using the test results wisely must be 
provided. The authors have not given concrete 
suggestions required for such a new emphasis 
in teaching, nor have they cited sources where 
the teacher may turn for help. 

Since these tests were originally designed for 
use with college-preparatory pupils, it is desir- 
able to try out this test in average public school 
systems with pupils in all types of courses. This 
will serve several purposes. One purpose is to 
define the high school student population that 
can profitably interpret and use the concepts, 
vocabulary, and skills involved in the test. An- 
other purpose is to discover the needs of teach- 
ers for intelligent use of the test results. Such 
a tryout may reveal needs for further simplifi- 
cation of the scoring and elaboration of the 
suggestions on the interpretation and use of the 
test scores. a 

This type of test marks a new and promising 
departure from the conventional types of test 
exercises. As a pioneer effort, it needs to be 
evaluated in a variety of school situations. 
More extensive use may point toward modifi- 
cation or revision of the present edition of the 
test and its manual. It is a challenge to better 
teaching and learning. 


[ 536 ] 
XLibrary Test for Junior High Schools. Grades 7- 
10; 1942; Forms A, B; $1.25 per 25; 25¢ per specimen 
set, postpaid; 30(35) minutes; Committee on Tests 0 
the School Library Association of California under the 
gie. of Jeannette Vander Ploeg; California Test 
'ureau. 


Robert A. Davis, Professor of Education, 
University of Colorado, Boulder, Colorado. Al- 
though the subtitle suggests that library skills 
are measured, the test measures primarily in- 
formation concerning library materials. It does 
not measure one's skill in the use of a library- 
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The test consists of five parts: (a) books, 
(b) the arrangement of books, (c) ‘card cata- 
log, (d) dictionaries and encyclopedias, and 
(e) miscellaneous materials. It is designed to 
reveal the student's information analytically as 
well as compositely. The various parts of the 
test consist of multiple-choice, completion, and 
true-false items, the number of items in each 
part ranging from 9 to 12, the total number 
being 51. Inasmuch as some items are of the 
recall type, scoring is not entirely objective. 

The materials of the test are based upon 
analysis of courses of study and library manuals 
used in representative schools. These materials 
were selected for appropriateness by a commit- 
tee of the School Library Association of Cali- 
fornia. No data are provided in the manual 
accompanying the test regarding the specific 
sources of material examined in formulating 
items. 

The value of this test lies in its ability to 
reveal the student’s knowledge concerning the 
resources of libraries. When all parts of the 
test are considered as a unit, it undoubtedly 
reveals considerable knowledge. The number of 
items under each part is too few to yield a valid 
and reliable measure of the student’s knowl- 
edge of that part. Only total test scores should 
be considered. 


Ethel M. Feagley, Associate Librarian and 
Assistant Professor of Education, Teachers 
College, Columbia University, New York, New 
York. Instruction in the use of the library and 
its materials varies widely in scope and empha- 
sis and seldom carries course credit. As a first 
step the librarian always needs to find out how 
much students already know about the use of 
the library so that instruction may be planned 
around real needs and repetition be reduced. 
Diagnostic tests in this field, therefore, are 
much more useful than those which measure 
achievement. The Library Test for Junior High 
Schools definitely states it is designed as a di- 
agnostic test to secure as accurate an indication 
as possible of each pupil's familiarity with li- 
brary usage. The results of this test will then 
be used by teachers or librarians in planning 
library lessons by showing them which details 
need attention. The questions are deliberately 
more general than specific in regard to certain 
library policies and book collections in order 
that they may be applicable to many situations. 
This is a weak point in most library tests, and 


it seems a high price to pay for trying to make 
a test applicable to other than local situations. 
The manual of directions reports on the con- 
tent validity, an important item in a diagnostic 
test. The present form is the result of several 
revisions. The items for the test were assem- 
bled from a careful study of manuals and 
courses of study. This is a logical and usual 
procedure. But if the existing manuals and 
courses of study fail to include many needed 
items and reflect the viewpoint of librarians 
rather than library users, the content validity 
may not be satisfactory. Here is an area de- 
manding attention from all librarians interested 
in preparing library tests. 

The present test is divided into five divisions, 
the usual ones: (a) the book; (b) arrangement 
of books; (c) the card catalog; (d) dictionar- 
ies and encyclopedias; and (e) miscellaneous 
reference books. The questions fall in three dif- 
ferent forms: matching, multiple choice, and 
true-false. The type of question varies in the 
two forms of the test. For instance, questions. 
on the card catalog in Form A are true-false, 
but in Form B they are multiple choice. In the 
sections of matching-type questions it might be 
suggested that the list of choices be given in 
alphabetical order to make the selecting easier. 
The questions on the card catalog deal with in- 
formation about it rather than skill in using it, 
a far more intricate thing to test. In all library 
tests the subjects or skills which fall readily 
into test items are tested again and again while 
the functional aspects which are difficult to 
translate into test terms are seldom included. 

The directions for administering and scoring 
the test are clear. Percentile norms are given 
for each grade. These apparently are based on 
the results of tests given to California students 
only. This test should be good for any librarian 
in a junior high school to use as a survey of 
needed library instruction and as a guide in 
planning a more detailed test suitable for the 
local situation. The weaknesses are basically 
weaknesses of the existing curriculum in library 
instruction rather than of the test techniques 
themselves. 

[ 537 ] 


X*Library Usage Test. Grades 11-13; 1940; $1.25 per 
25, postpaid; sample test free; 45(50) minutes; Elmer 
R. Smith; Turner E. Smith & Co. 


J. Wayne Wrightstone, Assistant Director, 
Bureau of Reference, Research and Statistics, 
Board of Education, New York, New York. 
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Library usage tests appear in increasing num- 
ber upon the measurement horizon. Included 
among these is the Library Usage Test. The 
test has eight subtests on (a) Using the Li- 
brary, (b) Using the Card Catalogue, (c) Us- 
ing Books, (d) Using the Dictionary, (e) Us- 
ing Reference Books, (f) Using Magazine 
Indexes, (g) Using Encyclopedias and (h) 
Library Abbreviations. 

The author of the test is attempting to meas- 
ure too much in too little space and too little 
time. Three of the subtests have only 10 items; 
the remainder average 15 items each. Diagnos- 
tic or analytical features are defeated by the 
unreliability of subtests having so few items. 
The lack of adequate norms and a manual of 
directions and interpretation is inexcusable in 
modern tests offered for commercial distribu- 
tion. 

The format of the test does not accord with 
the best principles in test construction, espe- 
cially in Subtest 1 where a list of 20 items in 
column 2 are to be matched with a list of 15 
descriptions in column 1. A preferred format 
would have been to arrange sets of 5 items in 
column 2 so that 3 of these could be associated 
with 3 items in column 1. The same criticism 
holds for the section of the test entitled Find- 
ing Books. 

In Subtest 2, Using a Card Catalog, the test 
items are entirely about the identification of 
different types of catalog cards. The test exer- 
cises would have been much more functional 
and valid if interpretation of catalog cards had 
been tested rather than mere identification of 
their types. The section on catalog arrange- 
ment, likewise, presents items in isolated and 
academic statements rather than as approxima- 
tions to a lifelike situation. It would have been 
preferable to have stressed interpretation and 
use rather than mere information about catalog 
arrangement. 

In Subtest 4, Using the Dictionary, the items 
again pertain to information about a dictionary 
rather than a measure of skills in the use of the 
dictionary. This section would have been im- 
proved by reproducing a sample page from a 
dictionary, as in the Jowa Every-Pupil Tests of 
Basic Skills: Test B, Work-S: tudy Skills, which 
measures the ability of the individual to exhibit 
skills in a realistic test exercise. 

The author has presented realistic test exer- 
cises in Subtest 6, Using Magazine Indexes, 
where he has reproduced typical entries found 


in a magazine index. These exercises involve a 
more direct use of the skills in a lifelike testing 
situation, Their use is commended. 

The test reliability would have been increased 
if the author had omitted certain subtests, 
namely, using books and abbreviations. These 
might more validly and logically be included in 
a test of work-study skills. 


[ 538 ] 
Peabody Library Information Test. Grades 4-8, 9- 
12, 13-16; 1938-40; Form A; 3 levels; $1.25 per 25 of 
any one level; so¢ per specimen set, postpaid; Louis 
Shores and Joseph E. Moore; Educational Test Bu- 
reau. : 
d) ELEMENTARY LEVEL. Grades 4-8; 1940; 30(35) min- 
utes. 
b) HIGH SCHOOL LEVEL. Grades 9-12; 1940; 30(35) 
minutes. ] 
C) COLLEGE LEVEL. Grades 13-16; 1938; 32(37) minutes. 


REFERENCES 
1, Moors, Joserx E. “The Relation Between Library In- 


formation and Elementary School Attainment." Peabody J Ed 
18:431-6 J1 40. * (PA 14:6219) 4 E 

2. DEER, Groce H. “The Peabody Library Information Test: 
A Study of Its Statistical Validity and Reliability.” J Exp Ed 
9:233-6 Mr '41. * (PA 15:4385) 

Douglas E. Scates, Professor of Education, 
Duke University, Durham, North Carolina. 
This test is not for librarians in training but is 
for regular school pupils, to show their under- 
standing of the role and the use of reference 
books—or, more generally, of the library itself. 
As such, the test falls in the general category 
of study- or work-skills tests. And, as paper- 
and-pencil tests so frequently do, it relies on 
knowledge as an indication of these skills. The 
test, being published separately on three levels, 
affords an instrument for three grade ranges at 
which schools may well wish to check on 
progress. 

The test is offered for the purpose of reveal- 
ing the need for more attention to library 
knowledge at different points in the curriculum 
and for assessing the success of teaching in 
these lines. The reviewer has more confidence 
in the beneficence of tests which are to be used 
to help define the responsibilities of the school 
and to suggest needs to the responsive teacher 
than he has in tests employed primarily to 
praise or condemn the student, frequently for 
attributes which are not of his volition. As 
educators, we are concerned with evaluation- 
fitted-into-a-plan-of-action, not simply with sci- 
entific assessment which leaves some persons 
feeling smug and others feeling hopeless. 

With this frame of reference the reviewer 
believes that the Elementary Level of these 
tests falls in an area of use likely to be con- 
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structive. The College Level, when used for 
freshman orientation and further guidance, also 
appears salutary. One suggested use of the 
High School Level, namely, for meeting the 
requirements of regional accrediting associ- 
ations, is tinctured with that oversimplification 
of thought which so quickly falls into machine- 
like standardization of education and ends in 
complete formalization. A comprehensive phi- 
losophy of education may, however, prevent the 
tests from having a significant influence in this 
direction. 

When we turn to the construction of the 
tests, we face another issue, From an inspec- 
tion of the items in the three levels, the re- 
viewer comes to the following conclusions: 

(a) The items evidence some ingenuity in 
the variety of exercises employed. Certain 
forms are, however, repeated seemingly with- 
out justification, and, in the lower level, cer- 
tain standard forms which are of value are 
neglected. 

(b) The content of the items is heavily 
slanted toward technical library information 
(e.g., the expectancy that the student will have 
memorized the Dewey Decimal Classification). 
For special purposes, this heavy emphasis may 
be desirable; for general use in schools, the re- 
viewer would say it was undesirable. This 
emphasis is, of course, wholly within the limits 
of the title of the test. 

(c) The workmanship on the items is poor. 
Far too large a proportion of them are faulty 
on the surface. In the Elementary Level, Form 
A, the reviewer is critical of nearly half the 
items on the basis of faulty construction. Some 
of the faults are in careless editing; for ex- 
ample, titles to reference books are given in 
italics in one item, in ordinary roman in the 
next item, and later in roman with quotes. A 
catalog card is presented with poor punctu- 
ation. A completion item is split at the bottom 
of the page, with 3 blanks on one page and 5 
on the other. And so on. There are similarly 
numerous defects in the logic of items. One 
matching item demands recall of the Dewey 
classification numbers without indicating that 
they are Dewey. Another matching item has a 
very heterogeneous set of choices. One multiple- 
choice item has two reasonably good answers. 
One completion item asks what a dictionary is 
about. (One wonders if the key permits “Every- 
thing" as an answer.) Most of the defects can- 
not readily be indicated briefly and so are not 


mentioned in detail here. The college level has 
fewer technical defects but is not perfect. 

The three levels of the test give the impres- 
sion of a fairly good idea poorly executed. 
Carelessness, or ignorance of what makes a 
good test item, is too prevalent. Statistical 
standardization will not rid a test of such faults, 
and the use of such tests in schools gives false 
notions of what is known and what is not be- 
cause of numerous spurious factors, 


Ed Res B 20:114 Ap 16°41. Ruth E. Seeger.* 
The directions are clear and simple, both for 
administering and for scoring. * The questions 
are well chosen to cover desired items and are 
well worded, making for clarity and exactness 
of meaning. They are designed to be used pri- 
marily to discover which students are in need 
of library instruction and upon what phases of 
library instruction they need such instruction. 
They may also be used as a measure of the ef- 
fectiveness of such instruction. All forms should 
be a definite contribution in the field of testing 
for efficiency in using libraries, particularly 
since good tests in library usage are not plenti- 
ful. They are especially timely in that library 
instruction is coming to be considered more 
and more essential in the training of youth by 
present curricular methods, and that such train- 
ing is beginning to be considered as one phase 
of standardization and evaluation of secondary 
education. 

J Higher Ed 12:217-8 Ap ’41. Ruth E. 
Seeger. * an outstanding new library informa- 
tion test * The test presupposed the use of the 
Dewey decimal classification system * 


[ 539 J 
Special Reading Test: Ohio Senior Survey Tests. 
Grade 12; 1934-38; Forms A, B; 3¢ per test; 60(65) 
minutes; S. L. Pressey and J. W. Sherburne (Form 
B); Ohio Scholarship Tests, Ohio State Department 
of Education. 


Miles A. Tinker, Professor of Psychology, 
University of Minnesota, Minneapolis, Minne- 
sota. This is one of the tests devised for use 
with Ohio high school seniors. The reading 
abilities measured are: technical vocabulary of 
English grammar ; foreign words, phrases, and 
abbreviations; use of the dictionary; and get- 
ting information from graphs, maps and di- 
agrams. The test is to be used as a point of 
departure in remedial instruction to locate stu- 
dents deficient in minimum essentials and to 
diagnose specific student weaknesses. In Ohio, 
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Form A of the test is given before the remedial 
work of the senior survey course starts, and 
Form B after the remedial program is com- 
pleted to check improvement. 

Percentile norms are given for each part and 
for the total scores on Form A and on Form B 
separately. Also item norms (ie., percentage 
answering each item correctly) are given. There 
are no data listed on techniques of standardiza- 
tion or on equivalence of the test forms. Ex- 
amination of the percentile tables reveals that 
the test forms are not of equal difficulty. There- 
fore, the interpretation of changes in test scores 
from Form A to Form B is hazardous. One 
can only note changes in percentile ranks. Since 
the two forms are apparently of unequal diffi- 
culty and we have no information on reliability, 
the meaning of the shifting in percentile ranks 
must remain obscure. No information is given 
on either the reliability or the validity of the 
test scores. In the absence of these data, no 
adequate interpretation of the test results is 
possible. It is difficult to justify the publication 
and use of a reading test without making avail- 
able data on reliability. 

The legibility of the print in the test is poor 
because of blurring of letters as if badly worn 
type had been used for the printing. In fact, 
some of the print in the maps and map expla- 
nations cannot be read at all. Directions are 
clearly and concisely worded, 

The content of this test is well selected, and 
the test items are well organized. The reading 
abilities measured are important both in school 
and in many of the nonacademic reading ac- 
tivities of the individual, 

The organization and content of this test are 
somewhat similar to those of the Tyler-Kimber 
Study Skills Test and the Chicago Reading 
Tests. Available information indicates more 
adequate standardization for the Tyler-Kimber 
and the Chicago tests, None of these tests, 
however, is entirely adequate. It is desirable 
that more satisfactory tests of this type be 
constructed and standardized. They can be very 
useful in the diagnosis of certain important 
reading abilities. 

Both the lack of data on standardization and 
the illegible print of this test will be obstacles 
to its use. 

[ 540 ] 


Study-Habits Inventory, Revised Edition, 1941. 
Grades 12-16; 1934-41; 1 form; $1.25 per 25; 10€ per 


Mn set; nontimed (10-20) minutes; C. Gilbert 
renn; Stanford University Press. 


REFERENCES 
1. Reever, C. W. "Study Habits.” Sch & Soc 42:413-5 S 21 


"35. * (PA 10:616 o: 
m ru AND Staton, THomas F. “Predictive Value 


of Various Tests of Emotionality and Adjustment in a Guidance 
Program for Prospective Teachers.” J Ed Res 32:653-9 My '39. 
= EA taa) A 
. Gorpon, H. Puorse. "Study Habit Inventory Scores and 
Scholarship." J Appl Psychol 25:101-7 F '41. * (PA 4 

4. Wrenn, C. GILBERT, AND HUMBER, Wirsum J. "Study 
Habits Associated With High and Low Scholarship.” J Ed Psy- 
chol 32:611-6 N "41. * (PA 16:2801) j a 

5. Jouwsox, A. P. The Prediction of Scholastic Achievement 
for Freshman Engineering Students at Purdue University. Pur- 
due Universit; Division of Educational Research, Studies in 
Engineering Education II. Lafayette, Ind.: the Division, May 
1942. Pp. 22, Paper. $0.35. * (PA 16:5020) 

6. Jounson, A. P. The Relationship of Test Scores to Scho- 
lastic Achievement for 244 Engineering Freshmen Entering 
Purdue University in September 1939. Unpublished doctor's the- 


sis, Purdue University, 1942. $ À 
7. MusseLman, Joun W. “Factors Associated With the 


Achievement of High, School Pupils of Superior Intelligence.” 
7 frt a rosse Syt E "A Note on the Wrenn Study Habits 
Inventory.” J Genetic Psychol 60:385-6 Je '42. * (PA 16:4555) 

Douglas E. Scates, Professor of Education, 
Duke University, Durham, North Carolina. 
This is a short test (7 or 8 minutes ought to be 
sufficient for the actual responses), well pre- 
sented as far as form is concerned, and well 
described in the accompanying manual. The 
test is recommended by the maker for clinical 
use, with the emphasis on an analysis of re- 
sponses to individual items rather than on the 
total score. 

The reviewer’s reaction is much the same as 
to all personality inventories of this general 
type. First, the respondent does not know. the 
proper answer (descriptive of himself) on 
many of the questions; second, different per- 
sons will give the same response for widely 
different reasons; third, such question blanks 
presume a degree of honesty and frankness on 
the part of the respondent which is incredible. 

The reviewer recalls an early comment of 
Goodwin Watson to the effect that certain per- 
sonality questionnaires utterly ignore the con- 
ditions under which adolescents will reveal 
their real problems. And there is also the sug- 
gestion by Gordon Hendrickson that full rap- 
port and its opposite (complete lack of con- 
cern) probably tend to produce more accurate 
responses but that in the great middle range 
of attitudes serious difficulty arises. For the 
majority of persons, so many factors operate 
that truth is likely to be not only veiled but 
effectively screened. Certainly students, like 
other persons, will seek the most respectable 
(and probably irrelevant) explanations of their 
difficulty. 

For example, the test affords a number of 
easy “outs” for the student who lacks the drive 
and possibly the ability to study. He can find a 
large number of mechanical or external reasons 
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to check and thus successfully divert the coun- 
sellor’s attention from the hard truth. Who 
would not, for example, prefer to take a formal 
scolding for the prevalent indulging in too much 
social life (Item 17) instead of facing the diffi- 
cult job of adequately summarizing, classify- 
ing, and systematizing the material learned 
(Item 23)? Or who would not prefer the ap- 
parently blameless “I get ‘fussed’ and nervous 
on examinations” (Item 18) as an excuse 
rather than admit weakness in the intrinsically 
hard matter of reciting to one’s self the ma- 
terial studied and checking all doubtful points 
(Item 3) ? What such natural and nearly uni- 
versal tendencies to avoid the truth do to the 
basic validating and standardizing of the test 
can only be conjectured. What these tendencies 
do to the responses of particular individuals is 
to give highly variable and undependable re- 
sults from one person to another. 

Aside from volitional or unconsciously eva- 
sive errors of response are the errors of in- 
terpretation arising from different bases for 
the same response. “I have to re-read material 
several times." Certainly, we all do. What does 
this response mean? It may be a function of 
the intrinsic difficulty of the material (as math- 
ematical, detailed description vs. newspaper 
stories or fiction). The response may be a func- 
tion of the type of mind (as generally resist- 
ant, critical, doubting vs. highly fluid, submis- 
sive, credulous). Or the response may arise 
from one's purpose in reading (as to get a per- 
fectly clear understanding or to memorize vs. 
to get a sketchy notion). Again, different stand- 
ards of work will cause different amounts of 
rereading, and so on. W/hy does the student 
respond as he does? Who can tell? Again, “I 
miss important points in the lecture while copy- 
ing down notes." Of course. With some in- 
structors nobody could help it. But the test is 
supposed to diagnose the student, not the 
instructor. 

Honest questions will arise in trying to an- 
swer the questions. “I spend too much time on 
some things and not enough on others.” Could 
anyone say no to this one? But apparently 
some persons are supposed to—who? “I finish 
my examination papers and turn them in be- 
fore time is called.” Suppose I do on objective 
tests, but on essay tests I always run over time. 
Then I will wish study-habit tests did not con- 
tain impossible questions! 

The statistical procedure for constructing 


the test is conventional and presumably impec- 
cable. Yet one notes that the scoring weights 
in more than one-third of the items do not run 
in a single direction but reverse themselves in 
a manner difficult to rationalize. For example, 
“Trouble in picking out the important points” 
counts —7 if it occurs sometimes; but there is 
no loss of points if the difficulty occurs often. 
To “study with others” is all right if it is done 
always, but the student who does it only oc- 
casionally shows poor study habits. Similarly, 
it is better to engage in bull sessions often than 
only sometimes. And so on. One wonders a lit- 
tle whether statistics aren’t as misleading as 
some persons claim they are. 

The scoring key reveals that only 12 of the 
28 items are discriminating in the case of 
women, Are not women human beings? Do 
they study or fail to study by means as yet un- 
discovered by men? Granted that they may per- 
plex and confound members of the opposite 
sex, but do they also confuse the statistics ? 

The items are selected and the weights at- 
tached on the basis of extreme groups—a tech- 
nique which is fairly common and generally 
accepted. But we must bear in mind that the 
bare fact that a testmaker can get two groups 
of college students of equal average intelli- 
gence, with one group doing well in its studies 
and the other doing poorly, is not adequate evi- 
dence that the two groups are obliging enough 
to differ markedly and consistently with respect 
to the trait which the testmaker happens to be 
thinking of (in this case, study habits). There 
were, certainly, hundreds or thousands of fac- 
tors besides study habits which operated to 
produce differences in college work. 

In the reviewer’s judgment, validity and other 
aspects of the standardization of such a test 
call for the selection of items and the determi- 
nation of weights only in connection with the 
most skillful interviews to determine what the 
response meant to the student and why he gave 
the response he did. Items found to mean dif- 
ferent things would then either be rejected or 
would be subdivided and defined further; the 
present test could probably be made four or 
five times its present length without the addi- 
tion of a single new question—just by making 
the present questions more specific. We have 
been taught now for three decades to utilize 
and depend on statistical methods. Perhaps the 
time has come to learn that we cannot rely on 
statistical methods—at least as they are often 
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used. Statistics suffers most at the hands of its 
friends. 


For reviews by Edward S. Jones and William 
A. McCall, see 40:1574. 


[54 ] 
Survey of Study Habits, Experimental Edition. 
Grades 8-12; 1944; 1 form; nontimed (30) minutes; 
Arthur E. Traxler; Educational Records Bureau. 


REFERENCES 


1. TRAXLER, ARTHUR E, “Some Results of an Experimental 
Survey of the. Atuar Habits of Independent-School Pupils,” pp. 
0-9. In 1944 Fall Testing Program in Independent Schools and 
'upplementary Studies, Educational Records Bulletin, No. 42. 
New York: Educational Records Bureau, January 1945. Pp. x, 
49. Paper, lithotyped. $1.00. * (PA 19:1812) 


[542] 
Test of Study Skills. Grades 4-9; 1940; IBM; 
separate answer sheets must be used; Forms A, B; 
$1.75 per 30; 14£ per machine-scorable answer sheet; 
25¢ per specimen set; 60(70) minutes; J. W. Edgar 
and H. T. Manuel; Steck Co. 


Douglas E. Scates, Professor of Education, 
Duke University, Durham, North Carolina. The 
test is well put up, makes no false claims, and 
will probably do as nearly as a paper-and-pencil 
test can, in a limited time, what it is supposed 
to do. The first part deals with the use of refer- 
ence books, indexes, and dictionary and the 
reading of graphs, tables, and maps. The second 
part deals with critical inference. ; 

For a basis for comparison, one’s thinking 
turns naturally to the test Work-Study Skills 
in the Jowa Every-Pupil Tests of Basic Skills. 
The test under review differs at once by giving 
nearly half of its items to critical thinking 
rather than all to work skills. And among the 
latter the present test seems to have certain ad- 
vantages. In the first place, it avoids the dispro- 
portionate emphasis (nearly one-third) of the 
Towa test (grades 6 to 8) on interpreting maps. 
It gives nearly half of the first part to general 
sources of information, including the use of 
reference books in general; this emphasis may 
be somewhat heavy but it is not indefensible. 
In the second place, the questions on the several 
aspects of securing information and interpret- 
ing technical material are all mixed together, so 
that they can be arranged spirally as to diffi- 
culty. They thus avoid any tendency toward 
monotony, and, inasmuch as the questions all 
relate to the same general field, this mixture 
does not seem to the reviewer to be objection- 
able. 

The second half of the test, concerning “criti- 
cal thinking in the use of printed materials,” 
represents an adaptation of the evaluation-of- 


inferences tests, developed during the Eight- 
Year Study under the influence of Ralph Tyler, 
in which one is asked to indicate whether cer- 
tain statements are true, probably true, unsup- 
ported, and so on, in the light of facts which 
are given. The present adaptation offers only 
three alternatives which include yes, no, and 
can’t tell. The adaptation is probably an advan- 
tage, especially for young children, as the five 
categories of positiveness are difficult even for 
adults to manage. Another advantage of print- 
ing the alternatives for each item is that they 
can be specially adapted to the wording of the 
item, and this is done in the test. The reviewer 
would probably prefer four alternatives, on the 
basis of a finding by Giles M. Ruch some years 
back that three-response items were somewhat 
tricky and erratic. 

One cannot fail to think of other important 
aspects of study skills, such, for example, as 
outlining and the ability to get the central 
thought in reading. These are, of course, cov- 
ered in other tests; but perhaps the name of the 
present test is somewhat broader than its actual 
coverage. If that is a sin, it is shared by other 
tests in this class. 

One notes in the sheet of directions the fol- 
lowing cautious and salutary statement concern- 
ing validity “The claim for validity rests simply 
upon the careful selection of items in the limited 
area which the test is designed to measure. It is 
believed that the test will be useful in individual 
guidance and in the motivation of learning. It is 
not designed as a pattern of either the content 
or the method of teaching pupils how to use 
printed materials. The test merely samples in- 
formation and abilities here and there in the 
total matrix of ability which the pupil has de- 
veloped.” Such frank, honest, and well-oriented 
statements are refreshing. So also is the follow- 
ing statement on interpretation: “Users of the 
test . . . should be cautioned against an over- 
emphasis upon comparisons either of individuals . 
or of groups with averages. It is easy to make 
the erroneous assumption that averages in 
some way necessarily reflect conditions that are 
desirable.” 

These quoted comments are welcome and 
helpful in mitigating the overemphasis which 
paper-and-pencil tests unavoidably place on cer- 
tain formal, highly verbalized aspects of ability. 
When more testmakers reveal and pass on to 
the user such insight and understanding, we 
may have a lessened fear of what commercial 
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tests in the hands of the teacher who lacks the 
judgment and the self-confidence to see around 
and beyond them are doing to education. The 
reviewer welcomes these comments in the man- 
ual as evidencing a more than ordinarily mature 
understanding of the nature and role of stand- 
ardized tests, 


[ 543 ] 
Use of Library and Study Materials. Grades 9-16; 
1939-40; IBM; separate answer sheets must be used; 
$1.75 per 30 tests; Mary Kirkpatrick, Lola Rivers 
Thompson, and Helen Tomlinson; Steck Co. 


REFERENCES 
_ 1. Voraw, Davin F. “A Comparison of Test Scores of Enter- 
ing College Freshmen as Instruments for Predicting Subsequent 
Scholarship.” J Ed Res 40:215-8 N '46. * (PA 21:1304) 


Ethel M. Feagley, Associate Librarian and 
Assistant Professor. of Education, Teachers 
College, Columbia University, New York, New 
York. It is always essential to know why a test 
has been prepared and what use is to be made 
of its results. The purpose of this test as ex- 
plained in the manual of directions is to provide 
some of the necessary instruments for apprais- 
ing the development of study and library skills. 
The manual adds that the test may be used as 
the basis for motivating the learning of the 
skills which it samples and may also be used as 
a follow-up test to reveal progress. It is not 
clear, therefore, whether this test should be con- 
sidered as diagnostic in nature or as measuring 
progress. Usually a test is a better instrument 
if prepared definitely as one type—diagnostic or 
achievement. 

The test is in two main parts. Part I is de- 
signed as a measure of a student's ability to 
locate information; Part II similarly tests cer- 
tain aspects of his ability to understand the 
material once it has been found. The questions 
are all phrased in multiple-choice form. Part I, 
containing 45 questions, covers the following 
groups of items: magazines, alphabetizing, call 
numbers, dictionaries, magazine indexes, the 
book, classification, card catalog, the library, 
bibliography, and specific reference books. It is 
unfortunate that the test is designed for both 
high school and college students as some of the 
questions refer to materials which would not 
usually be found in a high school library, such 
as the Art Index, Education Index, Interna- 
tional Index, and the Union List of Serials. On 
the other hand, the questions on reference books 
are too elementary for college students. For in- 
stance, out of eleven such questions three refer 
to the World Almanac. Among these 45 ques- 


tions six refer to alphabetizing and six more to 
call numbers. This seems a large proportion, 
particularly since the questions are quite similar 
and do not call for different kinds of analysis. 
Not all the questions test locating information. 
Judgment or recalled knowledge is required in 
four questions about magazines. For example, 
the student is asked to choose from five maga- 
zines that one which “has a regular department 
dealing with household devices and patents.” 
The questions on magazine indexes are quite 
limited in scope. More items should be related 
to the use of the index, subject headings, sub- 
divisions, and other details which enable the 
student to manipulate the index. Of course such 
skills are more difficult to test, and at present 
few good test items of this kind are available. 
As is so often the case, the questions seem to 
be phrased from the viewpoint of the librarian 
rather than the user of the library material. 

Part II, Interpreting Information, deserves 
favorable consideration. It contains some ingen- 
ious and useful techniques for testing the stu- 
dent’s ability to interpret and handle different 
types of facts and information. The questions 
are based on columns of statistics, indexes, 
graphs, various kinds of maps, a catalog card, 
and words with diacritical marks. Some of the 
printing on the map in Form A is difficult to 
read. The section on indexes needs more de- 
tailed questions for college students with per- 
haps references to an index for a whole set of 
books. It would be better to have a wider range 
of questions on the catalog card rather than 
limit the number by using the multiple-choice 
form. The last group in this section, dealing 
with diacritical marks, has two questions, Items 
75 and 80, which are based on the recall of 
learned knowledge rather than on interpretation 
of given facts. 

The directions for administering the test are 
clear, and the test may be scored by machine 
although a hand-scoring key is provided, Per- 
centile norms are given for Form A based on 
2,356 scores of Texas college freshmen. 

By trying to serve both high school and col- 
lege this test reduces its value to each one. A 
separate test for each group would be prefer- 
able. The test is further weakened by not clearly 
defining its aim—to diagnose or to measure 
progress. Usually different test procedures 
should be employed depending on which type of 
test is being prepared. An "all purpose" test is 
seldom very effective. This is one of the weak- 
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est phases-of library tests at present, together 
with the uniformity of test items and the small 
number of skills tested. More thought, study, 
and experimentation must be devoted to analyz- 
ing the facts and skills needed in this area and 
writing new and effective test items. Here is a 
task in test construction which still challenges 
and which will not be solved until librarians 
and test experts selected from a large num- 
ber and variety of local situations bend their 
joint efforts to the undertaking. 


[ 544 ] 
*Watson-Glaser Tests of Critical Thinking. 
Grades 9-16 and adults; 1942; IBM; Form A; 2 parts; 
separate answer sheets need not be used; $2.50 per 25 
of either part; 35¢ per specimen set, postpaid; 95¢ per 
25 machine-scorable answer sheets for either part; 
nontimed (60) minutes per part; Goodwin Watson 
and Edward M. Glaser; World Book Co. 
3) BATTERY I, DISCRIMINATION IN REASONING. 
b) BATTERY II, LOGICAL REASONING, 


REFERENCES 
1, Graser, Epwarp M. An Experiment in the Development of 


Critical Thinking. Columbia University, Teachers College, Con- 

tributions to Education, No. 843. Goodwin Watson, faculty 

Sponsor. New York: Bureau of Publications, the College, 1941. 
p. ix, 212, Out of print. * (PA 16:2057 

2. Howzir, WiLLtAM. Swirzv. "The Effects of High School 
Crane on LET Thinking." Speech Monogr 10:96-103 '43. 
the Watson Glaser Fetis ot Erding! BONO t 
43-8 Ja '45. * (PA 19:1180) 

Robert H. Thouless, Reader in Educational 
Psychology, Cambridge University, Cambridge, 
England. No one who has tried to construct a 
test of this type will underestimate its difficul- 
ties. The authors have succeeded in making a 
test which should prove useful for measurement 
and diagnosis and also of educational value to 
the testees. The tests include the ability to 
recognise the validity of arguments, the detec- 
tion of implications of statements, consistency 
of opinions, etc. In addition to a straightforward 
assessment of the ability to think critically, there 
are ingenious methods of assessing qualities of 
opinions and effects of opinion on the ability to 
think critically on particular problems. 

One of the difficulties of making a good test 
of reasoning out of material of real significance 
is that of ensuring an unquestionably right an- 
swer as standard. The authors tried to achieve 
this end by submitting all the items to a jury of 
IS persons trained in logic and scientific 
method, who apparently showed perfect agree- 
ment as to the correct responses to all items, I 
think that a more critical jury might have found 
fault with some of the remaining items but not 
many. The battery is perhaps as good in this 
respect as any that is likely to be made in prac- 
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tice though it does not appear to be perfect. 
Scoring is by stencil or by machine scoring. 

Norms and estimates of reliability are only 
tentative. The norms given are calculated from 
samples of considerably more than average in- 
telligence. More comprehensive norms are to be 
obtained in the future from reports to the 
author by users of the test. 


Ed Res B 24:139-40 My 16 '45. Harold P. 
Fawcett. * will be welcomed by all who are in- 
terested in the development of this important 
ability * a helpful and effective means of meas- 
uring abilities which are today so essential to 
competent citizenship. While they relate directly 
to the stated objectives of teachers in various 
subject-matter fields, they are particularly suit- 
able for evaluating the extent to which the abil- 
ity to think has been achieved throughout the 
school program to which all subject-matters 
contribute. If they are not used for this purpose, 
teachers of such subjects as social studies, 
mathematics, and science will find them useful. 
They are probably best suited for students on 
the senior-high school level. * 


[ 545] 
Work-Study Skills: Iowa Every-Pupil Tests of 
Basic Skills, Test B, New Edition. Grades 3-5, 5-9; 
IBM, grades 5-9; Forms L, M, N, O; 2 levels; 44¢ 
per specimen set of either level; H. F. Spitzer in col- 
laboration with Ernest Horn, Maude McBroom, H. A. 
Greene, and E. F. Lindquist (General Editor); 
Houghton Mifflin Co. 
0) ELEMENTARY BATTERY. Grades 3-5; $1.50 per 25; 
47(55) minutes. 
b) ADVANCED BATTERY. Grades 5-9; IBM; separate 
answer sheets need not be used; a per 25; 50¢ per 
25 machine-scorable answer sheets; 30¢ per set of 
machine-scoring stencils; 77(90) minutes. 


J. Wayne Wrightstone, Assistant Director, 
Bureau of Reference, Research and Statistics, 
Board of Education, New York, New York. 
ELEMENTARY BATTERY. This battery presents 
exercises which measure (a) map reading, (5) 
use of references, (c) use of index, (d) use of 
dictionary, and (e) alphabetization. The test is 
the best avaliable in the elementary school field. 
Its items are carefully constructed, and the test 
Situations are presented so that the skills are 
measured as closely as possible to the way they 
function in everyday activities of the modern 
School. The skills, furthermore, are important 
in the everyday life of the child. 

ADVANCED BATTERY. This battery measures 
skills of (a) map reading, (5) use of references, 
(c) use of index, (d) use of dictionary, and (e) 
reading graphs, charts, and tables. These are 
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skills that function in the modern classroom, 
and they are measured in lifelike exercises by 
this test. The authors are to be commended for 
the manner in which they have approximated 
real or work sample situations in the test. 

Realistic maps are reproduced, and well- 
chosen questions on interpretation are asked in 
order to sample skills in map reading. To meas- 
ure the use of an index, a sample page from an 
index is reproduced, and interpretations are re- 
quired by the test items. Similarly, well- 
constructed items measure interpretation of a 
sample page from a dictionary, and of repro- 
duction of graphs, charts, and tables in respec- 
tive subtests. In addition, an ingenious answer 
sheet and scoring device are used to facilitate 
the scoring of this test. It is recommended that 
the publisher print separate answer sheets to 
avoid mutilating the test booklet by detaching 
the title page, which includes the answer sheet. 
While this may not increase the profits of the 
publisher, it will permit the re-use of the same 
test booklet by several pupils, who would be 
supplied with separate answer sheets as re- 
quired. 

GENERAL COMMENTS. The teacher's manual 
presents concrete suggestions for the interpreta- 
tion and use of the test results. Specific skills 
tested are coded for each test item. Norms are 
presented in terms of equivalent grade scores 
and percentile scores for the various grade 
levels. 

One disadvantage of the test is the unrelia- 
bility of subtests for diagnosis of individual 
pupil achievement. Although the authors state 
in the manual that the subtests are not reliable 
enough for individual diagnosis, this assertion 
is negated by the fact that they recommend an 
individual profile card, including subtest scores 
for each child. The unreliability is more readily 
apparent when it is observed that a raw score 
change of one point on many of the subtests will 
account for a grade-equivalent score difference 
of from three to twelve months. 


The published norms for the test are derived 
mainly from the use of the test in selected com- 
munities which are not representative of the 
nation-wide population. The experience of sev- 
eral cities which have used the test leads to the 
conclusion that the norms should be revised. 
This conclusion is based upon the assumption 
that a test publisher—distributing a test na- 
tionally—has a responsibility for providing 
norms which represent a cross section of the 
national population. This responsibility cannot 
be avoided by recommending that it would be a 
better practice to obtain local norms. Most cities 
and small school systems are not equipped to 
carry out effectively such a suggestion. Further- 
more, not all purposes in a survey testing pro- 
gram can be achieved by the use of local norms. 
The method-of indicating grade equivalents is 
somewhat at variance with the usual practice; 
for example, a grade norm of third grade, fifth 
month is reported as 35 (without the decimal 
point) rather than as 3.5. This is a minor point, 
but the idiosyncrasy of the publisher runs coun- 
ter to accepted practice, and no advantages are 
cited for this unique practice, which may cause 
some confusion for the average classroom 
teacher. 

Teachers and supervisors generally have ac- 
cepted the work-study skills tests as the most 
significant contribution of the Iowa Every- 
Pupil test batteries. The use of lifelike work 
samples appeals to them. In the modern class- 


room the newer work-study skills are assuming | 


a position of equality with the more established 
skills in reading, arithmetic, and the language 
arts. 


For reviews by Frederic L. Ayer, Gustav J. 
Froehlich, and Ralph C. Preston of the com- 
plete battery, see 10. For reviews by William A. 
Brownell, J. Murray Lee, and Charles W. Odell 
of an earlier form of the complete battery, see 
38:872. 
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Reviews Bv Theodore A. Ashford, Alfred S. Brown, G. P. Cahoon, John H. Daugherty, Jack 

W. Dunlap, John C. Flanagan, Sidney J. French, Hans C. Gordon, Florence E. Hooper, C. W. 

Horton, Palmer O. Johnson, Richard H. Jordan, William B. Meldrum, Clarence H. Nelson, 

Victor H. Noll, Shailer Peterson, William Rieman III, Joseph J. Schwab, Edgar P. Slack, 
Herbert A. Thelen, Robert M. W. Travers. 


[ 546 ] 
*Ability for Science: Fife Tests of Ability, Test 
2. Entrants to secondary schools (England and Scot- 
land); 1947; 1 form; 5s. 6d. per 25; 4d. per manual; 
30(35) minutes; Frank M. Earle; University of Lon- 
don Press Ltd. 
REFERENCES 
1, EARLE, Frank M. Tests of Ability for Secondary School 


Courses. Publications of the Scottish Council for Research in 
Education, No. 10. London: University of London Press, 1936. 


-— € Pp. xiii, 138. s. * 


RANK M. Reconstruction in the Secondary School, 


2, EARLE, 
8s. 6d. 


London: University of London Press, 1944. Pp. 188. 
For a review by James Maxwell of the entire 
battery, see 8. 


[547 ] 

*Physical Science Aptitude Examination. Grades 
12-13; 1943; IBM; Form S; separate answer sheets 
must be used ; $1.25 per 25 sets of test and hand-scored 
answer sheets; $1 per 100 hand-scored answer sheets ; 
I0 per manual; 5¢ per keyi 15¢ per specimen set, 
postpaid; 60(70) minutes; C. J. Lapp, E. W. Chitten- 
den, and D. B. Stuit’; Bureau of Educational Research 
and Service, State University of Iowa. 


Jack W. Dunlap, Dunlap, Morris, and Asso- 
ciates, New York, New York. This test was re- 
leased to the public in 1943 in what is obviously 
an experimental edition. The test is designed to 
measure an individual’s aptitude for the study 
of mathematics and physical sciences and to 
predict his chances for success in work of a 
technical or engineering nature. Specifically, the 
authors claim the test can be used to (a) afford 
a basis for the prediction of the level of work a 
student will accomplish in the physical sciences, 
(5) aid in selecting and admitting individuals 
to technical and engineering courses of study, 
(c) section classes for instructional purposes, 
(d) assist in deciding how much work an indi- 
vidual should carry, (e) deal more effectively 
with individuals who are not well oriented in 
their academic work, and (f) furnish a basis for 
the diagnosis of class weaknesses. 

The test consists of four parts: mathematics, 
formulation, number series, and information. 
The test for formulation requires the subject to 
express statements of quantitative relations as 


algebraic formulas. The information test essen- 
tially is a measure of interest in the physical 
sciences. The mathematics and number series 
are typical of this class of tests. 

"There is evidence that the test as well as the 
subtests has validity for predicting course 
grades, as'the correlation coefficients of the sub- 
tests with mathematics, chemistry, and physics 
vary from .335 to .707. The subtest for mathe- 
matics correlated .707 with mathematics; the 
subtest for formulation, .595 with chemistry; 
and information, .6or with physics. Unfortu- 
nately, the coefficients are based on a single 
sample with an N of 188. While indicative of 
the value of the test, they can at best be taken 
only as presumptive evidence. It is to be regret- 
ted that four years have passed without the 
authors furnishing additional validation evi- 
dence. 

No reliability coefficient has been reported 
for the number series subtest, and for the total 
test only an estimate is available. In view of the 
simplicity with which reliability data can be 
secured, such an omission cannot be condoned. 
However, reliabilities are reported for the sub- 
tests on mathematics, formulation, and informa- 
tion as .90, .68, and .87, indicating reasonable 
reliability for the subtests. 

The directions for administering the test 
while clear give the impression of amateurish- 
ness and should be “cleaned up" before the test 
is unconditionally released to the public. The 
reader's attention is called to the possibility of 
misinterpreting the following directions for 
scoring, “but in part four the score is one-half 
the number right minus the number wrong.” 
Later the authors go on to give an example of 
how to compute the score for this section as 
follows: “The score on the test is obtained by 
subtracting the number wrong from the number 
right and then dividing by two.” 

In general this test appears to have merit, but 
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either the authors or the user will have to build 
up a more complete body of knowledge as to 
validity and reliability before the test attains 
maximum value. 


John C. Flanagan, Professor of Psychology, 
The University of Pittsburgh, Pittsburgh, Penn- 
sylvania. This examination was printed in 1943. 
However, it does not contain new material but 
is a reprinting of three subtests from the Jowa 
Physics Aptitude Test and one subtest from the 
Iowa Mathematics Aptitude Test. The Iowa 
Physics Aptitude Test was originally published 
in 1925 and a revised form was issued in 1941. 
The Jowa Mathematics Aptitude Test was first 
published in 1925 and the revision was printed 
in 1942. 

The test consists of four parts. The first part 
is called Mathematics and contains about 20 
items requiring simple algebraic skills and 5 
others involving a knowledge of decimals, per- 
centages, and square root. The second part, 
Formulation, contains 22 statements of quanti- 
tative relations expressed verbally, for which 
the appropriate algebraic expressions are to be 
written. The third part, Number Series, in- 
cludes 20 sets of numbers developed according 
to rules which the individual must determine to 
answer them correctly. The last part of the 
test, Information, is composed of 75 true-false 
items which provide “a measure of the individ- 
ual's familiarity with the general principles of 
physical science. In a sense this can be regarded 
as being a measure of interest. Even though 
the student may never have studied the physical 
sciences he may conceivably have acquired a 
good deal of the information sampled in this 
test, provided he is interested in these fields." 

The manual recommends that total scores be 
used for screening or prediction purposes. The 
test has been designed for use principally with 
persons who have had training comparable to 
that of a high school senior or a college fresh- 
man. 

Validity data are reported comparing scores 
made on the various parts of the test with 
grades in college courses. The manual states 
that “to date the validity of the test has been 
studied only in the prediction of success in the 
physical sciences. The relation between test 
scores and occupational success remains to be 
investigated.” 

The reporting of statistical information re- 
garding this test provides an example of the 


far too prevalent inadequate reporting of such 
data even by long established bureaus in reputa- 
ble universities. The validity coefficients for 188 
students in introductory engineering are mostly 
between .50 and .60 for the various parts. These 
coefficients are reported to three decimal places, 
although the standard errors are between .04 
and .o7. The intercorrelations for the parts of 
the test are not presented. The reliability co- 
efficients for this same group of students for 
three of the parts are reported as .68, .87, and 
.90. No data regarding reliability is presented 
for the other part of the test, for which validity 
data is provided on 79 freshmen taking mathe- 
matics courses. 

The reliability and validity of the total test 
score are not known, but the manual speculates 
that “total test scores should correlate well over 
.70 with grades in physical science." This sug- 
gests that the authors believe that correlations 
for parts would not be especially high. How- 
ever, the authors also speculate that reliability 
for the total test is .95. This would require that 
the intercorrelations between the subtests be 
very high. Tt is very difficult to make any checks 
of these "estimates" of the authors, since not 
only are the intercorrelations not reported, but 
there is also no report of the values of means 
and standard deviations. There is no mention 
of the method used in obtaining the reliability 
coefficients which are calculated for the three 
parts. 

Percentile norms based on 446 cases are re- 
ported. However, this reviewer was unable to 
find any definite information regarding the na- 
ture of this group. In connection with the report 
of the results of a special group of 168 students in 
an army meteorology school, it is stated that “the 
percentile scores made by this group are strik- 
ingly higher than those made by the ordinary 
college population." From this it might be in- 
ferred that the other norms refer to the “ordi- 
nary college population." 

In conclusion, it may be said that the items in 
the subtests of this examination appear to be of 
uniformly good quality and that each of the 
subtests appears relevant to the problem. In the 
absence of data regarding the intercorrelations, 
this reviewer is inclined to speculate regarding 
the likelihood that the first three parts are 
somewhat more highly correlated than would be 
desirable in a test battery for this purpose. The 
test can be expected to be found useful in pre- 
dicting grades in physical science courses. 
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Test of General Proficiency in the .Field of 
Natural Sciences: Cooperative General Achieve- 
ment Tests, Revised Series, Test 2. Grades 10-12 
and college entrants; 1942-47; IBM; Forms S, T, X; 
separate answer sheets need not be used; $2 per 25; 
254 per specimen set, postpaid; 4o¢ per 25 machine- 
scorable answer sheets; 15¢ per stencil for scoring 
answer sheets; 40(45) minutes; Form S: Charlottc 
W. Croon, Mary Willis, Alexander Calandra, Richard 
E. Watson, John G. Zimmerman; Form T: Carl A. 
aeon: Form X: Paul J. Burke; Cooperative Test 
ervice. 


Palmer O. Johnson, Professor of Education, 
The University of Minnesota, Minneapolis, 
Minnesota. These tests are designed to measure 
general proficiency or power in the broad field 
of natural sciences rather than the outcomes in 
a specific subject-matter course. The outcomes 
measured are presumed to be common to all 
science courses. In one sense, these tests are ac- 
cordingly supplementary to the specific course 
tests in the individual fields. 

Each test is made up of two sections: one 
section allotted 15 minutes consists of general 
terms and concepts; the other assigned 25 min- 
utes is made up of reading materials requiring 
comprehension and interpretations. 

The scientific terms and concepts are said to 
have been selected largely from frequency 
studies of vocabulary. One of the sources used 
was compiled over twenty years ago. That the 
terms were taken from diverse fields is indi- 
cated by the following distributions for the two 


forms: 
FORM S FORM T 
Astronomy 3 
Botany 
Chemistry .... 
Genetics ...... 


Geology 
Human physiology . 
Physics 
Psychology .... 
Scientific method . 
Zoology 


Slwonownrwoun 


Total number of terms ... 50 


Most of the terms and concepts are tested for 
by setting up items of the indirect or inter- 
pretative type rather than in the form of text- 
book definitions. 

Tn the second section, paragraphs of scientific 
materials adapted from newspaper and maga- 
zine articles or from advanced textbooks or other 
source materials are presented. In Form S, 
there are six different sets of material, in Form 
T, seven. Each set of materials is followed by 4 


to 8 multiple-choice items. These questions, 
with few exceptions, appear to call for critical 
evaluation and interpretation of the passage 
read. The student is usually required to draw 
inferences and implications from the scientific 
facts or principles presented. In the reviewer's 
opinion the materials and questions following 
them display quite generally careful selection 
and technical skill in the development of the 
corresponding questions. Gradation in length 
and difficulty of passages and in the correspond- 
ing difficulty of questions is apparent. 

The accompanying manual gives detailed di- 
rections for administering the tests and ex- 
tended discussions of the use and interpretation 
of the test results. Tables are given for con- 
version of raw to scaled scores. Percentile 
norms based on the scaled scores are furnished, 
but information such as the characteristics of 
schools and students constituting the norm 
groups is lacking. The standard errors of scaled 
scores are indicated, but there is no measure of 
the relative accuracy of measurements, such as 
reliability coefficients, reported. 

The chief value of tests of this kind to the 
classroom teacher is in the estimate of individ- 
ual growth in the ability to use the scientific 
method, particularly those aspects of the method 
related to the drawing of inferences and con- 
clusions, The availability of comparable forms 
and scaled scores makes possible an estimate of 
growth over a period of one year or over longer 
or shorter intervals of time. The ordinary in- 
structor is not skilled in building tests for this 
purpose. The increasing emphasis upon this 
growth as a major outcome from the study of 
science makes highly developed tests of this 
kind of especial importance. 


For a review by John V. McQuitty of the 
complete battery, see 3. 


BIOLOGY 


[ 549 ] 
Biology: Every Pupil Test. High school; 1946-47; 
new form usually issued each April and December; 
forms December 1946, April 1947; 2¢ per test; 1¢ per 
scoring key; 40(45) minutes; David B. Davis; Ohio 
Below stir Tests, Ohio State Department of Educa- 
ion. 
Y [ 550 ] 
ooperative Biology Test. High school; 1030-47; 
Forms P, Q, S, X; IBM ; separate answer sheets need 
not be used; $2 per 25; 25¢ per specimen set, postpaid ; 
40f per 25 machine-scorable answer sheets; 15é per 
stencil for scoring answer sheets; 40(45) minutes; 
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Form P: F. L. Fitzpatrick and S. R. Powers; Form Q: 
F. L. Fitzpatrick; Form S: Thomas F. Morrison, Vir- 
ginia F. Babcock, Donald H. Miller, Robert H. Wood- 
worth, and Carl A. Pearson; Form X: Paul E. Kam- 
bly; Cooperative Test Service. 
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Program in Independent Schools and Supplementary Studies. 
Educational Records Bulletin, No. 33. New York: Educational 
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$1.00. * (PA 15:4388) 


C. W. Horton, Consultant in Educational 
Research, Dartmouth College, Hanover, New 
Hampshire. [Review of Forms P, Q, and S.] 
"These three forms constitute a series of sound, 
presumably comparable, tests in biology at the 
high school level. All are 40-minute tests, di- 
vided, for no easily discernible reason, into 
Part I (25 minutes for 75 questions in Forms 
P and O, 69 questions in Form S) and Part 
II (15 minutes for 45 questions in Forms P 
and Q, 42 questions in Form S). Some ques- 
tions involving identification of structures 
occur at the beginning of Part II in each form, 
but otherwise (with the possible exception of 
Form S) the division into parts seems to be 
without meaning. 

"That the tests can be used in a class period 
is a point in their favor. That the questions are 
not numbered consecutively through the test is 
a minor point against them. This practice of 
beginning question numbers anew with r in 
each section of a test is a rather widespread 
but silly practice that should be abandoned now 
that machine scoring is common. It is of no 
advantage to the student and frequently is an- 
noying to the test administrator because it rules 
out the use of standard IBM answer sheets, In 
all of these tests the answers can be recorded 
in the books, or special IBM answer sheets are 
available. Raw scores on all forms can be con- 
verted into the CTS system of scaled scores. 

. Norms are available. 

Critical evaluation of the several forms as to 
coverage and question types is difficult without 
a detailed tabulation, which was not made by 
this reviewer. One gets the impression that 
Form P (authors Fitzpatrick and Powers) and 
Form Q (author, Fitzpatrick with the editorial 
assistance of others) are very much alike and 
that Form S (several authors with editorial 
assistance) has taken a somewhat different 
course. In Form S there seems to have been an 
attempt to concentrate the more specific, more 
limited “information” items in Part I and to 


present the questions involving application and 
interpretation along with identification of struc- 
tures in Part II. In Part II of Form S two 
very good problems are presented. One de- 
scribes an experimental situation in relation to 
which the student is asked what would happen, 
why certain things were done, and which of 
the suggested generalizations is supported by 
the expected result. Another involves thinking 
in terms of the organic-inorganic “cycles” and 
the interrelation of plants and animals and is 
the basis for several questions. Inclusion of 
such problems is a point of superiority over 
earlier forms. 

On the other hand, considering Form S as a 
whole, much has been lost. Part I of Form S 
is rather picayunish at points, while in Forms 
P and Q a great many of the items through- 
out the test are stated as realistic problems in- 
volving recall and use of important facts and 
generalizations. Many of them are excellent, 
and they can be called "Application of Prin- 
ciples" questions with quite as much validity as 
one can apply that characterization to the prob- 
lems of the USAFI Examination in Biology. 
In the earlier forms also, the meaning of some 
terms is very cleverly explored through de- 
scription of a situation, and several questions 
investigate the student’s understanding of cer- 
tain aspects of scientific method (but not, how- 
ever, his ability to use it). 

As a whole the series of tests is strong in 
items involving application of information in 
realistic situations in the areas of health, foods, 
and “economic” biology, but much too little 
emphasis is given to questions which explore 
the student’s ability to plan experiments, to 
judge the validity of proposed inferences from 
data, and to perform other aspects of thinking 
characteristic of scientific method. 

A decrease in amount of material to be read 
by students is noted in successive forms, The 
questions occupy about 974 pages in Form P, 
874 in Form Q, and 674 in Form S. One won- 
ders how many students would finish Form P 
in 40 minutes and whether reading speed might 
not be a factor in determining score. Judged 
from the scaled score conversion tables, Form 
S is much more difficult than Forms P and Q 
and might also prove useful at a higher level. 

If this reviewer were teaching biology, he 
probably would be willing to use any or all of 
these tests in his classes, but he would much 
prefer a series of tests each of which explored, 
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in thorough-going fashion, achievement of one 
important objective, the series to cover all the 
major objectives that could be evaluated by 
such devices, and the results to be susceptible 
to plotting in the form of a diagnostic profile. 
Is there no demand for such a series? Are 
most teachers satisfied with merely a "Biology 
Test," the prime requisites of which are that it 
samples a little of everything and can be gotten 
through with in a class period ? 


For reviews by Francis D. Curtis, George 
W. Hunter, and Ralph W. Tyler of earlier 
forms, see 38:907 and 40:1585. 

[ 551 ] 
XCooperative College Biology Test. College; 1943; 
IBM;.Provisional Form T; separate answer sheets 
need not be used; $2 per 25; 25¢ per specimen set, 
postpaid; so¢ per 25 machine-scorable answer sheets ; 


30¢ per stencil for scoring answer sheets; 90(95) min- 
utes; Carl A. Pearson; Cooperative Test Service. 


Clarence H. Nelson, Examiner in Biological 
Science, Board of Examiners, Michigan State 
College, East Lansing, Michigan. This test 
measures a representative sampling of the con- 
tent of a traditional college course in general 
biology. Tentative end-of-year norms based on 
667 cases in 16 colleges accompany the test and 
enhance its value when it is used as a basis for 
assigning marks or classifying students, particu- 
larly in a small college where the number of 
students tested is not large enough to insure a 
normal distribution. 

The test requires 90 minutes of actual writing 
time and is divided into three parts. Part I 
consists of 70 multiple-choice items, most of 
which are of the five-response type. Of these, 
approximately one-third test for simple recall of 
memorized factual information. The remaining 
two-thirds, however, involve some elements of 
understanding and, in a few instances, applica- 

.tion to new situations. Item 40 would be re- 
garded as one of the best in this section because 
. selection of the correct response is dependent 
upon a certain amount of biological information 
combined with the understanding of a vital bio- 
logical principle and the ability to apply these 
to a real situation. Part II is made up of 57 
matching-type items, the majority of which are 
definitions of terms, identification of processes, 
and classification of organisms. Selection of the 
Correct responses is dependent almost wholly 
upon recall of memorized factual information. 
The 47 items comprising Part III are based 
upon diagrams of biological structures. In each 


case one group of items accompanying the dia- 
gram call for identification of structures desig- 
nated on the sketch while the rest of the items 
involve selection of the functions of specific 
structures. Since the laboratory studies in the 
traditional college biology course attach consid- 
erable significance to this type of activity, Part 
III of the test justifiably reflects the outcome of 
that kind of teaching. 

The kind of test selected will necessarily be 
determined by the approach and emphasis of 
those who have taught the area, the outcomes 
of which are to be measured. If the learning of 
factual information has received major empha- 
sis throughout the course, the Cooperative Col- 
lege Biology Test should serve as a very 
satisfactory instrument for measuring how thor- 
oughly the course content has been mastered by 
the students. If, on the other hand, the approach 
has combined the mastery of content with the 
attempt to attain such objectives as: (a) a 
knowlege and understanding of principles, (5) 
ability to apply principles to new real situa- 
tions, (c) the formulation of hypotheses on the 
basis of experimental situations, (d) the evalu- 
ation of such hypotheses in the light of addi- 
tional evidence, (e) drawing conclusions, (p 
appraisal of the social significance of certain 
biological trends and legislation having biologi- 
cal implications, then the test should attempt to 
measure these outcomes. The Cooperative Col- 
lege Biology Test in its present form is inade- 
quate to such use. If the test author would 
undertake to develop a second companion book- 
let, of approximately equal length to this one, in 
which the items were so designed that they 
would measure the objectives listed above, the 
two booklets together might constitute a com- 
prehensive examination in the area of general 
biology. Such an examination could be used as 
a basis for granting advanced standing to col- 
lege students or prospective students irrespec- 
tive of how their biological science training was 
obtained. 


Joseph J. Schwab, Associate Professor of the 
Biological Sciences in the College and Exam- 
iner, The University of Chicago, Chicago, Illi- 
nois. In the sense that each recognized depart- 
ment and subfield of biology is represented by ah 
least one item, this examination has “coverage. 
In the sense that the aspects of each field treated 
are limited to the unexceptionable, pedestrian 
“facts,” this examination is technically sound. 


| 
| 
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No doubt, the capacity of the examination to 
provide a reasonably symmetrical distribution of 
scores without excess of platy- or lepto-kurtosis 
is, or will be, assured by appropriate trial and 
correction. In these senses, it is a good measur- 
ing instrument. 

However, as biology has outgrown naive sat- 
isfaction with accurate methods of measurement 
and has turned to the question of what it is one 
measures, so also has education in biology. And 
whether one judges the thing which this exam- 
ination measures by the criterion of the state of 
the science of biology or in respect to some view 
of the outcomes of education to be tested, this 
test is indefensible. Taken as a reflection of the 
science of biology, one would judge biology to 
be a compendium of informational tidbits with- 
out system or order and primarily concerned 
with giving names to things, parts, and proc- 
esses. Since biology is neither of these, the 
examination is a poor mirror indeed. If one 
prefers to judge the examination in terms of the 
outcomes which appear to be tested, the same 
conclusion is reached. The one outcome this 
reviewer can detect is best stated as the capacity 
to repeat tidbits set forth in an authoritarian 
manner without system and completeness on the 
one hand and without evidence or explanation 
on the other. 
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*Examination in Biology—College Level. College; 
1945; IBM; Form CBg-1-B-4 (usually called Form 
B) ; separate answer sheets must be used; $2 per 25; 
4o¢ per 25 machine-scorable answer sheets; 15¢ per 
scoring key; 25¢ per specimen set, postpaid; 120(125) 
minutes prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
PNE answer sheets; 50¢ per key; 5o¢ per specimen 
set. 


C. W. Horton, Consultant in Educational 
Research, Dartmouth College, Hanover, New 
Hampshire. The test is divided into three Sec- 
tions: Section I, Selection and Discrimination 
of Factual Material (70 questions) ; Section TI, 
Identification of Structures and Functions (30 
questions) ; and Section TII, Interpretation of 
Data (45 questions). Sections I and TI prob- 
ably are representative samplings of material 
commonly taught in college biology, but empha- 
sis seems very heavy on technical terminology 
and morphological minutiae. Some attempt 
should have been made to explore ability to 
apply important biological generalizations in 


realistic problem situations. Part of the space 
and student time now devoted to Section HI 
could have been used profitably for that pur- 
pose. 

Section ITI lacks logical tightness. Arriving 
at the keyed answers to some of the items ap- 
pears to be a game of “guess-what-I’m-think- 
ing” without the usual privilege of asking 
questions. Lack of defensibility arises primarily 
from semantic difficulties and uncertainty about 
the rules by which the game is being played. 
The game to be played is, in the words of the 
directions, to “confine yourself to the evidence 
given in the problem, even though you may be 
acquainted with other evidence.” The student 
is asked to classify each proposed interpreta- 
tion as either true, probably true, evidence not 
sufficient, probably false, or false. In this type 
of test the “true” and “false” items are presum- 
ably dependent on accurate reading of the tables, 
graphs and verbal descriptions. The “probably 
true” and “probably false” items depend on 
reasonable or cautious interpolation or extra- 
polation, and items which go beyond these lim- 
its are to be classed “evidence not sufficient.” 
What is, and what is not, “reasonable” inter- 
polation and extrapolation can become a point 
of contention. This test form is difficult to use, 
and the authors have not been very successful 
with it. 

An item under Problem III will serve to 
illustrate the logical difficulty encountered in 
defining the “other evidence” which we are in- 
structed to exclude. The problem gives data on 
the development (weight, length of wing feath- 
ers, feeding reflex, fear) of four starlings. 
Included in the description is the statement, 
“The average weight of the four eggs when laid 
was 7.5 grams, whereas their weight shortly 
before hatching was 6.2 grams.” Item 131 
states, “The loss of weight by the egg during 
incubation indicates that the shell is porous.” 
In judging this statement one is plagued by 
doubts, “Do the authors expect me to exclude 
all knowledge about evaporation, respiration, 
the water content of the egg, the nature of the 
shell, et cetera and mark “evidence not suffi- 
cient,” or do they expect a response of “true” 
or “probably true”? What do they mean by 
porous? Would they regard an animal mem- 
brane as porous?” The item is keyed “probably 
true.” But if one admits into the thinking proc- 
ess such a basic generalization as if a body loses 
weight something must have gotten out of it 
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and proceeds through other equally defensible 
items of common knowledge, is not the state- 
ment true? If one assumes he knows only what 
is given in the problem, is not the item to be 
keyed “evidence not sufficient" ? 

Item 132 serves as a contrast in keying. It 
states, “When the nestlings were first hatched 
they lacked wing feathers.” There is no entry in 
the table under wing feather length at the time 
of hatching or for four days thereafter. The 
reader is not informed whether observations 
were or were not made during that period. He 
knows only that on the fifth day wing feathers 
were 2 mm. long. The item is keyed “true.” 
But why not “evidence not sufficient,” or at 
most “probably true"? Absence of an entry in 
a table of data does not prove the absence of a 
phenomenon. 

Or consider an item in Problem I. The read- 
ing paragraph states, “When a worker finds a 
new source of honey as when a new species of 
flower comes into bloom, she returns to the hive 
and executes the so-called honey dance upon 
the combs. The other workers gather round 
and apparently identify the new source of nec- 
tar by its odor. They then leave in all directions 
in search of the new nectar source.” The test 
item states, “Workers cannot modify their be- 
havior to meet a change in conditions of nectar 
secretion.” Now, by the usual methods of in- 
terpreting the English language, “a change in 
conditions of nectar secretion” is described, 
and the workers did “modify their behavior” to 
meet the new conditions. The statement is, 
therefore, false. But, no, the item is keyed 
“evidence not sufficient." 

Item 116 of Problem II is stated in terms of 
“the best thing to do”; hence is a generalization 

.from the experiment reported. The data re- 
ported (percentage germination of seeds under 
several experimental conditions) are closely 
grouped at the point to which the item directs 
attention and show no consistent trend. Per- 
centage germination under the conditions re- 
quired in Item 116 falls only two or three per 
cent below germination under a widely con- 
trasting condition, yet the authors would have 
the student mark the item "false" on the basis 
of this slight difference. Are the authors not 
sensitive to the possible lack of reliability of this 
one experiment, to the danger of generalizing 
from it, or to the usual statistical approaches 
to confidence in data? Would not a skilled in- 
terpreter say "evidence not sufficient" to judge 


the truth or falsity of the proposed generaliza- 
tion? 

It is unfortunate that the test was published 
without more thorough criticism. Admittedly it 
is difficult to put into a pencil-and-paper device 
the essential elements of the processes that go 
on in the mind of a skilled interpreter of data, 
but tests using the form of response attempted 
here can be made reasonably defensible; for ex- 
ample, the P.E.A.'s Interpretation of Data (see 
535) and A. B. Crawford's verbal reasoning 
test in the Yale Educational Aptitude Tests 
(see 622). 

About 9o per cent of the items in Section IIT 
are defensibly keyed, and these do measure 
aspects of skill in reading and interpreting bio- 
logical data. Undoubtedly the test as a whole is 
more valuable than these negative criticisms 
would indicate. The bulk of the items are sound. 
A reliability coefficient of .89 for the entire test, 
based on 956 cases tested after two semesters of 
college biology, is reported by Lily Detchen of 
the Veterans Testing Service. Tentative norms 
are available. 
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X*Examination in Biology—High-School Level. 
Grades 10-11; 1945; IBM; Form SGB-1-B-4 (usually 
called Form B) ; separate answer sheets must be used ; 
$2 per 25; 50¢ per 25 machine-scorable answer sheets; 
30€ per scoring key; 354 per specimen set, postpaid; 
120(135) minutes; prepared by the Examinations Staff 
of the United States Armed Forces Institute ; published 
by the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
xe answer sheets; 50¢ per key; 50¢ per specimen 
set. 


REFERENCES 


„1. PETERSON, Suarzer. “The United States Armed Forces In- 
stitute Field Examination in High School Biology." Am Biol 
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C. W. Horton, Consultant in Educational 
Research, Dartmouth College, Hanover, New 
Hampshire. Designed, with the restricted Form 
A, for use in accrediting returning veterans in 
high school biology, the Form B examination 
should prove equally valuable for testing the 
achievement of their younger brothers and sis- 
ters. The test is, in this reviewer’s opinion, one 
of the few satisfactory tests—probably the best 
single test—now available for this purpose. 

The purpose, method of attack, and coverage 
of the test has been described by Peterson (1). 
The specifications that the test be interesting to 
adults and at the same time interesting and 
functional to roth and 11th grade students, 
that it sample widely from current popular ma- 
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terial, and that it avoid penalizing the student 
for lack of recent classroom and laboratory ex- 
perience seem to have been very well met. 
Construction of the test was approached 
through an excellent analysis of the objectives 
of high school biology, consultation with teach- 
ers and administrators, reference to research 
studies, and analysis of widely used textbooks 
and other standardized tests. As a realistic com- 
promise, those objectives susceptible to meas- 
urement in a pencil-and-paper test of this type 
were summarized into three broad “ability ob- 
jectives” which serve to delineate the three sec- 
tions of the test, namely : Section I, Basic Facts 
and Information (68 questions); Section 1I, 
Application of Principles (69 questions) ; and 
Section III, Reading, Interpreting Data, and 
Applying Scientific Method (27 questions). 
Each section attempts to sample three broad 
subject-matter areas: (a) structure and physi- 
ology of organisms; (5) health, hygiene, foods, 
conservation, and the interdependence of living 
organisms; and (c) genetics, heredity, evolu- 
tion, and plant and animal breeding. Part scores 
can be derived either on the basis of the ability 
objectives or on the basis of the areas of con- 
tent. 

While the fundamental difference between 
some of the questions of Section II and those 
of Section I is not apparent, most of the ques- 
tions of Section IT present a relatively complex, 
realistic problem in which prediction or ex- 
planation is required. These problems range 
from simple, direct questions (e.g., “Why is an 
explosion less apt to break one's eardrums if 
one's mouth is open ?") to lengthy descriptions 
which serve as the basis for a number of ques- 
tions. Line drawings help to set conditions in 
some of the problems. Introspection while an- 
swering the questions will show that one does 
recall and use many biological facts and gen- 
eralizations, and it seems appropriate enough to 
call the process "Application of Principles." 

Section III is given almost entirely to three 
large problems: (a) a long reading passage 
which serves as the basis for ten interpretive 
questions; (b) a graphical presentation of data 
with suggested interpretations which the stu- 
dent judges as being “supported by," “con- 
tradicted by,” or “neither supported nor 
contradicted by” the data; and (c) a situation 
involving concepts of heredity and natural se- 
lection followed by explanatory assumptions, 
each to be judged as to whether it helps or does 


not help explain the situation on the basis of 
modern biological concepts. One might question 
whether the mental operations involved here are 
not also “Application of Principles” and 
whether one can ever separate the application 
of information from “Interpretation.” These 
problems, however, do involve aspects of criti- 
cal thinking not explored in other sections, nor 
in any other high school biology test for that 
matter, and this marks a step in the right di- 
rection. 

It should not be inferred that the test is per- 
fect; a number of items appear to this reviewer 
to need patching. But, in general, the quality of 
the questions is high. It is, moreover, a genuine 
pleasure to find a test full of penetrating ques- 
tions on topics every intelligent adult should 
know about in contrast to the more typical test, 
overburdened with “naming things,” technical 
terminology, and details of interest only to a 
biologist. 

A Kuder-Richardson reliability coefficient of 
.89 based on over 1,200 cases tested at the end 
of a one-year course is reported by Lily Det- 
chen of the Veterans Testing Service of the 
American Council on Education. The test is a 
work-limit test requiring about two hours. Since 
the 164 questions require use of-both sides of an 
IBM answer sheet, machine scoring involves 
two runs of the papers. To obtain separate 
scores on the ability objectives would involve 
four runs, although this could be reduced to two 
if special answer sheets were printed. This is 
not a serious criticism. In fact, it is questionable 
whether part scores would be meaningful. No 
evidence is offered, aside from the face validity 
of the planning and of the questions that re- 
sulted from it, that part scores actually would 
be measures of relatively distinct aspects of. 
achievement, or even that they would be re- 
liable measures. More research on this point is 
needed. 

Although doubt must remain that one can 
draw any very sharp lines between “Facts and 
Information,” “Application of Principles,” and 
“Interpretation of Data,” the conscious inclu- 
sion of scientific method objectives in planning 
the test and the decision to devote separate 
sections of the test to them has resulted in a 
much better balanced test of general achieve- 
ment than can ever result when authors ap- 
proach test construction through the sterile— 
and we hope outmoded—method of merely ana- 
lyzing the information found in outlines and 


554 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 580 


textbooks. In both its planning and its execu- 
tion this test marks a big step forward, and its 
use is highly recommended. 


Richard H. Jordan, Associate Professor of 
Geology, Florida State University, Tallahassee, 
Florida, This examination in biology for the 
high school level is objective, adapted to ma- 
chine scoring, and in most respects orthodox in 
approach. It includes questions involving appli- 
cation of facts and principles as well as ques- 
tions of simple recall. The items cover the sub- 
ject matter commonly prescribed for high school 
biology, for example, as found in the New York 
State syllabus for general biology. The selection 
of items from the various phases of biology has 
been quite equally distributed and an equitable 
balance maintained between botany and zoology. 

The mechanics of good objective testing have 
been adhered to throughout the formation of the 
multiple-choice and key-list questions. The place 
of the correct response among the possible 
selections in the multiple-choice questions has 
been varied between questions in a chance dis- 
tribution ; the correct response cannot be chosen 
by the length or by qualifying phraseology; nor 
are the parts of the keys in the key-list questions 
confusing. The instructions to the student con- 
cerning procedures to be used in taking the test 
are clear and concise. 

However, the same care has not been taken 
in all cases in the wording of the items or in the 
scientific accuracy of the responses. The author 
of the test does not always say what he means. 
While it is true that in some instances the 
meaning of the question may be grasped and 
the desired response determined, the scholarship 
demonstrated in the construction is mediocre. 
A few examples may be cited. Question 7 asks 
what mineral is commonly added to the soil to 
counteract acidity. The desired response is lime- 
stone. But limestone is not a mineral nor are 
several of the foils. Question 12 asks, “Which 
of the following best describes the behavior of a 
cockroach as it runs away from the light?” 
Presumably the author wished to say, “The be- 
havior of a cockroach to light is an example 
of—" It is the cockroach's behavior to light, 
not his behavior as he runs from light, that is 
in question. The correct response to Question 76 
is C, “Pollen reaches the stigma, its sperm 
passes down the pollen tubes and comes in con- 
tact with the ovule which is thereby fertilized.” 
Aside from the confused grammar, fertiliza- 


» 2 


tion is not generally defined as the fusion of 
sperm nuclei and ovule. Some of these points 
may seem small and unimportant, but they are 
fully as important to exact science as the teleo- 
logical point which the student is asked to recog- 
nize in Question 159. 

Section IIT is titled, “Reading, Interpretation 
of Data and Applying Scientific Method." Con- 
cerning the reading passage, the reviewer be- 
lieves that such a passage, to be of value in an 
achievement test, should be so chosen that the 
student must have an understanding of the field 
of study to read it intelligently or that the items 
based on the passage should be so constructed 
that an understanding of the field of study is 
necessary for their correct solution. It would be 
interesting to know whether the reading passage 
in this examination could be handled apprecia- 
bly better by a student who had studied biology 
than by one who had not. 

Of the twenty-seven items in Section III 
there is only one which the reviewer believes 
could test anything but very limited aspects of 
the student's understanding of, or ability to 
apply, the scientific method (ie, Item 162). 
The whoie matter of testing for the scientific 
method is interesting and complex. If an at- 
tempt be made to incorporate this objective in 
a test, far more study should be placed on the 
construction of the items than appears to have 
been done here. 

And finally, Items 156 to 161 of Section III 
appear to be application of principles and thus 
more adapted to Section II, while Questions 
163 and 164 seem to require a mere knowledge 
of fact and thus would belong properly in Sec- 
tion I. 

Percentile norms and equivalence tables for 
Forms A and B have been prepared by the ad- 
ministration of the test to 1,268 students in 20 
schools. Raw scores and percentage of cases 
falling below the corresponding raw scores are 
listed. 
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*Examination in Botany—College Level. College ; 
1945; IBM; Form CBo-i-B-4 (usually called Form 
B) ; separate answer sheets must be used; $2 per 25; 
4o¢ per 25 machine-scorable answer sheets; 15¢ per 
scoring key; 25¢ per specimen set, postpaid; 120(125) 
minutes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
ES answer sheets; 50 per key; so¢ per specimen 
set. 
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C. W. Horton, Consultant in Educational 
Research, Dartmouth College, Hanover, New 
Hampshire. It is not difficult to say that this is 
the best published test available in college bot- 
any; it has no competitors. Some teachers of 
botany will like it, others will not. It exactly 
parallels, in its division into sections, and in 
number of items per section, the USAFI Ex- 
amination in Biology—College Level and pre- 
sumably was prepared by the same authors. The 
same broad criticisms apply: (a) it is heavy 
with technical minutiae; (b) it fails to explore 
ability to apply important facts and principles 
to realistic problems; and (c) relatively too 
much weight is given to the section on interpre- 
tation of data, which itself is open to criticism 
on several counts. It is, nevertheless, a reason- 
ably good test and should prove useful in differ- 
entiating between those who know much botany 
and those who know little. 

Some botanists will find details they do not 
like, for example, the technical terminology 
selected. Faced with the problem of making a 
"shotgun" test to cover study of different books, 
the authors had to compromise. They use ovary 
in one item, ovulary in another ; autophytic in- 
stead of the more widely used autotrophic; 
fibrovascular bundle instead of the more mod- 
ern vascular bundle. Perhaps this merely shows 
that, systems of teaching and botanical termi- 
nology being what they are, one can't please 
everybody. 

Others will point out what they regard as 
unfortunate choices of phraseology, items with 
more than one defensible answer, and items 
based on unimportant details. The following are 
examples: In Item 48 both pteridophytic and 
homosporous are defensible as the item is 
stated, but perhaps it is a justifiable test of 
knowledge to expect the student to select the 
more general characterization. Item 51 has two 
answers for students who have been taught that 
bacteria are fungi. Item 41 as stated has two 
plausible completions of the "because" state- 
ment differing in time of occurrence. Probably 
tendency to select the prior one is a good index 
of a thorough understanding of the situation. 
One learns from the key that Mendel's contri- 
bution to science was the discovery of "the 
independent assortment of genes" not "the 
factor hypothesis." There are those who will 
insist that the concept of the gene came much 
later in scientific history. Item 33, in defining 
placenta, forces the student to differentiate be- 


tween hilum and funiculus, details of little im- 
portance. In Item 69 the stele of the root is 
called a "fibrovascular bundle." Et cetera. 

More serious, in Section 1I one cannot tell 
where two label lines end, with the result that 
both Items 86 and 9o can be answered by either 
Choice A or Choice C. One infers from the 
answer key that "C" is meant to be the chloro- 
plast and “A” a pyrenoid, but the student does 
not have a key. Item 8o should read “a sperm 
nucleus" or “one of the sperm nuclei" not “the 
sperm nucleus," Et cetera. 

But Sections I and II are really better than 
the foregoing detailed criticisms might imply, 
because the bulk of the items are sound, and the 
sections as a whole are defensible in the light 
of the purposes for which the test was prepared. 
The defects, however, will condition acceptance 
of the test by botanists, most of whom are strict 
perfectionists. It is unfortunate that the test was 
not more widely circulated for criticisms by 
professional botanists before publication. One 
might moralize on the too frequent failure of 
test technicians to take this important step in 
preparing subject matter examinations. 

Section III is cut from the same cloth as the 
corresponding section of the USAFI Examina- 
tion in Biology—College Level and similar ex- 
amples of questionable keying could be cited, 
The bulk of the items are defensible, however, 
and the sections should give some measure of a 
student's ability to read accurately graphs, 
tables, charts, and verbal descriptions ; to recog- 
nize unwarranted generalizations from the data 
presented ; and to refrain from confusing things 
he knows, or thinks he knows, with the data 
given. These are important aspects of critical 
thinking, and teachers of botany prize the de- 
velopment of them in students. In this test 
form, however, doing them successfully does 
not depend on an understanding of the facts and 
principles of botany because the student is told 
to rule out all “other evidence.” In effect he is 
told not to use anything he might know about 
plant science. This suggests an important ques- 
tion. In testing achievement in a subject of 
study, would it not be better to use a test form 
in which the possession of accurate knowledge 
of the facts and principles of the subject actu- 
ally is helpful to the student in interpreting the 
data? Or would that have to be called “applica 
tion of facts and principles”? Whatever one 
calls it, it is missing from this test and should 
be represented. 
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The test apparently measures consistently 
what it does measure. A reliability coefficient 
of .90 based on 204 cases tested at the end of 
two semesters of college botany is reported by 
Lily Detchen of the Veterans Testing Service. 
"Tentative norms are available. The test is rec- 
ommended for use in botany classes with the 
reservation that the user may wish to decide 
which items he will count in scoring and which 
he will omit. 


Clarence H. Nelson, Examiner in Biological 
Science, Board of Examiners, Michigan State 
College, East Lansing, Michigan. This test is 
designed to measure the degree of mastery of 
elementary botanical information students have 
achieved. While this examination can be used 
to rank or classify students on the basis of their 
knowledge of elementary botanical information, 
it is not sufficiently inclusive to warrant its use 
as a comprehensive examination as the sole 
basis for assigning a final grade or for granting 
a year’s college credit in botany for informal 
training obtained elsewhere than in the class- 
room and laboratory. The examination is objec- 
tive and well adapted to machine or stencil 
scoring. If the special answer sheets are used, 
the same booklets may serve for a half dozen or 
more sections. The format could be improved 
by the printer if the type were not reduced quite 
so much. Ten to twelve multiple-choice items 
per page instead of eighteen would tend to re- 
duce eyestrain and nervousness on the part of 
the testees who must read and concentrate upon 
this material for a two-hour period. 

The examination is divided into three sec- 
tions. Section I consists of 70 four-choice items 
which involve the selection and discrimination 


of factual material—to a large extent recogni- ` 


tion of definitions. The 30 items which consti- 
tute Section II involve the identification of 
structures in diagrams and the recognition of 
their functions. These items also, like those in 
Section I, measure to a large extent the simple 
recall of memorized factual material. 

Section III consists of 45 items based upon 
graphical and tabular material and is designed 
to measure the student’s ability to interpret 
data. This section, however, does not test either 
a knowledge of or ability to apply botanical 
principles. A testee who has never had a day of 
botanical training but who is skillful in reading 
the English language and adept at following 
directions would be able to earn just as good a 


score on Section III as a highly trained bota- 
nist. This statement is not mere speculation but 
is based upon an actual tryout by this reviewer 
with people having precisely these qualifica- 
tions. One of these testees who had never had 
any botanical training answered all except five 
items out of the forty-five in Section III cor- 
rectly, and four of the five items missed by 
him are faulty items. The faulty items are 105, 
126, 131, and 132. The first three of these items 
require either interpolation or extrapolation of 
data which is not strictly in accord with the 
directions, Item 132 is unanswerable as it is 
now stated; none of the five key situations ap- 
ply. There is no doubt but that the qualities 
measured by Section III (ability to read Eng- 
lish precisely and ability to follow involved 
directions exactly) are skills which a good 
botanist ought to possess. Ordinarily, however, 
one measures these traits by means of a reading 
test. 

The United States Armed Forces Institute 
Examination in Botany, in its present form, 
would be more useful as a pretest than as a final 
examination. 


[ 555 ] 


Wisconsin Biology Test. High school; 1937-39; 
Forms A, B; Form A has the title Wisconsin Achieve- 
ment Tests: Biology; $1.25 per 25; Form A, 50(55) 
minutes; Form B, 60(65) minutes; prepared by the 
Bureau of Guidance, School of Education, and the 
Wisconsin Biology Test Committee: Vidamae Bunting, 
Ira C. Davis, Melvin Fuszard, Philip Krause, Mabelle 
Jamieson, Sylvia Jorgenson, Ruth Ledwell, H. G. Mc- 
Mullen, Elizabeth Perkins, J. W. Rhodes, H. W. 
Schmidt, O. J. Sieker, Marie Stainer, and Ruth Still- 
man; manual by T. L. Torgerson and G. J. Froehlich; 
E. M. Hale and Co. 
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XA.C.S. Cooperative Biochemistry Test. College; 
1947; IBM; Form X; separate answer sheets must be 
used; $2 per 25; 40€ per 25 machine-scorable answer 
sheets; 15¢ per stencil for scoring answer sheets; 25¢ 
per specimen set, postpaid; 100(110) minutes; pre- 
pared jointly by the Cooperative Test Service and the 
American Chemical Society through its Division of 
Chemical Education; C. W. Bennett, Lisabeth Beynon, 
E. R. Binnewies, Raymond Borcher, H. E. Carter, H. 
T. Coggin, Harriman H. Dash, Harry G. Day, Ed. F. 
Degering, Isidor Greenwald, B. L. Johnson, Harold G. 
Oddy, Imogene Manning, G. Paul, Gordan H. Pritham 
(Chairman), Florence Sargent, Margaret C. Swisher, 
E S. Vestling, and Leonard Wolf; Cooperative Test 
ervice. 


REFERENCES 


1, PrITHAM, Gorvon H. “Survey of Courses and Construc- 
mn of. Tests in Biological Chemistry." J Chem Ed 22:844- 
45. 
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XA.C.S. Cooperative General Chemistry Test fo: 

College Students. College; 1946; TBM: Form RA 
separate answer sheets need not be used; $2 per 25; 25¢ 
per specimen set, postpaid ; 40€ per 25 machine-scorable 
answer sheets; 15¢ per stencil for scoring answer 
sheets ; 110(120) minutes; prepared jointly by the Co- 
operative Test Service and the American Chemical 
Society through its Division of Chemical Education; 
Theodore A. Ashford, B. Clifford Hendricks, and F. D. 
Martin; Cooperative Test Service, 


REFERENCES 

Pag rena M S. “The 1939-1940 College Chemi. 
Med p ee eee 
pe set eR Se cer 
Program 1941-1942 I Chem Bd soar Me age 

Sidney J. French, Dean of the Faculty and 
Professor of Chemistry, Colgate University, 
Hamilton, New York. The A.C.S. Coopera- 
tive General Chemistry Test for College Stu- 
dents is by no means new. Prepared jointly by 
the Cooperative Test Service and a committee 
of the Division of Chemical Education of the 
American Chemical Society, it has been used 
for more than a decade by many teachers of 
general college chemistry. The 1946 form main- 
tains the high quality characteristic of earlier 
forms. The five parts cover: general knowledge 
and information, application of principles, quan- 
titative application of principles, scientific 
method, and knowledge of laboratory technique 
and procedure. The test provides a desirable 
basis for evaluating both the students and the 
course in general chemistry against recognized 
standards. Supplemented by the Cooperative 
Chemistry Test in Qualitative Analysis, it 
serves for those general courses which include 
consideration of material from qualitative 
analysis. 

The reviewer believes that more prominence 
might well be assigned to chemical information, 
including more realistic questions on practical 
applications. Metallic sodium, for example, is 
not prepared commercially by the electrolysis of 
molten sodium chloride as is indicated in Item 
23, but by the electrolysis of fused sodium hy- 
droxide. Likewise in Item 28, one would be 
inclined to question the omission of “phospho- 
rus” or “nitrogen” and the indication of “potas- 
sium” as the best answer to the item, “Artificial 
fertilizers are added to the soil to make up for 
deficiency of Ay 

The authors have made every effort to keep 
the test up to date with respect to the advance 
of chemical theory. On the other hand any test 
which is to be widely used must conform rea- 


sonably to the mean of such advances and can- 
not be far out in front. Possibly an optional 
section on newer developments such as the 
Bronsted nomenclature, use of dipole moment 
data, problems in electronic structure, and 
other recent concepts which are slowly finding 
their way to the elementary level might be 
helpful both to hasten such transitions as are 
worth while and to permit the further exchange 
of teaching ideas. 


Florence E. Hooper, Assistant Professor of 
Chemistry, The George Washington Univer- 
sity, Washington, D. C. This test is essentially 
a revision of the United States Armed Forces 
Institute Examination in General Chemistry 
(see 565). From 50 to 80 per cent of the items 
in the sections on general knowledge and in- 
formation, application of principles, quantita- 
tive application of principles, and scientific 
method are identical with items in the USAFI 
examination. A considerable number of items 
in the two examinations differ only slightly. 

Both tests satisfactorily cover all the theory 
which is usually studied in the first year of 
college chemistry. In both, the problems used 
have the advantage of placing the emphasis on 
handling the principles involved rather than on 
skill in arithmetic. Here as in the USAFI ex- 
amination, the number of items dealing with 
specific chemical reactions is less than the re- 
viewer considers desirable. Those reactions 
which are considered are reactions which are 
universally studied. In this respect this test is 
superior to the USAFI examination. This test 
differs from the USAFI examination in con- 
taining a 10-item section on knowledge of labo- 
ratory technique and procedure. 

The test is, in general, clearly written and 
free of errors. However, as mentioned in the 
review (see 565) of the USAFI examination, 
Items 110 to 116 (100 to 106 in USAFI test) 
will probably be confusing to the student, The 
fact that a considerable number of the items 
involve a large amount of reading may be ex- 
pected to result in a high number of reading 
errors. 

In the opinion of the reviewer, the time (110 
minutes) allowed for the test (130 items) is 
inadequate. Experience has shown that failure 
to allow sufficient time to carefully evaluate 
each item results in the acceptance of plausible 
wrong answers and even in wild guessing. It 
is interesting that less time is allowed for this 
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test than for the similar but shorter USAFI 
examination. 


[558 ] 

X*A.C.S. Cooperative Organic Chemistry Test. 
College; 1944-46; IBM ; Forms U, W ; separate answer 
sheets need not be used ; $2 per 25; 25¢ per specimen set, 
postpaid; 40¢ per 25 machine-scorable answer sheets; 
15¢ per stencil for scoring answer sheets; 100(110) 
minutes; prepared jointly by the Cooperative Test 
Service and the American Chemical Society through its 
Division of Chemical Education; C. T. Bahner, J. F. 
Bartlett, A. C. Blackwell, Theodore M. Burton (Form 
W), H. T. Coggin, Ed. F. Degering (Chairman), O. C. 
Dermer, W. T. Hatcher (Form U), W. N. Jones, Jr., 
P. J. Moore, G. H. Pritham, E. B. Reid, L. F. Supple, 
and L. V. Weeks ; Cooperative Test Service. 


REFERENCES 


1, Decertne, E. F, “Cooperative Objective Tests in Organic 
Chemistry." Letter. / Chem Ed 18:95-6 F '41. * 


Shailer Peterson, Director of Educational 
Measurements, Council on Dental Education, 
American Dental Association; and Research 
Associate, The University of Chicago; Chicago, 
Illinois. [Review of Form W.] This test is 
divided into two parts. Part I contains the 
material usually encountered in the first se- 
mester of an organic chemistry course. Part II 
covers the content of the second semester. This 
type of division is practically synonymous with 
division on the basis of the aliphatics and the 
aromatics. Each of the two parts is divided into 
three sections, namely: (a) general information 
(15 minutes), (b) application of principles (15 
minutes), and (c) problems (20 minutes). 
There are fifteen items in each of the six sec- 
tions in the test, making a total of go items. 
Each item in the available forms of the test 
may have “several answers, only one answer, 
or no answer.” Actually, in Form W, all of the 
items have at least one correct response, and 

` in five items, all of the five responses are cor- 
rect. The total go items have 227 correct re- 
sponses. The test is scored by responses rather 
than by items, so in effect the test includes not 
go items but rather 450 items of the true-false 
type. The scoring directions require a correc- 
tion for guessing. Scores on the six sections 
may be obtained by machine scoring, using two 
scoring stencils and inserting each paper in the 
machine twice. The single scoring stencil sup- 
plied by the publisher is said to be perforated 
for scoring the first three sections and has in- 
structions for changing and adapting it for the 
second grading. However, the perforations for 
part scoring have not been correctly made. No 
norms are available on Form W, the most re- 
cent and reportedly the best of the forms. 


The division of the test into parts represent- 
ing semesters is very fortunate for those pro- 
fessions requiring, for example, a half year of 
organic chemistry. This also would allow half 
the test to be used at the end of each semester. 
The plan for dividing the test into an infor- 
mation or “immediate recall” section as well as 
into sections requiring reasoning is the trend 
for all new examinations. The greatest step in 
the direction of part scores and profile records 
was probably made during World War II 
when the Armed Forces Institute Examina- 
tion Staff built their tests using this philosophy 
of testing and reporting. 

The subject matter coverage in the test is 
excellent, and a comparison of the items with 
those found in other organic tests reveals this 
to be one of the best, if not the best, organic 
chemistry test commercially available. It is 
hoped that norms may be made available on 
Form W soon, along with a description of the 
group or groups from which the data have beeri 
gathered. 


[ 559 ] 
*A.C.S. Cooperative Physical Chemistry Test. 
College; 1946; IBM; Form W ; separate answer sheets 
must be used; $2 per 25; 40¢ per 25 machine-scorable 
answer sheets; 15¢ per stencil for scoring answer 
sheets; 25¢ per specimen set, postpaid; 110(115) min- 
utes; prepared jointly by the Cooperative Test Service 
and the American Chemical Society through its Divi- 
sion of Chemical Education; T. A. Ashford, W. M. 
Baker, G. H. Chapman, J. S. Coles, M. Denise, D. A. 
Dreisbach (Chairman), F. B. Dutton, E. L. Haenisch, 
E. Krause, F. D. Martin, R. H. Newton, B. M. 
Norton, Maurice Palmer, W. J. L. Wallace, H. E. 
Wilcox, and P. M. Wright; Cooperative Test Service. 


Alfred S. Brown, Associate Professor of 
Chemistry, Colgate University, Hamilton, New 
York. This test provides a measure of a stu- 
dent's mastery of the subject if the course 
treatment weights the several topics appropri- 
ately. In this sense, the test may be said to 
define the course, rendering a comparison of 
class scores among different colleges of doubt- 
ful value in the usual sense. However, this in- 
version should in itself make the test of interest 
to many teachers. Significant ambiguities were 
noted in the statements of only two questions 
(in a total of 65)—Items 2 and 20. 

This reviewer believes that cooperative test- 
ing is especially significant in a field such as 
physical chemistry, where course content can 
vary so widely. For example, texts in this sub- 
ject differ much more than do texts in, say, 
general chemistry. A substantially more uni- 
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form undergraduate treatment of the divisions 
of physical chemistry will facilitate the devel- 
opment of these subjects at the graduate level. 

From the viewpoint of industrial practice, a 
test composed of multiple-choice questions (as 
is this one) is highly desirable. The industrial 
chemist is often faced with a situation requiring 
immediate action along one of several alterna- 
tives. The training value of such test questions 
is therefore unmistakable. However, this re- 
viewer believes that the choices offered the stu- 
dent in many of the problems of this test are 
highly unrealistic. One can only hope that his 
students, when called upon to use the Clausius— 
Clapeyron equation, will refer to a suitable 
source for the equation and will then solve it 
in an orderly fashion rather than set down sev- 
eral alternate rearrangements, both correct and 
incorrect, and attempt to recognize the correct 
solution. 

The reviewer must add that consideration of 
this test has been very helpful to him, and he 
recommends it for the attention of all teachers 
of undergraduate physical chemistry. 


[ 560 ] 
Chemistry: Every Pupil Test. High school; 1946- 
47; new form usually issued each April and December ; 
forms December 1946, April 1947; 2¢ per test; 1¢ per 
scoring key; 40(45) minutes; Ohio Scholarship Tests, 
Ohio State Department of Education. 

[ 561 ] 
Cooperative Chemistry Test. High school, 1939-47 ; 
IBM; Forms P, Q, S, X; separate answer A uum 
not be used; $1.75 per 25; 25¢ per specimen set, post- 
paid; 40é per 25 machine-scorable answer sheets; 15€ 
per stencil for scoring answer sheets ; 40(45) minutes ; 
Form P: Victor H. Noll and S. R. Powers; Form Q: 
Alexander Calandra; Form S: Charles L. Bickel, W. 
Gordon Brown, Robert N. Hilkert, C. S. Hitchcock, 
H. H, Loomis, and Carl A. Pearson; Form X : Paul J. 
Burke; Cooperative Test Service. 


REFERENCES 

1. Rice, Cectt L., anp Vinton, KeNNETR W. "An Experi- 
ment in Teaching Chemistry for Testing." Sch & Soc $2:43-5 
Jl 20 '4o. * (PA 14:5717) 

John H. Daugherty, Examiner in Physical 
Science, Board of Examiners, Michigan State 
College, East Lansing, Michigan. [Review of 
Form S.] This test contains a total of 76 mul- 
tiple-choice, 5-response questions divided into 
two sections, each having a time limit. 

About 30 of the 55 items in Part I are fac- 
tual but refer to well-chosen, important chemical 
knowledge. Others (15) deal with the acquaint- 
ance with definitions, general principles and 
theories and the understanding of chemical 
laws. There are 9 items involving interpreta- 
tions, laboratory techniques, and the under- 


standing of laboratory procedures, Only 2 
questions refer directly to valence and a like 
number to atomic theory, although a working 
knowledge of valence is required in 4 items in 
Part II. A few more items with direct refer- 
ence to atomic theory and oxidation might be 
desirable. However, reduction and reducing 
agents are mentioned a total of five times in 
the responses to various questions. 

Part II has 21 questions largely related to 
testing skills in the quantitative applications of 
chemical principles and interpretation of labo- 
ratory procedures. There are 6 items on bal- 
ancing equations, some in which the reactants 
are given, others where the student must com- 
plete the right-hand side. These are very well 
chosen. The student responds by indicating the 
sum of the coefficients of the substances in- 
volved. Mathematical procedures are tested in 
6 items such as percentage composition, for- 
mula determination, weight-volume relation- 
ships, and the weight of a liter of a given gas. 
As these are at the end of Part II, for which 
only 15 minutes is allowed, some capable stu- 
dents may not have time to show that they can 
do them. There are 6 questions covering labo- 
ratory techniques, several asking for the pre- 
diction of results or the prediction of the sub- 
stance causing the result. Question 4 covers 
valence changes, and Question 8, the gas laws. 

The test as a whole is good and gives reason- 
able coverage of secondary chemistry courses. 


For reviews by Charles L. Bickel, Edward E. 
Cureton, Louis M. Heil, and W. B. Meldrum 
of earlier forms, see 38:932 and 40:1592. 


[ 562 ] 
*Cooperative Chemistry Test in Qualitative 
Analysis. College; 1943-46; IBM; Form T; separate 
answer sheets must be used; Part 3, Chemical Equili- 
brium, is entitled Chemistry Examination in Qualitative 
Amalysis, USAFI Supplement Form CQ1-1-B-4 
(1946) ; $2 per 25; 40¢ per 25 machine-scorable answer 
sheets; 15¢ per stencil for scoring answer sheets; 25¢ 
per specimen set, postpaid ; 60(65) minutes for Parts 
1-2; prepared jointly by the Cooperative Test Service 
and the American Chemical Society through its Divi- 
sion of Chemical Education; Alexander Calandra, 
Fred A. Heath, and Rufus D. Reed; Cooperative Test 
Service. 
REFERENCES 


1. Ree, Rurus D.; ConrzLvou, W, P.; AND CALANDRA, ALEX- 
Axpen "Aims or Objectives of Qualitative Analysis." J Chem 


Ed 17:220-5 My '4o. * 

William B. Meldrum, Professor of Chem- 
istry, Haverford College, Haverford, Pennsyl- 
vania. This test, together with the supplemen- 
tary Armed Forces Institute test, covers quite 
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satisfactorily the usual course in qualitative 
analysis. The separation of Part 1, Descriptive 
Information, into two parts, one dealing with 
the principles of the course, the other with the 
laboratory work, might be considered. The 
representation of concentrations by two dif- 
ferent methods, by “C” in Part 2 and by en- 
closing the formula of the substance in square 
brackets in Part 3 will be confusing to the stu- 
dent; one method should be used consistently 
throughout the test. The substitution of the 
modern representation of the hydrogen ion, 
H40*, for the Arrhenius symbol, H+, is prob- 
ably advisable. Precision with which data are 
given should be checked ; it is faulty in a num- 
ber of items. A few detailed criticisms in con- 
nection with individual items follow. 

Items 5, 6, 7, 8: I suggest dissolving in- 
stead of “solution” as being less ambiguous. 

Item 9: The wording should be altered here 
to remove ambiguity. I suggest: “. . . of barium 
chloride im a solution containing a strong 
acid..." 

Item 19: “The “pH values" are not “listed.” 
I suggest: “a lower pH value than that of any 
other substance listed.” 

Item 20: The same change. 

Items 36, 40, 42: Delete “(ous)” and “(ic)”; 
the student should know the ions. 

Item 44: Delete “anhydrous” ; it is redundant. 

Item 47: Change to read: “...is 1 X 10-14 
at 25° C.” It holds only for this temperature. 

Item 48: “K,” is to be preferred over “Ka” 
for solubility product. 

Items 47, 48: Change “moles per liter" to 
“mole per liter"; the latter is correct if we are 
dealing with less than 1 mole per liter (cf. 
Item 64). 

Items 68, 69, 71: Such representations as 
"Kmuojgog" are awkward. Why not "K, for 
HC,H;0,”? 

Item 73: I suggest a change in order : “Given 
that the solubility of AgsCrO, is . . . what is 
its solubility product?” 


William Rieman III, Professor of Analytical 
Chemistry, Rutgers University, New Bruns- 
wick, New Jersey. The examination is divided 
into three parts. Part 1 consists of 43 questions 
of the multiple-choice type. These questions are 
well chosen to test the student’s knowledge of 
the facts of qualitative analysis. The only am- 
biguous question that the reviewer could find 
is Item 8: “The solution of Zn in HCl depends 


upon ." The examiners consider the cor- , 
rect answer to be Choice 3: “the formation of 
a slightly soluble gas.” However, Choice 4: “an 
oxidation reduction reaction” is equally cor- 
rect. The reviewer doubts if the allowed time 
(20 minutes) is sufficient for this part. 

Part 2 consists of six problems and seven 
questions in completing chemical equations. In 
both cases the student selects the correct answer 
from the five suggested answers. The problems 
are very simple—too simple to constitute an 
adequate test of this aspect of qualitative analy- 
sis. However, the inclusion of Part 3 in the 
test would provide a sufficient number of prob- 
lems of suitable difficulty. 

There is some possibility of ambiguity in 
Item 48: “If the Ksp for NiS is 1.4 X 107%, 
the largest possible value for the Cy;t++ in a 
solution where the Cs— equals 1 X 10-7? 
per liter is ." The words “at equilibrium" 
should be inserted after "Cy,---." 

Item 49 gives the solubility product of silver 
sulfide and asks the student to calculate the 
concentration of silver ions in a saturated solu- 
tion of this salt. The examiners consider as 
correct the answer obtained by neglecting the 
very extensive hydrolysis of the sulfide ion! It 
may be true that a problem which involves the 
simultaneous application of the principles of 
hydrolysis and solubility product is too difficult 
for the student of qualitative analysis. Then the 
data of problems on solubility products should 
be chosen from those salts which are not sub- 
ject to extensive hydrolysis. If the integration 
of the content of the various courses of a col- 
lege curriculum is desirable, should we not also 
strive to integrate the contents of a single 
course? The student should not be encouraged 
to believe that an ion gains immunity from 
hydrolysis by establishing an equilibrium with 
a precipitate. 

Part 3, Chemical Equilibrium, appears on 
the Supplement Form CQr-1-B-4. It consists 
of 17 questions and problems, all of the mul- 
tiple-choice type. In some of the problems the 
student has to compute the answer in order to 
select the correct one of the suggested five an- 
swers. In these problems he is warned of an 
arithmetical error by his failure to check any 
of the suggested answers. In most of the prob- 
lems the answers are given with the arithmeti- 
cal steps indicated but not performed, such as 


Vo.1 X 7.7 X 10-33. A student who would 


mole 
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» be unable to solve a problem in a conventional 
type of examination may be reminded of the 
correct approach by one of the five suggested 
answers. 

[ 563 ] 

X*Cooperative Chemistry Test in Quantitative 
Analysis. College; 1944-47; IBM; Forms U, X ; sepa- 
rate answer sheets must be used; $1.75 per 25; 40$ per 
25 machine-scorable answer sheets; 15€ per stencil for 
scoring answer sheets; 25¢ per specimen set, postpaid ; 
90(95) minutes; prepared jointly by the Cooperative 
Test Service and the American Chemical Society 
through its Division of Chemical Education; E. K. 
Bacon, C. W. Bennett (Form U), P. E. Clark (Form 
X), Edward L. Haenisch, J. J. Marcus (Form U), 
C. S. Martin, Gordon H. Pritham, Ralph L. Van Peur- 
sem (chairman), and Bruce V. Weidner; Cooperative 
Test Service. 

William B. Meldrum, Professor of Chem- 
istry, Haverford College, Haverford, Pennsyl- 
vania. [Review of Form U.] This test, as a 
whole, is highly satisfactory. It is well con- 
structed and well balanced and touches on the 
majority of the most important topics covered 
in the usual elementary course in quantitative 
analysis. The various items have been very 
carefully thought out and, for the most part, 
are free from ambiguity. Many of the items 
can properly be described as penetrating and 
will serve admirably to differentiate between 
the keen and thoughtful student and the one 
who is more dependent upon mere memory. 
Some items call for discriminating considera- 
tion by the student before indicating his answer, 
and so it is possible that the time allowed for 
the test is rather short. This, however, can be 
determined by actual trial. 

There are a few general criticisms of an 
adverse nature that seem called for. First, more 
items dealing with gravimetric procedures 
might be included, although it is admitted that 
they would be difficult to devise. Second, the 
complete absence of items concerning equiva- 
lent weights, an extremely important topic in 
volumetric analysis, is unfortunate. Third, 
parts of the test might well be modernized by 
using the symbols of the present-day ionic the- 
ory. Just when the old should be "doffed" and 
the new “donned” is a matter of opinion, but 
these modern tests might well consider being 
up to date in this respect. This point is men- 
tioned again in the detailed criticisms of indi- 
vidual items which follow. 

Item 7: Change to read: “ . . . may have a 
pH less than 7 at 25° C." A given pH value 
lacks precision if the temperature is not in- 
dicated. 


Item 13: To improve English, change to 
read: “If the atomic weight is 132.91 and that 
Df Glisten 

Item 15: Not good; eliminate. 

Item 18: Am not sure of the answer here. 
Certainly Choice 5 is a correct answer if it is 
assumed that the ignition is done properly. 

Item 19: Although the student should know 
the answer, this item is vague as it stands. I 
suggest: “Which of the following ions must 
be absent when determining Cl- by precipita- 
tion with Ag+ ?" 

Item 24: For precision I suggest: "The ad- 
dition of a substance containing a common 
ion..." 

Item 28: I missed the “AB” in Choice 3 and 
so will others. Why not use the more conspicu- 
ous AgCI or BaSO4? 

Item 30: Change "silicic acid" to silica in 
accord with most quantitative analysis texts. 

Item 32: To improve the English, I sug- 
gest: ^. . . in the same form as that in which 
they...” 

Item 38: Correct spelling of “phosphorus.” 

Item 41: I suggest sulfate in place of “sul- 
fur”; sulfate, not sulfur, is the substance pre- 
cipitated. 

Item 42: Since precision is being emphasized 
in various items in the test, it should be ob- 
served also by those setting the test. In this 
item to say “exactly 250 ml." doesn’t mean 
“exactly 250 ml."; it means, judging from the 
other data given, “250.0 ml," and it should 
be so given. 

Items 53-55: This type of item belongs on 
the elementary level. Also, we know that TES 
is wrong and that “H,O*” is probably right. 
In this respect the test should modernize. 

Item 56: Substitute “oxidation indicator” 
for the seldom-if-ever-used “chemical indicator.” 

Item 57: This is not clear. Iodine is evolved 
in Choices 1, 3, and 5 and could be in Choice 
2, I suggest: “Select the compound or ion 
which, in a solution containing KT, will liberate 
iodine.” 

Item 59: This also lacks clarity. I suggest: 
“A suitable primary standard for titration with 
H,SO, solution is:” 

Item 65: This one also is not clear. I sug- 
gest: "Select the substance whose solution 
would serve for comparison of the concentra- 
tions (or normalities) of NaOH and KMnO, 
solutions." 

Item 67: Not everyone uses “millimoles” ; 
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why not give moles in parentheses, thus: 
"... 4.500 millimoles (0.004500 moles) .. .” 


William Rieman III, Professor of Analytical 
Chemistry, Rutgers University, New Bruns- 
wick, New Jersey. [Review of Form U.] The 
test is divided into five sections: A, general; B, 
principles and laboratory applications of gravi- 
metric analysis; C, problems in gravimetric 
analysis; D, principles and laboratory applica- 
tion of titrimetric analysis; and E, problems in 
titrimetric analysis. Each one of the 75 ques- 
tions involves a choice of the correct answer 
from the five answers given on the examina- 
tion sheets. The questions are distributed ap- 
proximately equally among the five sections. 

The questions in Sections A, B, and D are 
well chosen, for the most part, although in 
some cases a strong argument can be presented 
in support of the contention that there is more 
than one correct answer. Thus, Item 4 consists 
in selecting the true statement about a desic- 
cator. The well-informed student should have 
no difficulty in rejecting Choices 2, 3, and 4. 
However, the choice between 1 and 5 presents 
a real difficulty. Choice 1 is “A desiccator is 
used to cool an object in a perfectly dry atmos- 
phere”; Choice 5 is “A desiccator should be 
airtight when closed.” Some colleges use alumi- 
num desiccators which are not perfectly air- 
tight when closed. The student who has used 

- such a desiccator can hardly be blamed if he 
decides that Choice 5 is wrong. Then only 
Choice 1 remains. The thoughtful student may 
be somewhat reluctant to accept Choice 1 as 
correct. However, the only objection to this 
answer is the word "perfectly." 

Again in Item 28, there are two correct or 
nearly correct answers. Choice 3 is “The solu- 
bility product constant for a salt is the square 
of its solubility in moles per liter, for a salt of 
the type formula AB." This statement is cor- 
rect unless the salt undergoes hydrolysis. 
Choice 5 is “The solubility product constant 
for a salt changes if a common ion is added." 
This statement is correct only if the solubility 
product is expressed in terms of activities. 

Item 16 is also unsatisfactory. Choice r is 
"For a satisfactory quantitative determination 
by gravimetric methods, the formula of the 
ignited precipitate must be known." This state- 
ment is incorrect because the composition of 
the ignited precipitate is of no importance if 
the precipitate is to be weighed after drying 


at 110°. On the other hand, the student can 
hardly be blamed for considering this state- 
ment to be correct. Choice 3 is “For a satisfac- 
tory quantitative determination by gravimetric 
methods, the precipitation reaction may be in- 
complete if it reaches equilibrium.” Upon super- 
ficial thought, this statement seems to be in- 
correct. Actually, however, it is correct because 
all precipitation reactions are theoretically in- 
complete. The examiners consider Choice 1 to 
be correct! : 

Coprecipitation, which is the most serious 
source of error in careful gravimetric analysis, 
is not adequately represented on the examina- 
tion. 3 

Sections C and E (problems) are not satis- 
factory. In most cases, the student is asked 
simply to select the correct mathematical ex- 
pression for the answer without performing 
any computations. In a few cases he is to cal- 
culate the correct answers and then mark the 
answer on the sheet that agrees with his. If his 
answer fails to check any on the sheet, he is 
automatically warned of an error in his cal- 
culations and thus has an opportunity to re- 
peat his calculations without suffering any 
penalty except the loss of time. 

The problems are all comparatively simple. 
The reviewer believes that more involved 
problems should be included, such as calculat- 
ing the quantity of calcium that remains in 
solution at equilibrium conditions when the 
excess of oxalate, the volume, and the hydro- 
gen-ion concentration are given. Problems of 
calculating the pH of buffer solutions should 
also be included. 

The time allowed for the test is so short that 
ability to read and think rapidly will contribute 
almost as much to the student’s success as a 
thorough knowledge of quantitative analysis. 
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*Examination in Chemistry—High-School Level. 
High school; 1944; IBM; Form SCh-1-B-4 (usually 
called Form B) ; separate answer sheets must be used; 
$2 per 25; 40¢ per 25 machine-scorable answer sheets; 
15¢ per scoring key; 25¢ per specimen set, postpaid ; 
120(125) minutes; prepared by the Examinations Staff 
of the United States Armed Forces Institute ; published 

the American Council on Education ; distributed by 
Cooperative Test Service. (Also distributed by Science 
Researclr Associates: $2 per 25; 65¢ per 25 machine- 
Ce answer sheets; 50¢ per key; so¢ per specimen 
set. 
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1. Heren, WiLLIAM, AND THELEN, Hersert A. "The High- 
School Chemistry Test of the Armed Forces Institute." J Chem 
Ed 21:507-15 O "44. * 


589 ] 


TESTS & REVIEWS: SCIENCE—CHEMISTRY 


565 


: V ictor H. Noll, Professor oj Education, 
Michigan State College, East Lansing, Michi- 
gan. The examination under review is one of 
the series produced during the war primarily 
for military use but now available for second- 
ary schools and colleges. It consists of five 
parts: Part I, Information; Part II, Skills 
Used in Solving Problems; Part III, Applica- 
tion of Principles in Familiar Contexts; Part 
IV, Application of Principles for Prediction 
and Explanation; and Part V, Application of 
Scientific Method. The procedures by which 
the examination was planned, constructed, and 
standardized are described in a general way 
in the manual accompanying all tests in this 
series and in great detail in an article (1) by 
two of the authors responsible for its develop- 
ment. All items in the test are of the multiple- 
choice type. Answer sheets, for either hand or 
machine scoring, are required. The different 
parts have no separate time limits. There are 
two forms, A and B, with a possible score of 
III and a total time limit of two hours on 
each. Percentile norms based on about 700 
cases are available for Form A, and equivalent 
scores are given for Form B. Form B seems 
on the whole to be somewhat more difficult 
than Form A, although this difference appears 
to reside almost entirely in Parts IV and V of 
the two forms. 

This examination gives the impression of 
being carefully planned and constructed. The 
five parts represent five generally accepted 
areas of accomplishment in high school chem- 
istry. Each area was analyzed for specific be- 
havior objectives, resulting in a list of 84 such. 
In most cases one test item was devised to test 
each objective, although in some instances, par- 
ticularly in Parts IV and V, certain items or 
problems carry more than one point of possible 
credit. An attempt was made to estimate the 
validity of the examination by having four ex- 
pert teachers of high school chemistry assess 
each item as to whether it measured what it was 
intended to measure. There is a high degree 
of agreement among these experts on the valid- 
ity of most items. Besides such subjective esti- 
mates, comparisons were made of the scores on 
the separate parts of the examination earned 
by the same pupils and also comparisons of 
part and total scores made by classes in chem- 
istry in widely separated schools. Great varia- 
tions are found in both types of comparisons, 
and this is interpreted as indicating that the 


separate parts of the examination "ueasure dif- 
ferent qualities, traits, or abilities This as- 
sumption, though possibly true, does not ap- 
pear necessarily so since differences in difficulty 
might also account for differences in score on 
separate parts even though these parts meas- 
ured the same objectives. No statistical data on 
reliability are given but it is stated that there 
is inferential evidence (Kuder—Richardson) of 
satisfactory reliability. 
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X*Examination in General Chemistry—College 
Level. First year college; 1944; IBM; Form CCh-2- 
B-4 (usually called Form B) ; separate answer sheets 
must be used; $2 per 25; 4o¢ per 25 machine-scorable 
answer sheets; 15¢ per scoring key; 25¢ per specimen 
set, postpaid; 120(125) minutes; prepared by the Ex- 
aminations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
cation; distributed by Cooperative Test Service, (Also 
distributed by Science Research Associates : $2 per 25; 
65¢ per 25 machine-scorable answer sheets; 50 per 
key; 50¢ per specimen set.) 


REFERENCES 
1. Asxromp, Tuzovore AskouNEs, "The College Chemistry 


Jenn the Armed Forces Institute.” J Chem Ed 21:386-92 Ag 
"m 

John H. Daugherty, Examiner in Physical 
Science, Board of Examiners, Michigan State 
College, East Lansing, Michigan. This test has 
110 multiple-choice items. The examination is 
well arranged in four sections according to 
objectives: I, general knowledge and informa- 
tion; II, applications of principles ; III, quanti- 
tative application of principles; and IV, several 
aspects of the scientific method. 

Section I contains 35 multiple-choice 5-re- 
sponse questions, largely items testing acquaint- 
ance with important concepts and a general 
knowledge of the properties of the common 
substances. The factual information covers a 
broad field with a reasonable range of difficulty 
and does not suffer from the defect of resorting 
to isolated information of debatable importance. 

Section II contains 30 multiple-choice items 
testing the ability to apply principles in making 
simple predictions involving the understanding 
of principles and theories and their interrela- 
tions and the ability to interpret and select data 
in making interpretations. Six items involve 
the periodic system in an ingenious manner. 
Five items on the activity series are very good. 
Four items consider equilibrium conditions. 
Several of the foils here seem too easy as even 
poor students may know that an equilibrium 
is altered in some way and that all reactants 
are not affected equally. The remaining items 
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“cover a wide variety in an excellent way, with 
the possible exception of Item 48, where the 4 
incorrect responses seem so unrelated to the 
question that many students would easily dis- 
card them. 
Section III has 20 items on the quantitative 
application of principles with emphasis on han- 
“dling rather than on mathematical skill. The 
- problems include balancing equations, percen- 
rs tage composition, gas laws, weight-volume re- 
z lationships, formula determination, molarity, 
“valence, and charge. In several, the student 
merely indicates correct mathematical proce- 
dure, thus making for efficient use of time. 
Section IV, testing several aspects of scien- 
tific method, contains 25 items about experi- 
- mental evidence for statements and the theory 
explaining them; truthful bases for statements, 
whether observation, theory, or consequence of 
“a definition; assumptions for given conclu- 
sions; and experimental factors needing control. 
One type of question consists of a statement 
about which there are two questions (What is 
the best experimental evidence for the state- 
? ment? What is the best theoretical explanation 
of the statement?), both to be answered from 
the same foils, However, by eliminating the 
. apparently theoretical responses, most students 
‘could select the correct experimental ones. 
Ttems 100-106, dealing with the validity of the 
~ conclusion of an experiment, are excellent, 
One mechanical aspect of the test may be 
» open to question. The fifth response was the 
correct answer only IO times in IIO items (8 
items had less than 5 foils) ; the first only 13 
times. The B, C, D responses were used as 
correct answers in 79 per cent of the items. 
Some students might note a pattern in not 
using the first and last choices more often. 
— „The reviewer would refer the reader to an 
= interesting article (1) by Theodore A. Ashford 
—— for a detailed report on the purposes, planning, 
~ - and construction of this test. The test as a 
whole is excellent and shows the result of care- 
= ful planning to meet very worth-while objec- 
~ tives. 


y: 


Florence E. Hooper, Assistant Professor of 
Chemistry, The George Washington Univer- 
sity, Washington, D. C. This examination 
satisfactorily covers all the theory usually 
studied in the first year of college chemistry. 

_ Thirty-five items deal with general knowledge 
| and information, 30 with application of prin- 


ciples, 20 with quantitative application of prin- 
ciples, and 25 with scientific method. The prob- 
lems used have the advantage of placing the 
emphasis on handling the principles involved 
rather than on skill in arithmetic. 

In view of the fact that a considerable por- 
tion of class time is usually devoted to study- 
ing specific chemical reactions, it seems that a 
greater number of items should be concerned 
with such material Since the examination is 
to be widely used, it is important that the 
items be restricted to those reactions which are 
studied in all first year college chemistry 
courses. Several of the items in this examina- 
tion seem to have been written with a particu- 
lar course or text in mind. For example, the 
oxidizing action of nitrogen dioxide (Item 22) 
is entirely omitted in many standard texts. 
Likewise the majority of students will not have 
sufficient information to compare the reducing 
action of copper and hydrogen sulfide (Item 
32). 

The examination is, in general, clearly writ- 
ten and relatively free of errors. However, 
Items 100-106, which refer to a particular ex- 
periment, will probably be confusing to the stu- 
dent, who is expeeted to classify as assump- 
tions several statements which he knows to be 
untrue. No mention is made of the important 
assumption that the unknown does not contain 
silver, lead, or mercurous ions. It should be 
noted that both Alternatives B and D in Item 
I5, and A and C in Item 30 are correct and 
that the answer given in the key for Item 74 
is incorrect. 

It is the opinion of the reviewer that the 
120 minutes allowed for the examination will 
not permit the average student to carefully 
evaluate each alternative. Several years ex- 
perience in administering tests of this type has 
shown that too short an examination period 
results in the acceptance of plausible wrong 
answers by even the best of students. 

This examination is similar in many respects 
to the A.C.S. Cooperative General Chemistry 
Test for College Students, Form 1946. Fifty- 
Six items of the two are identical and 26 differ 
only slightly. The A.C.S. test differs from this 
examination in that it includes 10 items on 
laboratory technique and procedure. Either 
test should prove useful in measuring the 
achievement of students who have completed a 
year of college chemistry. 


+ 
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“Iowa Placement Examinations: Chemistry Apti- 

tude, Grades 12-13; 1025-44; IBM; 2 editions; G. D. 

Stoddard, L. W. Miller, and Jacob Cornog; Bureau of 

educations Research and Service, State University of 

owa. 

d) SERIES CAI, REVISED. 1925; Forms A, B ; $4 per 100; 

50g per specimen set including the other 10 tests in the 
series, postpaid; 44(50) minutes; prepared under the 
direction of C. E. Seashore and G. M. Ruch. 

b) NEW SERIES CA-2, REVISED. 1925-44; also called 

Quick-Scoring Edition; IBM; Form M; separate an- 
swer sheets need not be used; $1.15 per 25; 30€ per 
specimen set including the other 4 tests in the series, 
postpaid; $1 per 100 machine-scorable answer sheets ; 
15¢ per stencil for scoring answer sheets; 47(55) min- 
tutes; revised by Jacob Cornog and D. B. Stuit. 
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Theodore A. Ashford, Assistant. Professor 
of Chemistry in the College, The University of 
Chicago, Chicago, Illinois; and Chairman, Com- 
mittee on Examinations and Tests, American 
Chemical Society. The aptitude tests are de- 
signed to measure the student’s ability in chem- 
istry and predict probable scholastic success in 
the subject. In the Examiner's Manual, seven 
other related uses are suggested and the enter- 
prising teacher will doubtless find additional 
uses, The examinations are designed to test stu- 
dents who have had little or no formal work 
in the subject. 

Form M is the latest one of the series. The 
reviewer is of the opinion, despite some reser- 
vations, that this is the best aptitude test avail- 


able in chemistry. The test consists largely of - 
exercises testing mental skills which have been 
found, or are suspected, to play an important 
role in the mastery of chemistry, although it 
also contains exercises which can be answered — 
more readily by a student who has had previous 
training in chemistry. It is completely objec- 
tive and may be scored either by a stencil or 
by an electrical scoring machine. - 

The test consists of four parts. The mathe- . 
matics part (25 items, 15 minutes) consists of a 
items testing arithmetic and simple algebra... 
These items are well selected and represent 
the indispensable minimum of mathematical 
equipment necessary for success in chemistry. 

The formulation part (10 items, 7 minutes) 
is designed to measure ability to write alge- á 
braic expressions from verbal statements. The 
items appear valid but rapidly become complex, 
It is doubtful whether 7 minutes is sufficient to 
permit answering more than the first few sim- 
ple items. 

The reading part (30 items, 18 minutes) is 
perhaps a misnomer—it is more nearly ability 
to manipulate chemical symbolism. It consists 
of an indigestible presentation on symbols, for- 
mulae, valence, ionization, and naming of com- 
pounds, followed by exercises in writing correct — | 
formulae, given the symbols and valences of 4 
the ions and the names of the compounds. 
Practically all conceivable combinations of ions, `- 
are represented. The section is overweighted 
with two or three skills in chemical symbolism. 
It is regrettable that this section, which re- 
quires a large overhead in reading time, does - 
not include exercises testing ability to read and 
understand a scientific exposition. Moreover, 
this section might invalidate the predictive 
power of the test, in that students who have 
had previous knowledge are likely to score ab- ` 
normally high, while those unacquainted wi g 
chemical symbolism are likely to find it difi- — - 
cult to “catch on.” E 

The information part (45 true-false items, i 
7 minutes) contains a well-selected group of 
items. The majority of the items test acquaint 
ance with important principles, though several 
items test isolated and unimportant facts or 
knowledge of technical terms, and two items 
(Items 13 and 18) may be answered both ways. 
It is regrettable that only seven minutes is 
allotted to this important section. 

The reported reliability based on 369 cases —— 
is .963, with a standard deviation of 23.5 24 
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a probable error of 3.04. The test is useful over 
a wide range of abilities, the first quartile be- 
ing 36 and the third quartile 76, with 130 as a 
maximum possible score. The correlation with 
first semester grades is .50. 

Forms A and B, two parallel tests, are ear- 


~ lier forms of the series and are hand scored. 


They are on the whole inferior to Form M. 
The mathematics part is comparable to that of 
Form M. Parts 2 and 3 are designed to test 
reading ability and contain reading questions 
which are absent from Form M. On the other 
hand, these tests do not contain exercises on 
chemical symbolism. Part 4 contains 60 true- 
false items, testing general information. This 
section has much poorer selection of items 
than the corresponding part of Form M. 
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Iowa Placement Examinations: Chemistry Train- 
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and G. M. Ruch; G. D. Stoddard and J. Cornog; 
Bureau of Educational Research and Service, State 
University of Iowa. 
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Theodore A. Ashford, Assistant Professor of 
Chemistry in the College, The University of 
Chicago, Chicago, Illinois; and Chairman, Com- 
mittee on Examinations and Tests, American 
Chemical Society. [Review of Form A.] Consid- 
ered as a whole, the test measures fairly ade- 


quately the information of high school chemistry 
but falls short of testing for the understanding of 
the basic principles or the attainment of the im- 
portant skills and abilities. Thus, while seve ral of 
the 45 true-false items of Section 1 test impor- 
tant basic information, the majority of the items 
test unimportant details and at best only ver- 
bal or superficial acquaintance with principles. 
Similarly, most of the 50 true-false items of 
Section 3 test specific information on named 
processes (Solvay, chamber, contact, LeBlanc, 
Dutch, etc.) or on commercial names of mate- 
rials (cinnabar, borax, fire damp, white lead, 
etc.) or on various properties of a few sub- 
stances. The basic principles of chemistry are 
almost completely ignored. A part of the diffi- 
culty is inherent in the form of the true-false 
items—several items are not clear-cut, because 
of this limitation, More important ideas could 
be tested by changing the true-false into mul- 
tiple-choice items. But to improve the test sig- 
nificantly would require a deliberate attempt to 
test functional understanding of the principles 
and the application of knowledge to new situ- 
ations. 

A similar criticism can be made on Section 
2. At first glance the section appears to meas- 
ure ability to manipulate chemical symbolism. 
Actually this section tests memory of valence of 
specific ions or radicals (20 items) and of 
names of substances, given the formulas (10 
items). Only five items test ability to predict 
reactions and balance equations. The under- 
standing of chemical symbolism is not other- 
wise tested for. 1 

Section 4 is the most satisfactory. It contains 
12 problems testing the quantitative under- 
standing of density, the gas laws, atomic equiva- 
lent, and molecular weights and valence. The 
concepts are tested with a minimum require- 
ment of arithmetic manipulation. 

Because of Section 4, the test measures a 
little more than information and does so reli- 
ably. The reported reliability, based on 100 
cases, is .90 with a standard deviation of 30.4 
and a probable error of a score of 6.4. The pre- 
dictive power of this test is not reported, but the 
average correlation between all the training tests 
and scholastic success in the respective subjects 
is reported as .60. This implies that for predict- 
ing scholastic success or for assigning grades, 
this test should be supplemented with the apti- 
tude test as well as with an additional test 
measuring understanding of principles. 
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[ 568 ] 
Kirkpatrick Chemistry Test. First, second semes- 
ters high school; 1940-41; Forms A, B; 2 levels; 90¢ 
per 25, postpaid ; 15¢ per specimen set, postpaid ; 40(45) 
minutes; Ernest L. Kirkpatrick and H. E. Schrammel 
(manual only) ; Bureau of Educational Measurements, 
Kansas State Teachers College of Emporia. 
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Theodore A. Ashford, Assistant Professor of 
Chemistry in the College, The University of 
Chicago, Chicago, Illinois; and Chairman, Com- 
mittee on Examinations and Tests, American 
Chemical Society. This test is designed to meas- 
ure achievement in a one-year high school 
chemistry course. Test I appears in two parallel 
forms, covering the work of the first semester ; 
Test II also appears in two parallel forms, cover- 
ing the work of the entire year. This review is 
based on Test II, Form A. 

The intent of the authors to base the test on 
common content of leading textbooks and 
courses of study has been fully realized. As a 
result, the test is comprehensive in coverage 
and contains a wide variety of information, 
only part of which would be emphasized in any 
one course. By this device, the test can be used 
in many schools. On the other hand, intensity 
and depth have been sacrificed. 

Although there are many excellent items, the 
majority of them are purely factual. Far too 
many items are trivial or test information of a 
specific character. Most of Part II tests knowl- 
edge of the composition of alloys (e.g., Babbitt, 
type metal, Rose metal, etc.) or knowledge of 
formulas of substances with commercial names 
(e.g. laughing gas, muriatic acid, mossy zinc, 
etc.) or historical information associating an 
idea with a person. Moreover, an uncomfortably 
large number of items are phrased so as not 
to permit a clear-cut answer. For example, 
"Hydrogen has metallic properties" is to be 
marked "true," though it is at least debatable, 
and in any event the idea could be tested better 
in another way. 

Many principles and theories are touched 
upon but are tested largely on the verbal level. 
There are no items testing depth of under- 
standing or the application of knowledge to 
new situations. There are 16 items on balancing 
equations, but they all test essentially the same 
skill with a considerable amount of duplication. 
The quantitative understanding of principles 
and concepts is almost entirely absent—there 


are only three simple problems. In this respect 
the Iowa Placement Examinations: Chemistry 
Training is to be preferred. 

On the whole, the test may be used to assay 
quickly the range of information. The reported 
reliability, based on 128 cases, is .go. Correla- 
tion between semester marks and scores on this ~ 
test, based on 111 cases, is reported as .68. 
This is quite high and suggests that the criti- 
cisms in this review should be directed to the 
courses of study primarily and only secondarily 
to this test. 
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* Wisconsin Achievement Tests: Chemistry. High 
school; 1938-39; Form A; $1.25 per 25; 50(55) min- 
utes; prepared by Bureau of Guidance and Records, 
University of Wisconsin, and the Wisconsin Chemistry 
Test Committee: Richard Butler, Ira C. Davis, C. E. 
Kommes, William Leahy, Walter Mero ee 
Peterson, E. V. Smith, Harold Wiercks, uis W. 
Willie, and C. R. Wolf; E. M. Hale & Co. 


Victor H. Noll, Professor of Education, 
Michigan State College, East Lansing, Michi- 
gan. This test was probably constructed for use 
in a particular situation and for definite pur- 
poses known best to those responsible. It is im- 
possible to evaluate it adequately because of 
lack of information. There appears to be no 
manual and no instructions for giving the test. 
except the directions to pupils which are printed 
on the title page. No information is given as 
to the purposes of the test, plan of organization, 
validity, reliability, or methods used in stand- 
ardization. 

The test consists of five parts separately 
timed but not labelled. Part I contains 26 
multiple-choice items testing information, in- 
cluding a few problems; Part II consists of 
four sets of matching items dealing with chem- 
ists and their contributions, definitions of chem- 
ical processes, formulas and names, and tests 
for various “chemicals”; Part III consists of 
various diagrams such as of a blast furnace, 
atomic structure of certain elements, and setups 
of apparatus, with multiple-choice or short- 
answer questions based on each; Part IV 
consists of 20 short-answer questions of a 
miscellaneous character; Part V consists of 32 
short-answer questions on formulas, equations, 
and chemical problems. 

The total possible score is 163, based on one 
point for each item; the time limits are 7, 10, 
15, 8, and ro minutes on each of the five parts 
respectively. Percentile norms in intervals of 
five are given for pupils in four groups, those ~ 
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with IQ's of 115 and above, between 106 and 
114, between 97 and 105, and lower. The total 
number of pupils involved is 794 and the num- 
ber of schools is 26. Nothing is said concerning 
the size, type, or location of these schools. 

Approximately one half the items are of the 
short-answer type. This rules out machine scor- 
ing and reduces objectivity. In 42 four-response 
multiple-choice items, the correct answer ap- 
pears four times in the first position; 15 times 
in the second position; 14 times in the third 
position; and 9 times in the fourth position. 

Some questions might be raised regarding the 
chemistry of the test. For example, on page 4 
in the first matching set "discovered oxygen" 
appears in one column and both Priestley and 
Scheele in the other. It is true that Priestley is 
generally credited with the discovery of oxygen, 
but it is also true that Scheele discovered it 
several years before Priestley. In this set of 
items the problem thus created could easily have 
been avoided. Again, on page 5 in the last set of 
matching items the directions state that, “Col- 
umn A lists a number of “chemicals.” The first 
item in Column A is “ammonium.” It is cer- 
tainly dubious to call this radical a chemical in 
the common sense of the term, and here again 
the problem could easily have been avoided. 

The test gives the impression of having been 
constructed without definite plan, or regard for 
the various types of objectives of a good course 
in high school chemistry. For the most part, it 
seems to test a variety of miscellaneous chemi- 
cal information. The only exception to be noted 
is in Part III where knowledge and understand- 
ing of apparatus, processes, and atomic struc- 
ture are tested. 

As stated above, the test probably was con- 
structed for their own purposes by a group of 
chemistry teachers and it may never have been 
intended for general distribution and use. It 
does not give evidence of the care in planning 
and construction which one expects to find evi- 
denced in the best available tests. 
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Cooperative General Science Test. High school; 
1939-47 ; Forms P, Q, X; IBM; separate answer sheets 
need not be used; $1.75 per 25; 256 p specimen set, 
postpaid ; 40¢ per 25 machine-scorable answer sheets; 
15¢ per stencil for scoring answer sheets; 40(45) min- 
utes; Forms P and Q: O. E. Underhill and S. R. 
Reg Form X: Paul E. Kambly; Cooperative Test 
ervice. 
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For reviews by G. W. Hunter, W. B. Mel- 
drum, and Alvin W. Schindler of earlier edi- 
tions, see 38:1125 and 40:1601. 
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XCooperative Science Test for Grades 7, 8, and 9. 
Grades 7-9; 1941-47; IBM; Forms R, S, X; scparate 
answer sheets need not be used; $2 per 25; 25¢ per 
specimen set, postpaid; 40 per 25 machine-scorable 
answer sheets; 15¢ per stencil for scoring answer 
sheets; 80(85) minutes; Form R : John G. Zimmerman 
and Richard E. Watson; Form S: Carl A. Pearson; 
Form X: Paul Kambly; Cooperative Test Service. 


Hans C. Gordon, Special Assistant to the 
Director, Division of Educational Research, 
Public Schools, Philadelphia, Pennsylvania. 
[Review of Form R.] The purpose of the Co- 
operative tests is "to provide objective and 
reliable measures of the educational achieve- 
ment of the pupils tested." For this purpose 
carefully constructed tests have been provided 
with enough additional accessories to compare 
the achievement of any pupil (a) with that of . 
the average pupil completing an average course 
of study in the subject (scaled scores) and (5) 
with the amount of gain to be expected for a 
pupil (end-of-year norms on the same instru- 
ment for successive years). In this way Co- 
operative tests attempt to extract a great 
amount of meaning from scores obtained. 

The Cooperative Science Test consists of 
three parts: I, Facts, Skills, and Applications; 
II, Terms and Concepts; and III, Comprehen- 
sion and Interpretation. Parts I and II are 
multiple-choice tests on science content; Part 
III is a reading test. 

Some teachers find the long forms of Co- 
operative tests difficult to use because of the 
time required. Eighty minutes of actual testing 
is more time than many teachers are willing to 
devote to a single test. Shorter forms of tests 
with adequate reliability can be attained as is 
indicated by the Cooperative 40-minute series. 
For example, Part III, Comprehension and 
Interpretation, is timed at 25 minutes, and yet 
only 30 items are included. In this part, pupils 
have to read materials that are quite difficult for 
them, requiring close detailed attention, at the 
rate of over 200 words per minute in order to 
have half the allotted time to make responses. 
Tests can be devised to meet the same purposes 


595 ] 


TESTS & REVIEWS: SCIENCE—GENERAL 


„as Part III and yet provide more responses per 
minute and equal reliability in less time. 

Generally the selections in Part III need to 
be read carefully before the questions can be 
answered. This is as it should be. Occasionally, 
however, it is quite possible for a pupil to know 

» the answers to some questions without actually 
reading the content. Tests of comprehension of 
science materials should be carefully made so as 
to present concepts that are new to the pupil 
and yet use enough of the specific vocabulary 
and form of expression of science to make these 
tests different from an ordinary reading com- 
prehension test. The authors do not prove that 
Part III scores are significantly different from 
those of general reading tests. If they are not 
different, the administration of Part III is un- 
necessary for those pupils who have adequate 
reading test records. 

The norms show a difference of only three 
scaled-score units at the soth percentile from 
the end of the eighth to the end of the ninth 
grade. This is very small in comparison with 
the gains in scaled-score units in subjects such 
as foreign languages. This failure of the Co- 
operative Science Test to show important dif- 
ferences from one grade to the next may be due 
to the fact that many pupils are taking their 
first year of general science instruction in the 
ninth grade. The norms would be more helpful 
if we could distinguish the length of time that 
pupils have been in general science classes as 
well as their grade level. 


Herbert A. Thelen, Assistant Professor of 
Education, The University of Chicago, Chi- 
cago, Illinois. [Review of Forms S and R.] 
The items of Part I elicit a variety of behaviors 
ranging from recall of specific facts to compara- 
tively simple applications of principles. The 
number of items portraying situations likely to 
be novel to the students is relatively small. In 
Part II, Terms and Concepts, the emphasis is 
upon recognition of the correct word to apply 
in a given situation. In general, the skill is con- 
cerned with definition, but there appears to have 
been some effort to avoid the stereotyped ver- 
balizations which so often mask lack of under- 
standing of the purposes served by definition. 
Part III presents the student with six passages, 
each approximately 200 words long, and re- 
quires him to answer a number of questions fol- 
lowing and related to each. The behaviors range 
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from recognizing almost direct quotations to 
application of social criteria for the purpose of 
deciding such matters as what the author's pur- 
pose is, what the advantages of a described in- 
vention are, and the like. 

All of the items are multiple choice and de- 
signed for answering on a separate response © 
sheet. Tentative norms "based on approxi- 
mately 5,000 Junior High School students" are 
supplied as scaled scores sampling the percentile 
range in grades 7, 8, and 9. The test reliability 
is not given. The administration of the test takes 
80 minutes and the sections are timed. 

As usual, the problem of deciding whether to 
use a test which seems technically as satisfac- 
tory as this one is the problem of deciding to 
what extent the behaviors elicited by the test 
adequately represent the behaviors the teacher 
is trying to develop in his course. In the opin- 
ion of this reviewer most teachers would prob- 
ably check as desirable the objectives appraised 
by most of the items in Part III, but the same 
teachers would probably have to admit that few 
of their class activities were designed con- 
sciously to promote these behaviors. By provid- 
ing through its items operational definitions of 
some desirable specific objectives, Part III 
supplies to the teacher a valuable instrument 
which should increase the effectiveness of his 
efforts to develop learning activities consonant 
with some objectives of general education. 

Part II, which appraises the student's ability 
to use terms properly in context, is an improve- 
ment over similar cliché items in the usual sci- 
ence achievement test (cf. for example, the 
McDougal General Science Test). 

Part I, which elicits a number of different 
kinds of behaviors, might be useful for a rapid 
survey of the student's general orientation. 

In summary, this instrument offers a number 
of advantages over most similar efforts: (a) it 
provides three scores which can be interpreted 
to provide useful information about the effec- 
tiveness of the course in promoting certain 
learnings in each student; (b) it attempts to 
appraise specific outcomes which are, in general, 
clearly defensible; (c) it is a well-built instru- 
ment. If the teacher can supplement this test by 
instruments or observations designed to take 
account of other objectives recognized by the 
teacher and class and of special experiences 
unique to the particular course, appraisal should 
be reasonably adequate. 
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Elementary Science and Health: Every Pupil 
Test. Grades 4, 5, 6; 1946-47 ; new form usually issued 
each April and December ; forms December 1946, April 
1947 ; 2¢ per test; I¢ per scoring key; 40(45) minutes ; 
Ohio Scholarship Tests, Ohio State Department of 
Education. 
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XExamination in General Science—High-School 
Level. High school; 1945; IBM; Form SGSc-1-B-4 
- (usually called Form 8j. separate answer sheets must 
be used ; $3 per 25; so¢ p 25 machine-scorable answer 
sheets ; 30¢ per scoring key; 35 per specimen set, post- 
id; ORE prepared by the Examinations 
taff of the United States Armed Forces Institute; 


published by the American Council on Education ; dis- 


tributed by Cooperative Test Service. ( Also distributed 
by Science Research Associates : $3 per a 65¢ per 25 
machine-scorable answer sheets; 50¢ per key; 50¢ per 
specimen set.) 


Hans C. Gordon, Special Assistant to the 
Director, Division of Educational Research, 
Public Schools, Philadelphia, Pennsylvania. 
This 150-minute examination is designed to 
explore as thoroughly as possible the achieve- 
ment of generally accepted objectives in gen- 
eral science. USAFI examinations are designed 
not only to furnish assistance in evaluation of 
achievement in high school but also to furnish 
in some cases the sole evidence of the attain- 
ment of objectives. For this reason, the United 
States Armed Forces Institute has produced 
one of the most thoroughgoing instruments 
available in the measurement of achievement in 
general science. 

The test consists of three parts: Section I, 
Basic Facts and Information; Section II, Ap- 
plication of Scientific Principles; Section III, 
Scientific Attitude and Application of Scien- 
tific Method. Of these three sections, Section 
II, Application of Scientific Principles, is more 
comprehensive than either of the other sec- 
tions. The scientific problems presented in Sec- 
tion II involve not only interpretation of pre- 
sented data, but also working skill with basic 
scientific facts and principles. The problems 
presented are likely to be original in the sense 
that they are not common in scientific textbooks 
or included specifically in courses of study. For 
this reason they have unusual value for meas- 
urement. 

The directions emphasize the desirability of 
developing local norms for the total and sepa- 
rate section scores. The tables of norms fur- 
nished with the test may be of help in eval- 
uating achievement until local norms are 
established, and in estimating the general needs 
of a class or community in science instruction. 


However, there is no description of the pupils | 
furnishing the data for the norms: age, grade, 
mental, socio-economic level, or geographic dis- 
tribution. According to the norms the median 
pupil does not achieve 50 per cent of the pos- 
sible score in the total or in any section. The 
test seems difficult enough to detect unusual 
ability. 

The manual suggests that the section scores 
be used to guide instructional emphasis for in- 
dividuals and for classes. If this is to be carried 
out, for some individuals there would be an 
attempt to develop a wider concept of the ap- 
plication of scientific principles, for others there 
would be an attempt to increase his knowledge 
of basic facts and information, and for still 
others emphasis would be thrown upon scien- 
tific attitudes and the application of scientific 
methods. It is not too clear how or why a 
teacher would stress knowledge of basic facts 
without also stressing application of scientific 
principles, attitudes, and methods. 


Victor H. Noll, Professor of Education, 
Michigan State College, East Lansing, Michi- 
gan. The test consists of 180 multiple-choice 
items divided into three parts: Basic Facts and 
Information (62 items), Application of Scien- 
tific Principles (94 items), and Scientific Atti- 
tudes and Application of the Scientific Method 
(24 items). A substantial proportion of the 
items involve the interpretation of diagrams. 
The time allowance is 2 hours and 30 minutes. 
Percentile norms on the separate parts and on 
the total, based on nearly 1,200 cases, are avail- 
able. The cases are not classified as to grade 
level, it being merely stated that they are high 
school level. No information regarding validity 
and reliability or purpose is given. The test 
covers a wide range of material in science and 
shows evidence of care and skill in construc- 
tion. 

Without more information one is leít with 
a feeling of uncertainty as to the merit of this 
examination. Many of the items are reminiscent 
of the type used in general classification tests in 
the armed forces, and as such their appropriate- 
ness for use in this test may be questioned. If 
the examination is designed for use at the | 
junior high school level where general science - 
is usually taught, it seems ill-suited both in con- 
tent and in difficulty. The average junior high 
school pupil in general science would probably 
be overwhelmed by it. 
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The reviewer classified the items in the test 
according to whether he considered them to be- 
long in one of four categories. A few items 
were difficult to classify but the following dis- 
tribution is approximately correct: physics, 57 
per cent; biology, including conservation, 23 
per cent; earth science, 12 per cent; chemistry, 
8 per cent. This seems somewhat high for 
physics and low for the other categories, espe- 
cially earth science. If the examination is de- 
signed to measure general scientific knowledge 
after the usual high school courses in biology, 
chemistry, and physics, the selection of content 
seems even more questionable. In such cases it 
would seem desirable to have had a much more 
nearly equal division of items among the three 
major fields. 

It is impossible to judge the extent to which 
the three parts of the examination have differ- 
ential value. Certainly Part I overlaps both 
Parts II and III, and a number of the items 
in Part III are so like those in Part II that it 
is very difficult to see any difference. 

In summary, one may say that the exami- 
nation has the qualities of accuracy and care 
in item construction; that it is long and there- 
fore probably reliable; and that it covers the 
physical sciences (taken all together) much 
more thoroughly'than the biological. For use 
in junior and senior high schools as a measure 
of achievement in science courses as generally 
offered, its value is open to question without 
more information concerning purposes, meth- 
ods of construction, norms, and other criteria. 
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*Examination in Senior Science—High-School 
Level. Grades 11-12; 1944; IBM; Form SSrS-1-B-4 
(usually called Form B) ; separate answer sheets must 
be used; $3 per 25; 40f per 25 machine-scorable answer 
sheets; 15¢ per scoring key ; 25¢ per specimen set, post- 
paid; 120(125) minutes ; prepared by the Examinations 
Staff of the United States Armed Forces Institute; 
published by the American Council on Education; dis- 
tributed by Cooperative Test Service. (Also distributed 
by Science Research Associates : $3 per 25; 65¢ per 25 
machine-scorable answer sheets; 50¢ per key; 50¢ per 
specimen set.) 


Richard H. Jordan, Associate Professor of 
Geology, Florida State University, Tallahassee, 
Florida. This examination in high school senior 
science is objective, adapted to machine scoring, 
and on the whole well constructed. It includes 
questions involving application of facts and 
principles, reflective thinking, as well as simple 
recall. The mechanics of good objective testing 


have been followed generally in the writing of 
the items; for example, the place of the correct 
response among the possible choices has been 
varied between questions in a chance distribu- 
tion; the correct response cannot be chosen by 
length or qualifying phraseology ; in most cases 
the possible choices are equally plausible to the 
uninformed student ; the parts of keys in the key 
list questions are not confusing; nor is there 
more than one possible answer in the multiple- 
choice questions where only one is desired. 
The instructions to the student are clear and 
concise. Care has been taken in the wording of 
the items, The test seems to be accurate from a 
scientific point of view. Many of the questions 
show thought and ingenuity on the part of the 
author. 

The content of the examination is predomi- 
nantly that of physics and chemistry. There are 
a few questions concerning biology, a very few 
dealing with meteorology, and none with 
geology. 

In the opinion of the reviewer Section III, 
Application of the Scientific Method, is the 
only section open to any serious criticism and 
then primarily because of the title. Three of 
the series of items are based upon the student’s 
ability to read graphs and use arithmetic, one 
on his ability to read a table, and the last on 
his ability to apply a physical principle. Al- 
though these are excellent exercises in think- 
ing, the reviewer questions their validity in 
the measurement of anything but very limited 
applications of the scientific method. They do 
call for exact thinking, a certain amount of 
accurate observation, and possibly suspended 
judgment, but they do not test the formation of 
(or selection of) reasonable hypotheses, the 
testing of hypotheses, ability to distinguish fact 
from theory, or a combination of these points. 
It is believed that there are questions in other 
parts of the test which measure the student’s 
ability to apply the scientific method fully as 
well as these. 

An adequate series of items that will test the 
student’s ability to apply the scientific method 
is difficult to construct. It is believed that a 
more adequate approach should be made here 
or that the title of the section should be made 
more appropriate to the included items, 

Percentile norms and equivalence tables for 
Forms A and B have been prepared by the ad- 
ministration of the test to 968 students in 27 
schools, Raw scores and percentage of cases 
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falling below the corresponding raw scores are 
listed for Forms A and B. 


[ 575 ] 
General Science: Every Pupil Test. mgh school; 
1946-47 ; new form usually issued each April and De- 
cember ; forms December 1946, April 1947 ; 2¢ per test; 
Ig per scoring key; 40(45) minutes; Walter L. Arn- 
holt; Ohio Scholarship Tests, Ohio State Department 
of Education. 

[ 576 ] 
*McDougal General Science Test. First, second 
semesters high school; 1941; Forms A, B; 2 levels ; 
90f per 25, postpaid; 15¢ per specimen set, postpaid; 
40(45) minutes; manual by H. E. Schrammel; Clyde 
R. McDougal; Bureau of Educational Measurements, 
Kansas State Teachers College of Emporia. 
a) TEST I. First semester. 
b) TEST 11. Second semester. 


Hans C. Gordon, Special Assistant to the 
Director, Division of Educational Research, 
Public Schools, Philadelphia, Pennsylvania, 
This test is issued in two levels, one for the 
end of each semester of the first year of high 
school general science, 

The author states that the test results may 
be profitably used (1) for determining student 
achievement; (2) for checking the efficiency 
of instruction; (3) for assigning school marks; 
(4) for analyzing student and class weaknesses; 
and (5) for motivation of student effort.” 

Some of the important things that one should 
like to know before selecting a test for these 
Purposes would be answers to the following 
questions : 

(a) After a half year’s or a year’s work in 
general science do pupils show greater gains in 
scores on this test than pupils who have not 
had such experience? The author furnishes no 
data about this, but inspection of the items in 
the test indicate a high loading with verbal and 
memory factors rather than evidences of growth 
in scientific reasoning, attitudes, interests, and 
skills. There is no pictorial content, problem 
material, or interpretive exercise in the test. 

(b) Are all, most, or even many of the 
kinds of learning expected as a result of the 
general science experience sampled adequately? 
This test apparently measures a very limited 
sample from the total science experience. Al- 
though the author has made an undoubted 
effort to sample the content of textbooks and 
courses of study, it is obvious that this sampling 
cannot be accurate enough to meet the needs 
of most specific teaching situations. General 
science is a subject that has not so formalized 

its subject matter that specific concepts are al- 
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located to any specific part of the course with 
anything like the same degree of regularity that 
topics are presented in sequential subjects such 
as algebra, geometry, history, physics, or even 
chemistry and biology. The approach to meas- 
uring achievement of aims in general science 
is not likely to be found in sampling only the 
facts listed in textbooks and courses of study. 

(c) Are the measured learnings clearly de- 
fined so that steps can be taken to supplement 
the test with other measures that will round 
out an adequate total picture of pupil growth 
in general science? This test is subdivided into 
three parts based on type of question as true- 
false, multiple-choice, or matching. Obviously 
this sort of subdivision has no value to indi- 
cate those general science aims that have been 
attained. 

In general, it may be said that this test may 
have value for the teacher who has compared 
carefully the content of the test and the content 
of his instruction and found them in agree- 
ment. For such teachers the test will give meas- 
ures of information remembered at the time of 
the test, although even in this respect the 
teacher would be at a loss in interpreting the 
scores in terms of the norms provided. The 
directions provide no information as to the gen- 
eral mental level, grade or age, number of 
hours of instruction, or specific content of the 
course for the group tested. Data of this sort 
are essential if the scores are to have more 
meaning than a test made by the teacher him- 
self. 


Herbert A. Thelen, Assistant Professor of 
Education, The University of Chicago, Chi- 
cago, Illinois. This test of recall of facts and 
simple generalizations from the area of “Gen- 
eral Science” deals with the usual academic 
physical and biological science at elementary 
levels. The three parts of the test require the 
student to check the accuracy of each of a num- 
ber: of suggested completions for given state- 
ments, to select the best choice from a number 
of suggested answers to simple questions, and 
to match a number of descriptive phrases to 
the nouns thus described. The two tests elicit 
160 and 170 responses respectively and have 
a reliability coefficient of “approximately 0.90.” 
The directions ask that time be called after 40 
minutes so that the complete objective of the 
tests would appear to be accuracy and speed of 
recall of a wide range of common scientific 
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* facts. In the opinion of the reviewer, the items 
are straightforward and clear-cut and should 
present little difficulty to a student who “knows 
the facts.” 

It is unreasonable to suppose that in 40 min- 
utes of testing at the end of a year one could 
get reliable evidence of the extent of pupil 
growth with respect to each of the major ob- 
jectives of the science program. It does, how- 
ever, seem reasonable to require that whatever 
tests are given should demand from the stu- 
dent intellectual skills of the kinds presumably 
facilitated by the laboratory and other activities 
of the course. A thoughtful and test-wise 
teacher would be in the best position to de- 
scribe these skills and the depth of penetration 
appropriate. No standardized test could be ex- 
pected to have high curricular validity for every 
class. The McDougal General Science Tests, 
because they deal only with commonly taught 
facts, are probably about equally unfair for 
most classes. Tt is this general usefulness for 
comparing one class with many other classes 
that makes the test in general not useful for a 
teacher wishing to appraise worth-while achieve- 
ment as a form of growth of his own pupils. 
Such a teacher would be concerned with spe- 
cific ability in applying each of the major prin- 
ciples taught during the course, in solving 
particular types of problems, in designing experi- 
ments for a described set of purposes, etc. None 
of these appraisals would be helped by collec- 
tion of the data possible from the tests in ques- 
tion. The most serious difficulty with these tests 
is not that they are poor tests for the behaviors 
they appraise but rather that the behaviors they 
appraise are per se of little significance for 
general education. — 


[ 577] 

Wisconsin General Science Test. High school; 
1937-39; Forms A, B; Form A has the title Wisconsin 
Achievement Tests: General Science; $1.25 per 25; 
50(55) minutes; prepared by the Bureau of Guidance 
and Records, School of Education, and the Wisconsin 
General Science Test Committee: Eleanor Cox, Ira C. 
Davis, Violet Dohse, D. J. English, Hazel Hyer, 
Thelma Keyes, Edwin Jensen, George C. Peterson, 
Harvey Peterson, H. W. Schmidt, and Clyde Stewart; 
manual by T. L. Torgerson and G. J. Froehlich; E. M. 
Hale and Co. 


MISCELLANEOUS 


[ 578 ] 
Examination in Astronomy—College Level. Col- 
lege; 1945; IBM; Form CAs-1-B-4 accally called 
Form B) ; no norms available; separate answer sheets 


must be used ; $1.75 per 25; 40€ per 25 machine-scorable 
answer sheets; 15¢ per scoring key; 25 per specimen 
set, postpaid; 150(155) minutes; prepared by the Ex- 
aminations Staff of the United States Armed Forces 
Institute; published by the American Council on Edu- 
cation; distributed by Cooperative Test Service. (Also 
distributed by Science Research Associates : $2 per 25; 
65¢ per 25 machine-scorable answer sheets; 50¢ per 
key; 50¢ per specimen set.) , 
[ 579 ] 

Cooperative Geology Test. College; 1940-41; 1 
form; 2 tests; $2 per 25; 25¢ per specimen set of either 
test, postpaid; G. Marshall Kay; Cooperative Test 
Service. 7 

a) HISTORICAL GEOLOGY. 1940; Form Q; 110(115) min- 


utes. 
b) PHYSICAL GEOLOGY. 1941; Form R; 90(95) minutes. 


[ 580 ] 
XExamination in Meteorology—High-School 
Level. Grades 11-12; 1944; IBM; Form Mg-1-B-4 


(usually called Form B) ; no norms available; separate 
answer sheets must be used; $2 per 25; 40¢ per 25 
machine-scorable answer sheets; 15¢ per scoring key; 
256 per specimen set, postpaid ; 120(125) minutes ; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education; distributed by Cooperative Test 
Service. (Also distributed by Science Research Asso- 
ciates : $2 per 2) 654 per 25 machine-scorable answer 
sheets; 50¢ per key; 50¢ per specimen set.) 


PHYSICS 


[ 581 ] 
Cooperative Physics Test. High school; 1939-47; 
IBM ; Forms P, Q, S, X; separate answer sheets need 
not be used; $1.75 per 25; 25¢ per specimen set, post- 
paid; 40 per 25 machine-scoral le answer sheets; 15¢ 
per stencil for scoring answer sheets; 40(45) minutes ; 
Form P: H. W. Farwell; Form Q: Carl P. Swinner- 
ton; Form S: Russell S. Bartlett, Lester D. Beers, 
Winston M. Gottschalk, Roberta G. Poland, Alan T. 
Waterman, and Carl A. Pearson; Form X: Paul J. 
Burke; Cooperative Test Service. 
REFERENCES 
1. Bınıncer, Mary Ler, A Study on the Prediction of Suc- 


cess in an Engineering College. npublished master's thesis, 
Clark University, 1945. (Abstracts of Dissertations . . . 1945, 


pp. 20-21.) dene" " 
2. Weaver, J. F. "The Distribution of Emphasis in Ten 


Physics Tests and in Twelve Physics Textbooks." J Ed Res 
39:42-55 S '45. * (PA 20:571) 

G. P. Cahoon, Professor of Science Educa- 
tion, The Ohio State University, Columbus, 
Ohio. [Review of Forms P, Q, and S.] These 
forms are composed of five-answer, multiple- 
choice items designed to be completed in a 40- 
minute period. Specific detailed directions for 
administering, scoring, tabulating, interpreting, 
and reporting the test results are provided in a 
separate manual. 

Form P contains 85 one-best-answer multi- 
ple-choice items, arranged in no particular 
order with reference to the major topics or 
divisions of the usual physics course (me- 
chanics, heat, electricity, etc.). Items on prob- 
lems are interspersed with those on factual 
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information. There are no items utilizing tables, 
graphs, or charts. Form Q is in two parts. Part 
I consists of 43 five-answer multiple-choice 
items in mechanics and heat, while Part II has 
42 items of the same type in electricity, light, 
and sound. All items related to one topic (such 
as heat) are grouped together. Six of the total 
85 items make use of diagrams, charts, or 
graphs. Form S consists of two parts. Part I 
(for which 30 minutes is allowed) is composed 
of 50 one-best-answer multiple-choice items ar- 
ranged, as in Form P, at random with respect 
to the major division of the usual physics 
course, Six items relating to three diagrams are 
included in this part. Part II (for which ro 
minutes is allowed) consists of five-answer 
multiple-choice items which have multiple cor- 
rect answers. One diagram is utilized for one 
of the 15 items in this part. 

The Cooperative Physics Test is probably 
among the best of the achievement tests in this 
area. The validity of the items for most teach- 
ing situations is probably high if, as the manual 
indicates, the content "is based on surveys of 
curriculum outlines and bulletins, test book 
analyses, and consultation with specialists." To 
include "only those items which discriminate 
adequately between good and poor students and 
which give a satisfactory range and distribu- 
tion of difficulty" as the manual suggests, seems 
desirable, No information or actual data rela- 
tive to these phases is furnished. For an in- 
creasing number of test users, the type and 
sampling of items in these forms wotld be in- 
adequate to give evidence of pupil progress 
toward other goals or outcomes of a physics 
course than those concerned with factual in- 
formation and problem-solving ability. While 
there has been a slight tendency to make use of 
a few diagrams and graphs in the more recent 
forms, few items have been aimed to secure 
evidence of such abilities as to apply principles 
to new situations, to understand or explain the 
operation or functioning of important natural 
phenomena or man-made devices, or to inter- 
pret data from experiments, demonstration, 
tables, and graphs. This suggests the desir- 
ability of supplementing the present forms with 
an additional type of test rather than including 
more and different items in the present ones, 
which obviously have a limited enough sam- 
pling in a test of 40 minutes to give evidence 
relative to basic facts, principles, and problem 
solving. 


It is to be questioned whether the average _ 
physics student can handle the load of approxi- 
mately 85 items on any of these forms of this 
test in a time limit of 40 minutes. The time 
which it takes simply to read and understand 
what the problem is uses up much if not most 
of the time. One has little time to think and 
“figure” and work on even a few items. For 
many students the pressure of this “time race” 
would not allow them to show their true ac- 
complishment. That this actually works out in 
practice is suggested by the “scaled score" 
values given on the keys. The 50-point on the 
scaled scores "corresponds to the expected per- 
formance of an 'average' individual with one 
year of study of the subject tested at the end 
of the 11th grade." The raw scores reported 
for this "average" performance are: Form P, 
21-22; Form Q, 12; Form S, 7. It would seem 
that these tests have too much "top" for the 
time involved, at least. 

The 15-page manual is dated 1940 and obvi- 
ously relates to Forms P and Q only. This 
gives very brief data relative to the construc- 
tion and returns of the Cooperative tests in 
general. The directions for giving and tabulat- 
ing the tests are quite specific and relatively 
easy to follow. The suggestions given for scor- 
ing are particularly pertinent and well pre- 
sented. The manual appears to be carefully 
prepared. In spite of these and other commend- 
able features, however, it will appear quite 
formidable to many test users, and it takes con- 
siderable time to figure out what is relevant and 
what irrevelant for these physics tests. The 
booklet has been prepared to cover all the Co- 
operative tests in all areas. Hence, there are 
numerous references to and special sections on 
certain phases which are not concerned with 
the physics tests. It would be simpler and more 
helpful if a booklet relating only to the tests in 
a particular subject (physics in this case) could 
be supplied. While experienced workers in test- 
ing and guidance would quickly understand and 
follow the directions, explanations, and com- 
ments concerned with such things as scaled 
scores, quartiles, percentiles, standard deviation, 
standard error, and profile charts, many begin- 
ning teachers and those inexperienced in testing 
techniques would be quite confused. This is 
particularly true with respect to the discussion 
of the scaled scores, the use and interpretation 
of which calls for some care, if not experience. 
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. For reviews by Ernest E. Bayles, A. W. 
Hurd, Andrew Longacre, Alvin W. Schindler, 
and Ralph K. Watkins of earlier forms, see 
38:1088 and 40:1608. 


[ 582 ] 

Cooperative Physics Tests for College Students. 
College; 1939-42; Forms C, D, E, F; $3 per 25; 25€ 
per specimen set, postpaid ; 215 (220) minutes ; prepared 
with the cooperation of the Committee on Tests of the 
American Association of Physics Teachers; H. W. 
Farwell, C. J. Lapp, Harvey B. Lemon, Frederic Pal- 
mer, Jr., J. T. Tate, and A. G. Worthing; Cooperative 
Test Service. 


REFERENCES 
M WANN Het Be recedures foie Pia f 
Bee Students With Advanced Standing in Physics.” Am J 

Edgar P. Slack, Professor of Physics, Poly- 
technic Institute of Brooklyn, Brooklyn, New 
York. [Review of Form F.] This test covers the 
subjects taken up in the usual course in general 
physics. The balance among the main subdi- 
visions is well established, and in each sub- 
division the topics selected cover the ground 
with reasonable thoroughness and with proper 
emphasis. In the mechanics test, one question 
deals with the heating of a barometer ; it would 
seem that this question would be better located 
in the test on heat. In the heat test, it would be 
well to include a question dealing with the re- 
lation between heat and mechanical energy; 
such a question might replace one of the five 
that deal with change of state. In the sound 
test, it is likely that if the test were to be pre- 
pared at the present time a question on super- 
sonic waves would be included. In the elec- 
tricity test, it would seem highly desirable to 
include thermionics and even to make it a 
major topic. 

In a few cases the stem might be improved 
for clearness, completness, or correctness. Thus, 
in the mechanics test, one question (Item 22) 
has the statement IC = JD associated with the 
diagram, where it may not be seen, instead of 
including it with the rest of the description. In 
another question (Item 30) on the same test, 
the curve on which the question is based starts 
noticeably off from the origin and might reas- 
onably be rejected by the student for this rea- 
son, In the heat test, one question (Item 17) 
states that when a gas expands into a vacuum 
it does so without change in temperature. This 
should be modified to refer to an ideal gas, as 
featured in the possible answers. In this same 
test another question (Item 27) states that heat 
is generated in the compression of a gas be- 
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cause the average velocity of the molecules is 
increased. I suppose that the change in velocity 
is an accompanying effect but not the cause of 
the heating. In the light test, one question 
(Item 19) uses the term "Newton's rings" in 
what is believed to be an unusual sense. This 
name is commonly given only to the circular 
pattern. In the electricity test, one question 
(Item 35) describes the electrolysis of an aque- 
ous solution of sulfuric acid as yielding hydro- 
gen ions and oxygen ions. Is it not true that in 
such a case there would be sulfate ions or hy- 
droxyl ions rather than oxygen ions? 

A few opportunities were overlooked in 
which an incorrect answer might be made very 
tempting. One of the questions in the electricity 
test deals with the current furnished by a cell 
of which the emf and terminal voltage are 
stated. The student who makes a mistake on 
this question will very likely do so by using the 
emf instead of the other value, but the result 
so obtained was not offered as a choice. 

The foregoing matters deal with the details 
only. Basically, the tests are believed to be care- 
fully constructed and useful as an index of the 
students’ achievement. 


For reviews by Paul A. Northrop and Alan 
T. Waterman of earlier forms, see 38 :1089 and 
40: 1609. 

[ 583 ] 
XExamination in Electricit 
College Level. Grade 16; 1946; IBM; Form CE1M- 
1-B-4 (usually called Form B) ; no norms available ; 
separate answer sheets must be used; $2 per 25; 40f 
per 25 machine-scorable answer sheets ; 15¢ per scoring 
key ; 25¢ per specimen set, postpaid; 150(160) minutes ; 
prepared by the Examinations Staff of the United 
States Armed Forces Institute; published by the Amer- 
jcan Council on Education; distributed by Cooperative 
Test Service. (Also distributed by Science Research 
Associates: $2.00 per 35; 65€ per 25 machine-scorable 
answer sheets; 50¢ per key; 50¢ per specimen set.) 


[ 584 ] 

XExamination in Electron Tubes and Circuits. 
Grade 16; 1945; IBM; Form CETC-1-B-4 (usually 
called Form B) ; no norms available; separate answer 
sheets must be used; $2 per 25; 50¢ per 25 machine- 
scorable answer sheets; 30¢ per scoring key; 35¢ per 
specimen set, postpaid; 150) 160) minutes; prepare 

by the Examinations Staff of the United States Armed 
Forces Institute; published by the American Council 
on Education ; distributed by Cooperative Test, Service. 
(Also distributed by Science Research Associates: $2 
per 25; 65¢ per 25 machine-scorable answer sheets ; 50¢ 


per key; 5of per specimen set.) 


[ 585 ] 
XExamination in Physics—High-School Level. 
High school; 1945; IBM; Form SPh-2-B-4 (usually 
called Form B) ; separate answer sheets must be used; 


and Magnetism— 


* 
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$2 per 25; 40¢ per 25 machine-scorable answer sheets; 
15¢ per scoring key; 25¢ per specimen set, postpaid; 
120(125) minutes; prepared by the Examinations Staff 
oi the United States Armed Forces Institute; published 
by the American Council on Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
none answer sheets; 50¢ per key; 50¢ per specimen 
set. 

G. P. Cahoon, Professor of Science Educa- 
tion. The Ohio State University, Columbus, 
Ohio. This test in physics is part of the Armed 
Forces Institute program "for use in reporting 
the educational achievements and status oí 
service personnel to the educational institutions 
from which these men and women may wish to 
secure credit" Civilian forms equivalent to 
those tests which are given to the men and 
women while in the service are published and 
distributed by the American Council on Edu- 
cation for use by high schools and colleges. As 
indicated in the mantial accompanying the tests, 
the "use of the civilian forms by educational 
institutions will enable such institutions to be- 
come familiar with the levels of achievement 
represented by the published norms, to estab- 
lish their own norms, and to relate the norms 
of both types to their own scholastic standards. 
This will facilitate the just evaluation of the 
educational experiences of returning service 
men and women." 

The test is for a two-hour period and is com- 
posed of 96 one-best-answer multiple-choice 
items. It is divided into three parts or "sec- 
tions": 37 items on information; 21 items on 
problems; 38 items on applications of princi- 
ples. Twenty-one diagrams or graphs are uti- 
lized in the total test. 

The test would appear to have high validity 
for the three objectives indicated above. The 
manual indicates that in the construction of 
such a test "the examiners, specialists experi- 
enced both in teaching and in testing in subject 
matter fields, meet with teachers of those sub- 
jects to identify the educational objectives 
which students are expected to attain. These 
objectives are defined as clearly as possible in 
terms of behavior; that is, a student should be 
able to do definite things which are indicative 
of his attainment of each objective. On the 
basis of this definition the kinds of exercises 
which test the student's attainment of a given 
objective are specified. Exercises testing each 
of the objectives are collected in a sufficiently 
large number to provide for two or more forms 
of each test.” The organization of the test into 


sections so that each “objective” can be meas- 
ured and considered is a desirable feature. — 

The time limit of 2 hours for a test of this 
nature is deemed desirable. The reading of 96 
multiple-choice items in such a specialized sub- 
ject as physics is in itself time consuming. 
Since most items are not simply a matter of 
recall but call for judgment and thought, suffi- 
cient time is needed, The present test is in con- 
trast to the somewhat similar ones of the Co- 
operative Test Service, where 85 multiple-choice 
items, including problems, are to be answered 
in 40 minutes—nearly the same number in one 
third the time. 

A feature which many will welcome is the 
emphasis which the test places upon practical 
applications of physics principles. This is evi- 
dent not only in the 4o per cent of the total 
items contained in the section on application 
of principles but also to a considerable extent 
in the nearly 40 per cent of the total items in 
the section on information. The section on 
problems is concerned largely with the usual 
academic textbook-type problem. 

While the stated purposes of most courses 
and texts include as one of the important out- 
comes the ability to apply principles to natural 
phenomena and common devices, relatively f ew 
published tests contain items designed specifi- 
cally to evaluate such an objective. The evi- 
dence concerning the pupils' functional utili- 
zation of subject-matter background and under- 
standing directed to practical application in the 
items of this section should be a desirable sup- 
plement to the usual factual achievement indi- 
cated by most test results. 

Norms are furnished based on about 1,300 
cases in 45 schools. They are given simply in 
raw scores and percentiles. The interpolated 
median raw score for the test reviewed was 45. 
Some definite cautions and excellent sugges- 
tions regarding the interpretation and use of 
the scores on the tests are indicated in the man- 
ual. Stress in the use of the test results is 
rightly related to local institutions, local stand- 
ards, local grading, diagnosis, guidance, and 
placement of individuals. 


Palmer O. Johnson, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota. This examination in high school physics 
is the civilian form of the examination pre- 
pared by the United States Armed Forces In- 
stitute. This form of the examination purports 


hia poen 


603 | 


TESTS & REVIEWS: SCIENCE—PHYSICS 


587 


to enable an institution to evaluate accurately 
* the educational experiences of returning service 
men and women. 

The examination requires two hours for ad- 
ministration. The type of test item used is the 
5-response multiple-choice. The test may be 
used with either a hand-scored or a machine- 
scored answer sheet. 

The examination is organized into three sec- 
tions so as to secure separate measures on each 
of three objectives: I, Information; II, Prob- 
lems; and III, Application of Principles. The 
traditional content organization of physics is 
followed, and the following classification of 
test items shows the representation among the 
five topics and three objectives. 


TOPIC SECT.I SECT. II SECT. 111 TOTAL 
Mechanics . . 12 8 9 29 
Light 5 2 9 16 
Heat . 9 4 6 19 
Sound | 1 1 6 8 
Electricity 

and Magnetism ....... 10 6 8 24 

Total ... 37 ar 38 96 


Whether this distribution strikes a proper bal- 
ance among topics and objectives is not estab- 
lished. The question of the most efficient samp- 
ling has apparently not received other than 
incidental consideration in the design of the 
examination. This problem was apparently left 
to the critics of the examination, who were to 
judge its adequacy with respect to compre- 
hensiveness of coverage in both content and 
objectives. Why sound, for example, should 
receive a sampling of only one item each in the 
informational and problem sections is more 
likely explained as an accident than on a prin- 
ciple of sampling of test situations designed to 
secure representativeness among the various 
topics. 

The indirect or interpretative approach has 
been used in testing the student’s knowledge of 
common terms, definitions, and principles. This 
practice should reduce the bias for a particular 
text or texts often resulting from requiring the 
student to select a correct definition or state- 
ment from a set phrased in the language ofa 
textbook. Evidence of considerable resource- 
fulness is indicated by the generally high qual- 
ity of the problems and test situations requiring 
applications of principles. Unusually few of 
the test items seem to deal with trivialities or 
superficial situations, The distinction between 
interpretative definitions and applications of 
principles is not easily discovered since many 
of the items in the two sections of the test seem 


to be very much alike. This may be due to the 
fact that in a highly developed science such as 
physics definitions are essentially statements of 
empirical laws transformed into definitions. 
Norms are provided, but the failure to de- 
scribe the population upon which they are based 
renders them of little value for comparative 
purposes, The manual urges each institution to 
set up its own norms. This suggestion would 
probably be practical only in the larger high 
schools that have a considerable number of 
service men. The uses of the test can, how- 
ever, be extended to include the estimate of the 
level of achievement of all students in the high 
school physics course. No evidence at all is 
given with respect to the relative or absolute 
accuracy with which the test measures. 


[ 586 ] 
XExamination in Physics: College Level. College; 
1945; 1 form (usually called Form B); 2 parts; no 
norms available; separate answer sheets must be used ; 
40¢ per 25 machine-scorable answer sheets for either 
part; 15€ per scoring key for either part; 25¢ per 
specimen set of either part, postpaid; 180(190) min- 
utes; prepared by the Examinations Staff of the United 
States Armed Forces Institute; published by the Amer- 
ican Council on Education ; distributed by Cooperative 
Test Service. (Also distributed by Science Research 
Associates: $2 per 25 of either part; 65¢ per 25 
machine-scorable answer sheets for either part; 50¢ 
per key; Soé per specimen set.) 
G) SECTIONS I AND II, CONCEPTS AND PRINCIPLES AND 
PROBLEMS. $1.75 per 25; 90(95) minutes. 
b) SECTION III, CRITICAL THINKING IN SCIENCE. $2 per 
25; 90(95) minutes. 

[ 587 ] 
Iowa Placement Examinations: Physics Aptitude. 
Grades 12-13; 1925-44; IBM; 2 editions; G. D. Stod- 
dard and C. J. Lappy Bureau of Educational Research 
and Service, State University of Iowa. 
a) SERIES PAI, REVISED. 1925; Forms A, B; $4 per 100; 
50¢ per specimen set including the other r0 tests in the 
series, postpaid; 50(55) minutes; prepared under the 
direction of C. E. Seashore and G. M. Ruch. 
b) NEW SERIES PA-2, REVISED. 1925-44; also called 
Quick-Scoring Edition; IBM; Form M; separate an- 
swer sheets need not be used; $1.15 per 25; 30¢ per 
specimen set including the other 4 tests in the series, 
postpaid; 50(55) minutes; revised by C. J. Lapp and 


D. B. Stuit. 

REFERENCES 
.1. Stopparp, Gzonc Dinsmore. Iowa Placement Examina- 
tions. University of Iowa, Studies in Education, Vol. 3, No. 2. 
Towa City, Iowa: the University, 1925. Pp. 103. Paper. $1.00. * 
Romer M. “The Predictive Value of the Iowa 
Physics ai ide Placement Test.” J Appl Psychol 11:381-4 


Ache 
(PA 15:4421) 

Robert M. W. Travers, Associate Professor 
of Education, University of Michigan, Ann 
Arbor, Michigan. Form M of the Iowa Place- 
ment Examinations replaces the earlier edition 
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of the series. The physics aptitude test has 
undergone considerable revision, and much ef- 
fort seems to have been devoted to the analysis 
of the skills required in mastering a first course 
in college physics. The test has a reliability of 
4915, and, according to the manual, this kind 
of examination correlates between .61 and .72 
with success in certain academic courses. How- 
ever, little specific information is supplied in 
the manual about the validity of the test in 
specific situations. 

The test is divided into four parts. The first 
part is a test of formal high school algebra. 
"The second measures the student's skill in con- 
verting verbal material into mathematical sym- 
bols. The third section, which is described as a 
reading test, is virtually a measure of the stu- 
dent's skill in handling verbal material which 
is interspersed with mathematical symbols ; and 
the fourth part samples the student's informa- 
tion in the area of elementary physics. 
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[ 588 ] 


Physics: Every Pupil Test. High school; 1946-47; 
new form usually issued each April and December; 
forms December 1946, April 1947; 2¢ per test; 1¢ per 
scoring key; 40(45) minutes; Darwin J. Kimble; Ohio 
Scholarship Tests, Ohio State Department of Educa- 
tion. 


[ 589 ] 


Wisconsin Physics Test. High school ; 1937-39; 
Forms A, B; Form A has the title Wisconsin Achieve- 
ment Tests: Physics; $1.25 per 25; Form A, 50(55) 
minutes; Form B, 60(65) minutes; prepared by the 
Bureau of Guidance and Records, School of Educa- 
tion, and the Wisconsin Physics Test Committee : W. P. 
Clarke, Ira C. Davis, A. H. Gould, Jerome Herreid, 
Charles Horwitz, M. J. W. Phillips, Severn Rinkob, 
F. W. Schuler, R. J. Suchy, J. H. Thorngate, and C. H. 
Walter; manual by T. L. Torgerson and G. J. Froeh- 
lich; E. M. Hale and Co. 


REFERENCES 


1, Weaver, J. F. “The Distribution of Emphasis in Ten 
Physics Tests and in Twelve Physics Textbooks." J Ed Res 39: 
42-55 S "45. * (PA 20:571) 


For a review by Louis M. Heil, see 40:1613. 


SOCIAL STUDIES 


Reviews nv Dorothy C. Adkins, Howard R. Anderson, Harry D. Berg, W. H. Cartwright, 

Robert. 4. Davis, Elaine Forsyth, J. R. Gerberich, Lavone A. Hanna, C. Robert Pace, Roy 

A. Price, Edwin H. Reeder, Frederick H. Stutz, Wallace W. Taylor, Agatha Townsend, 
Marguerite Uttley, Edgar B. Wesley, Ray G. Wood. 


[ 590 ] 
XAmerican History—Government—Problems of 
Democracy: Acorn Achievement Tests. Grades 9- 
16; 1942-44; Forms A, B ; $2.00 per 25; 15¢ per speci- 
men set; 40(45) minutes; Vincent McGarrett; Acorn 
Publishing Co. 


Howard R. Anderson, Specialist for Social 
Sciences, U. S. Office of Education, Washing- 
ton, D. C. Each form of this test includes 100 
four-response multiple-choice questions grouped 
under five section headings: Part I, Growth of 
a National Spirit; Part II, The Growth of 
Democracy; Part III, The Constitution; Part 
IV, Foreign Policy; and Part V, Problems of 
Democracy. The author is Mr. Vincent McGar- 
rett, Chairman, Social Studies, Andrew Jack- 
son High School, New York City. 

The following purposes of this test are given 
in the Teachers’ Guide: *(1) to measure . . . 
mastery of the content of standard courses in 
American History, Government and Problems, 
(2) to indicate those areas . . . in which re- 


medial teaching may be necessary, (3) to test 
the student's grasp of the bases of America's 
democratic way of life, and (4) to measure the 
student's ability to make sound inferences based 
on the relationships among historical person- 
ages and events." 

The author, in discussing validity, states, “In 
the preparation of the test fifty or more items 
were evaluated for each section . . . The events 
measured . . . are common to American history 
syllabi in all sections of the United States. . . . 
The results correspond very closely to those 
obtained in the American History examination 
given by the University of the State of New 
York. The administration of this test in ad- 
vance of official (Regents’?) examinations in 
American History will reveal with high fidelity 
those students who . . . will pass or fail. . . .” 

The Teachers’ Guide gives the reliability co- 
efficient of the test as .94 when “obtained by 
the odd and even scores and the Spearman- 
Brown formula." On the class record form 


— 


-—— 
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. norms are provided on a month by month basis 
for grades 8 to 14. Norms for the five parts 
of the test are also provided for grades 9 to 
14. These norms “are based on trials which 
included 5,120 students in various sections . . ." 

This reviewer doubts that this test can be 
considered a satisfactory test for "standard 
courses in American History, Government and 
Problems." In some states these courses are 
fused in a two-year sequence in grades 11 and 
12, but in other states the practice is to allocate 
two, one, and two semesters, respectively, to 
the three courses in question. This reviewer 
would classify 60 items as primarily American 
history (Parts I, II, and IV) and 20 items 
each as government (Part III) and problems 
of democracy (Part V). 

The forms need revision to eliminate “dated” 
items, such as “An outspoken supporter of 
American aid to democracies at war with ag- 
gressor nations” and “The chief purpose of the 
United States in fighting the Axis.” 

The right answer to some of the questions is 
difficult to determine: “The Latin American 
nations which hesitated the longest to join the 
Axis are: (a) Brazil and Uruguay, (0) Mexico 
and Guatemala, (e) Argentina and Chile, (c) 
Peru and Colombia” and “Which of these is 
generally not a cause for crime? (f) the auto- 
mobile, (k) divorce, (c) poverty (i) voca- 
tional training." 

The peculiar sequence of letters in the re- 
sponses to test items results from the fact that 
pupils are told to underline the right response 
to each item and the teacher is given a “code 
word” to help him score the tests quickly. 


[ 591 ] 
XCooperative Community Affairs Test. Grades 9- 
12; 1941; Form R; $1.75 per 25; 25€ per specimen set, 
postpaid; 30(35) minutes; Roy A. Price and Robert F. 
Steadman; Cooperative Test Service. 


REFERENCES 


W. H. Cartwright, Assistant Professor of 
Education, Boston University, Boston, Massa- 
chusetts. This test deserves to be widely used. 
It is suited for use in any community. It is par- 
ticularly adaptable to courses in community 
civics but can well be used in local history, 
American history, modern problems, or any 
other courses in the social studies curriculum 


in which special attention is given to com- 
munity study. 

The test consists of 40 multiple-choice and 
40 modified true-false items. The publishers 
provide no information regarding the validity 
or reliability of the test. 

The items call for specific information about 
many phases of the community, including ge- 
ography, government, population make-up, em- 
ployment, health, welfare, etc. Since the an- 
swers vary from community to community, 
they are not to be found in ordinary textbooks. 
Hence, neither scoring keys nor norms are 
available. This means that the teacher will have 
to construct his own key to fit his own com- 
munity. While constructing a key will require 
some effort, it should not tax the teacher who 
is giving adequate treatment to community af- 
fairs. This local adaptation will probably lend 
more validity to the test for use in the local 
situation. In those cases where making a key 
does prove a burden to the teacher, the experi- 
ence gained should make him aware of the 
potential value of the community as a source of 
curricular material. 

The test was originally prepared in connec- 
tion with the Regents Inquiry into the Cost 
and Character of Public Education in New 
York State but has since been revised to make 
it suitable for use anywhere in the country. An 
account of the circumstances surrounding its 
original construction, as well as a very helpful 
discussion of the problems involved in testing 
for community information, may be found in 
the Ninth Yearbook of the National Council 
for the Social Studies (1). 


J. R. Gerberich, Director, Bureau of Educa- 
tional Research and Service, and Professor of 
Education, University of Connecticut, Storrs, 
Connecticut. The Cooperative Community Af- 
fairs Test is the only standardized test of its 
type known to the reviewer. No scoring key 
is supplied, for correct answers to its recog- 
nition items necessarily depend upon such fac- 
tors as the location of the community, its 
population, its cultural offerings, its crime rate, 
and other local characteristics. Individual dif- 
ferences are probably as great among communi- 
ties as among human beings, so no scaled scores 
and consequently no norms are or conceivably 
could be provided. Part I consists of 40 four- 
response multiple-choice items and Part II in- 
cludes 40 items requesting yes, no, or inde- 
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terminate answers. The scoring formula of 
i= x for the entire 8o items appears slightly, 


although not seriously, to overcorrect for 
chance. The test format is excellent. 

Items in Part I deal primarily with such 
characteristics of the community as its settle- 
ment, source of name, location, climate, eleva- 
tion, and water supply; such population factors 
as its size, geographical origins, occupational 
pursuits, family size, age, illiteracy, and birth 
and mortality rates; such cultural factors as 
newspapers, recreational areas, library loans, 
churches, and school costs; such financial 
characteristics as wages, total incomes, and 
rental rates; and such crime indices as traffic 
violations, reasons for arrests, and juvenile de- 
linquency. Responses are well chosen and ap- 
pear to provide for any degree of community 
eccentricity in their scope, 

In Part II, 6 grouped items deal with laws 
or rules governing the people; 3 grouped items 
deal with their activities; 19 items in three 
groups are variously concerned with commu- 
nity institutions, practices, and services; 4 
items in a sixth group compare incomes and 
taxes of today with those of five years ago; 
and 8 items in a seventh group compare the 
community with the United States in general, 
A very few items in this part appear unlikely 
to function well in both large urban and rural 
communities, such as those dealing with farm- 
ers' incomes and laws concerning factory sani- 
tation. Although provision is made for re- 
Sponses to groups of items which do not apply 
to the individual community, no unambiguous 
way of responding to such scattered items as 
the above is evident. Complete disregard for 
these few items in the scoring in those communi- 
ties where they seem inapplicable would have 
no more serious effect than to reduce the total 
scoring points by a small number, however. 

It seems possible to the reviewer that the 
test fails to include certain rather important 
criteria of community culture about which high 
school pupils might be expected to have knowl- 
edge and understanding. Among these are book 
ownership and magazine Subscriptions, tele- 
phones, bathtubs, radios, etc., in the home. It 
may be, however, that the limitation of pupil 
experience to homes essentially at one eco- 
nomic or social level would to some degree in- 
validate such items, It is possible also that cer- 
tain types of individual differences, such as sex 
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and occupational, might advantageously receive * 


somewhat more emphasis. 

The reviewer is impressed with the values to 
be obtained from the use of this instrument 
with high school pupils as a measuring instru- 
ment, as a teaching device, or both. Its validity 
for measuring knowledge and understanding 
of one's community seems evident. Further- 
more, the test demands upon the part of some 
school official in any community which uses it 
a worth-while experience in the development 
of community understanding, for it would be 
a rare individual indeed who could correctly 
key the test for his community without rather 
extensive research. 


Lavone A. Hanna, Associate Professor of 
Education, San Francisco State College, San 
Francisco, California. This test consists of two 
parts. The 40 items in Part I are of the 
multiple-choice type as, for example, “The 
average annual family income in my commu- 
nity is ( ) less than $600, ( ) $600 to $1200, 
( ) $1200 to $2000, ( ) $2000 or more.” Part 
II consists of seven incomplete statements fol- 
lowed by several clauses or phrases, each of 
which is to be checked according to whether it 
is correct (Y), incorrect (N), or very doubt- 
ful or unknown (?). 

The test was originally constructed to be 
used in the New York Regents' Inquiry and 
was revised to make it suitable for nation-wide 
use. The questions call chiefly for factual in- 
formation about a community, and some of the 
items need to be revised in light of recent 
trends in employment and living costs. The test 
has value as a teaching device in introducing 
a unit on community life or as an end-test at 
the close of such a unit. No key is supplied, as 
the answers depend upon the local community. 

No statistical data as to the reliability or 
validity of the test are given, and there are no 
norms or scaled scores. Although it is intended 


that the answers be written in the test booklet, 


standard answer sheets could be used if ma- 
chine scoring is preferred. 


[ 592 ] 
XCooperative Social Studies Test for Grades 7, 8, 
and 9. Grades 7-9; 1941-47; IBM; Forms R, S, X; 
separate answer slieets need not be used ; $2 per 25; 25€ 
Per specimen set, postpaid 3 40¢ per 25 machine-scorable 
answer sheets; 15¢ per stencil for scoring answer 
sheets; 80(85) minutes; Form R: Agatha Townsend 
and Mary Willis; Form S: Mary Willis; Form X: 
Hany D. Berg and Elaine Forsyth; Cooperative Test 
ervice, 
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Robert A. Davis, Professor of Education, 
University of Colorado, Boulder, Colorado. 
[Review of Forms R and S.] The purpose of 
this test is to provide a survey of the pupil's 
information and knowledge in the broad field 
of the social sciences. Where the maturity of 
the student permits, the test provides a con- 
venient and valid measure of his background 
of information and knowledge in this field. 

Form R consists of three parts: Part I, 
Facts, Skills, and Applications; Part IT, Terms 
and Concepts; and Part III, Comprehension 
and Interpretation. The total time for admin- 
istration is 80 minutes distributed as follows: 
Part I, 40 minutes; Part II, 15 minutes; and 
Part III, 25 minutes. Form S, published one 
year later, is similarly constructed but differs 
slightly in that Part I is labeled Informational 
Background. This form consists of the same 
number of items and the same time allotments. 

The items are of the multiple-choice non- 
reasoning type. Almost all items deal with the 
problem of measuring information which the 
learner has obtained from a wide range of 
source materials and experiences. Historical 
materials are made more realistic by occasional 
pictures and graphs, on the basis of which the 
learner selects a response appropriate to the 
illustrative matter presented. The materials for 
the test were gleaned from a wide variety of 
sources by members of the testing committee. 

Range of scores is wide and results tend to 
follow a normal distribution. The test, how- 
ever, is too difficult for some pupils, as shown 
by the relatively small percentage of questions 
answered by groups used in the standardizing 
process. Typical seventh grade students, for 
example, answered correctly about one ques- 
tion in four; ninth grade students answered 
approximately 44 per cent. 

The test places a premium on speed. There 
are 150 items. The typical junior high school 
pupil needs approximately a third of his time 
to read the test. Owing to the difficulty of the 
material and the premium placed on speed, 
some seventh and eighth grade pupils may find 
the test too difficult, For the ninth grade, how- 
ever, the test is more appropriate. The manual 
provides no information regarding validity on 
the basis of detailed item analysis. Reliability 
coefficients are not given, but the large number 
of items should assure high coefficients, par- 
ticularly for the entire test. 

Norms are based on approximately 5,000 


junior high school pupils. Scoring is objective 
and scores are corrected for chance. Items may 
be answered either on the test booklet or on the 
answer sheet, which may be scored by machine. 
The use of the answer sheet is recommended 
since the scoring key for the test booklet is 
somewhat cumbersome. Raw scores may be 
converted to scaled scores. Percentile scores 
for individual students insure easy interpreta- 
tion. 


Edgar B. Wesley, Professor of Education, 
University of Minnesota, Minneapolis, Minne- 
sota, [Review of Forms R and S.] These tests 
each consist of 150 items, 75 being devoted to in- 
formation, 45 to terms, and 30 to comprehen- 
sion. All items are of the multiple-choice type 
with five options in each. Both forms make use 
of maps, charts, and other illustrative material, 
thus insuring a high degree of accuracy in 
testing the skills required in interpreting such 
graphic devices. Somewhat curiously, the ques- 
tions relating to the use of these graphic ma- 
terials appear in the section on information 
rather than under comprehension and interpre- 
tation where they would seem to belong. 

The first parts, devoted to information, deal 
for the most part with significant matters; the 
second sections are clear identifications of 
words; and the third sections are primarily 
reading items. Parts I and III are needlessly 
wordy and even Part IT could be condensed 
and thereby enhance its clarity. In fact, wordi- 
ness is a burden to the pupils, a time consumer, 
and a bar to simple, direct measurement. 

In Form R, Item 75 is unfortunate in its 
implications, for it refers to "free" camping 
grounds in our national parks, whereas any one 
who has ever been to one must have marveled 
at the high charge for admission. Ttem 68 
underscores the false connotation of the word 
“true.” The authors obviously meant "accur- 
ate." Option 4, assuming a good critical author, 
is of course the best answer, but the testmaker 
keyed option 3. In Part TI, Item 8 perpetuates 
the confusing and needless word “barter,” 
which in spite of all pedagogical efforts is and 
will stay dead. 

In Form S, Item 9 asks how most Ameri- 
cans made their living in 1840. It is a pointless 
question from the standpoint of achievement; 
it might have validity in an intelligence test. 
Item 48 is probably correct, but it implies that 
city voters have access to information about 
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candidates, an assumption that is contrary to 
fact. In Part II, Item 3, the phrase “freedom 
of the press” is inadequately if not erroneously 
defined. The phrase historically and legally re- 
fers to freedom from govermental restraint. 
Item 15 contains a curious reference to a primi- 
tive country as one that has not developed “its” 
civilization or industries, implying that civiliza- 
tion is there like a mineral ready to be ex- 
ploited. Item 38 says the King of England is 
the “sovereign” and so he is, but the “prime 
minister,” which is one of the options, exer- 
cises sovereignty. The item is unfortunate in 
its teaching if it teaches anything. 

In spite of these minor criticisms, however, 
both forms of this test are excellent. They de- 
serve wide use, for they are vastly superior to 
the typical test devised by the classroom teacher. 


[ 593 ] 
Illinois Teachers College Cooperative Social Sci- 
ence Test, 1942 Edition. College; 1938-42; 1 form; 
10-99 copies, 714¢ per test; 10¢ per single copy; 15¢ per 
manual, postpaid; 40(45) minutes; John A. Kinneman 
and Clarence Orr; McKnight & McKnight. 


Harry D, Berg, Assistant Professor, Board 
of Examinations, Michigan State College, East 
Lansing, Michigan. This test “has been de- 
veloped as the result of a need for a compre- 
hensive test in the social sciences which can be 
administered within a reasonable period of 
time. . . . The test should prove helpful to 
teachers in senior high schools and to instruc- 
tors of college freshmen classes who want to 
make an evaluation of student accomplish- 
ments.” This is the rather vague fashion in 
which the publication of this particular test is 
justified. Whatever may be the express or im- 
plied purpose, about the best that can be said 
of the test is that it can be “administered 
within a reasonable period of time.” Appar- 
ently the examination is intended to serve as a 
measure of general proficiency in the social sci- 
ences and as a placement device. If so, it is this 
reviewer’s belief that it falls far short of its 
purpose. For the most part the test is made up 
of highly factual “popgun” matching and mul- 
tiple-choice items which have been drawn with 
no apparent plan from the fields of govern- 
ment, economics, and history. The item “The 
last religious rites for the dying Catholic is: 
a. baptism b. celibacy c. Holy Eucharist d. ex- 

treme unction” is typical. Many of the items 
violate the most elementary rules of test con- 
struction, particularly those with regard to clues, 


verbalism, and homogeneity of foils. The whole 
test gives evidence of hasty construction. This 
is evident even in the matter of proofreading— 
at least it would be kindness to account for the 
spelling of “counterfeiting” as “counterfitting” 
in this manner. 

The test is divided into several parts, namely; 
definition of terms, classification of names, 
sequence of events, recognition of facts, number 
relationships, identification of persons, and iden- 
tification of terms. Such a classification of items 
is of little value for diagnostic purposes since 
the nature of the content covered in each classifi- 
cation is basically similar. The authors state that 
“the validity of the items and the reliability of 
the test have been determined by the chance- 
halves techniques.” The reliability of the test is 
.94 but how the validity of the test can be deter- 
mined in this manner and what it is are obviously 
not explained. Before purchasing a test for this 
intended purpose it would be well for the pros- 
pective user to examine the generally superior 
Cooperative Test Service tests. 


[ 594 ] 
Social Studies Test: National Achievement Tests. 
Grades 4-6, 7-9; 1937-45; Forms A, B; $1.75 per 25; 
15¢ per specimen set ; 35(40) minutes ; Robert K. Speer 
and Samuel Smith; Acorn Publishing Co. 


Ray G. Wood, Director, Ohio Scholarship 
Tests and Instructional Research, State De- 
partment of Education, Columbus, Ohio. Since 
no manual accompanies this test, little or no in- 
formation is had on the method of validation, 
reliability, or the specific purpose this test is to 
serve, 

GRADES 4-6, The items of the two forms are 
apparently parallel, measuring the same func- 
tions. The norms, which are only the medians 
of the total scores, are reported for each month 
for grades 3.1 through 8.1. 

These medians give helpful data for inter- 
grade comparison of the total scores, but this 
reviewer believes that this test would be much 
more useful if norms were available for each 
of the five parts of this test and if several per- 
centile values were given. 

Part I, Human Relations, Part IT, Life Situ- 
ations, and Part III, Social Problems, test the 
student's attitude on several basic social prob- 
lems by thought-type questions, while other 
general information is tested by objective-fact 
items. The results on these three parts of the 
test will depend largely upon the background 
students have in the fundamental principles of 
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good citizenship. For example, a typical item 
of Part I is: "The right of free speech (i) 
means the right to insult the President (o) 
permits citizens to express political opinions 
(q) belongs only to labor unions." The pupils" 
reaction to this type of item will be governed 
by his sense of propriety as well as by his con- 
cept of the place and use of free speech in a 
democracy. A citizen will not necessarily be 
fined or jailed if he "insults the President," 
since there is no law against such action. The 
accepted principle of respect for the chief ex- 
ecutive of these United States would be the 
basis for the individual's reactions to such 
items. A typical item of Part II is: “Clothing 
that keeps one warm in winter climates is 
usually made of (m) silk (z) cotton (t) wool." 
The validity of this type of item in a social stud- 
ies test, standardized on a nation-wide scale, is 
questionable since it is purely factual. If a 
pupil lives in a warm winter climate, he may 
give “silk” or “cotton” for the answer and be 
correct as governed by his environment. The 
answer “wool” which is the one expected will 
be given by those who live in a cold winter 
climate. 

Part IV, Products and Peoples, and Part V, 
The Meaning of Events, are made up of infor- 
mation items on political geography, history, 
and the Constitution. 

Both forms of this test are scored by one 
key word. This is a unique scoring technique, 
but it could easily destroy the validity of the 
test results if pupils obtain this “key word.” 

GRADES 7-9. The objectives tested are: Part 
I, Human Relations; Part II, Life Situations: 
Part III, Social Interpretations; Part IV, 
Value of Products; Part V, Social Ideas; and 
Part VI, Miscellaneous Facts. 

It is noted that the objectives tested in Parts 
I and II are the same as those tested in Parts 
I and II of the test for grades 4-6. The num- 
ber and sampling of items for these two parts 
of this test and those for the lower grades are 
neither parallel nor comparable. If the two 
tests would have been parallel for these im- 
portant objectives, growth or changes in ideals, 
attitudes, and information of pupils in the 
higher grades could have been compared with 
those of pupils in the lower grades in the par- 
ticular community or area being tested. 

An achievement test must be evaluated not 
only in terms of breadth of content, but it 
should also be judged by the kind of reaction 


students make in answering the test items. 
While many of the items of this test evaluate 
the students' reaction to fundamental social 
concepts, many are limited to simple recall. 
For example, Item 7, Part I, Form B reads: 
"Among the following, the country that de- 
pends most upon its colonies for raw materials 
is (f) Poland, (g) United States, (o) Eng- 
land, (w) Turkey." By process of elimination 
and the knowledge that England has a great 
merchant marine, the student would be able to 
answer this item. Item II, Part VI, Form A, 
reads: “A great rubber product manufacturing 
city is (o) Cleveland, (a) Akron, (q) Pitts- 
burgh, (u) Denver." These are typical of some 
of the rather superficial items appearing in the 
two forms of this achievement test. 

Typical of the items of this test that test for 
broader social concepts is Item 8, Part II, 
Form A, which reads: “When discussing social 
problems with a friend, you should consider 
his ideas liberal if he (b) wishes mainly to pre- 
serve the status quo (things as they are), (t) 
advocates the wider distribution of wealth, (z) 
wishes to give people jobs because of their race 
or religion, (p) attacks the right to strike and 
defends the right to work.” The student's re- 
action to this type of social studies item is 
much more meaningful for interpretation of a 
social consciousness than being able to recall 
the name of a “great rubber products manu- 
facturing city." 

Five of the six parts of this test are scored 
with one key word. This is an easy way to 
make and provide a key, but it would be disas- 
trous to the validity of the results were the 
students to get information on this key word. 
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*Stanford Achievement Test: Social Studies 
Tests. Grades 4-9; 1940-42 ; an adaptation of the social 
studies sections of the Stanford Achievement Test; 
IBM; Forms DM, EM; separate answer sheets must 
be used ; $1.90 per 25; 70¢ per 25 machine-scorable an- 
swer sheets; 20¢ per key; 35¢ per specimen set, post- 
paid; 30(35) minutes; Truman L. Kelley, Giles M. 
Ruch, and Lewis M. Terman; World Book Co. 


Ray G. Wood, Director, Ohio Scholarship 
Tests and Instructional Research, State De- 
partment of Education, Columbus, Ohio. These 
tests are carefully constructed examinations 
and represent much time and thought on the 
part of the authors. Forms DM and EM are 
machine-scorable revisions of the social studies 
tests in the Stanford Achievement Tests, Forms 
D and E. The authors have spared no effort in 
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studying results of administering Forms D and 
E and in revising them on the basis of the data 
obtained. Forms DM and EM are among the 
best social studies tests available, according to 
this reviewer's judgment. 

It does seem, however, that Forms DM and 
EM would be of more use to the social studies 
teacher and the pupil concerned if the items of 
these forms were organized under specific ob- 
jectives rather than in the present “scrambled” 
form. It would be better to have Test I, History, 
and Test II, Geography, deal specifically with 
history and geography respectively and then to 
have a Test III, General Social Studies, cover- 
ing the other areas of the social studies rather 
than to have the items of these general social 
subjects interspersed within the two special 
subjects. This would necessitate scaling the 
iteras by specific objectives rather than by the 
test as a whole. 

The 70 items of each of the two forms of 
this social studies test represent a sampling of 
a great amount of information. Most of the 
items are factual seeking, rather than problem 
solving or thought provoking. While the items 
of these two forms have been carefully and ex- 
cellently selected, a truer and better evaluation 
of a student's social studies achievement would 
be had if his reactions to practical, current so- 
cial problems were tested as fully as his knowl- 
edge of the factors requiri.g "facts and 
memory." 


For reviews by Walter W. Cook and Ralph 
C. Preston of the entire battery, see 18. 


[ 596 ] 

Test of General Proficiency in the Field of So- 
cial Studies: Cooperative General Achievement 
Tests, Revised Series, Test 1. Grades 10-12 and col- 
lege entrants; 1942-47 ; IBM; Forms S, T, X ; separate 
answer sheets need not be used; $2 per 25; 256 per 
specimen set, postpaid; 40g per 25 machine-scorable 
answer sheets; 15¢ per stencil for scoring answer 
sheets; 40(45) minutes; Forms S and T: Mary Willis, 
Charlotte W. Croon (Form S), and the Committee on 
Social Studies Tests of the Educational Records Bu- 
reau: Ronald Beasley, Charles K. Cummings, Jr., 
W. Ainsworth Greene, Margaret Hastings, and Elsie 
Resor; Form X: Jeanne M. Bradford; Cooperative 
Test Service. 


Harry D. Berg, Assistant Professor, Board 
of Examinations, Michigan State College, East 
Lansing, Michigan. [Review of Forms S and 
T.] The purpose of this particular test is 
to measure general proficiency in the social 
studies at the high school and college freshman 
level for diagnostic and placement purposes. 


With this intent in mind it is natural that the 
publishers should'wish to claim that power and 
the achievement of general objectives in the 
whole field are measured rather than the acqui- 
sition of specific information. The task of con- 
structing such a test is admittedly great be- 
cause of the problems involved in establishing 
the nature of aptitude and defining general ob- 
jectives in the social studies, and then con- 
structing items which will test for these. It is 
this reviewer’s belief that the authors have suc- 
ceeded reasonably well. 

Part I, on terms and concepts, is essentially 
a test of social studies vocabulary and is drawn 
from the fields of government, history, eco- 
nomics, and geography. Since any of the items 
can be easily classified within any one of these 
fields, one is led to question the claim that gen- 
eral objectives are being tested for. It is true, 
however, that the selection was carefully made, 
that textbook definitions have been avoided, 
and that the terms and concepts are generally 
those which are useful in the daily life of citi- 
zens and for future work in the field of the 
social studies. It may be of value to know that 
the items were based on studies made by Kelley 
and Krey (Tests and Measurements in the So- 
cial Sciences) and that the difficulty of words 
was checked by reference to Thorndike's Teach- 
er's Word Book of Twenty Thousand Words. 

It is evidently in Part II, which deals with 
comprehension and interpretation, that the au- 
thors feel that power or aptitude is being meas- 
ured. These items are concerned with para- 
graph comprehension and the interpretation of 
maps, graphs, and cartoons, and are essentially 
of a basic skills nature. Pupils are asked to 
generalize, to discern motives, and to draw 
conclusions. The items in this part are of the 
multiple-choice type commonly used for this 
purpose, but since no provision is made for 
separate scores on the various skills involved, 
the diagnostic value of the test is lessened. 

As is the case with most Cooperative tests, 
extensive tryouts were given before the ex- 
aminations were issued for general use. Norms 
in terms of percentiles and scaled scores are 
furnished. These are based on a large number 
of cases and are differentiated on a basis of 
high school year-levels so that growth may be 
measured. 


For a review by John V. McQuitty of the 
complete battery, see 3. 


611 ] TESTS & REVIEWS: SOCIAL STUDIES—GEOGRAPHY 598 


ECONOMICS 


[597-1 
Cooperative Economics Test. High school 
lege; 1939-42; IBM; Forms P, pee AR 
sheets need not be used; $1.75 per 25; 25¢ per specimen 
set, postpaid; 40€ per 25 machine-scorable answer 
sheets; 15¢ per stencil for scoring answer sheets; 
40(45) minutes; Form P: H. R. Anderson and J. E. 
Fartngion; Form S: Mary Willis; Cooperative Test 
rvice. 


For a review by Edgar B. Wesley of an earlier 
form, see 40:1624. 


GEOGRAPHY 


[ 598 ] 
Emporia Geography Test. Grades 4-7; 1037; Forms 
A, B ; 75¢ per 25, postpaid; 15¢ per specimen set, post- 
paid; 30(35) minutes; H. E. Schrammel, E. J. Calkins, 
Harold Bechtoldt, Forrest Frease, and LaVerna Whar- 
ton; Bureau of Educational Measurements, Kansas 
State Teachers College of Emporia. 


Edwin H. Reeder, Professor of Education, 
University of Illinois, Urbana, Illinois. The most 
serious problems in either making or evaluating 
tests in geography arise from the same unfortu- 
nate situation, namely, that there is no agree- 
ment among theorists in education or among 
specialists in geography on the question of what 
geography is. There are at least three concep- 
tions of the nature of geography which are cur- 
rent either in theory or in practice. The first of 
these is that the aim of geography is to produce 
students who know place locations, boundaries, 
products of countries and some facts about math- 
ematical geography. From classroom observa- 
tion one might well conclude that this aim is still 
the most clearly operative of any, although no 
theorist would accept it today as embodying the 
most important goals of geography teaching. 

The second point of view is that geography 
is the study of how man and his natural environ- 
ment are interrelated ; in other words, that geog- 
raphy is human ecology. It is probable that this 
conception is held by the majority of theorists 
in elementary and secondary education. The 
third notion is that geography is a study of spe- 
cific regions from the standpoint of the ways in 
which geographical factors are operative in 
them. This view is held by a group of university 
professors of geography. It denies that there are 
any general "principles of geography" and main- 
tains that a statement of geographical relation- 
ships must always be made in terms of a particu- 
lar region. 


It seems obvious that a testmaker in geogra- 
phy should make a test which will be in harmony 
with his point of view of what geography is. But 
when one examines tests in the field, one has 
the feeling that the testmakers were apparently 
not clear in their own minds about the nature of 
the subject or how to test competence in it. That 
is certainly the case with the Emporia Geogra- 
phy Test. In their manual, the authors say, “It 
includes questions of facts, principles, informa- 
tion, causes and effects, map study, and numer- 
ous others." 

The test certainly includes many questions of 
"facts, information and numerous others." In 
these respects the test is comprehensive, and in 
the opinion of the reviewer, the test items are 
well chosen and widely distributed in range of 
geographical location. The test will, therefore, 
be helpful to the teacher who wishes to know 
whether his pupils are acquiring a wide knowl- 
edge of place locations and geographical infor- 
mation. 

Tn that sense, therefore, the test is in harmony 
with the first of the three statements of aims 
enumerated above, but as has already been noted, 
this statement of aims is not in line with pro- 
gressive thinking with respect to the chief goal 
of geography teaching. 

If one may judge from the statement quoted 
above from the manual of directions, the test- 
makers accept in part at least the concept of 
geography as a study of man's interrelationships 
with his natural environment, for they say that 
the test “includes map study” and “causes and 
effects." But the reviewer does not believe that 
ability in these fields is measured by the test. 
Certainly the map questions which constitute 
the first 20 test elements measure map remem- 
bering, not map study. No geographical reason- 
ing is required to answer these questions. 

With respect to “causes and effects,” it would 
be well to examine a sample of a test item which 
is apparently designed to test a child's ability to 
reason. There are a number of these in the larg- 
est section of the test which consists of more than 
70 true-false statements. Here is a sample, Item 
21 from Form B: “A fertile soil makes agricul- 
ture the most important industry of New Eng- 
land.” Let us suppose that a child marks this 
correctly as false. What can one conclude about 
the reason for which he marked it correctly ? 
Was it because he knows that the New England 
States are in a region of old, worn-down moun- 
tains; and because he then reasoned from this 
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fact that agriculture is not a very prosperous 
industry in such a geographical location? Or 
was it because he simply remembered that New 
England is more a manufacturing than an agri- 
cultural region? Does this test element test geo- 
graphical thinking or geographical remember- 
ing? The answer is not clear. Since a teacher 
cannot know why a class of pupils marked a test 
well or poorly, how can he know what to do 
about it in his teaching ? 

A final word may be said which needs no 
elaboration. Many of the test elements are out 
of date. For example, Item 103 on Form A reads, 
“The League of Nations meets in 1. Geneva. 
2. London. 3. The Hague. 4. Paris." If the test 
is to be used today, revisions should be made. 

To summarize, if one wishes to know whether 
the pupils in his class are memorizing a large 
number of place locations and miscellaneous 
geographical information, the Emporia Geogra- 
phy Tests will be useful. If one wishes to know 
whether children are attaining the aim of geog- 
raphy which consists in becoming geographical 
thinkers and reasoners, the test will be of little 
assistance. 


Agatha Townsend, Research Assistant, Edu- 
cational Records Bureau, New York, New York. 
This test will probably be found satisfactory for 
use in schools following a standard curriculum in 
which geography is studied as a separate sub- 
ject. The emphasis is on place and physical 
geography and on factual material in human 
geography. Only about 15 of the 120 items in 
Form A call for the application of principles 
or the exercise of judgment based on inferences 
from geographical information. In view of this 
limitation, the test will probably appeal less to 
schools with a combined social studies program, 
unless the purpose of testing is to measure 
acquisition of facts independent of other objec- 
tives of the program. 

From the more technical standpoint, the test 
has several advantages. Two comparable forms 
are available. Public school percentiles for both 
midyear and end-of-year testing in grades 4 to 7 
are provided. The reliabilities reported by the 
authors indicate that the test should distinguish 
satisfactorily among groups of pupils, though 
they are not so high as seems desirable for indi- 
vidual diagnosis. 

The date of publication of the test (1937) 
should not necessarily be taken as evidence that 
it is too old for use today. In Form A, one item 


referring to the League of Nations and one to 
World War without further identification and 
in Form B, an item dealing with Alsace-Lorraine 
are the only questions whose keyed answers 
might be in some doubt. On the other hand, 
omission of references to air transportation and 
similar developments of increasing current sig- 
nificance may put the test more markedly out 
of line with recent courses of study. 


[ 599 ] 
Fourth Grade Geography Test. Grade 4; 1940; also 
called National Council of Geography Teachers Geog- 
raphy Test; Form A; 8¢ per test; 4¢ per manual; 3¢ 
per class record sheet ; 20¢ per scoring stencils ; 30(35) 
minutes; Zoe A. Thralls, George Miller, and Mar- 
guerite Uttley; McKnight & McKnight. 
REFERENCES 

1. Nationat CouNcrL or Grocrarny TEACHERS, TESTING 
CowaMiTTEE, "Fourth Grade Geography Test." J Geog 39:269:73 
O 4o. * 

Elaine Forsyth, Assistant Professor of Social 
Studies, New York State College for Teachers, 
Albany, New York. The general excellence of 
this test makes it stand out sharply from many 
others in the same field. It has an attractive 
format. The items are varied in type, with yes 
and no responses or multiple choices with from 
three to seven foils. Vocabulary is carefully 
graded. The subject matter tested is common to 
nine widely used textbooks. Adequate infor- 
mation is given on test reliability, discriminat- 
ing power of items, and range of difficulty of 
items. Scoring is done by means of a stencil. 
Preparation of an answer sheet for machine 
scoring might increase the usefulness of the 
test. 

The most important aspect of the test to po- 
tential users is what it attempts to measure. In 
contrast to the more usual questions on iso- 
lated facts, this test attempts to measure the 
child's understanding of relationships in the 
field of natural environmental conditions and 
in the area of man's reactions to these condi- 
tions. It also attempts to measure the child's 
ability to read maps, to read pictures and text, 
and to draw reasonable inferences from them. 

Because of its centering on common subject 
matter and generally acceptable objectives, this 
test should serve as an excellent measure of 
achievement ; because of its careful and thought- 
ful construction, it should prove especially use- 
ful for diagnosis of difficulties. 


Agatha Townsend, Research Assistant, Edu- 
cational Records Bureau, New York, New 
York. This test seems to have a number of 
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. features of format and content to recommend 
it. The quality of the printing is good. The 
pictures are clear; and the maps, of good size 
and well-marked. Schools considering the use 
of the test should judge it carefully in terms 
of the objectives of the teaching done in the 
geography course and in terms of the specific 
outcomes for which they wish to test. This test 
will not measure so efficiently as, for example, 
the Emporia Geography Test or the geography 
section of the Metropolitan Achievement Test, 
the retention of facts of place and physical geog- 
raphy. It is designed to measure the ability of 
pupils to apply factual knowledge to the inter- 
pretation of a reading passage, a picture, or a 
map. The facts required for the performance of 
the tasks include knowledge of climatic condi- 
tions, identification of the main features of re- 
gions of Europe and Africa, and general aware- 
ness of the adaptations made by man in living 
under various climatic and physical conditions. 
In general the test seems well designed to 
measure the aims of most geography teaching 
included in social studies programs, and the 
content of the items is appropriate. Some teach- 
ers, however, might desire more reference to 
industrial development and commerce. 

The authors report information favorable to 
the reliability and validity of the test; the se- 
lection of questions was based on acceptable 
techniques of item analysis. A limitation of the 
usefulness of the test in its present form is the 
lack of published percentile norms and com- 
parable editions for retesting. In addition, 
some doubt may be cast on the effectiveness of 
the test from the point of view of item con- 
struction. No correction for guessing is pro- 
vided, and the inclusion of two-, five-, and 
seven-choice items in a single total score raises 
the question of the relative weight which is 
actually given to each answer. 

The general conclusion of this reviewer is 
that here is a promising test which merits fur- 
ther development. The authors might well give 
more attention to test construction techniques 
and attempt to produce forms somewhat more 
homogeneous as to the types of items included 
and the response methods used. Further study 
of the results might also clear up some of the 
questions which occur to the prospective user: 
What is the relation between acquisition of fac- 
tual knowledge and ability to interpret a hy- 
pothetical situation? To what extent does this 
test measure abilities distinct from those meas- 
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ured in a reading comprehension test? Would 
pupils in a traditional geography class do as 
well on a test of this sort as pupils following 
a social studies course? 


[ 600 ] 
Geography Test: National Achievement Tests. 
Grades 6-8; 1936-39; Forms A, B; $1.50 per 25; 15¢ 
per specimen set; nontimed (35-45) minutes; Robert 
K. Speer, Lester D. Crow, and Samuel Smith; Acorn 
Publishing Co. 


Elaine Forsyth, Assistant Professor of So- 
cial Studies, New York State College for 
Teachers, Albany, New York. The purpose of 
these tests, according to the authors, is “to test 
the pupil’s understanding of geographical ideas 
and his mastery of facts.” Of course, there may 
be differences in definition of what constitutes 
a geographic idea, but apparently the authors 
do not include in geographic ideas any con- 
cerning relationships. The items on all four 
forms require recall of fact. The skill factor is 
entirely neglected. There are no items requir- 
ing use of maps, pictures, graphs, or reading 
material. 

The following comments and illustrations 
are based on all four forms. In many items the 
information called for is of questionable value: 
"Tn which of these countries do men kiss each 
other when they meet?” The vocabulary items 
ask for definitions which may or may not be 
of geographic importance: dirigible, gyroscope, 
latex, potash. Other items ask for the uses of 
ostriches, oysters and elephant tusks and the 
location of winter sports, ancient ruins, and 
beautiful paintings. Occasionally questions are 
so clumsily worded as to be difficult to read: 
“A country the vast areas of which make it 
difficult to conquer is ? Actual errors in 
fact are promulgated by such a question as 
this: ^Which country in North America is part 
of Great Britain?" Often incomplete and even 
incorrect generalizations must be marked by 
the child in order to answer the question at all. 

The tests are scored by key and code word. 
There are no answer sheets. A table of grade 
norms is provided, but there is no information 
as to how it was derived. No information is 
given on the reliability of the test or the dis- 
criminating power of the items. 

These tests have little to recommend them 
unless someone needs a measure of ability to 
repeat verbatim isolated, fragmentary, mean- 
ingless tidbits of information of the quiz-pro- 


gram variety. 
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[ 601 ] 
*Tate Economic Geography Test. Grades 9-16; 
1940; Forms A, B; 9oé per 25, postpaid; 15¢ per speci- 
men set, postpaid; 50(55) minutes; D. J. Tateand G. A. 
Buzzard; Bureau of Educational Measurements, Kan- 
sas State Teachers College of Emporia. 


Marguerite Uttley, Associate Professor oj 
Geography, Iowa State Teachers College, Cedar 
Falls, Iowa. This test is designed for high school 
and junior college students. Reliability is stated 
as approximately .go. Percentile norms for the 
high school level are given, but they are based 
upon too few cases to be dependable, Form A 
was given to 259 students in 17 high schools, 
and Form B, to 145 in 11 high schools. The year 
of high school was not indicated. Another evi- 
dence that more scores need to be collected and 
considered is the difference in the median scores 
of the two forms. The items in the two forms of 
the test are well paired, yet the median score 
for Form A is 76 (50 per cent of the total of 
150 items) and that for Form B is 96 (64 per 
cent of a total of 150 items). 

The test has a wide coverage of subject mat- 
ter. Form A includes 139 different place names 
and 68 different commodities. The test items are 
predominantly fact questions. Few of the ques- 
tions require a consideration of such earth con- 
ditions as climate, surface and soil; and no at- 
tempt is made to introduce the important factors 
of density of population per square mile and 
standard of living. There is no provision for 
measuring skill in reading and interpreting 
maps, graphs, or tables of statistics. 

Various types of test items are used. In the 
total of 150 items, 60 are true-false, 30 multiple- 
choice, 30 matching, and 30 names of places indi- 
cated on outline maps. On the whole, the test 
items are free from ambiguity and error. In only 
two cases in the 60 true-false questions do the 
determiners "only" and "entire" appear. One 
true-false question is debatable: Item 5 in Form 
A, “Mexico contains extensive lands suitable to 
agriculture.” The key indicates the statement to 
be true. Mexico does have extensive areas of 
poor grazing lands, but only a small per cent of 
its total area is suitable for plow agriculture. 
Item rog asks for Trinidad’s chief export and 
the answer is asphalt. Asphalt undoubtedly was 
once the chief export, but recently Trinidad has 
exported more than six times as many dollars’ 
worth of sugar as of asphalt. 

The workmanship on the outline maps is 
faulty. Some of the numbers are so placed that 


it is difficult to determine which number indi- 
cates the city and which the country. For exam- : 
ple, Item 5 on the map, Part IV of Form A, is 
named Malay peninsula in the key, but this 
reviewer reads 5 as Singapore and 17 as the 
Malay peninsula. 

With a few corrections and the calculation ot 
norms for specific years of high school and col- 
lege from an adequate number of scores, the test 
can become a valuable instrument for measuring 
the retention of facts of economic geography. 


[ 602 ] 


Wiedefeld-Walther Geography Test. Grades 4-8; 
1931; Forms A, B; $1.70 per 25; 35¢ per specimen set, 
postpaid ; 60(65) minutes; M. Theresa Wiedefeld and 
E. Curt Walther; World Book Co. 


Marguerite Uttley, Associate Professor of 
Geography, Iowa State Teachers College, Cedar 
Falls, Iowa. When the Wiedefeld-Walther 
Geography Test was published in 1931, it was 
an innovation in geography testing. Abilities in 
geography other than the retention of facts are 
measured. To this date the reviewer has found 
no other geography test which is as comprehen- 
sive and as useful fpr diagnosis of pupil diffi- 
culties. 

In the manual the purpose of the test is stated, 
“Part x [Tests I-III] tests the abilities and skills 
needed in the study of geography. Part 2 [Tests 
IV-V] tests the control of geographical infor- 
mation, or knowledge of geographical facts." 

TEST I, READING. Each paragraph is composed 
of four sentences and is followed by a why ques- 
tion. The answer is one of the four sentences, but 
in no case can a child match the wording of the 
question to the wording of the sentence. He must 
use his reasoning powers in order to select the 
right sentence. Test items are carefully arranged 
in order of difficulty. 

TEST II, ORGANIZATION. A set of why ques- 
tions is given. Beneath each are several state- 
ments. The child is directed to select the general 
statement which answers the question. Then he 
is to indicate the statement which does not ex- 
plain the general statement. This is a noble effort 
to test a child's geographic mode of thinking, 
but adults have difficulty understanding the 
directions. Much depends upon the teacher's 
interpretation and how well she explains the 
example to the children. The median scores at- 
tained by various grades on this part of the test 
are considerably lower than on any one of the 
other five sections of the test. 
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TEST III, MAP AND GRAPH READING. Four maps 
of Spain are printed at the top of two facing 
pages: (a) temperature and rainfall for Janu- 
ary, (b) for July, (c) relief and ocean depths, 
and (d) political. Twenty-two completion ques- 
tions call for the reading of these maps. These 
questions cover remarkably well the reading of 
scale, directions, altitude, symbols for cities, 
isotherms, rainfall, latitude, and longitude. It is 
unfortunate that the temperature maps are built 
on data reduced to sea level. A geographer is 
concerned with actual temperatures on the 
plateau of Spain and not with the hypothetical 
map which shows temperatures at sea level. 
However, the sea-level isotherms do not inter- 
fere with the map questions in this test. A ques- 
tion on a cross section, two questions on pic- 
tures, and nineteen questions on graph reading 
complete this section. 

TEST IV, GEOGRAPHY VOCABULARY, Technical 
and semitechnical geographical terms are tested 
not by definition, but by association with particu- 
lar places or with type regions. Included in the 
list are a few foreign words which are relatively 
unimportant in a geographic vocabulary, for 
example, kayak, kufa, mantilla, sirocco, gauchos. 

TEST V, GEOGRAPHICAL RELATIONSHIPS. Three 
unbalanced matching exercises comprise this 
part of the test. In the first column are human 
activity items to be matched with natural en- 
vironmental items from the second column. In 
each exercise the list of natural environmental 
items is longer than that of human activity items. 

TEST VI, PLACE GEOGRAPHY. Four clear outline 
maps have 54 well-chosen places indicated by 
numbers. Where the number refers to a city, a 
dot is plainly marked in the appropriate location. 

It is regrettable that in the years since 1931 
few testmakers in geography have profited by 
the notable pioneering of Wiedefeld and Walther 
and advanced the frontier in the measurement 
of the geographic mode of thinking. 


For reviews by Anna Parsek and Marie E. 
Trost, see 40:1626. 


[ 603 ] 
World Geography Test: Dominion Tests. Grade 8; 
1938; Forms A, B; soé per 25; 23¢ per specimen set; 
specimen sets must be purchased to obtain the manual ; 
30(40) minutes; prepared by the Department of Edu- 
cational Research, Ontario College of Education, Uni- 
versity of Toronto; Vocational uidance Centre. 


Edwin H. Reeder, Professor of Education, 
University of Illinois, Urbana, Illinois. This test 


reveals a lack of clearness with respect to what 
it is supposed to measure. Each of the two forms 
consists of 75 multiple-choice questions, each 
with five choices. The great majority of the test 
elements require pure memory of facts in order 
to answer them correctly. To the extent that a 
teacher wishes to know whether his pupils are 
memorizing large numbers of facts about cli- 
mate, products, boundaries, locations and the 
like, the test will be of use. It should be noted, 
however, that the tests are published in Canada 
and the test questions are rather heavily weighted 
for information about the British Common- 
wealth of Nations. About one third of the items 
deal with parts of this group of nations. 

A very small number of questions in the tests 
under consideration attempt to test geographical 
reasoning. Item 61, Form B, will illustrate this 
attempt. It reads as follows: "Tasmania, unlike 
most of the states of Australia, always has suffi- 
cient rainfall. This is because (1) it is very 
mountainous, (2) it lies in the path of the 
Trades, (3) it is unsheltered by the mainland of 
Australia, (4) it lies in the paths of the Mon- 
soons, (5) it lies in the path of the Westerlies." 
Even in these test elements, however, it seems 
safe to say that remembering facts is an ability 
more required to answer them correctly than is 
reasoning from cause to effect. (For this re- 
viewer's general remarks on geography tests, 


see 598.) 
HISTORY 


[ 604 ] 
Cooperative American History Test. High school 
and college; 1942-47; IBM ; Forms S, T, X ; separate 
answer sheets need not be used; $1.75 per 25; 25¢ per 
specimen set, postpaid; 4o¢ per 25 machine-scoral le 
answer sheets; 15¢ per stencil for scoring answer 
sheets; 40(45) minutes; Forms S and T : Mary Willis; 
Form X: Harry D. Berg; Cooperative Test Service. 
REFERENCES 

1. Linngutst, E. F. “The Form of the American History Ex- 
zminations of the Cooperative Test Service." Ed Rec 12:459:75 
9 a! Tearen, ArtHuR E. “Progressive Methods as Related to 
Knowledge of American History.” Sch & Soc 57:6403 My 29 


V TaAxteR, ArtHUR E. “Some Data on the Cooperative 


A-nerican History Test,” Pp. 46-8. In 1043 Achievement Testing 
Program in Independent Schools and Supplementary Studies. 
Educational Records Bulletin, No. 38. New York: Educational 
Pecords Bureau, June 1943. Pp. xiii, 53. Paper, lithotyped. 
$1.50. 


For reviews by Edgar B. Wesley and Mar- 
garet Willis of earlier forms, see 38:1014 and 
40:1633. 
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Cooperative Modern European History Test. 
High school and college; 1947; IBM; Form X; sepa- 
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rate answer sheets need not be used; $1.75 per 25; 25¢ 
per specimen set, postpaid; 40€ per 25 machine-scorable 
answer sheets; 15¢ per stencil for scoring answer 
sheets; 40(45) minutes; Frederick H. Stutz; Coopera- 
tive Test Service. 


For reviews by Lavone A. Hanna, A. C. 
Krey, and S. P. McCutchen of earlier forms, 
see 38:1016 and 40:1635. 


[ 606 ] 
Cooperative World History Test. High school; 
1947; IBM; Form X; separate answer sheets need not 
be used; $1.75 per 25; 25¢ per Specimen set, postpaid ; 
40¢ per 25 machine-scorable answer sheets; 15¢ per 
stencil for scoring answer sheets; 40(45) minutes; 
Wallace Taylor; Cooperative Test Service. 


For reviews by Kenneth E. Gell and R. M. 
Tryon of earlier forms, see 38: 1017 and 40: 1636. 


[ 607 ] 
*Examination in American History. Grades 11-14; 
1944; IBM; Form SAmH-2-B-4 (usually called Form 
B); separate answer sheets must be used; $2 per 25; 
4o¢ per 25 machine-scorable answer sheets; 15¢ per 
scoring key; 25¢ per specimen set, postpaid ; Xu 
minutes; prepared by the Examinations Staff of the 
United States Armed Forces Institute; published by 
the American Council on. Education; distributed by 
Cooperative Test Service. (Also distributed by Science 
Research Associates: $2 per 25; 65¢ per 25 machine- 
qnie answer sheets; 50¢ per key; 50¢ per specimen 
set. 
REFERENCES 
1. TOWNSEND, AGATHA. “The Reliability and Validity of the 


ment Testing Program MENS Acad P Sean 
per e epi iem nsa NS d, Ner 
Paper, lithotyped. $2.00. * (PA 22:463) pace cA 

Howard R. Anderson, Specialist for Social 
Sciences, U. S. Office of Education, Washing- 
ton, D. C. This examination is one of a number 
prepared by the Examinations Staff of the 
Armed Forces Institute as a means of measur- 
ing the educational achievement of service per- 
sonnel and of reporting it to "educational insti- 
tutions from which these men and women may 
wish to secure credit." 

The test being reviewed is a two-hour ex- 
amination and includes 150 multiple-response 
items. An answer sheet which permits machine 
scoring is provided; but if part scores are de- 
sired for diagnostic purposes, scoring stencils 
must be used. 

The manual describes the procedure followed 
in the construction of these tests. In brief, ex- 
aminers who were skilled both in teaching and 
in testing American history met with teachers 
to identify educational objectives. These were 
defined as behaviors, and appropriate exercises 
constructed. The test materials were tried out 
and revised, and the revised materials criticized 


by content experts before final forms were con- - 
structed. 

The test includes seven parts: Chronology 
(to 1789; 1789-1860; since 1860), Identifica- 
tion, Ideology, Interpretation and Generaliza- 
tion, and Comparison and Contrast. Percentile 
norms for both total and part scores at both 
high school and college levels are provided. 
The manual suggests that these norms are based 
on results obtained with average pupils and 
suggests that institutions which feel that their 
student body is markedly inferior or superior 
may wish to establish their own norms. 

The test items are carefully prepared and 
effectively sample concepts which are usually 
included in general courses in American his- 
tory taught at either the senior high school or 
the junior college level. The test therefore 
should be a valid measure of general achieve- 
ment in this field. 
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XExamination in Modern European History. High 
school and college; 1945; IBM; Form MEH-1-B-4 
(usually called Form B) ; no norms available ; separate 
answer sheets must be used; $2 per 25; 40? per 25 
mathine-scorable answer sheets ; vd per scoring key; 
25¢ per specimen set, postpaid ; 120(125) minutes ; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education ; distributed by Cooperative Test 
Service. (Also distributed by Science Research Asso- 
ciates: $2 per 25; 65¢ per 25 machine-scorable answer 
sheets; 50¢ per key; 50¢ per specimen set.) 


Frederick H. Stutz, Assistant Professor of 
Education, Cornell University, Ithaca, New 
York. The purpose of this test is to measure 
achievement in modern European history at the 
high school or college level. A manual accom- 
panying the’ test explains in detail procedures 
used in constructing subject-matter tests, estab- 
lishment of test norms and the evaluation of 
achievement, and directions for administering, 
scoring, and interpreting scores. Because the 
general philosophy of the USAFI achievement 
testing program is well known and because the 
statistical data supporting the tests are available 
to test users, this review will be confined to a 
discussion of the strengths and weaknesses of 
the test itself. ; 

As is true of others in the series, the Examt- 
nation in Modern European History was pro- 
duced by the cooperative work of examiners, 
teachers, and subject-matter specialists. An at- 
tempt is made to measure pupil attainment in 
terms of accepted educational objectives in the 
field. The test consists of 148 items and is to 
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„be administered in a time of two hours. Of the 

148 items, 15 test chronological relationships, 
31 are matching exercises, 41 are of the mul- 
tiple-choice type, 32 are based on keys, and 29 
are concerned with the interpretation of read- 
ing passages. 

In general, the test is well planned and well 
constructed. Among the educational objectives 
measured are knowledge of historical informa- 
tion, understanding of concepts and develop- 
ments, understanding of chronological and of 
cause-effect relationships, ability to read and 
interpret historical passages, and ability to rec- 
ognize historical allusions. The items are rather 
well distributed over the period of modern 
European history. Greatest emphasis is given 
to the rise of national states and the achieve- 
ment of political rights by the peoples of west- 
ern Europe. Most of the items are so con- 
structed that in answering them the pupil gives 
evidence of using or of not being able to use 
the type of thinking which the items are de- 
signed to measure. Of particular value are the 
questions based on historical passages. These 
questions test both the student’s knowledge of 
history and his ability to read and understand 
an appropriate passage. 

The test has certain apparent weaknesses. 
An examination of the 148 items reveals that 
99 require knowledge of political or military 
history, while only 13 are directly concerned 
with economic developments and only 32 deal 
with social, cultural, and intellectual events or 
trends. In the light of modern historical schol- 
arship and of recent testing practice, this dis- 
tribution seems to give too much emphasis to 

` the political and military aspects of history. 
Other shortcomings in the distribution of items 
are evident. Though the test was copyrighted 
in 1945, no coverage is given to events after 
1940 and very light coverage to those of the 
period from 1935 to 1939. French history is 
the subject of 34 items, but developments in 
eastern Europe receive relatively little atten- 
tion. Certain periods of European history, such 
as that from 1815 to 1848 and that of indus- 
trial and scientific changes, are neglected. Cer- 
tain questions, such as those which require 
recognition of and differentiation of the pro- 
grams and basic aims of Chartists, Utopian 
Socialists, and Scientific Socialists, may pre- 
sume an understanding of nineteenth century 
history which most high school students do not 
possess. 


The test is an excellent one. It is well con- 
ceived and well built. It could be strengthened 
by the inclusion of more items dealing with 
economic and social developments and by a 
somewhat better distribution of items over the 
period. 
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XExamination in World History—High-School 
Level. High school; 1945; IBM; Form SWH-1-B-4 
(usually called Form 55 separate answer sheets must 
be used ; $2 per 25; 50¢ per 25 machine-scorable answer 
sheets ; 30¢ per scoring key; 35¢ per specimen set, post- 
paid; 120(125) minutes ; prepared by the Examinations 
Staff of the United States Armed Forces Institute; 
published by the American Council on Education ; dis- 
tributed by Cooperative Test Service. (Also distributed 
by Science Research Associates: $2 per 25; 65¢ per 25 
machine-scorable answer sheets; 50¢ per key; 50¢ per 
specimen set.) 


Dorothy C. Adkins, Chief, Test Development 
Unit, U. S. Civil Service Commission, Wash- 
ington, D. C. This test of 198 items appears to 
be comprehensive in coverage. There is no 
reason for the reviewer, a nonhistorian, to ques-_ 
tion the emphasis on the different phases of 
world history. The chronological ordering of 
the items will appeal to many persons as prefer- 
able to possible alternative arrangements. 

A general manual accompanying examina- 
tions prepared by the United States Armed 
Forces Institute describes the general proce- 
dures used in constructing subject tests. Norms 
determined on the basis of tryouts are made 
available on a separate page or leaflet printed 
for each specific test. Unfortunately such a 
leaflet was not available to this reviewer. No 
mention of reliability determinations is made 
in the general manual, but there seems no rea- 
son to doubt that a test of the length and gen- 
eral characteristics of this one could be dem- 
onstrated‘ to be adequate in this respect. 

The general manual states that the tests are 
so organized that attainment of each objective 
can be measured separately. Possibly this state- 
ment is true of the world history test, but its 
applicability is not apparent to this reviewer. 
It seems likely that this aim was sacrificed in 
order to achieve a more nearly chronological 
ordering. Another statement in the general 
manual is probably not applicable to the world 
history test—that norms are given for part and 
total scores—since the test does not appear to 
be organized in such a way as to yield mean- 
ingful part scores readily. 

The general manual emphasizes that the sub- 
ject tests measure more than memory of facts. 
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"Tt will be observed that numerous exercises 
represent problem situations, involving for their 
successful solution applications of knowledge 
and the functioning of critical reflective think- 
ing." This is probably true of the world history 
test; yet it should be noted that mere inspec- 
tion of an item often does not reveal whether it 
tests factual knowledge or "critical reflective 
thinking." 

One form of item used throughout the test 
may cause annoyance to the subject and, in 
addition, can be criticized from the measure- 
ment point of view. In this type, a question is 
presented and several (usually four) state- 
ments are presented. Each of the statements 
is essentially to be judged as true (applicable) 
or false (inapplicable). The subject does not 
indicate as his response the number(s) or let- 
ter(s) corresponding to the true statement(s) 
but must reveal his answers by a code, such as 
(A) I and II; (B) I and III; (C) I and IV; 
-(D) II and IV; (E) II, III and IV. This 
necessity for coding the response introduces an 
irrelevant and probably irritating factor. Fur- 
ther, there are 16 possible responses if all com- 
binations of the four statements are considered. 
Hence a type of scoring ambiguity is intro- 
duced. It is recognized that this device was 
probably adopted primarily in the interests of 
reducing the chance factor in two-choice items. 
And it may be that the particular combinations 
of responses presented to the subject were se- 
lected on the basis of a statistical analysis of 
the item presented in a different form. This 
possibility gains some credence when it is seen 
that for some items only 4 of the 16 combina- 
tions are presented. This shifting from four 
alternatives to five in itself may be confusing 
to some subjects. A solution to the objection 
from the standpoint of scoring ambiguity alone 
would be to limit the number of statements 
presented for each item to two. Then a four- 
choice code (neither I nor II, I alone, II alone, 
I and IT) would apply. This suggestion still 
does not eliminate the first objection to the 
requirement of coding. Coding could be avoided 
by having the subject check all applicable state- 
ments and then scoring them as one would 
score two-choice items. One might then wish 
to reduce the weight of items of this type. The 
basic data are a series of responses to essen- 
tially two-choice items. Tt is difficult to under- 
stand how the accuracy of measurement is in- 
creased by translating the basic responses 


through a code that does not allow for many: 
possible combinations of responses. 

A similar difficulty occurs in items on the 
chronological order of each event within its 
group of three events. The emphasis on order 
rather than on exact dates is desirable. But the 
common difficulty that arises in scoring ar- 
rangement items enters here. Apparently each 
item correctly indicated as first, second, or 
third within its group contributes one point to 
the score. Yet it may be argued that the per- 
son, for example, who assigns three events order 
321 when their correct order is 123 gets one 
point although he is wrong on all of the rela- 
tionships, while the person who gives the order 
312 gets zero on all three, even though he in- 
dicates correctly that 1 comes before 2. 

There is no entirely satisfactory solution to 
this problem if the tests are to be machine 
scored. The items can be recast into a form 
requiring the subject to indicate which of sev- 
eral events occurred first (or second, third, 
fourth, or fifth) or into a form requiring the 
subject to indicate (by A, B, C, D, or E) 
when a particular event occurred in relation to 
a sequence of (four) other events. It may be 
objected that these devices yield less informa- 
tion about the subject per unit time than the 
method used in the test. If machine scoring is not 
an important consideration, perhaps the best 
solution is to assign scores on each possible com- 
bination of events proportionate to its rank- 
difference correlation with the correct order of 
the events. Such a scheme does not lend itself to 
machine scoring. Another possibility that might 
be considered would be to treat each set of 
three events as a unit, giving a credit of one 
point for the correct sequence on all three and 
no credit for other responses. Although this 
solution would again result in assigning the 
same score (zero in this case) to different de- 
grees of knowledge, it probably would work 
more equitably than the present plan if the 
number of items were large enough. As the test 
stands, some persons may believe that too much 
weight is given to the knowledge involved in 
the sets of items in question, Items 49-63, 83- 
94, and 164-187. Dichotomous scoring of each 
set of three would probably result in better 
weighting for these items. 

Aside from the points made above, the item 
construction is technically very good. The test 
format is excellent and the directions clear. 
Scoring is entirely objective and may be done 
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. either by hand or by machine. The score is sim- 
ply the number of right answers. Not correct- 
ing the scores for chance success is entirely 
appropriate if the time limit of two hours is 
sufficient to allow practically all subjects to 
complete the test. 


Wallace W. Taylor, Professor of Social 
Studies; and Head of the Department of Social 
Studies, Milne School; New York State Col- 
lege for Teachers, Albany, New York. This 
examination, in general, compares favorably 
with the best achievement examinations in 
world history now in existence. The individual 
items are well constructed and are free from 
clues and irrelevant material. 

The allocation of items among the various 
fields of world history leaves something to be 
desired. For example, there are virtually no 
items testing understanding of the Far East 
and Latin America. The examination would he 
appropriate for the type of world history taught 
before the war but hardly valid for the good 
courses being taught at the present time. 

The proportion of items devoted to cultural, 
diplomatic, military, economic, and social his- 
tory seems well balanced for the areas treated 
in the examination, However, the ancient and 
medieval period is stressed at the expense of 
the modern period. For example, 64 of the 198 
questions are devoted to the ancient world 
through the Roman period, and there are only 
25 questions on the period since World War I. 
Unless it is intended that this examination be 
used concurrently with a contemporary affairs 
test, that proportion is surely unwise. 

Probably too great a stress is placed on 
chronological items, as this type comprises 25 
per cent of the total. The time-interval tech- 
nique used in the Iowa Every-Pupil tests is 
more effective as a means of testing the un- 
derstanding of cause-and-result relationships 
than the rearrangement technique used in this 
examination. 

There is also some question as to whether 
the use of three varieties of multiple-choice 
items in addition to the matching, true-false, 
and chronology types does not cause loss of 
time in reading directions and some confusion 
in the minds of students in transferring from 
one tvpe of item to another. 

With many teachers committed to a program 
of teaching basic study skills as an integral 
part of all social studies courses, it seems that 


subject examinations should include some items 
to help them evaluate the success of their work 
in teaching these skills. This examination does 
not include items of that sort. 


[ 610 ] 
Kansas American History Test. First, second se- 
mesters high school ; 1938; Forms A, B; 2 levels; 90¢ 
per 25 of either level, postpaid; 15¢ per specimen set, 


postpaid ; 40(45) minutes; Arthur Hartung, C. Stewart 
Boertman, and H. E. Schrammel; Bureau of Educa- 
tional Measurements, Kansas State Teachers College 
of Emporia. 

a) TEST I. First semester. 

b) TEST 11. Second semester. 


W. H. Cartwright, Assistant Professor of 
Education, Boston University, Boston, Massa- 
chusetts. There are two tests, one covering the 
pre-Civil War period and designed as an 
achievement test for the first semester of high 
school American history and the other con- 
cerning the entire period since 1492. Each test 
has two forms. Each form contains 75 true- 
false items, 25 revers@ multiple-choice items, 35 
matching items, and 15 items, each requiring 
the selection from four events of the one which 
occurred first. Convenient scoring keys are 
provided, as is a table of percentile scores, com- 
puted from 22,087 student scores reported 
from 757 schools. 

The test does not sample enough aspects of 
society. While more than 100 of the 600 items 
require specific knowledge of individuals, one 
looks in vain for the name of an architect, an 
artist, an educator, a labor leader, or a musi- 
cian. None of the authors or poets of the na- 
tional period are mentioned. The only woman 
included is Anne Bradstreet; the only captain 
of industry, John D. Rockefeller; and the only 
inventor, Eli Whitney, although the names of 
“Cooper’s locomotive” and “Fulton’s steam- 
boat” are required. 

Contrary to the statement of the authors that 
Test II “covers chiefly the period from the be- 
ginning of the Civil War to the present,” ap- 
proximately 60 per cent of the items in that 
test are based on the period before 1860. 

Despite the use of reverse multiple-choice 
items, the test does not sufficiently require the 
use of mental processes other than memory. Tt 
is unfortunate that the authors have included 
no questions of the best-answer type, which is 
especially suitable for testing judgment, under- 
standing, and ability to discriminate. 

Half the test is made up of true-false items. 
Such items do not test ability to discriminate 
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or understanding of relationships. Those which 
require judgment are too often controversial. 
For instance, the student is required to believe 
that the colonies were independent before the 
Declaration of Independence was adopted ; that 
the powers of the Second Continental Congress 
were not related to the Declaration of Inde- 
pendence; and that the Indians of the Old 
Southwest did not constitute so great a problem 
as those of the Old Northwest. 

The matching questions are poorly con- 
‘structed from several points of view. The col- 
umns are too long, the response columns con- 
taining from 20 to 31 items. Not only does the 
student waste his time scanning the lists, but 
his selection becomes too much a function of 
the IQ. Half of the matching exercises are 
composed of a wide variety of items. It re- 
quires no knowledge of American history to 
relate "John Smith's account of life in Vir- 
ginia” with “A True Relation of Virginia," 
when the word "Virgirfla" appears only once 
in each column or to select the answer for the 
"first case in which a Congressional act was 
held unconstitutional by the Supreme Court" 
when the only lawsuit listed in the response 
column is “Marbury vs. Madison." 

A few errors of fact have crept into the test. 
Great Britain did not “grant” the colonies their 
independence. The grandfather clauses are not 
a constitutional method of restricting Negro 
suffrage. The Mayflower Compact was not an 
agreement among the Puritans. 

Minor weaknesses exist which might have 
caught the eye of an editor. Occasionally one 
item gives the answer to another. Typographi- 
cal errors are frequent, including at least ten 
` misspelled words in Test I, Form A. 

With these shortcomings, it is doubtful 
whether the test can serve in helping the 
teacher to do more than compare the achieve- 

ment of his students with that of a great num- 

ber of others, 


For a review by Wilbur F. Murra, see 
40:1639. 
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Kansas Modern European History Test. Fi - 
ond semesters high MESE I d Forms AZ 
levels; 9of per 25 of either level, Postpaid; 15¢ per 
poma set, postpaid; 40 (45) minutes; Alvin L. 
asenbank, H. E. Schrammel, and B. A. Hamilton ; 
Bureau of Educational Measurements, Kansas State 
Teachers College of Emporia. 
a) TEST I. First semester ; 1940. 
b) TEST 11. Second semester ; 1938. 
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Frederic H. Stutz, Assistant Professor of . E 
Education, Cornell University, Ithaca, New ` 
York. The test is designed to measure achieve- 
ment in modern European history and is to be 
used in high school and college classes studying 
the subject for one year. Test I covers the 
period from the discovery of America to about 
1850, and Test II covers the period from 1850 
to the present. The publishers indicate that the 
purpose of the test is to measure "the student's 
understanding of significant movements and 
events in relation to their social and economic 
consequences." Though relatively little informa- 
tion is available to the reviewer concerning 
validity, reliability, and interpretation of results, 
the manual of directions explains that the au- 
thors made a detailed study of the content of 
books in modern European and world history 
and that items were checked by teachers, super- 
visors, and test-construction experts. A brief 
statement of test reliability is presented. Per- 
centile norms, derived from 1,095 student scores 
obtained from 44 schools in nationwide testing 
programs, make it possible to translate test 
scores into school grades. 

Test I, to be administered in 40 minutes, 
consists of 37 groups of items containing a total 
of 160 responses. All responses are to be an- 
swered as either true or false. Of the 37 groups, 
8 are concerned with historical information, 9 
with dates, 11 with concepts, and 9 with per- 
sonages. Test II, to be administered in 40 min- 
utes, contains 145 items, of which 100 are of 
the true-false type, 30 of the multiple-choice 
variety, and 15 of the matching type. 

The items written for this test are fairly well 
distributed over the period of modern Euro- 
pean history. Certain of the items are well con- 
structed and measure significant aspects of the 
subject. The test is so constructed that it can 
be easily administered and scored. Though the 
period of history covered by this edition of the 
test stops with the year 1938, it is presumed 
that later editions will bring the test up to date. 

In general, however, the test does not attain 
the standards set by comparable achievement 
tests in this field. Too much attention is given 
to mastery of factual information, while rela- 
tively little emphasis is placed on understanding 
of concepts, developments, chronological and 
cause-effect relationships, etc. The test would 
be a better one if it attempted to measure under- 
standing of history in terms of educational ob- 
jectives rather than of acquisition of historical 
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facts. Over 65 per cent of the total items deal 
with political and military history, and inade- 
quate coverage is given to economic, social, and 
cultural developments. Shortcomings in the 
choice and construction of individual items tend 
to weaken the test. Though true-false questions 
have long been regarded as among the least 
successful types for social studies testing, they 
are used liberally in this test. Many of the 
chronology items test merely the ability to asso- 
ciate a single date with an event or person and 
do not require an understanding of chronological 
relationships. The matching items lack homo- 
geneity and therefore allow the student to select 
correct responses from a rather narrow range 
of choices. In several instances the multiple- 
choice or completion items contain textbook ex- 
pressions or other clues that enable the pupil 
to select the correct response without under- 
standing the reason for his selection. 


For a review by Clinton C. Conrad, see 
40: 1640. 
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*Kniss World History Test. High school; 1940; 
Forms A, B ; $1.75 per 25; 35¢ per specimen set, post- 
S 50(55) minutes; F, Roscoe Kniss; World Book 
O. 


REFERENCES 

1, Kniss, F. Roscoe, The Construction of am Achievement 
Test in Tenth Grade World History. Unpublished doctor’s the- 
sis, Pennsylvania State College, 1937. 


a. Fowrer, Victor M. A Critical Analysis of the Kniss 
World History Test. Unpublished master’s thesis, Ohio Univer- 
s 1945, 


SYA TOA: Pp. so. (Abstracts of Masters’ Theses .. . 
p. 9. 


Dorothy C. Adkins, Chief, Test Develop- 
ment Unit, U. S. Civil Service Commission, 
Washington, D. C. This test of 130 items is pre- 
sented in 6 parts: Factual Knowledge (25 
items), Time Relationships (15 items), Contri- 
butions of the Past (20 items), Cause and Effect 
(20 items), Tying Past and Present Together 
(20 items), and Problems of Life (30 items). It 
is intended to cover the content of high school 
world history courses as commonly given in the 
toth grade. The divisions grew from a survey 
of objectives of world history courses based on 
research in social studies, textbooks, courses of 
study, and opinions of high school teachers. 
There is no obvious one-to-one relationship be- 
tween the six objectives elicited and the six 
divisions, several of which tap several of the 
objectives. Although this is not necessarily a 
criticism of the test, the manner of arriving at 
the divisions might have been elaborated upon 
in the manual. 
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-The test, published in 1940, now seems out of 
date, at least to a person who is not a student 
of world history. In fact, the impression it leaves, 
in so far as timeliness of content is concerned, 
is that it was written very shortly after World 
War I, which, of course, is not identified as such 
in the test. Probably many schools will hesitate 
to use this test until a new revision is issued. 

Some points related to the statistical data 
given in the manual are not entirely clear. Items 
for a preliminary form were gleaned from text- 
books, courses of study, teacher-made tests, fé- 
search studies, and teachers’ judgments. “In all, 
2,927 items were obtained from which the first 
experimental edition of the test was constructed.” 
Unless the 2,927 items constituted the first ex- 
perimental edition, the number of items it con- 
tained is not given. The tetrachoric r of each 
item was computed, based on 210 cases. We are 
not told what the criterion was. Presumably it 
was the score on whatever total number of items 
comprised the preliminary form. We are not 
told what the standard was for deciding when 
to eliminate or revise an “indiscriminatory” item 
nor how many or what proportion of items were 
so treated. 

A second edition was constructed with the 
items arranged by difficulty (presumably as de- 
termined by the percentages of the 210 subjects 
who answered them correctly) and with the 
proportions of items on ancient, medieval, and 
modern history corresponding to the space al- 
lotted by textbooks. It is stated that “the revision 
was administered to 244 tenth-grade pupils com- 
pleting the course in world history, and the 
results indicated not only that the test was valid 
[italics mine] but also that it was a reliable 
measure.” How the validity was determined by 
the results is not indicated. The final form of the 
test is said to be essentially the same as this 
second revision. Does this mean Form A or 
Form B, or does it mean that both were culled 
from the second revision? 

The reported data on test reliability seem to 
be satisfactory. For 150 cases selected at random 
from the “normative population” (apparently 
described later on as consisting of over 1,100 
pupils, although one wonders why so few cases 
were selected), the interform reliability for total 
scores was .90; for the 244 cases taking the 
second experimental edition, the corrected split- 
half coefficient was .95. 

Regarding the comparability of the two forms, 
it is stated that “to insure comparability of con- 
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tent, the items were distributed similarly among - 


the various phases of history covered. . . . The - 
forms were also balanced with regard to the 
difficulty of items, in terms of difficulty values 
determined experimentally. As a final check on 
equality, both forms . . . were administered si- 
multaneously in the same classes, half of the 
pupils taking Form A and the other half Form 
B." The size of the group on whom the difficulty 
values were based is not stated. Presumably the 
group referred to as giving the final check on 
equality was the "normative population" of over 
1,100. It is little short of amazing, then, to 
encounter in the next paragraph a statement to 
the effect that the distributions of total scores 
|... plotted on normal percentile charts revealed that 
Form A scores were five points lower than 
Form B scores throughout the entire range of 
scores. It will not be denied that different dis- 
tributions of item intercorrelations could affect 
the variance and the shape of the distribution of 
-... total scores. Two tests composed of items of the 
same difficulty distribution could not emerge 
‘with means differing by five points except as a 
result of instability in the original index of diffi- 
culty. But if they did differ for a new and larger 
population, would not the normal course be to 
shift items between the forms in order to equate 
-the difficulty? Then the finding that the five- 
point difference is uniform and constant through- 
out the entire range of scores appears inexpli- 
cable to me, unless it is accounted for by the 
presence in the harder form of five items that 
no one answered correctly or by the presence 
~ in the easier form of five items that everyone 
. answered correctly. This hardly seems plausible. 
The table of percentile ranks (Table I) has an 
additional peculiar feature. The entire list of 
percentile ranks for one form is exactly the same 
» as the list for the other, except for this constant 
Wine ime ont difference. That is, raw scores of 5 on 
- Form A and ro on Form B have a percentile 
`- rank of .8, raw scores of 6 on Form A and 11 on 
_ Form B a percentile rank of 1.2, and so on. For 
tests that correlate only .go for the population on 
which the ranks were based, this exact similarity 
would be impossible except by remote coinci- 
dence. 

The writer does not question that the content 
of the test represents with reasonable adequacy 
the coverage and emphasis of high school courses 
and textbooks in world history (aside from the 
previously noted deficiency with respect to up- 
to-dateness). 
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The test format is good, although one wonders 
-why provision for recording answers on a sepa- ` 
rate sheet was not made. : 

Most of the parts consist entirely of four- 
choice items, although the first part is in com- 
pletion form. The directions for scoring are 
satisfactory in most respects, but they do not 


indicate what the scorer is to do about the inis-. | 


spellings that almost certainly arise on Part 1. 
The instructions call for correction for chance 
success in scoring the four-choice items, and the 
subjects are instructed not to guess without at 
least partial knowledge. 

The phrasing of some of the items leads this 
reviewer to wonder whether the purpose of the 
test is to test or to teach. The cover page states, 


“This test is to help you appreciate and ünder- . .* 


stand better the meaning of world history." Some 


of the questions lead to the reaction that undue. "T 


effort was exerted to draw in a relationship or 


a comparison of little significance. This is espez". ~ 


cially true of many of the items in Part V, Tying* 
Past and Present Together. Many of them do 
not test powers of judgment or analytical abil- 
ity or understanding of relationships but only | 
knowledge of one of the two facts included in the 
item. For example, consider the following prem 
ise and answer : “Legislators today face the same 
conditions that Turgot did before the French 
Revolution; namely, that when attempts are - 
made to tax the privileged class they . . . suggest 
other ways of raising money." Why bring in 
Turgot? In another item, one needs no knowl- é 
edge of the early Romans to know that the de- - 
sired response is that they (like the early Ameri- 
can settlers) “were industrious and frugal.” A 
similar criticism applies to many other items in 
this part. In Part VI, Problems of Life, this 
reviewer would prefer questions stated more ob- 
jectively and not directed toward pointing a 
moral. For example, in Item 22, Form A, the 
first sentence, “The world is constantly striving 
to reach the place where just and fair treatment 
will be given to all,” is unnecessary. 

A number of items need pruning ; several have 
answers given away immediately by the similar- 
ity of their wording to that of the question 
(Examples in Form A: Item 20, Part V ; Item 
8, Part VI) and there is some undesirable over- 
lapping which results in the answer of one ques- 
tion being given away by the statement of the 
problem in another or in two interrelated items 
(Examples in Form A: Items 12 and 20, Part 
IV ; Item 2, Part I and Item 15, Part IV). 
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Perhaps the items in Part II, Time Relation- 
‘ships, place too great an emphasis on knowledge 
of exact dates. Probably the modern trend in 
the teaching of history is to stress knowledge of 
the chronological sequence of events, and espe- 
cially of related events, rather than exact dates. 

With some editing of individual items, the 
addition of items on recent world events, correc- 
tion of the unequal difficulty of alternate forms, 
and more attention to adequate description of 
the standardization and statistical procedures 
used,.a revision of this test should be worth 
while. 


Wallace W. Taylor, Professor of Social 
Studies; and Head of the Department of So- 
cial Studies, Milne School; New York State 
College for Teachers, Albany, New York. This 
world history examination was copyrighted in 
1940. An examination of the test shows that the 
items were apparently prepared several years 
before that. An item on the Saar plebiscite is the 

' only item referring to anything that happened 
in the thirties. There are only a few items deal- 
ing-with events since the first World War. In its 
presefit form, the test is'outdated and should not 
be used. 

The division of the test into six sections ‘with 
the intriguing titles of Factual Knowledge, Time- 
Relationships, Contributions of the Past, Cause 
and Effect, Tying Past and Present Together, 
and Problems of Life, is misleading. The only 
heading that is really descriptive is the one on 
Factual Knowledge. The section titled Time- 
Relationships requires little more than a straight 
memorizing of dates. The section titled Prob- 
lems of Life is just another heading for factual 
items concerned with social and economic his- 
tory. 

Entirely too large a proportion of this exam- 
ination is devoted to items of doubtful impor- 
tance. There are no items on Latin America or 
the Far East. 

It is my opinion that a teacher seeking a ter- 
minal examination for a world history course 
would do far better to use one of the tests of 
the Cooperative Test Service or of the United 
States Armed Forces Institute. 


Ed Res B 20:263-4 D 17 "41. William J. 
Jones. A thoroughly objective, standardized in- 
formation test of world history * For those who 
accept the assumption that history information 
can be standardized, it is a carefully constructed 
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instrument. The usual rigorous procedure of 
examining current research, analyzing courses 
of study (7 in all), studying textbooks (8 men- 
tioned), and summarizing questionnaires re- 
turned by rco teachers has been followed. The 
result is a 130-item test, most of which are 
multiple-choice items with a few completion and 
matching. It is divided into six parts as follows: 
factual knowledge, time relationships, contribu- 
tion of the past, cause and effect, typing past 
and present together, and problems of life. 
Validity is claimed for the test because: first, 
it discriminates between “good” and “poor” 
students; second, it includes “only items that 
cover what is being taught in world history 
courses” (textbook and course of study analy- 
ses, and so forth) ; third, it was “subjected to. 


the careful scrutiny and criticism of 80 nation- — 


ally known specialists in the fields of history and 
education”; fourth, the items are arranged ač 
cording to difficulty following a tryout with 210 
eleventh-grade pupils; and fifth, the numbers 
of items in each of the three phases of history— . . 


ancient, medieval, and modern—were made to 


correspond to the percentage of space allotted 
by textbooks to these three epochs. A study of 
the content of the test, however, reveals that the 
test does not measure what it purports to meas- 
ure. The six parts of the test are supposed to 
measure the following experimentally deter- 
mined objectives: to study the development of 
civilization, to explain the present, to foster 
patriotism, to develop intelligent citizenship, to 
broaden the pupil’s sympathies, and to increase _ 
in the pupil the powers of interpretation and 
judgment. To test these purposes, nonfunctional 
memoriter learning is called for in the test! Part 
II, for example, requires the student to match 24 
dates with 15 events and yet it is labeled “Time 
Relationships.” Part IV, by requiring the stu- 
dent to pick a “best” cause or effect out of four, 
may actually encourage thinking in a single — 
causation pattern. The test purports to measure `. 
powers of interpretation and judgment, but the 
students are not required to solve any problems, 
apply any principles, explore any relationships, 
interpret any data, or formulate any generaliza- 
tions. The test may have some value as an infor- 
mation inventory, however. * 

Teach Col J 12:70 Ja '41. Waldo F. Mitchell. 
* The 130 items cover the range of history from 
the Egyptian civilization to the present genera- 
tion. Fach of the items deals with a significant 
event, happening, or idea. Each item is well 
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stated. In only a few cases of the match list items 
could there be any doubt about the correct 
answer. A perforated cardboard key to be super- 
imposed on the answered test pages makes the 
scoring of the test easy and rapid. * This test is 
probably the best standardized test in world 
history for high school. It covers the field and it 
is excellent in form and content. 
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XSurvey Test in United States History. High 
school and college; 1943; an adaptation of the history 
test given by the New York Times in its survey of 
7,000 college freshmen in 36 colleges ; norms not avail- 
able; $1.50 per 25, postpaid; sample test free; (60) 
minutes; Turner E. Smith & Co. 


REFERENCES 


_ 1. “Are ‘Educators’ to Blame for the Poor Results Shown by 
the ‘Times’ Test?" Sch & Soc s7:560-1 My 1543. * 
- Q2 “How Do Senior College Students and Adult Groups 
Stand on the "Times' Test?" Sch & Soc s7:6s4 Je Ak * 
3. “ ‘The Néw York Times' Again Investigates the 'each- 
ing of History." Sch & Soc 57:403 Ap 10 ^43. * 
T 4 ALLEN, James E. “Another Explanation of the ‘Times’ 
Test Findings.” Letter. Sch & Soc 58:238 S 35 "43. * à 
"a » Bovp, Paut P. “The ‘Times’ Test an Our Public 
Schools.” Sch & Soc Apos My 29 '43. * 
Broupy, H. S. istory Without Hysteria.” Sch & Soc 
58:106-7 Ag 14 '43. * 
LLA, GERALDINE P. “The American History Situation in 
Few Words." Letter. Sch & Soc 58:309-10 O 16 43. * 
8. DuxBAx, WiLLis F. "Why Gur Faces Are Red: A Com- 
ment on the ‘Times’ Test.” Sch & Soc 58:265-8 O 2 43. * 
9. Fraser, Hucn RUSSELL. NAI Russell Fraser Replies 
to a Criticism." Letter, Sch & Soc 5 1504-5 D 25 '43. * 
"VIO. Fraser, Huc Russet. “The ‘Inside’ Story of ‘The 
New York Times’ Test.” Sch & Soc ge Ag 743. * 
11, Fraser, Huon RussELL. "A ‘Red Herring’ Across the 
Trail of the ‘Times’ Test.” Letter. Sch & Soc 58:190-1 S 11 
43: 


12. PCR UAR Roger. “ ‘The New York Times’ Test.” 
Letter, Sch & Soc 58:311 O 1548. *. 

13. D, JAMES Morcan. "History Versus the Social Sci- 
ences.” Sch & Soc SSMUS 4 4s. * 

14 Suaw, Romrer S. "What is History?" Letter. Sch & 
Soc $7:634-5 aye 43. * 
spid: Stumpr, Wiprert A. “On Mr. Fraser's Defense of the 
‘Times’ Test." Letter. Sch & Soc 58:348-9 O 30 "43. * 

16. LER, ARTHUR E. "Progressive Methods as Related 
o Knowledge of American History." Sch & Soc 57:640-3 My 
9 '43. " 
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XCooperative Test in American Government. 
High school; 1947; IBM; Form X; separate answer 

sheets need not be used ; $1.75 per 25; 25¢ per specimen 
Set, postpaid; 40é per 25 machine-scorable answer 
sheets ; I5¢ per stencil for scoring answer sheets ; 
40(45) minutes; John Haefner; Cooperative Test 
ervice. 
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X*Examination in Civics. High School; 1944; IBM ; 
Form SCv-1-B-4 (usually called Form B ; separate 
answer sheets must be used; $2 per 25; 40¢ per 25 
machine-scorable answer sheets; 15¢ per scoring key; 
25¢ per specimen set, postpaid; 90(95) minutes; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education; distributed by Cooperative Test 
Service. (Also distributed by Science Research Asso- 
ciates: $2 per 25; 65¢ per 25 machine-scorable answer 
sheets; 5o¢ per key; 50£ per specimen set.) 


REFERENCES 


r. “A Note on the USAFI Tests in Civics and Problems of: 


Democracy,” pp. 59-60. In 1947 Achievement Testing Program 
in Independent Schools and Supplementary Studies. Educay 
tional Records Bulletin, No. 48. New York: Educational Rec- 
ords Bureau, June 6, 1947. Pp. xii, 66. Paper, lithotyped, 
$2.00. * 


Roy A. Price, Professor of Social Science and 
of Education, Syracuse University, Syracuse, 
New York. The United States Armed Forces 
Institute, created to provide off-duty education 
for service personnel, established an examina- 
tions staff at the University of Chicago. This 
staff undertook to build tests from which reports 
of educational achievement could be made to the 
educational institution from which service per- 
sonnel wish to receive credit. It was hoped that 
in this manner "the danger of blanket credit for 
military experience such as that which was so 
frequently given after the last war would be 
avoided." Civilian forms of the tests which at- 
tempt to measure valuable and generally ac- 
cepted educational objectives in various subjects 
are published by the American Council on Edu- 
cation for use by high schools and colleges. 

The tests were prepared by specialists after 
meetings with teachers of various subjects. At- 
tempts were made to define objectives in terms 
of behavior situations. The original exercises 
were tried out, revised, and then submitted to 
critics who were asked to check coverage in con- 
tent and objectives, accuracy, and validity. 
Norms for the finished tests are available for 
whole and part scores. Users are urged to make 
careful analysis of the various parts of the tests 
so as to facilitate diagnosis of individual weak- 
nesses as well as formation of judgments on total 
achievement. The authors warn against the un- 
critical acceptance of the norms as absolute 
standards and stress the importance of local 
norms and standards. Excellent directions for 
the administration of the test and the interpre- 
tation of test scores are also given. 

The Examination in Civics is, in the judgment 
of this writer, the finest test yet published in this 
field. The test seeks to measure an understand- 
ing of behavior and function of governmental 
officers, and the usual mystical treatment of 
government as a structure removed from human 
behavior is entirely lacking. Coverage includes 
trends in government, individual rights, the fed- 
eral government, administrative agencies, or- 
ganization and functions of Congress, political 
parties, relation of federal, local, and state gov- 
ernment, foreign relations, taxes, and govern- 
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ment aid to and control of such groups as 
agriculture, labor, and industry. 

Data on validity and reliability are not pre- 
sented, but the care exercised in the construc- 
tion of the test and the type of items used suggest 
that the test compares favorably with others. 
The writer would like to have had included in 
the tables of norms separate scores for the ques- 
tions that purported to measure the several ob- 
jectives, making possible a comparison between 
success in one skill and success in another. It is 
also hoped that in the very near future, there 
may be forthcoming comparisons between norms 
on this test and norms on comparable tests. 
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XExamination in Problems of Democracy—High- 
School Level. Grades 11-12; 1944; IBM; Form 
SPrD-1-B-4 (usually called Form B); separate an- 
swer sheets must be used; $2 per 25; 40f per 25 
machine-scorable answer sheets; 15¢ per scoring key; 
254 per specimen set, postpaid; 120(125) minutes; pre- 
pared by the Examinations Staff of the United States 
Armed Forces Institute; published by the American 
Council on Education; distributed by Cooperative Test 
Service. (Also distributed by Science Research Asso- 
ciates : $2 per 25; 65¢ per 25 machine-scorable answer 
sheets; 50¢ per key; 50¢ per specimen set.) 


REFERENCES 
1. WILLIAMS, JAY, AND ABRAHAM, Hersert J. "Evaluating 
the Course in Problems of Democracy.” Social Ed 9:167-72 Ap 


45. * 

2. “A Note on the USAFI Tests in Civics and Problems of 
Democracy," Bp. 50:60, In 1947 Achievement Testing Program 
in Independent Schools and Supplementary Studies. Educa- 
tional Records Bulletin, No. 48. New York: Educational Rec- 
ords Bureau, June 1947. Pp. xii, 66. Paper, lithotyped. $2.00. * 


Lavone A. Hanna, Associate Professor of 
Education, San Francisco State College, San 
Francisco, California. This test consists of 10 
short passages which present different points of 
view on social problems; 5 are excerpts from 
editorials, 2 are statistical tables, and 3 are quo- 
tations from a discussion between two speakers. 
The questions based on these passages purport 
to test the student’s ability to read correctly and 
his knowledge and understanding of the prob- 
lems with which the passages deal. The prob- 
lems included in the test have been well selected 
and cover topics usually included in a problems 
of democracy or American problems course, 
such as: crime, population trends, labor unions, 
housing, prices and production, education, pres- 
sure groups, “Americanism,” internationalism, 
and democracy. 

Although no norms are given for the test and 
no statistical data are furnished on their relia- 
bility or validity, the care with which the tests 
of the Armed Forces Institute were constructed 
recommends the use of this test. The manual 


states that the examiners, specialists both in 
teaching and testing in the subject field, met 
with teachers to identify the educational objec- 
tives which students were expected to attain 
and the exercises were constructed to test these 
objectives. The exercises were then tried out on 
groups of students, and, on the basis of this 
tryout, some exercises were eliminated because 
they were not discriminating, were ambiguous, 
or were of inappropriate difficulty. The test was 
then submitted to critics who checked the test 
for comprehensiveness in coverage, for accuracy 
of material, and for the validity of the items. The 
test was then revised on the basis of these criti- 
cisms. The test seems to this reviewer to be 
superior to most tests of social competence, and 
it is hoped that norms will be made available. 
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X*Mordy-Schrammel Constitution Test. High 
school and college; 1940-45; Forms A, B; 90¢ per 25, 
postpaid; 15¢ per specimen set, postpaid ; 40(45) min- 
utes; Francis E. Mordy and H. E. Schrammel ; Bureau 
of Educational Measurements, Kansas State Teachers 
College of Emporia. 


W. H. Cartwright, Assistant Professor of 
Education, Boston University, Boston, Massa- 
chusetts. This test is intended to measure knowl- 
edge of the Constitution and understanding of 
its application. It is designed for use in an ele- 
mentary course in high school or college. 

There are two forms of the test, each contain- 
ing 65 true-false items, 35 best-answer items, 
and 25 matching items. Convenient scoring keys 
are provided, as is a table of percentile scores 
based on 3,277 cases representing 126 schools. 

A desirable feature of the test is Part 3, con- 
sisting of ro items requiring the exercise of judg- 
ment in applying principles of the Constitution 
or of famous court decisions. 

It is unfortunate that more than half of the 
test is made up of true-false items. Most of these 
test only memory of fact. In a few other in- 
stances, the student can know and understand 
all the facts involved and yet not give the keyed 
answer. For instance, the president is required 
to be a natural-born citizen, therefore it is true 
that he must have been a citizen for fourteen 
years ; the House of Representatives cannot seat 
anyone it pleases; and the members of the cabi- 
net may or may not all be eligible for succession 
to the presidency. 

The matching exercises are poorly con- 
structed. They are somewhat too long, having 
from 18 to 20 items in the response column. 
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More important is the lack of homogeneity 
among the items. For too many stimuli one par- 
ticular response is dictated by grammatical con- 
struction or similarity in nature. Thus, “kinds of 
cases at court” is matched with “civil and crim- 
inal,” the only plural item in the response col- 
umn, and “a tax on goods manfactured in the 
country" is matched with “excise tax,” the only 
item in the response column which contains the 
word “tax.” 

In several of the best-answer items, the cor- 
rect option is easily distinguished by reason of 
its length. Such items, like many of those in the 
matching exercises, require native intelligence 
or practice in taking objective tests rather than 
knowledge of the Constitution. 

There are some cases in which the keyed 
answer is incorrect, and a few in which there is 
TO correct answer. A national convention for 
amending the Constitution would be called by 
Congress, not by two-thirds of the states (Item 
36, Form A), and a writ of habeas corpus is not 


- "the right to demand immediate trial" (Item 


I04, Form A). The keyed answers to Items 17, 
29, 39, 71, 85, and 98, in Form B, are at least 
questionable. 

A minor criticism of the form of the test is 
that in several instances the incomplete state- 
ment or one of the options is isolated from the 
rest of the item irt another column or on a differ- 
ent page. This causes needless confusion and 
irritation to the person taking the test. 

Despite its shortcomings, the test can serve 
to measure knowledge of the Constitution and 
to compare the achievement of students of one 
-school with that of other students in other 
schools. 

[ 619 ] 
Mordy-Schrammel Elementary Civics Test. 
Grades 7-0; 1935; Forms A, B; 75¢ per 25, postpaid ; 
15¢ per specimen set, postpaid; 30(35) minutes; Fran- 
cis E. Mordy and H. F. Schrammel; Bureau of Educa- 


tional Measurements, Kansas State Teachers College 
of Emporia: 


C. Robert Pace, Associate Director, Evalua- 
tion Service Center, Syracuse University, Syra- 
cuse, New York. The two forms of this test, 
published in 1935, are now out of date and their 
continued use in junior high schools is not rec- 
ommended until revised editions are published. 

Many of the needed revisions in the test are 


technical and editorial in nature. For example, 
the reliability, based on 115 cases, is reported 
to be .80, a figure which is rather low for an 
achievement test of 93 items. The first 47 items 
on each form are true-false items; rephrasing 
these in multiple-choice form would increase the 
reliability. Several items in both forms are atti- 
tude test items. Pupils are expected to agree 
with such statements as: “One of the best 
methods of group control is public opinion" ; 
"Washington, D. C., is one of the best-planned 
cities in the world"; and "The mayor-council 
form of city government is the best form." Items 
of this sort do not properly belong in an achieve- 
ment test. Midyear and end-of-year norms are 
given in percentiles for grades 7, 8, and 9. At 
the 5oth percentile the difference between mid- 
year and end-of-year scores is not significant in 
any of the three grades. Nor is there a significant 
difference between the soth percentile scores of 
grades 7 and 8 and grades 8 and 9. When the 
differences between midyear and end-of-year 
and between one grade and the next are merely 
chance differences, it is difficult to see how the 
test can provide a very precise measure of stu- 
dent's progress or of the efficiency of instruc- 
tion. The manual includes a table for translating 
percentile scores into school marks. The prac- 
tice of assigning marks “on the normal curve” is, 
at best, debatable and should not, in my opinion, 
be encouraged. 

Beyond these technical and editorial criti- 
cisms, a serious question can also be raised 
regarding the content of the test. Most of the 
items are concerned with simple identification 
or definition. Injunction, referendum, franchise, 
suffrage, and petit jury are typical terms in one 
large group of definition items. Another large 
group calls for knowledge of such facts as: a 
president may not be arrested while he is in 
office; the smallest unit of society is the family; 
the chief source of government income is taxa- 
tion; general election day occurs on the first 
Tuesday after the first Monday in November. 
The authors state that “the test items were 
selected from the common content of several 
leading text books in the field.” A more valid 
blueprint for test construction would be obtained 
from a study of the major objectives of junior 
high school civics courses. 
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VOCATIONS 


Reviews By Dorothy C. Adkins, George K. Bennett, Ralph F. Berdie, Robert G. Bernreuter, 
Marion A, Bills, Reign H. Bittner, Milton L. Blum, Edward S. Bordin, Paul S. Burnham, 


Harold D. Carter, E. G. Chambers, Glen U. 


Cleeton, Clyde H. Coombs, Lysle W. Croft, 


Edwin W. Davis, Gwendolen Schneidler Dickson, Stanley G. Dulsky, M. H. Elliott, George A. 
Ferguson, Norman Frederiksen, John W. French, Douglas H. Fryer, Edwin E. Ghiselli, 
Edward B. Greene, William R. Grove, Milton E. Hahn, Thomas W. Harrell, Charles M. 
Harsh, Edward N. Hay, Lloyd G. Humphreys, Thelma Hunt, Clifford E. Jurgensen, Raymond 
A, Katzell, Willard A. Kerr, Joseph E. King, John R. Kinzer, C. H. Lawshe, Jr., D. Welty 
Lefever, Herschel T. Manuel, Joseph E. Moore, Charles I. Mosier, C. A. Oakley, Jay L. Otis, 
Albert Gibson Packard, Donald G. Paterson, Shailer Peterson, James M. Porter, Jr., Alec 
Rodger, Floyd L. Ruch, C. H. Ruedisili, George A. Satter, Harold G. Seashore, R. B. Selover, 


Benjamin Shimberg, Patrick Slater, Dewe: 


B. Stuit, Donald E. Super, Erwin K. Taylor, 


Howard R. Taylor, Herbert A. Toops, Robert M. W. Travers, Arthur E. Traxler, Frances 


Oralind Triggs, Morris S. Viteles, 


illiam W. Waite, J. V. Waits, Neil D. Warren, Henry 


Weitz, John M. Willits, E. F. Wonderlic. 
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*Differential Aptitude Tests. Grades 8-12; 1947; 
IBM; 2 forms, 8 tests in 7 booklets; separate answer 
sheets must be used ; all prices include postage; $19.50 
per 25 sets of the 7 booklets; $1.50 per 50 machine- 
scorable answer sheets for any one booklet; $1.25 per 
50 individual report forms; $1.25 per set of 14 scoring 
stencils (specify whether hand- or machine-scoring) ; 
$1.75 per manual ; $2.50 per specimen set ; CE 
minutes, entire battery ; Ceu Bennett, Harold G. 
Seashore, and Alexander G. Wesman; Psychological 
Corporation. 
2 VERBAL REASONING. $2.75 per 25; 30(40) minutes. 
b) NUMERICAL ABILITY. $2 per 25; 30(35) minutes. 
C) ABSTRACT REASONING. $2.75 per 25; 25(30) minutes. 
d) SPACE RELATIONS. $3 per 25 ; 30(40) minutes. — 
e) MECHANICAL REASONING. 85 per 25; 30(35) min- 
utes. 
f), CLERICAL SPEED AND ACCURACY. $2.75 per 25; 6(15) 
minutes. 
9) LANGUAGE USAGE. Includes Spelling and Sentences 
tests ; $2 75 per 25; 35(45) minutes. 


J Consult Psychol 12:62 Ja-F ’48. The pub- 
lication of Differential Aptitude Tests is a 
major psychometric event. The battery stresses 
the significance of abilities rather than “ability” 
as the basis for prediction and guidance at the 
secondary school level. The parts, other than 
the clerical, are power tests rather than speed 
tests. Average reliabilities (except for that of 
girls on mechanical reasoning, which is .71) 
range from .85 to .93. Separate percentile norms 
are given for boys and girls from grades 8 to 
12, based on national selections of from 750 
to 2,000 cases for each grade-sex group for 
Form A, 350 to 1,100 for Form B. Profiles of 
percentiles and standard scores are drawn, and 
illustrative case studies offer some assistance 
to counselors in the use of results. The loose- 
leaf manual is convenient for reference, and for 


the addition of further data as they become 
available. Although there are many immediate 
applications for tests of this type, much research 
is needed on the validity of profiles for predict- 
ing various sorts of educational and vocational 
success. 
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*Guilford-Zimmerman Aptitude Survey. Grades 
9-16 and adults; 1947; IBM for Parts 1-2, 5-7; Form 
A; 8 tests in 7 parts; separate answer sheets need not 
be used; $1 per key for scoring any one part; 50¢ per 
key for scoring answer sheets for any one of parts 1-2, 
5-7; 2¢ per machine-scorable answer sheet for any one 
part; 25€ per manual; 143(188) minutes; J. P. Guil- 
ford and Wayne S. Zimmerman; Sheridan Supply Co. 
a) PART I, VERBAL COMPREHENSION. IBM; $2 per 25; 
25(30) minutes. 

b) PART 2, GENERAL REASONING. IBM ; $2 per 25; 35(40) 
minutes. 
C) PART 3, NUMERICAL OPERATIONS. $2 per 25; 8(13) 
minutes. n 
d) PART 4, PERCEPTUAL SPEED. $2 per 25; 5(10) minutes. 
e) PART 5, SPATIAL ORIENTATION. IBM; $3.50 per 25; 
10(20) minutes, 
f) PART 6, SPATIAL VISUALIZATION. IBM; $5 per 25; 
30(40) minutes. 
9) PART 7, MECHANICAL KNOWLEDGE. IBM ; $3.50 per 
25; 30(35) minutes. 
REFERENCES 

i. GurLromp, J. P., AND Zimmerman, Wayne S. “Some 
A.A.F. Findings Concerning Aptitude Factors." Occupations 
26:154-9 D '47. * (PA 22:2198) 


[622 ] 
*Yale Educational Aptitude Tests. Grades 9-16; 
1947; 7 tests in 2 booklets; IBM except for one brief 
section ; $1.75 per set of testing materials ; $2.50 per set 
of testing materials including the scoring and reporting 
of individual results; (45) minutes per test if practice 
booklets are administered previously, otherwise (60) 
minutes per test; Albert B. Crawford; distributed by 
Educational Records Bureau. 
REFERENCES 

1. Vaucun, K. W. “The Yale Scholastic Aptitude Tests as 
Predictors of Success in the College of Engineering." J Emg 
Ed 34:572-82 Ap '44. * 
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a. Crawrorp, ALserT B., ano BurNmam, PauL S. Forecast- 
ing College Achievement: A Survey of Aptitude Tests for 
Higher: Toy Mi hoa Part I, General Co: lerations in the Meas- 
urement of Academic Promise, pp. 134-69, 199-201, 253-71. New 


ivi onn.: Yale University Press, 1946. Pp. xxi, 291. 
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X*Clerical Aptitude ten ads National Apti- 
tude Tests. Grades 7 and over; 1943; 1 form; $2.00 

T 25; 50¢ per specimen set; 40(45) minutes; Andrew 
Kobal ae Wriehtstone, and Karl R. Kunze; Acorn 
Publishing Co. 

Marion A. Bills, Assistant Secretary, Aetna 
Life Insurance Company, Hartford, Connecti- 
cut. This is a 3-part test of 40 minutes. The 
three parts are timed separately. They are in- 
tended to measure business practice, spelling, 
number checking, and name checking. The test 
was standardized on 576 employed persons. 
About the average relationship between test 
scores and ratings of the employees was found. 
Norms were worked out on this group and on 
a large number of grade and high school pupils. 
The scoring directions are clear and the inter- 
pretation is given definitely. 

This is a clerical, not a mental alertness test. 
The score apparently improves with training, 
and therefore previous experience of the indi- 
vidual must be taken into account in evaluating 
the score. It falls into the class of a trade test 
and should give indication of immediate ability 
to perform simple clerical operations; it should 
not be used before training to predict probable 
success after training. à 

The test is long enough to indicate somewhat 
the “staying” ability of the applicant and there- 
fore may give more information concerning the 
applicant than a shorter test of the same nature. 


Donald G. Paterson, Professor of Psychol- 
ogy, University of Minnesota, Minneapolis, 
Minnesota. This is an 8-page test composed of 
three parts and provision is made to record 
three part scores and a total score. Part 1, 
containing 27 multiple-choice items plus 13 
spelling items, is devoted to knowledge of busi- 
ness practice. Part 2 is labeled “number check- 
ing” and is composed of 4 subparts: ro items 
requiring subject to check the smallest number 
in a group of 5 (6 to 9 digits in length) ; 10 
items requiring subject to check the second 
largest number in a group of 5 (6 to 9 digits 
in length) ; ro items requiring subject to check 
the number which is the second smallest in a 
group of 5 (3 to 9 digits in length) ; and 26 
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items of the well-known number checking vari- 
ety, subject being required to compare 26 pairs. 
of numbers and to put a check mark between 
each pair which is exactly alike, Part 3 is 
labeled "date and name and address checking" 
and is composed of two subparts: 12 items re- 
quiring subject to check the date which is next 
to the latest date in each group of six dates 
(month, day, year); 25 items each consisting. 
of a complex name and number checking item, 
subject being required to compare pairs of 
three items (name of company, street address, 
and city address) and to check those pairs that 
are exactly alike. 

The test appears to be primarily a test of 
accuracy of work (40 minutes working time) 
with speed of work being secondary, although 
the authors state “the allotted time is an essen- 
tial part of qualitative and quantitative scoring." 

A 3-page manual is provided. The test is 
designed to "measure abilities necessary for a 
successful performance of skills required in 
commercial courses and subsequent clerical 
work." It is strange to see an "achievement 
type of test" included as part of an aptitude test 
as is done in Part 1. The remaining two parts 
appear on inspection to be of the aptitude type. 
The test is longer than the usual employment 
test. The directions for scoring are ingenious, 
utilizing a method of scoring by key words 
which can be readily memorized. Reliability, by _ 
the test-retest method, for a total of 4,141 stu- 
dents in grades 7 to 12, is shown to range from 
.83 to .93 for each of these grades. There is a 
systematic drop in these r's from grades 7, 8, 
and 9 to an r of .83 for grade 12. A validity 
study shows r's from .27 to .71 between total 
scores for 576 clerks and ratings obtained from 
cumulative records and ratings by supervisors. 
The r of .71 between "accuracy ratings" of su- 
pervisors and total score is remarkably high. 
The r of .62 between “speed ratings” and total 
score is also quite high. One cannot help but 
wish that these validity coefficients were pre- 
sented for the part scores as well as for total 
scores. In that way one could ascertain the 
extent to which “proficiency” measured in Part 
I and aptitudes as measured in Parts 2 and 3 
are responsible for these high r’s. 

Percentile norms are provided for total 
scores only for grades 7 to r2 separately and 
for 576 trained clerks. Failure to provide norms 
for male and female students and for male and 
female trained clerks separately is a serious 
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.drawback in view of known sex differences in 
clerical aptitude. Failure to provide norms by 
age as well as sex is also unfortunate. The 
norms should also be broken down by part 
score as well as total score, and these in turn 
should be provided for the general population, 
for different types of clerical workers, and for 
other occupational groups. Correlations should 
also be presented between part scores and total 
scores and standard intelligence tests for vari- 
ous typical norm groups. Such additional infor- 
mation is essential to an adequate interpretation 
of the test results when used for selection or 
for guidance purposes. 

In the reviewer's opinion, this is a most 
promising test of clerical aptitude (if scores on 
Parts 2 and 3 are utilized). Intensive research 
is needed to provide the data needed to evalu- 
ate the real usefulness of the test. 


Henry Weitz, Director, Psychological Serv- 
ices Center, University of Delaware, Newark, 
Delaware. This test purports “to indicate and 
measure abilities necessary for a successful per- 
formance of skills required in commercial courses 
and subsequent clerical work." The manual states 
that the test does not measure intelligence or 
personality, both of which constitute essential 
factors in clerical ability, although it offers no 
evidence to support the idea that the items are 
unrelated to intelligence or personality. 

Evidence for the validity of the test is given 
in the relationship between test scores and shop 
ratings. Correlations of .71 with supervisors’ 
accuracy ratings, of .62 with supervisors’ speed 
ratings, and .52 with over-all shop ratings are 
reported for the total scores of 576 male and 
female clerks. Since no adequate description of 
the group is given, and since the nature and 
purpose of the ratings are not reported, it is 
impossible to accept these validity coefficients 
with any degree of confidence. The fact that 
mean scores rise from grade to grade is offered 
as further evidence of the validity of the test. 
A test which indicates that a behavior segment 
becomes more efficient as additional behavior 
patterns are accumulated through age, matu- 
rity, and experience is not necessarily measur- 
ing a particular “aptitude” or ability pattern. 
No evidence is offered to support the conten- 
tion that the test scores are related to achieve- 
ment in key commercial courses. 

Reported reliability coefficients range from 
.93 for eighth grade students to .83 for a group 
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of clerks, with the median reliability of .9r. 
The standard error of a score is reported as 
3.4. If this latter statistic is applied to the table 
of norms for trained clerks, it may be said that 
the odds are 19 to 1 that a subject whose score 
is found to be 86 (the 4oth percentile for this 
group) has a truly determined score between 
93 and 80. According to the manual's inter- 
pretive scale a score of 93 falls in the “supe- 
rior" group and a score of 8o falls in the "in- 
ferior" group. 

A key-word scoring system is used in which 
an item is scored correct if any letter of the key 
word is recorded or checked by the subject. 
The publishers state that this system of scoring 
saves time and increases the accuracy of scor- 
ing and that they use this procedure in many 
of their tests. The use of lower-case letters for 
the purpose of recording responses in 40 of the 
102 items may lead to errors especially when, 
for example, the selection is between alterna- 
tives labeled “h” and “k.” Two sections of the 
test representing about one fifth of the total 
score involve the checking of matched and un- 
matched materials. The use of the key-word scor- 
ing procedure permits only correctly matched 
items to be scored. Thus, if a subject marked 
all items in these sections as correctly matched, 
he would receive a perfect score. One of the 
authors states in an unpublished communication 
that a study of 280 cases revealed no instance 
where a subject placed check marks indiscrimi- 
nately, and it was, therefore, decided not to 
deduct credit for errors in order that the key- 
word scoring procedure might be retained. 
There is reason to believe, however, that there 
is sufficient variability among subjects in the 
degree to which they mark unmatched mate- 
rials as matched to warrant a consideration of 
the possibility of including some measure of 
this behavior in the score. 

Fundamentally the test items appear to be 
most closely related to routine clerical activi- 
ties. Even the section on business practices in- 
volving some 27 items contains 14 items which 
are no more than vocabulary items, while only 
the remaining 13 require some knowledge of 
business operations and related information. 
Since clerical work is such a broad field in- 
volving so varied a configuration of skills and 
requiring so great a variety of traits, it is 
doubted that tests of this type can do much 
more than offer minor assistance in the selec- 
tion of routine clerical workers. 
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E. F. Wonderlic, Vice-President, General 
Finance Corporation, Chicago, Illinois. This 
test is composed of three parts, requiring three 
separate timing intervals. Its construction and 
standardization as well as the administrative 
directions make it much more adaptable to edu- 
cational and vocational guidance fields than to 
application in the field of personnel selection. It 
contains several original methods in the direc- 
tions for scoring. At some points these seem 
confusing to the ordinary test administrator. 
Published norms are given for only one group 
of employed clerks. Several norms are avail- 
able for high school students. Sex differences 
are not reported although most research work- 
ers in the field of clerical testing have reported 
pronounced differences in favor of female sub- 
jects both with experienced clerical workers 
and with totally unselected groups. The author 
reports that “a study of scores obtained on the 
Clerical Aptitude Test and a comparative rat- 
ing on the job for purposes of correlation war- 
rants selection of clerical employees on the 
basis of test scores” and further gives inter- 
pretative ratings by letter grades. 

This reviewer recommends considerable cau- 
tion be used before accepting these critical scores 
in other employment situations. The reported 
experience of most research workers is that 
careful standardization of tests of this nature 
are necessary in each business organization, 
and norms must be based on both applicants 
and employed workers before selection is recom- 
mended on the basis of test scores. Such in- 
formation is not readily available on this cleri- 
cal aptitude test. 


[ 624 ] 
*Clerical Perception Test. Grades 9 and over; 1947; 
1 form; 75¢ per 25; 15¢ per specimen set, postpaid ; 
15(20) minutes; G. Bernard Baldwin; Educational 
Test Bureau. 


Edward N. Hay, Personnel Officer, The 
Pennsylvania Company for Banking and Trusts ; 
and President, Edward N. Hay and Associates, 
Inc.; Philadelphia, Pennsylvania. This test was 
constructed to give an indication of an indi- 
vidual’s "ability to discriminate the small dif- 
ferences in numeral and verbal materials that 
comprise a large part of the work in offices." 
This ability has been called clerical perception 
and has already been found to play an impor- 
tant role in the prediction of machine book- 


keeping. 


At the same time the author has undertaken 
to measure ciliary muscle fatigue and has di- 
vided both number and name checking tests 
into two parts, giving the subject an oppor- 
tunity to note how many items he has attempted 
in the first period, and to estimate how many 
he expects to be able to do in the second. Em- 
phasizing speed in this way and providing 
small print and a cramped typographical ar- 
rangement probably draws attention to the 
resulting eye fatigue, the degree of which is 
reflected in the decrease in scores from one sec- 
tion of the test to the next. A small group of 
high school students, examined with and with- 
out glasses, showed significantly higher scores 
when glasses were worn, but the data are am- 
biguously presented, with no information about 
how the differences in these scores compared 
with test-retest scores of students whose eyes 
were normal. 

Because the test has been condensed to a 

single page, directions have been reduced to a 
minimum, and practice exercises are carefully 
fitted into the space remaining. The test would ' 
be easier to administer if it were less com- 
pactly arranged. A further improvement could 
be made in the directions to examiners, where 
limitations of space have again caused too great 
condensation. 
_ As a clerical test “validity was established by 
using the test to select the student clerical staff 
for scoring the tests of a six hour battery ad- 
ministered to ten thousand students in . . . 
high schools" and those chosen “from the upper 
twenty-five per cent of scores . . . required 
only half the time" of those chosen at random. 
The number of students involved in the scoring 
and the type of production record obtained are 
not mentioned. There has apparently been no 
attempt to validate the test for business situ- 
ations. 

Reliability for each of the parts in a single 
grade range is stated to be approximately .93 
using Froehlich's form of the Kuder-Richard- 
son formula. Since this does not, however, take 
into consideration the factor of speed, which 
is an important part of a test of this kind, an- 
other type of reliability coefficient should have 
been obtained. Norms for high school grades 
are provided. The author combines boys' and 
girls’ scores in a single table, since “competi- 
tion in clerical occupations is also with com- 
bined sex groups." It is questionable whether 
this is actually the case in business houses, but 
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.under any circumstances it would seem desir- 
able to have separate norms for each sex. 

The author wisely recommends that school 
counselors use the Clerical Perception Test 
only in.a battery and says that for occupational 
selection critical scores must be determined by 
job requirements. The attempt to measure eye 
fatigue is interesting and should be pursued, 
but a better clerical test would result if the two 
factors were not measured simultaneously. 


Raymond A. Katzell, Associate Professor of 
Psychology, The University of Tennessee, 
Knoxville, Tennessee. Two subtests are in- 
cluded in this test. The items of the first are 
composed of pairs of numbers which are to be 
compared with respect to identity to one an- 
other. The items of the second subtest are made 
up of pairs of names which are likewise to be 
checked for similarity. 

The test is economically designed to fit on a 
single sheet of paper. Instructions are concise, 
but in one or two places are obscure. 

In single high school grade ranges, the reli- 
abilities of the subtests are above .90, computed 
by the Kuder-Richardson formula 20. The re- 
liabilities are probably overestimates, since this 
formula is not exactly applicable to simple, 
highly speeded tests like the present one. The 
results of a single study tend to support the 
validity of the test for routine clerical duties. 
Norms in integral standard deviation units are 
furnished separately for the four high school 
grades and are based on hundreds of cases in 
each grade. 

The test is very similar in essence to the 
Minnesota Vocational Test for Clerical Work- 
ers. Like its well-known predecessor, the Cleri- 
cal Perception Test seems to measure aptitudes 
related to speed and accuracy of discrimination 
of visually presented details. The usefulness of 
such measures for the appraisal of suitability 
for routine clerical work has been established 
in numerous instances. 

In addition, the Clerical Perception Test is 
claimed to have value in the detection of de- 
fective vision, Evidence furnished for this claim 
includes the fact that 48 students, who ordi- 
narily wore eyeglasses, showed significantly 
higher mean scores when tested with glasses 
than when retested without them. No mention 
is made of checks on the extent to which the 
decrement may have been due to normal fatigue, 
loss in interest, or other extraneous influences. 


Other evidence is that these subjects showed a 
greater loss from the first to the second subtest 
when working without glasses than when 
working with them. The statistical reliability of 
this difference is not reported. 

Granted, however, that imperfect vision has 
some adverse effect on performance on this 
test, the question remains of how useful the 
test is in the detection of such defects. It is pro- 
posed in the test manual that people getting 
scores below one standard deviation below the 
mean be suspected of visual deficiency and 
given an optical examination. However, if low 
scores on the test are to be strongly indicative 
of defective vision, individual differences in test 
performance must depend primarily on visual 
factors. But there is every reason to believe 
that nonvisual aptitudes play a very significant 
role in determination of the score. It is interest- 
ing in this connection that the mean scores of 
the 48 visually defective subjects, when tested 
without glasses, exceeded the mean scores of 
the general normative population on both sub- 
tests. 

In general, then, the evidence does not en- 
sure that the Clerical Perception Test, when 
used in the proposed manner, will have much 
practical utility for the indication of defective 
vision. Nor do there seem to be any technical 
reasons for preferring this test to the earlier 
and similar Minnesota Vocational Test for 
Clerical Workers as a measure of aptitude for 
performing routine aspects of clerical work. In- 
deed the more diversified standardization and 
validation of the Minnesota test are distinctly 


in its favor. 


Erwin K. Taylor, Chief, Performance Evalu- 
ation and Criterion Research Unit, Personnel 
Research Section, The Adjutant General's Of- 
fice, Washington, D. C. Except that it contains 
half as many items, is printed in somewhat 
smaller type, and is more difficult to score, the 
Clerical Perception Test is merely a copy of the 
Minnesota Vocational Fest for Clerical Work- 
ers. A Kuder-Richardson reliability of .93 on 
an unknown number of cases is reported on the 
instruction and scoring sheet. Since the use of 
the Kuder-Richardson formulae on speed tests 
is entirely inappropriate, the reliability of the 
instrument remains an unknown quantity. The 
validity of the test, according to the manual, 
was “established” by the finding that schools 
that selected their clerical workers from the 
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upper 25 per cent of the scores required only 
half the time for the same job that was required 
when workers were selected at random. No 
mention is made of the number of cases in- 
volved, nor of any of the numerous possible 
contaminants which may have entered into the 
situation. Some data based on 48 cases indicate 
that people who need glasses do better on the 
test when they wear them than when they do 
not. Norms based on high school students are 
available on sizable samples. No data are pre- 
sented on employed groups. 

The reviewer is in full accord with the state- 
ment in the manual to the effect that critical 
scores for occupational selection will need to be 
determined by job requirements. Unless this is 
intended as mere lip service or as an appeal to 
face validity, it is essential that basic validation 
research be performed before this test, or any 
other clerical test, can be safely employed to 
select workers. The greater length of the Min- 
nesota Vocational Test for Clerical Workers, 
and the fact that considerably more is known 
of the Minnesota Test than of the present in- 
strument, would indicate that the longer test is 
to be preferred for inclusion in an experimental 
selection battery. The test in its present state of 
development is not suitable for guidance pur- 
poses. 


E. F. Wonderlic, Vice-President, General 
Finance Corporation, Chicago, Illinois. 'This 
test was standardized on the work of high 
School students. Therefore, its use in business 
and industry is limited. Standardization and 
validity in business and industrial situations 
with adult subjects are not reported by the 
author. This test is not in general use in busi- 
ness personnel situations as a selective device. 
For this purpose it is difficult to administer. Tt 
consists of four subtests, each requiring sepa- 
rate timing, and thus demands close attention 
of the test administrator. Occupational norms 
by employed clerical workers from industry are 
not available in the author's published material. 
Data in the Directions for Administration sug- 
gest that interpretative value can result from 
scores when the test is used for a measure of 
vision, with or without normal use of glasses, 
with high school groups. Sex and educational 
differences are also demonstrated for high 
school students. Its use in industry must be 
considered purely experimental. 


J Consult Psychol 11:156 My-Je '47. * Level 
of aspiration scores are obtained for Part B of 
each test, but these are not discussed in the 
manual. * 
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Clerical Test D: Extending—Verifying—Check- 
ing—Classifying. Ages 14 and over; 1922; $3 per in- 
troductory set including 10 tests, postpaid; 5¢ per extra 
test, postpaid; introductory sets may be ordered on 
approval; 12(20) minutes; Eugene J. Benge; Manage- 
ment Service Co. 


Donald G. Paterson, Professor of Psychol- 
ogy, University of Minnesota, Minneapolis, 
Minnesota. This test is printed on a 4-page 
leaflet 814 X 11 inches. The first page is de- 
voted to instructions and practice exercises. 
The next three pages are devoted to 100 items, 
presented in cycle-omnibus form, classified as 
Extending (10 arithmetical reasoning items), 
Verifying (40 fundamentals-of-arithmetic items 
presented in true-false form), Number Check- 
ing (30 pairs of numbers to be labeled Same 
or Different), and Classifying (20 topical sen- 
tences taken from correspondence, each to be 
classified as to type and kind by multiple 
choice). Working time is 12 minutes. Speed 
and accuracy are emphasized alike. A cardboard 
scoring stencil is provided. There is only one 
form of the test. 

A r-page mimeographed manual is provided, 
supplemented by an additional mimeographed 
page which purports to give as an addendum 
a “refinement in scoring” the Extending items. 
The purpose of the test is presented in one 
paragraph and the user is advised not to hire 
those making “a low score in the test," but no 
hint is given as to what “a low score" is. An- 
other one sentence paragraph states that the 
test is to be used to eliminate unfit applicants. 

Instructions are supplied for giving and scor- 
ing the test. Score is the number of items tried 
minus the number wrong, and a percentile scale 
is given for converting the raw score into a 
score by "rank." A percentage right score is 
also advocated with the statement that accuracy 
should be 95 per cent or better but that appli- 
cants for juniors should average 93 per cent in 
accuracy. 

The number and kind of applicants upon 
which the percentile rank table is based is not 
indicated. No norms for employed clerical 
workers are provided. Data regarding relia- 
bility are absent and no evidence in regard to 
validity is presented. The correlation between 
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intelligence and this test as a whole or the sub- 
tests is not given. The reliabilities and interre- 
lations of the subtests are not given. No refer- 
ences are given. 

Judging solely on the basis of the test itself 
and the inadequate manual of directions, one 
is forced to conclude that all of the aims and 
objectives of the Mental Measurement Year- 
books have been violated. Judging on the basis 
of knowledge of the field of aptitude testing, it 
is apparent that the test is a mixture of num- 
ber ability (fundamentals) measured unrelia- 
bly, of number-checking ability measured un- 
reliably, and of two components of the usual 
type of intelligence test (arithmetical reasoning 
and topical classifying) measured with un- 
known reliability. Only the uninformed and 
inadequately trained personnel worker would 
be likely to adopt this test in practice. 


John M. Willits, Associate Professor of Psy- 
chology, Graduate School of Business, Stan- 
ford University, Stanford, California. The con- 
tent of this brief test may be appropriate to a 
number of clerical situations. Its four types of 
items all have a reasonable resemblance to com- 
mon, simple clerical tasks, And all of the item 
content is general enough to make it an aptitude 
rather than an achievement test. 

Mechanically, the test is adequately designed 
and printed. There are 10 sample items and 5 
practice items, adequately covering the test con- 
tent. Scoring is mechanically simple, with a 
single strip key that is accurately fitted, legible, 
and durable. 

The chief disadvantages of the test appear to 
be: (a) the inadequacy of the data in its manual ; 
(b) the shortness of the test, for statistical re- 
liability; and (c) the lack of part-scores on the 
different types of item. 

(a) The “manual” consists of a single type- 
written (mimeographed) page, plus a one-page 
“addendum to scoring instructions” (discussed 
below). This “manual” properly cautions the 
user that job success probably requires other 
attributes besides those measured by the test; 
and it gives simple, concise directions for ad- 
ministration and scoring. But there is no men- 
tion of either reliability or validity and only a 
single abbreviated set of percentile norms, with 
population also unmentioned. From the norms 
table, which is grouped in intervals of 5 raw- 
Score points, only fourteen different percentile 


ranks can be read directly without interpo- 
lation. 

(b) The test has only 100 items, all with 
unit weight in scoring. There is no correction 
for guessing; the printed instructions properly 
advise the examinee to guess. As 70 of the 
items are in two-choice form, and 20 more are 
four-choice, both with approximately balanced 
distribution of right answers, a chance score 
of 40 points is quite possible. This fact is rec- 
ognized in the norms table, which gives the per- 
centile rank of 1 to all raw scores below 40. 
Thus, the effective range is only 60 raw-score 
points. 

Thirty of the items are number-checking 
items, much like those of the Minnesota Voca- 
tional Test for Clerical Workers. The manual 
for that test reports an odd-even reliability 
coefficient of .86 for its 200 items of this type. 
If the length were reduced to 30 items, this 
reliability would be expected to shrink to .48. 
The reliability of the 30 similar items in the 
present test is probably no higher. No two of 
these items are presented adjacent to each 
other, while in the Minnesota test all 200 items 
are consecutive. 

Ten of the items in the present test are arith- 
metic problems in the extension of invoice and 
payroll entries. They are fairly similar to at 
least part of the problems in Part V of the 
Psychological Corporation General Clerical 
Test, where 16 problems have given a split- 
half reliability of .63. The expected shrinkage 
here, in a reduction to 10 problems, would give 
a reliability of .52. 

(c) All four types of items (the other two 
are classifying topic sentences from business 
letters and verifying simple arithmetic equa- 
tions) are different enough to suggest low in- 
tercorrelations among them. If the test were 
of reliable length, an average intercorrelation 
not higher than .40 might be expected, on the 
basis of other data on somewhat similar mate- 
rials. Given a clerical force large enough to 
permit differential placement of selected appli- 
cants, reliable part scores would be valuable on 
test materials as dissimilar as these. But such 
scores would require more than the present 12 
minutes of testing time, and several times 100 
items, besides rearrangement of material and 
sectional timing. 

The “addendum to scoring directions” seeks 
to overcome one defect of many spiral-omnibus 
tests, by setting up a scale of progressive penal- 
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ties for omission of difficult items—in this case, 
items of the “invoice-extension” type. But these 
penalties, of which the examinee is not warned, 
are not in harmony with the printed instruc- 
tions which tell him, “If you cannot solve any 
item readily, pass on to the next." Also the 
scale implies a steeper difficulty gradient in 
these items than in other types, although the 
others are also evidently arranged in graded 
difficulty. The "addendum" does not state 
whether the scale is arbitrary or based, as it 
should be, on accurate time studies of actual 
working times. 


[ 626 ] 
Detroit Clerical Aptitudes Examination. High 
school ; 1937-44; 1 form; $5.40 per 100; 246 per speci- 
men set, p 30(40) minutes; Harry J. Baker 
and Paul H. Voelker; Public School Publishing Co. 


REFERENCES 

r. Anperson, Roy N. “Review of Clerical Tests (1929— 
1942)." Occupations 21:654-60 My '43. * (PA 17:3218) 

E. F. Wonderlic, Vice-President, General 
Finance Corporation, Chicago, Illinois. This 
clerical aptitude test was developed on a large 
number of school students and for school pur- 
poses may be very desirable. Reliability and 
validity in school situations is high. However, 
it has not been standardized for industrial and 
business use. Before it could be useful in the 
employment department, it would require stand- 
ardization on adults, both those working at 
clerical work and applicants for those positions. 
It is possible that it would have predictive value 
if this were done. 

The test is made up of eight parts, which re- 
quire close administrative attention and which 
would make it difficult to administer in the typi- 
cal employment office. 


For reviews by Irving Lorge and M. W. 
Richardson, see 40:1655. 


[ 627 ] 
Minnesota Clerical Test. Grades 8-12 and adults; 
1033-46; formerly called Minnesota Vocational Test 
for Clerical Workers; 1 form; manual only revised in 
1946; $1.25 per 25, postpaid; 35¢ per specimen set, 
postpaid; Dorothy M. Andrew, Donald G. Paterson, 
and Howard P. Longstaff ; Psychological Corporation. 


REFERENCES 
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19. Lozvincer, J. An Analysis of Verbal and Numerical 
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Thelma Hunt, Professor of Psychology, The 
George Washington University, Washington, 
D. C. This test is a brief, easily administered, 
easily scored measuring instrument. It is a test 
of speed and accuracy in two types of perform- 
ance—number checking and name checking. 
Underlying the speed and accuracy of perform- 
ance it can also be presumed that the test meas- 
ures qualities of attention to a stated problem, 
ability to determine small differences, and at- 
tention to detail. There are absent from this 
test the more complex performances found in 
so many other tests labeled “clerical” or "'cleri- 
cal aptitude" tests, such as the parts measuring 
spelling ability, alphabetizing ability, arithmeti- 
cal calculation or reasoning ability, reading 
ability, and vocabulary. Despite its more lim- 
ited nature, in general studies have indicated a 
relatively high validity for the Minnesota test 
as compared with the more complex tests. 
Many users should welcome this simple speed 
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„and accuracy test. The other factors so often 
covered in the clerical tests can be measured 
by separate arithmetic, spelling, reading, or 
intelligence tests where desired. 

The manual contains far more information 
about the test than is included in the usual test 
manual, a fact for which the authors are to be 
commended. The test was originally developed 
through extensive research by the Employ- 
ment Stabilization Research Institute at the 
University of Minnesota. The studies made by 
this Institute are summarized. Since the publi- 
cation of The 1940 Mental Measurements 
Yearbook, a revised manual for the Minnesota 
Vocational Test for Clerical Workers (now 
called Minnesota Clerical Test) has appeared. 
It contains a summary of several additional 
studies of reliability and validity and much 
more information on norms than was contained 
in the earlier manual. The reliability of the test, 
as attested by statistical studies presented, is 
high. A satisfactory validity is indicated by 
correlations of the test with ratings of clerical 
workers, ratings of clerical ‘abilities-of commer- 
cial students, grades of students in accounting, 
and speed of typing by typing students. Infor- 
mation is presented to indicate that the clerical 
test scores predict performances in such cleri- 
cal fields as accounting and typing better than 
do customary academic ability measures. Low 
correlations reported between the test and 
years of clerical experience, amount of com- 
mercial training, grade of school completed, and 
intelligence suggest that the test is measuring 
something other than the academic and experi- 
ence factors so often measured in vocational 
tests. 

Some comment should be made upon the 
extensive norms presented in the new manual. 
There are norms according to school grade for 
grades 8 through 12, with additional norms for 
grades 11 and 12 for New England groups. 
For adult groups there are norms for em- 
ployed clerical workers (in general and broken 
down into types of clerical workers), for cleri- 
cal applicants, for unemployed clerical work- 
ers, and for adults gainfully employed (but not 
in clerical work). All the norms are presented 
separately for men and women, and all in terms 
of percentile ratings. The extensive norms may 
seem confusing. They are of most use only 
when the user can carefully study his own 
problem and relate the performance of those 
tested to the most applicable group. In voca- 


tional guidance a counselor is chiefly interested 
in the abilities of an individual in comparison 
with the abilities of persons in general, so that 
among all the aptitudes of the individual he 
may find those which are the most outstand- 
ing. On the other hand, the employer using the 
Minnesota test is likely to be most interested 
in the comparison of the performance of his 
applicants with other clerical workers. In the 
case of the employer using this test, as well as 
any other clerical test, it should be pointed out 
that norms should be carefully scrutinized for 
comparability of the groups on which the au- 
thors’ norms are based with the employer’s own 
workers. Even with the best situation in respect 
to published norms, many users will find it de- 
sirable to set up their own norms. 

On the whole, the Minnesota Clerical Test 
can be recommended as a very usable test for 
selecting promising clerical workers, or for 
guidance in the selection of students for clerical 
training. In many instances it may need to be 
supplemented by more complex tests covering 
other qualities, but there is an advantage in 
having separately the simpler measures yielded 
by the Minnesota test. The examination has 
stood the test of wide use and acceptability over 
a period of fifteen years. 


R. B. Selover, Assistant Personnel Director, 
The Prudential Insurance Company of Amer- 
ica, Newark, New Jersey. This test was con- 
structed for selecting clerical employees and for 
advising persons who wish to secure training in 
the clerical field. The test consists of two parts, 
number checking and name checking. In each 
part there are 200 items, 100 of which are 
identical pairs and 100 of which are dissimilar 
pairs. The numbers range from 3 through 12 
digits; the names range from 7 through 16 let- 
ters. The tests are timed separately at 8 and 7 
minutes. The score is the number right minus 
the number wrong. 

The manual revised in 1946 reports infor- 
mation concerning reliability, validity, relation- 
ships with other tests, and extensive norms. 
The reliability of the total test, determined by 
administering comparable forms at one testing, 
is .go. Retest reliability coefficients are about 
.8s for combined performance and somewhat 
lower for the separate tests. High and signifi- 
cant relationships are reported between per- 
formance on this test and commercial teachers’ 
ratings of students, grades in accounting, rat- 
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ings of general clerical workers, and ratings of 
file clerks. Association is reported between test 
performance and speed of typing. Significant 
differences are reported between clerical work- 
ers and other groups. Although the two tests 
show considerable relationship, quite obviously 
separate skills are measured. The name check- 
ing test is more closely related to speed of read- 
ing, spelling, and conventional measures of in- 
telligence, while the number checking test is 
more closely related to verifying arithmetic 
computations. Studies indicate that test per- 
formance is only slightly related to experience 
on clerical jobs and amount of formal educa- 
tion. A factor analysis by one of the authors 
indicates that “Number Checking measures 
more of a numerical factor and Name Check- 
ing more of a verbal factor.” 

There is a possibility that the prospective 
user of this test may be confused by some of the 
information reported in the revised manual. 
From the fact that clerical groups frequently 
show small and nonsignificant differences in 
test performance, it is concluded “that the cleri- 
cal test used alone or in combination with in- 
telligence tests can do no more than serve as a 
guide toward or away from clerical occupa- 
tions as a group.” The reader might quite prop- 
erly ask to see relationships between test scores 
and criterion data for separate groups before 
accepting this conclusion. In explaining the 
relationships found between clerical test scores 
and intelligence for heterogeneous groups, the 
problem might be dealt with more effectively 
by reporting the authors’ previous work show- 


ing how little relationship remains when speed. 


of reading is held constant. 

It is the reviewer’s opinion that this clerical 
test is valuable for selecting certain clerical 
employees, as well as for advising students 
with respect to commercial training. Progress 
in the refinement of measures of clerical abili- 
ties will probably be made by continuing such 
attempts to measure specific abilities which 
show significant relationships to criterion data. 


Erwin K. Taylor, Chief, Performance Evalu- 
ation and Criterion Research Unit, Personnel 
Research Section, The Adjutant General’s Of- 
fice, Washington, D. C. The Minnesota Voca- 
tional Test for Clerical Workers (now called 
Minnesota Clerical Test), consisting of 200 
number checking and 200 name checking items, 
is too well known to test users generally to 


require elaborate description here. A total of 
15 minutes is required for the administration of 
the test which is hand scored with a key supplied 
by the publisher. 

While it has become an almost accepted doc- 
trine in industrial psychology that name and 
number checking constitute valid measures of 
clerical success, it appears to the writer to be 
somewhat presumptuous to imply, as does the 
title of this test, that these two elements are 
adequate for exclusive use in the selection of 
clerical workers. The prediction of clerical suc- 
cess would undoubtedly be improved if the 
name and number checking were supplemented 
by tests of verbal intelligence and computa- 
tional ability. 

Neither the reliability nor validity data pub- 
lished in the test manual can be accepted as 
adequate to justify the use of this test without 
further research in each given instance. Relia- 
bilities of .86, .93 and .90 are given for the 
number checking, name checking and a com- 
bination of the two, respectively. These are 
odd-even reliabilities based on less than 150 
cases. Since the test is essentially a speed test, 
these figures must be taken as gross exaggera- 
tions of the true reliability of the instrument. 
Test-retest reliabilities on 48 business students 
(university) and 28 individuals in a clerical 
training group range from .76 to .93, but are 
based on so small a sample as to preclude the 
drawing of any conclusions on the basis of 
these data. 

Since the validity of a test is crucial to its 
usefulness, the writer is much more concerned 
with the published data in this area than with 
reliability. Here again the outstanding feature 
of the manual is a scarcity of acceptable data. 
The product-moment correlation between per- 
sonal history ratings based on education, ex- 
perience and training, and age at leaving school 
is .65 for a group of 138 cases. These, however, 
cannot be considered valid vocational criteria. 
Where supervisors ratings were obtained, con- 
tingency coefficients ranged from .28 to .42 on 
groups varying in size from 22 to 97. The rat- 
ings, consisting of a combination of personal 
history and supervisors’ ratings, yielded con- 
tingency coefficients ranging from .54 to .64 on 
equally small groups. In no case is there a de- 
scription of the positions for which the test was 
"validated." 

Norms on the following employable groups 
are presented in the manual: (a) employed 
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„Clerical workers, 284 women and 120 men; 
(b) gainfully occupied adults, 232 women and 
500 men; (c) eastern clerical applicants, ap- 
proximately 1,500 women and 750 men; and 
(d) southern clerical applicants, 162 women. 
In addition nine sets of distributions of em- 
ployed clerical workers and six sets of unem- 
ployed clerical workers by classification are 
presented. These groups vary in size from 17 
male bank tellers to 121 female stenographers 
and typists. War Department research with 
civilian clerical employees has indicated such 
norms to be highly unstable, varying from job 
to job and between similar installations. Forty- 
four distributions of scores made by various 
groups of school children are also presented. 

Under these circumstances, it is doubted 
whether the data presented in the test manual 
can be of much real value in using the test 
either as a selection or guidance instrument. 
Surely not enough in the way of validity stud- 
ies have as yet been published to render the 
Minnesota Vocational Test for Clerical Work- 
ers a device upon which any considerable reli- 
ance can be placed for guidance purposes. The 
one possible exception occurs in the case of 
those who score in the lowest decile on the two 
tests. One could be reasonably certain that such 
individuals would be unlikely to succeed in 
clerical operations in which elements common 
to the test were heavily weighted. With respect 
to selection, the situation is somewhat happier. 
Where it is feasible to continue research for the 
selection of clerical workers, it might be well to 
include a test such as this in a preliminary bat- 
tery to be validated. 


E. F. Wonderlic, Vice-President, General 
Finance Corporation, Chicago, Illinois. This is 
the most widely used clerical test in business 
and industry. It has proved useful in many 
companies for the selection of various types of 
clerical workers. Some personnel research 
workers have found that test results are also 
predictive of success in other than clerical oc- 
cupations. The latest manual, revised in 1946, 
gives much significant data and summaries of 
experiences with this test. Psychotechnical 
standards are clearly met in this publication. 

The simplicity of the test when casually ex- 
amined may mislead the uninformed. While it 
is simple to administer and to score, its inter- 
pretation requires study and practice. Sex and 


educational differences are pronounced. Age 
factor, over 40 years, is important. 

The timing is a little awkward. Number 
Checking requires 8 minutes and Name Check- 
ing requires 7 minutes. These differences in 
time intervals are easily overlooked or reversed 
by clerical test administrators and thus cause 
errors in the final scores and resulting inter- 
pretations. 

The consistent use of this test is a satisfac- 
tory indication of its acceptance and value in 
personnel administration. 


For a review by W. D. Commins, see 

40: 1664. 
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N.LI.P. Clerical Test: American Revision. Ages 
16 and over; 1034; an American revision of the Na- 
tional Institute of Industrial Psychology Clerical Test, 
Series 25; $2.10 per 25, postpaid ; 35¢ per specimen set, 
postpaid; ( 39) minutes ; American revision by Herbert 
Moore; distributed by Psychological Corporation. 


REFERENCES 
1-2. See 40:1665. 


3. Osernetm, Grace M. “A New Use for the Institute's 
Clerical Test in America,” Occupational Psychol 16:83-6 Ap 


"4a. * (PA 1633297) Ly 
4. OBERHEIM, Grace M. “The Prediction of Success of Stu- 
dent Assistants in College Library Work." Ed & Psychol Meas: 


SESE p EA GED m qup Genge of Yon 
Children.’ Occupational Psychol 16:31-8 Ja '42. 

6. OnerHEIM, GnAcE M. “The Relationship Between Scores 
on a Clerical Test and Clerical Proficiency in Library Work.” 
J Ed Psychol 35:493-9 N '44. * (PA 19:1020) 

R. B. Selover, Assistant Personnel Director, 
The Prudential Insurance Company of Amer- 
ica, Newark, New Jersey. This test is an attempt 
to measure general clerical ability by combining 
performance from seven short subtests. The 
subtests present problems in immediate recall 
of information presented orally, classification 
of miscellaneous items according to six head- 
ings: arithmetic computation, copying names 
and symbols, checking names and numbers, fil- 
ing, and reasoning. The subtests are adminis- 
tered with separate time limits ranging from 2 
to 5 minutes each. The total test can be ad- 
ministered in 30 minutes and corrected in 5 
minutes. 

The manual reports deciles for high school 
students and college seniors. Median scores on 
the subtests are reported for high school seniors 
and college seniors. No information is given 
in regard to the reliability or validity of this 
test for selecting clerical workers. The test 
items show evidence of care in construction; 
much of the material presents considerable sur- 
face validity. 

It is not probable that seven separate abili- 
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ties are measured by the subtests, although 
several abilities are probably sampled. Speed 
is apparently an important factor in most of 
the tasks. The subtests are probably too short 
to be used individually with much confidence. 
Apparently this test was constructed by arbi- 
trarily adding together performance on the 
separate tasks without reference to either the 
interrelationships of the subtests or the rela- 
tionships between the subtests and criterion 
data. It is the reviewer's opinion that the test 
could be improved by the careful selection of 
fewer subtests of greater length. 


For a review by Donald G. Paterson, see 

40:1665. 
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O'Rourke Clerical Aptitude Test, Junior Grade. 
Applicants for clerical positions; 1926-36; 2 parts; 
$1.75 per 50 of either part, postpaid ; 25¢ per specimen 
set of either part, postpaid; L. J. O'Rourke; Psycho- 
logical Institute. 
d) CLERICAL PROBLEMS. 1926-35; Form 1; 25(30) min- 
utes, 
b) REASONING TEST. 1926-36; Form A; 20(25) min- 
utes. 


REFERENCES 

1. CorzLAND, Herman A. “Some Characteristics of Three 
Tests Used to Predict Clerical Success.” J Appl Psychol 
20:461-70 Ag '36. * (PA 11:421) 

a. ANDERSON, Roy N, “Review ot Clerical Tests (1929- 
1942)." Occupations 21:654-60 My '43. * (PA 17:3218) 

3. Woopy, Crrrromp. Guidance Implications From Measure- 
ments of Achievements Aptitudes and Interests. Universit; 
of Michigan, School of Education, Bureau of Educational Ref- 
erence and Research, Bulletin No. 156. Ann Arbor, Mich.: the 
School, September 1, 1944. Pp. viii, 162. Paper. $1.25. * (PA 
20:573) 

Raymond A. Katzell, Associate Professor of 
Psychology, The University of Tennessee, 
Knoxville, Tennessee. This is a battery of two 
tests, including a reasoning test and a clerical 
problems test. The two tests are published 
separately and could be used independently if 
desired. 

The reasoning test contains a total of 45 
items, which are grouped roughly according to 
the following types: following directions, gen- 
eral information, interpretation of proverbs, 
spelling, verbally stated arithmetic problems, 
word usage, and word relations. The test there- 
fore seems to be essentially an intelligence test 
which measures mainly verbal factors. It is 
geared in difficulty and context to an average 
adult population. 

The test is printed on both sides of a single 
sheet of paper and has no cover page. Thus, 

there are opportunities for examinees to read 
some of the items prior to the starting signal. 


This is of some consequence in view of the fact 


that there is a short time limit. Present users. 
of the test will face a problem in handling Item 
6, which is based on postal rates. These rates 
were recently changed so that the answer which 
is now correct does not appear among the five 
alternatives offered. 

The clerical problems test consists of 6 
parts: (a) alphabetical filing; (b) arithmetic; 
(c) classification; (d) checking names and 
numbers; (e) arithmetic computations; (f) 
alphabetical filing as in part (a). There is a 
single time limit during which the examinee 
is to work on the 6 parts in succession. 

A table of percentile norms is provided for 
each test separately and for the sum of the 
score on the clerical problems test plus ten 
times the score on the reasoning test. The basis 
for this weighting is not indicated. The norms 
were computed for a group of 792 "typical" 
applicants for clerical positions. Although no 
further information is given concerning the 
applicants or the jobs for which they applied, 
these norms are probably more useful for vo- 
cational guidance and selection than the oft- 
found norms based on school or general popu- 
lations. 

Unfortunately, no data are provided on the 
construction, reliability, or validity of these two 
tests. Anderson (2) cites a letter from the 
tests’ author which indicates satisfactory relia- 
bility and validity with groups of clerical work- 
ers in industry. But such information is of 
limited value unless supplied directly to pros- 
pective users. Also, unelaborated citations of 
validity coefficients are not enough; the infor- 
mation furnished should include characteristics 
of the population studied, the jobs they filled, 
and the criteria of success used. The intelligent 
selection of tests is so severely handicapped by 
the absence of these data that every author of 
an occupational aptitude test should feel 
obliged to furnish them when the test is offered 
for general sale. 

On the whole, the battery of two tests seems 
to have been designed in line with the common 
practice of measuring aptitude for clerical 
work in terms of general abstract intelligence 
and ability to perform simple detailed tasks 
speedily and accurately. There have been a 
number of demonstrations of good results 
through this approach. 

However, the question may be raised whether 
a single general alertness score like that yielded 
by the reasoning test is really the most useful 
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„for clerical placement. Since different clerical 
positions involve different relative amounts of 
verbal and arithmetical aptitudes, it would be 
advantageous to have a test that would yield 
separate scores for these two areas. Such a 
breakdown can aid in differential placement 
and also, through multiple regression weight- 
ing of the two scores, enhance the validity of 
the test for any one job. This worth-while fea- 
ture is incorporated in several recent measures 
of clerical aptitude, such as the Psychological 
Corporation General Clerical Test. 

The arrangement of the clerical problems 
test may be objectionable to those who would 
prefer a test with separate scores for each of 
the parts as an aid to differential diagnosis. 
However, the inclusion of this feature would 
necessitate the lengthening of each part and 
would add considerably to the total time con- 
sumption. Since the test as it stands probably 
measures mainly a group of aptitudes related 
to speed, perception, and manipulation of sim- 
ple details, all of which enter into a number 
of the more routine clerical jobs, it should be 
quite useful in providing a quick over-all ap- 
praisal of this important aspect of clerical 
aptitude. 


[ 630 ] 

*Psychological Corporation General Clerical 
Test. Grades 9 and over; 1944; revision of General 
Clerical Test, PCI Selection Form 20 (1942) ; 1 form; 
$3.25 per 25, postpaid; 35 per specimen set, postpaid ; 
46(50) minutes; this test is also available in two sepa- 
rate booklets (Partial Booklet A, Speed and Number ; 
and Partial Booklet B, Verbal) at $1.80 per 25; Psy- 
chological Corporation, 


Edward N. Hay, Personnel Officer, The 
Pennsylvania Company for Banking and Trusts ; 
and President, Edward N. Hay and Associates, 
Inc.; Philadelphia, Pennsylvania. This test is 
well designed for use in counseling situations. 
It gives three types of scores: an over-all score 
which, according to the authors, "may be re- 
garded as an index of general clerical aptitude 
(and which) correlates quite highly with the 
scores of general intelligence tests”; three sec- 
tion scores for “routine clerical aptitude, pro- 
ficiency in mathematics, and verbal facility” ; 
and 9 part scores, each resulting “from a spe- 
cific type of clerical performance." Theoretically, 
the counselor should be able to guide the subject 
into that aspect of clerical work for which he 
is best fitted. 

At present, however, the test offers no norms 
for specific occupations. Validity is merely sug- 


gested by the difference between percentile 
scores for high school students, industrial 
workers, and office workers—type unspecified. 
No information about interpreting scores has 
been included in the manual. 

The test is not practical for an employment 
situation. It is too long, requiring over 45 min- 
utes for administration. The authors state that it 
is permissible to administer and score the sub- 
tests independently ; but since they are printed 
together in booklet form, it is impractical to do 
so. The reported correlation of .74 between the 
over-all score and a test of general intelligence 
indicates that it would be repetitious to include 
both tests in a selection battery ; but if the cleri- 
cal test were given alone, there would be no way 
of determining how much of the applicant’s score 
could be attributed to mental alertness and how 
much to the ability to handle routine clerical 
tasks. Finally, the fact that there has been no 
validation of the test in business situations mili- 
tates against its use for employment. 

The format of test and manual are pleasing, 
and the authors have supplied additional norms 
to supplement those originally printed. They 
have also made a factor analysis, in which they 
found that Parts 1 and 2 are “true” measures of 
“clerical” ability, in that they are relatively free 
of both the verbal factor, which is loaded in 
Parts 6, 7, 8, and 9, and the numerical factor, 
loaded heavily in Parts 3, 4, and 5. The logical 
application for this information would be to 
simplify the test. 

It is unfortunate that the authors have made 
the test available before it has been validated and 
that they have not recognized the fact that the 
best procedure in employment situations is to 
use a mental ability test for range of ability and 
an indication of promotability, supplemented by 
pure tests of aptitude for handling verbal and 
numerical material, having minimal correlation 
with mental ability items. 


Thelma Hunt, Professor of Psychology, The 
George Washington University, Washington, 
D. C. This is a relatively new clerical test. In 
nature of the test items, however, it follows the 
general pattern of earlier ones of the more com- 
plex clerical tests. There are nine parts to the 
test, each timed separately. The parts involve 
identification of errors in a copy of original ma- 
terial, identification of errors in an arithmetical 
table, alphabetizing, arithmetic computation, 
arithmetic reasoning, spelling, reading, vocabu- 
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lary, and grammar. By suggestions for use as 
given in the manual, and by norms, these nine 
parts are grouped into three sections (from 
which section scores are derived) covering, re- 
spectively, routine clerical aptitude, proficiency 
in mathematics, and verbal facility. This avail- 
ability of the three section scores would seem 
to be one of the chief advantages of this test over 
other similar clerical tests. Since clerical work- 
ers constitute by no means a unitary group in 
respect to duties, many users may find it helpful 
to weight ratings on these three subparts of the 
test in proportion to their importance in particu- 
lar jobs. The authors suggest the feasibility of 
utilizing the test as a whole, or in any of its three 
sections or nine parts. Users should exercise 
some caution, however, in utilizing alone some 
of the nine separate parts, since reliabilities are 
of marginal adequacy. 

The statistical information on the test supplied 
in the manual includes: (a) intercorrelations 
between the nine parts, which are for the most 
part between .30 and .50; (b) reliabilities for the 
nine separate parts; (c) correlations of the test 
with Modified Alpha Examination Form 9; and 
(d) norms for high school senior girls (com- 

` mercial students), clerical employees, draftsmen, 
and miscellaneous industrial workers. There is 
also a statement that factorial analyses have 
indicated factors agreeing with the three divi- 
sions of the test as indicated in the section scor- 
ing. The correlation of the total clerical test with 
the Modified Alpha Examination is .74, the 
verbal subsection correlating higher than the 
other two sections with the intelligence test. No 
information is given on validity of the test in 
actual use either in vocational guidance or in 
employment work. No references to reported 
studies in the literature are given, The test is as 
yet too new to expect many follow-up studies to 
be available. It is to be feared that there is noth- 
ing unusual enough in its nature or make-up to 
stimulate rapid research on it or study of it. 

One could well afford to give the test serious 
consideration if looking for one of the more com- 
plex types of clerical tests. The directions for 
giving and scoring are clear, the test layout and 
printing are attractive and of such a nature as 
to minimize the testees’ problems in following 
through a test. To many, the advantages of the 

three sectional scores will be a desirable feature. 


Raymond A. Katzell, Associate Professor 
of Psychology. The University of Tennessee, 


Knoxville, Tennessee. This test is evidently, 
based on the concept that there exist several 
aptitudes which, in different patterns, enter into 
different types of clerical jobs. It has therefore 
been designed to measure various aspects of 
clerical aptitude and to express them in sepa- 
rate scores. This reviewer believes that such an 
approach has two very important advantages: 
it makes provision for maximizing the validity 
of the test for any one type of job by making 
possible optimum weighting of the various as- 
pects measured, and it permits differential diag- 
nosis of an individual's suitability relative to 
different types of clerical positions. 

The General Clerical Test comprises nine 
parts published together in a single booklet. Each 
part is a subtest having its own directions, time 
limit, and score. The contents of the parts may 
be described as follows: 1, checking names and 
numbers; 2, alphabetical filing; 3, arithmetic 
computations ; 4, arithmetical checking ; 5, arith- 
metical problem solving; 6, spelling; 7, para- 
graph comprehension; 8, vocabulary; 9, gram- 
mar. Past research on clerical aptitudes has 
demonstrated the usefulness of such types of 
tests. 

The printing and format of the test booklet 
are excellent. Instructions are concise and gen- 
erally clear, with the exception of those to Part 4. 
In this instance the instructions are somewhat 
lengthy, but, in spite of this, experience with 
clerical applicants in industry has shown that 
they sometimes have difficulty in understanding 
exactly what is to be done until given further 
explanation. This can probably be attributed in 
part to the rather unusual nature of the task 
itself. Part 9 also may present some problems 
to the user. A few of the items, which are of the 
free-choice type, may have more than one con- 
ceivable answer, although only one is given on 
the scoring key. Items 1, 9, and 17 seem to fall 
in this category. 

The test provides for the computation of a 
total score which is the sum of the number of 
correct answers in all parts. It will be possible, 
of course, for each user of the test to develop 
on the basis of the subtests a multiple regression 
equation for predicting success in each particu- 
lar job in which he may be interested. This 
procedure would make possible the computation 
of a weighted total score that would be a more 
valid indicator of aptitude for that job than the 
unweighted score now yielded by the test. 

Provisions are made for the computation of 
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. three subtotal or section scores. The “routine 
clerical” subtotal is the sum of the scores on 
Parts 1 and 2. The “numerical” subtotal is the 
sum of the scores on Parts 3, 4, and 5. The 
"verbal" subtotal is the sum of the scores on 
the remaining 4 parts. Two separate factorial 
analyses were performed in an effort to check 
on the authenticity of these groupings. Dr. G. K. 
Bennett kindly made available to the reviewer 
a copy of the rotated factor loadings. The results 
of the two analyses are not in very close accord, 
which is probably due in part to the relatively 
small number of cases in each sample (108 and 
120). However, the results generally support 
the legitimacy of the three groupings. The sub- 
totals overlap considerably with respect to the 
factors measured, but each does seem to empha- 
size the area of aptitude ascribed to it in its 
title. 

For practical purposes, the test is somewhat 
long and perhaps unnecessarily so in view of the 
overlap existing among several of its parts. For 
example, Parts 1 and 2 are factorially similar 
to each other. The same can be said for Parts 3, 
4, and 5; for Parts 6 and 9; and for Parts 7 
and 8. One wonders whether a test composed of 
one of the parts from each of these sets, say 
Parts 2, 3, 6, and 8, might not be substantially 
as effective as the total battery. Of course, when 
the test is validated for any particular job, the 
results will indicate which parts, if any, may be 
eliminated without seriously reducing validity. 

With the exception of Part 5, the parts all 
have split-half reliabilities above .75 (corrected 
for length). Those of Parts 1 through 4 are prob- 
ably overestimates of their true reliability, since 
these subtests are highly speeded. The split-half 
reliability of Part 5 is only .63. The reliabilities 
of the total and subtotal scores are not reported, 
but are probably all above .8. 

Norms based on several different kinds of 
occupational groups are provided for total, sub- 
total, and part scores. 

Unfortunately, no decisive data on validity 
are reported with the test. However, for maxi- 
mum utility, a test of this sort should be vali- 
dated and standardized locally for each type of 
position for which it is to be employed. Once 
this preliminary research is done, the user will 
probably be in possession of an effective instru- 
ment for the placement of clerical personnel. 


E. F. Wonderlic, Vice President. General 
Finance Corporation, Chicago, Illinois. This test 
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is made up of nine subtests, each requiring con- 
stant attention and administration. The sub- 
tests remind one of the Army Alpha. Correla- 
tions with intelligence tests are not reported, but 
it seems likely that there might be a high rela- 
tionship, thus eliminating the necessity for the 
use of such a test if a satisfactory intelligence 
test is used in the test battery. The timing inter- 
vals vary from 3 to 7 to 81 minutes for each 
group, making it particularly difficult to admin- 
ister in an employment office. 

The norms are established on 143 office work- 
ers, 195 industrial workers, and 380 high school 
students. This number of cases hardly justifies 
the serious use of this test as a selecting aid in 
business. Information in the test manual is re- 
fined from a statistical point of view, but the 
refinement is hardly justified on the basis of the 
number of cases involved. Such a test, if care- 
fully standardized on thousands of cases from 
individuals in different parts of the country and 
in different companies, might be of value to the 
personnel technician. However, its extremely 
complicated administration, because of irregular 
time intervals, makes it impractical for the typi- 
cal employment office. 

[ 631 ] 
Survey of Working Speed and Accuracy. Grades 
9-16 and adults ; 1943-44; $1.75 per 25; 35¢ per speci- 
men set, postpaid; 20(25) minutes; Floyd Ruch; Cali- 
fornia Test Bureau. 


Edward N. Hay, Personnel Officer, The 
Pennsylvania Company for Banking and 
Trusts; and President, Edward N. Hay and 
Associates, Inc.; Philadelphia, Pennsylvania. 
This test is intended for “selection, placement, 
upgrading, and transfer” of routine office 
workers and industrial checkers and inspectors 
who perform clerical operations. 

Tt consists of four parts: The first part, 
Number Checking, includes letters and symbols 
and is set up with a left-margin alignment 
which is unusual in business situations. The 
second test, Code Translation, supposedly 
“measures the ability of the applicant to learn 
and apply complicated procedures.” Next is a 
dotting test, to measure “finger dexterity as it 
is involved in filing and in the operation of a 
wide variety of office machines,” and which, 
the author says, “is also meant to measure the 
ability and willingness of the applicant to work 
at a highly monotonous task.” No one has ever 
proved that a so-called “dexterity test” will 
usefully predict success in operating office ma- 
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chines, and no evidence is offered that any use- 
ful information about temperament can be 
obtained on a test of this kind. A further draw- 
back to the finger dexterity test is that it is 
printed on the same page with Code Trans- 
lation, and the 5-minute time allowance may 
very easily lead some subjects to abandon dot- 
ting in favor of completing Test 2. The final 
test, Counting, requires the subject to count the 
vowels in each line of meaningful prose and is 
"intended to measure the type of ability in- 
volved in proofreading and in checking reports 
and statistical tables." No data are given in 
support of this claim. 

No mention is made of validity in connection 
with the Counting Test, although the other 
three tests have been found "valid," the author 
says, in a study of a group of office workers in 
an insurance company. The fact that 88 per 
cent of workers rated "superior" by their su- 
pervisors had scores on each of the three tests 
which exceeded the median scores for those 
rated "inferior," is scant evidence for validity, 
particularly since there is no indication of the 
qualities on which ratings were based; of the 
number or type of ratings; of the sex, age, or 
occupation of the workers; or even of the 
number of individuals included in the sample. 

Norms are given in percentile scores, based 
on a population of 400 males and 400 females, 
aged 16 to 5o. No data about distribution of 
ages, education, occupation, or even the source 
of subjects are given, and the author has ap- 
parently overlooked the findings of Andrew 
and others on the sex differential in clerical 
operations. 

The Survey of Working Speed and Accur- 
acy cannot be properly evaluated until more in- 
formation is given about what it measures and 
how efficiently it measures it. 


Donald G. Paterson, Professor of Psy- 
chology, University of Minnesota, Minneapolis, 
Minnesota. This test, although admittedly a 
clerical test, is correctly labeled so as to be ap- 
plicable to workers performing routine clerical 
operations in office or factory whether such 
workers are identified by a "clerical job label" 
or not. There are four subtests: Test 1, Num- 
ber Checking, 150 pairs using numbers along 
with code letters, plus, minus, per cent, and 
dollar signs to make the test more closely re- 
semble office and shop clerical procedures; 
Test 2, Code Translation or a letter-digit sub- 


stitution test with 26 pairs in the key to be 
learned and 20 lines of about 15 numbers each * 
to be filled in; Test 3, Finger Dexterity, con- 
sisting of 50 lines of small circles in each of 
which the subject is to put a pencil dot ; and Test 
4, counting the number of each kind of vowel 
in 40 lines of printed text. Time limits appear 
to be adequate, being 5 minutes for each sub- 
test or 20 minutes working time. It is possible 
that the arrangement of the four subtests on 
three pages may tend to invalidate the results 
because of the difficulty of preventing subjects 
who are unusually adept at understanding di- 
rections from working ahead on a subtest while 
others are still listening to the examiner read 
or explain the directions. 

A two-page, undated revised manual is pro- 
vided, the first page consisting of directions for 
giving and scoring the test, data on validity 
and reliability, and standards or percentile 
norms and the second page giving scoring keys 
for each subtest. 

Judged on the basis of face validity, the test 
appears to be most promising. Data on validity 
are inadequate; they show the percentage of an 
unknown number of life insurance office work- 
ers rated as “superior” who exceeded the me- 
dian of an unknown number rated as “infe- 
rior” on each of three of the subtests and on 
the total score for the three subtests, No test- 
retest reliability correlations are given, Inter- 
correlations based on only 100 cases are given 
for the four subtests. These range from .15 to 
.59, code translation and number checking 
being 59. Correlations between each subtest 
and total score and standard intelligence tests 
for the general population classified by sex or 
for employed clerical workers classified by sex 
and occupation are not given. 

Norms are given in terms of percentile ranks 
for a so-called true cross section of the adult 
population of both sexes from 16 to 50 years 
available for employment (N = 400 males and 
400 females), but the percentile ranks are given 
only for N = 800 so that any sex differences that 
may and in all probability must exist, at least 
on the number checking test, are not revealed. 
Thus, norms for age and sex, school grade, 
type of curriculum, specific occupations, em- 
ployed workers, unemployed workers, etc. are 
not given. 

In the reviewer's opinion, this is a most 
promising test of clerical aptitude and deserves 
intensive research to provide the data which 


643 ] 


_ are essential to an adequate interpretation, of 
the test results when used for employment or 
guidance purposes. 


Erwin K. Taylor, Chief, Performance Evalu- 
ation and Criterion Research Unit, Personnel 
Research Section, The Adjutant General's 
Office, Washington, D. C. In the years fol- 
lowing the first World War, the impetus 
given the testing movement by the use of tests 
in the Army resulted in the widespread pub- 
lication of numerous unvalidated and poorly 
standardized paper-and-pencil tests about which 
the authors frequently made exorbitant claims. 
This resulted, as is too well known, in an al- 
most complete discrediting of tests for use in 
industrial selection. A very real danger of a 
repetition of this situation exists today. If 
paper-and-pencil tests are to serve any useful 
purpose in the field of industrial psychology, a 
considerable amount of research must neces- 
sarily precede their publication. From all that 
appears in the revised manual, this test has 
teen released prematurely. “The validity of 
this test,” says the manual, “has been checked 
by a number of personnel people and through 
a correlation of scores with performance on 
the job as rated by supervisors.” What sort of 
checks these “personnel people” obtained is not 
specified, nor are we informed as to the mag- 
nitude of the correlations referred to or the 
number of cases in each. The only data quoted 
are on an unspecified number of superior 
clerks, 88 per cent of whom exceeded the 
median of a so-called “inferior” group on each 
of three subtests. 

No reliability data whatsoever are given. An 
intercorrelation matrix of the several subtests, 
based on 100 cases, is presented. Norms based 
on a cross section of the adult population of 
both sexes, age from 16 to 50 years, are pre- 
sented and consist of 400 of each sex. Use of 
this test on anything but a purely experimental 
basis is not recommended. 

For those who may entertain the notion of 
using this test, it may be interesting to note 
that it consists of the following parts: (a) a 
150-item number checking test, bearing the ad- 
ditional “face valid” item of decimal points, 
fractions, percentage signs, dollar signs, aster- 
isks, and other such symbols commonly found 
employed in business transactions; (b) a 20- 
line letter-substitution coding test contami- 
nated by the employment of meaningful ma- 
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terial; (c) a 2,000-item dotting test; and (d) a 
vowel cancellation test called "counting." The 
manual presents arguments rationalizing each 
of the four tasks. The validity of these argu- 
ments is not substantiated by any data. 


[ 632 ] 
Thurstone Examination in Clerical Work: Thur- 
stone Employment Tests. Clerical applicants; 1922; 
Form A; $1.60 per 25; 35¢ per specimen set, postpaid ; 
nontimed (60-80) minutes; L. L. Thurstone; World 
ook Co. 


REFERENCES 
i. THunsTONE, L. L. “A Standardized Test for Office 
Clerks,” J Appl Psychol gassi S39. 
2. STEDMAN, MzLIssA Branson. “Factors Influencing School 
Success in Bookkeeping.” J Appl Psychol 14:74:82 F '30. * (PA 


4:4556) 3 
3. JORGENSEN, C. “Analysis of Some Psychological Tests 


b 

the’ Spearman Factor Method." Brit J Ed Psychol 4:96109 F 
34. 1431 

4. DavipsoN, CuanLzs M. “Analysis of Clerical Tests." Per- 
sonnel J 16:95-8 S '37. * ae TRA 

s. Davinsow, CmamLEs M. “Evaluation of Clerical Tests." 
Personnel J 16:57-64 Jesar: * (PA 11:4713) 

6. Hates, W. M. “Clerical ‘Tests in State Reformatory.” 
Personnel J 16:316-24 Ap '38. * (PA 12:4278) 


John M. Willits, Associate Professor of Psy- 
chology, Graduate School of Business, Stan- 
ford University, Stanford, California. In the 
historical background of clerical aptitude test- 
ing, this is a highly significant test. In the 
present-day practice of such testing, However, 
it is an anachronistic survival. The test and 
its manual bear the copyright date 1922—and 
evidently neither one has been revised since 
then. When first issued, a quarter of a cen- 
tury ago, this test was a tremendous forward 
stride in the application of psychology to per- 
sonnel selection. Its author, then as now a 
leader in the testing field, has since moved on 
to other and no doubt more challenging areas 
of research and development—but the test re- 
mains as he left it. 

The continued availability of this test through 
the years might be claimed as empirical evi- 
dence of its merit. Quite likely it does have 
numerous satisfied users, who may have (and 
surely should have) individually validated and 
standardized it. But this review is intended for 
prospective users rather than for past ones, 
and they have a right to expect in the manual 
of a 25-year-old test (or of any commercial 
test) far more of objective data than this one 
offers. 

Strictly speaking, the test has no manual— 
only two pages of “directions” and a key. No 
data whatever are given on validity or reli- 
ability, beyond the statement that “it has been ` 
experimentally verified.” The only guide for 
interpretation of scores is a set of general 
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norms (on a wholly unmentioned population) 
giving 5-step adjectival ratings on "accuracy," 
"speed," and "combined speed-and-accuracy,” 
with a recommendation to use only the combi- 
nation ratings. 

Scoring is by the now largely outmoded 
"time-plus-errors" method. As only the total 
time is recorded, there is no practical way of 
utilizing part-scores on the eight widely differ- 
ent subtests. The directions ignore the matter 
of part-scores, while stating that the tests pur- 
pose is to supplement the employment inter- 
view in determining "at least roughly the ap- 
plicant's general intelligence as well as his 
accuracy and speed in doing clerical tasks." 

"The several parts of the examination are 
sample clerical tasks," the directions continue. 
“The last part of the test is a generally ac- 
cepted intelligence test." The last part of the 
test consists of ten proverb-matching items and 
one sample item. Ten items of this sort may 
have constituted “a generally accepted intelli- 
gence test” in 1922, but they would meet with 
no such acceptance today from anyone even 
slightly acquainted, for instance, with Dr. 
Thurstone’s distinguished work on the primary 
mental abilities. 

Most of the subtests seem to be well de- 
signed, as aptitude counterparts of clerical 
achievement worksamples, An exception is the 
one which requires the underlining of some 40 
misspelled words in a reading selection of 
about 800 words—a literary essay of mod- 
erately high comprehension level. This subtest 
is a worksample of one aspect of proofreading 
—but not, as is sometimes assumed, a spelling 
test. For a poor speller, it measures an unana- 
lyzable resultant of spelling and concentration ; 
for a good speller, it measures concentration 
alone. 

The scoring key is accurately spaced to fit 
the test blank. But in design it is as outmoded 
and cumbersome as the scoring method. Time- 
saving ideas such as fan keys were unknown 
in 1922, and this test is strictly a “1922 model,” 
albeit a well-built one for its day. 


E. F. Wonderlic, Vice-President, General Fi- 
nance Corporation, Chicago, Illinois. This test, 
first published in 1922, has since been used con- 
sistently in a few business organizations with 
modest success. Norms and standardization ma- 
terial and descriptive information regarding the 
method of construction are not reported by the 


author. The test consists of eight subtests, mak- 


ing it cumbersome to administer in the typical’ 


employment office. However, some of the sub- 
tests do discriminate differences in clerical abil- 
ity remarkably well. Later testing techniques 
have perhaps surpassed this test, but the types 
of questions and the underlying philosophy is 
still the basis for many psychological and clerical 
tests, Careful students of testing will study this 
form and in spite of its being over 25 years old 
will find it informative and useful. However, 
tests developed and perfected in later years, such 
as the Minnesota Clerical Test, which are sim- 
pler to administer and have norms available on 
adult performance in business and industry, tend 
to replace this work. 


INTERESTS 
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Basic Interest Questionnaire: For Selecting Your 
Vocation or Avocation. Grades 9-16 and adults; 
1938-39; replaced by the Job Qualification Inventory 
(see 639) ; nontimed (75) minutes; Keith Van Allyn; 
National Institute of Vocational Research (now Sur- 
veys, Inc.). 


Occupations 19:311-3 Ja "41. Donald E. 
Super. The Van Allyn Technique for Vocational 
Selection was developed, according to the Man- 
ual, to provide a sound basis for vocational 
guidance and selection because of the difficulties 
experienced by counselors in relating the results 
of test batteries to vocational specifications, in 
choosing appropriate tests from the great variety 
available, and in arriving at reasonably similar 
conclusions in making diagnoses. It consists of 
three items, to which a fourth is to be added, 
which are claimed to constitute a uniform tech- 
nique which will lead to uniformity in the diag- 
noses made by different counselors. One might 
question, at the outset, the validity of a uni- 
formity achieved by the standardization of tech- 
niques rather than by the development of more 
valid techniques. Validity and reliability are not, 
after all, synonymous. The first of these items 
is the Manual, which begins with a quotation 
from Emerson to the effect that each man is 
called to one vocation for which he is better 
suited than any other and in which he has no 
rival, a statement contradicted by all the facts 
of vocational psychology and to which no quali- 
fied vocational counselor would subscribe. The 
Manual is not intended “as a technical publica- 
tion,” but is meant for “everyone interested in 
guidance”; it is stated that the technique may 


ee 


645 qn TESTS & REVIEWS: VOCATIONS—INTERESTS 


_be used by “any persons with academic back- 
ground, or with special interest of a practical 
nature, in helping others find themselves," aíter 
“a little preparation.” Near the end of the Man- 
ual it is stated that courses in the use oí the 
technique conducted by Dr. Van Allyn are 
offered, a series of 10 two-hour lecture periods, 
including laboratory work. Can it be that we 
have all been wrong and that counselors can be 
made overnight regardless of previous experi- 
ence and personality? To the trained counselor 
who examines tests critically, with their validity 
in mind, the Manual is exasperatingly ambigu- 
ous and lacking in specific data. Constant refer- 
ence is made to "research" which Van Allyn has 
allegedly been carrying on for some years, but 
no data are given (with one minor exception), 
no reference to other publications are cited, and 
no other publications of this self-styled voca- 
tional psychologist (who is not a member of the 
American Psychological Association nor of the 
American Association for Applied Psychology) 
are listed in any issue of Psychological Abstracts. 
Several pages are devoted to a confused discus- 
sion of factors which differentiate vocations. 
This seems to lead to the conclusion that inter- 
ests and achievement are the best indices of 
aptitude, that personality is so modifiable and 
so inconstant as to be a negligible factor, and 
that intelligence is relatively unimportant. Ac- 
cepted methods of vocational guidance such as 
explanatory courses and the imparting of voca- 
tional information are surveyed and found want- 
ing. The Basic Interest Questionnaire and the 
Key are then explained. The Questionnaire is 
a 12-page inventory “listing all the basic ele- 
ments which constitute occupations,” a “classi- 
fication of 25 knowledge-fields . . . providing a 
means of determining the individual's dominant 
interests and general aptitudes." We are not told 
just how the basic elements of occupations were 
determined. Nor are we told how it was estab- 
lished that there are just 25 knowledge-fields 
(“factors” in psychology?). Each knowledge- 
field is “tested” by means of five questions, after 
which the examinee indicates by circling a 0, I, 
2, 3, 4, or 5 the extent of both his interest and 
aptitude in that activity. We are not told the 
reliability of the scores derived by the 10 re- 
sponses thus obtained, although we are assured 
that they are consistent. The examinee is not told 
how to judge his “aptitude,” whether this means 
native ability or present skill and knowledge, 
neither is he told how to rate his aptitude for 
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such activities as “preferring money for a dis- 
covery you made than fame you would receive” 
and “attending theatrical performances and stage 
shows,” nor does the Manual show any con- 
sideration for the deficiencies of self-ratings of 
aptitudes and interest in this test which, its 
author claims, “makes it possible for one to 
draw a sharp conclusion as to the right voca- 
tion for each individual.” The Key to Vocational 
Guidance and Occupational Placement consists 
of: (1) a list of “dominant interests” according 
to which it is claimed occupations can be classi- 
fied, including classifications ranging from “Bio- 
logical Sciences” through “Materials” to “Travel 
by Land” and “Travel by Sea and Air” ; and (2) 
a list of occupations classified according to the 
supposed importance of these dominant inter- 
ests, singly and in combination. Again, there are 
no data as to how the dominant interests were 
determined or as to how their role in each occu- 
pation was established; simply the statement: 
“All known occupations were then reviewed and 
were assigned representative letters. . . .” But 
need we continue? Perhaps the Manual is not 
meant to be technical or scientific in its presenta- 
tion, and the important scientific data are, there- 
fore, left out. But why such a manual if such 
evidence exists? Scientific and conscientious vo- 
cational counselors cannot use a diagnostic in- 
strument until they are presented evidence of its 
validity. Let us, therefore, conclude by signifying 
our agreement with at least one statement in the 
Manual, emphasizing its applicability to both the 
Van Allyn Technique and this brief review: “If 
it succeeds only in raising questions in the minds 
of those who read it, this publication will have 
more than served its purpose.” 
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Brainard Occupational Preference Inventory. 
Grades 9 and over ; 1945; revision of Specific Interest 
Inventory by Paul P. Brainard and Frances G. Stew- 
art; Form i ; separate record forms must be used for 
recording answers ; 25¢ t test, postpaid ; $1.35 per 25 
record forms, postpaid ; per specimen set, postpaid 
nontimed (30) minutes; Paul P. Brainard and Iph 
T. Brainard; Psychological Corporation. 


REFERENCES 
1-4, See 40:167: 


5. SHUTTERLY, Vancrnra. “Ts the Aptitude Test a Panacea?” 


t 1260-2 Ja '44. * (PA 18:1904 
RE etes RD Arne Reading od culty of Interest 


Inventories.” Occupations 26:95-6 N '47. * (PA 22:1722) 
Edwin W. Davis, Director of Washington 
Counseling Center and Associate Professor of 
Psychology, The George Washington Univer- 
sity, Washington, D. C. This vocational interest 
inventory replaces the four forms of the Specific 
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Interest Inventory which was reviewed in earlier 
Mental Measurements Yearbooks (see 38:1176 
and 40:1675). The new inventory is self-admin- 
istering and self-scoring, but the subject would 
do well to have his scoring and his interpreta- 
tions reviewed by a professional counselor. The 
subject expresses his preferences for or against 
140 work situations on a selí-scoring record 
form. Here, the subject may score his answers 
for 28 occupations which in turn may be com- 
bined into the 7 major occupational fields of 
commercial, personal service, agriculture, me- 
chanical, professional, esthetic, and scientific. 
"The subject records his preferences for each of 
the 140 questions on a five-point scale with the 
following numerical values : —2 dislikes greatly, 
—1 dislikes some, o neutral, 1 likes some, and 2 
likes greatly. The manual lists 378 occupations 
classified under the 28 occupations of the inven- 
tory. 

Generally the inventory is a good one. Vet- 
erans Administration Guidance Centers re- 
ported 17 per cent using it as compared to 99 per 
cent for Kuder, 70 per cent for Strong, 36 per 
cent for Lee- Thorpe and 30 per cent for Clee- 
ton.* The author reports a reliability correlation 
of .81 on over 300 cases. Intercorrelations of 
major fields for 342 boys and girls gave a median 
r of .29 ranging from .07 to .63 with the low 
correlations for unrelated fields and the highest 
correlation between the related professional and 
scientific occupational fields. Norms for the 7 
fields and 28 occupations were based on 335 
boys, 378 girls, 95 men, and 75 women. 

This test excels all others in its possibilities 
for self-analysis and for counselor analysis. Al- 
though its major fields do not give as good cover- 
age as some other inventories, its 28 subfields 
and 378 related occupations offer better possibil- 
ities for actual analysis of occupational choices 
than do other interest inventories. It has the 
unique advantage of being keyed to hundreds 
of other related occupations in the USES Dic- 
tionary of Occupational Titles, Part II. Further 
validation with adults should make it a serious 
competitor of any other interest inventory now 
in use. 


Herschel T. Manuel, Professor of Educational 
Psychology and Director of Testing and Guid- 
ance Bureau, The University of Texas, Austin, 
Texas. This inventory consists of 140 questions 


* Baker, Gertrude, and Peatman, John Gray. “Tests Used in 
Veterans Administration Advisement Units" Am Psychol 


2:99-102 Mr '47. 
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arranged in seven fields, each of which is further 
subdivided into four sections, consisting of five 
questions each. In the booklet containing the 
questions, no occupations in the sense of voca- 
tion are named; instead, one infers from the 
publisher’s catalogue, each question describes 
briefly an activity characteristic of some occupa- 
tion. The answer sheet provides spaces for re- 
cording scores for each field and section, which 
are there given names. The fields are commer- 
cial, personal service, agriculture, mechanical, 
professional, esthetic, and scientific. An example 
of the division into sections may be taken from 
the professional field: medicine, law and social 
work, education and uplift, and personnel. 

According to the manual, “it is a standardized 
interview designed to bring to the fore the 
salient facts about a person with respect to his 
occupational interests so that he and his advisers 
can more intelligently and objectively discuss 
his occupational and educational plans.” The 
question booklet ‘and the answer sheet are 
cleverly arranged to make the administration of 
the inventory easy and to make it possible for 
the student himself to score the test, construct 
profiles, and make certain records preparatory 
to interpretation. 

Preliminary “section” and “field” norms are 
given for the sexes separately at the high school 
and adult levels, and occupations are suggested 
for various combinations of highest scoring sec- 
tions. The reliability by the Ghiselli formula is 
given as .8r. Intercorrelations of field scores 
range from .07 to .63. 

In the hands of qualified counselors, the In- 
ventory can be a useful instrument. Certainly 
the mechanics of administration and scoring are 
not difficult. The publisher's term, "'self-inter- 
preting,” however, implies too much. The attrac- 
tive mechanics of a measuring instrument can 
be misleading by tending to make the process of 
vocational guidance appear to be a much simpler 
Process than it can possibly be. The authors 
themselves recognize the complexity of voca- 
tional planning, but it needs to be emphasized 
over and over until the need for specially quali- 
fied counselors is fully appreciated by school 
and college administrators and the public. 

The use of descriptive phrases Tepresenting 
activities of various occupations gives the inven- 
tory a degree of validity as an instrument for 
surveying occupational interests. Yet one can- 
not escape the impression that too much has 
been sacrificed to convenience and mechanical 
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excellence. An inventory limited to 140 ques- 
tions is entirely too brief to give adequately the 
variety of information attempted in the section 
and field summaries. Five questions can hardly 
be sufficient to give a very accurate picture of 
interests in “Law and Social Work," “Educa- 
tion and Uplift,” or “Writing, Speaking, and 
Acting.” In the long run, the fault may be more 
with the test user than the testmaker. As users 
of the tests, we are continually trying to over- 
simplify the process of dealing with human 
nature. 


For reviews by Jack W. Dunlap, Everett B. 
Sackett, and M. R. Trabue of the original edi- 
tion, see 38:1176 and 40:1675. 
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Cleeton Vocational Interest Inventory, Revised 
Edition. Grades 16 and adults; 1937-43; Form A; 
separate editions for men and women; IBM; separate 
answer sheets must be used; 10-99 tests, 734¢ each; 
4¢ per machine-scorable answer sheet; 40¢ per set 
of scoring stencils; so¢ per specimen set, postpaid ; 
nontimed (40-50) minutes: Glen U. Cleeton; Mc- 
Knight & McKnight. 
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: Edward B. Greene, Chief, Personnel Train- 
ing, Michigan Unemployment Compensation 
Commission, Detroit, Michigan; and Lecturer 
in Psychology, University of Michigan, Ann 
Arbor, Michigan. Two forms are provided—one 
for men and one for women—each of which 
consists of 700 items, divided into ro fields of 
work of 70 items each. The fields of work for 
men are: (a) biological sciences; (b) insurance, 
real estate and specialized selling; (c) physical 
sciences; (d) social sciences; (e) business ad- 
ministration; (f) literary occupations; (g) me- 
chanical occupations ; (h) financial and account- 
ing; (1) artistic, including music, art, and public 
performances; and (j) agriculture. The fields 
of occupations for women are: (a) clerical; (b) 
retail sales; (c) biological sciences; (d) social 
service; (e) artistic creative; (f) grade school 
teacher; (g) high school teacher; (4) personal 
service, including actress, dancer, and singer; 
(i) household and factory mechanics; and (j) 
homemaker (child care and household econo- 
mist). 

According to the author, each group indicates 
some of the occupations, things, activities, and 
people that are liked by successful persons ina 
particular field. The scores are simply the num- 
ber of plus marks for each section, although 
instructions asked for zeros to be placed after 
the items which one dislikes or thinks he would 
dislike. 

The methods by which the fields of work were 
determined and the items placed in each field 
are not clearly indicated. The author states that 
the present inventories were developed from 
items included in forms published by Robinson, 
Moore, Freyd, and-Ream, plus some additional 
items. In all of these scales the basis of selecting 
items has been “the extent to which people in 
different occupations report interest in specific 
items." There is no evidence, however, that the 
Cleeton forms have been validated by this pro- 
cedure. The author states “new items appearing 
in the present inventories for the first time in 
any interest test were selected by determining 
the extent to which new items agreed with the 
basic items of known vocational significance.” 

The scales have been validated by applying 
them to 1,741 persons successfully engaged in 
standard occupations. The occupations, how- 
ever, are not named. Among these persons the 
occupation actually being followed corresponded 
to the field of work given the highest inventory 
rating in 78 per cent of the cases. Figures are 
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not given to show how much higher such ratings 
were than other ratings or whether or not un- 
successful persons in an occupation would show 
similar ratings to successful persons. 

The fields of work are not intended to be 
mutually exclusive. Identical or nearly identical 
items are found in several of the sections, and 
a number of somewhat independent elements are 
found in each section. Thus, in the first section 
in the men's series, devoted to biological sciences 
and related occupations, bacteriologist, dentist, 
nerve specialist, optician, pharmacist, hospital 
laboratory technician, physician, specialist in 
mental diseases, public health officer, and sur- 
geon seem to represent occupations in this field. 
Other related occupations are chemist and scien- 
tific research worker. In addition are included 
occupations which seem very little related to bio- 
logical science, such as astronomer, sculptor, 
carpenter, explorer, cabinet maker, instrument 
maker, watchmaker, and ship’s officer. The 
writer feels that this range of occupations is 
probably too varied to give a clear picture. It is 
also noted that such occupations as biologist, 
dental hygienist, dietician, and nurse are omitted. 

Under the group of activities which indicate 
a liking for biological science work, we find such 
things as pet monkeys, cowboy movies, animal 
Zoos, detective stories, gardening, similarity in 
work, operating a small business for myself, 
chopping wood as a recreation, men who use 
perfume, people who are quick tempered, people 
who make excuses, and repairing a clock. Other 
items in this list seem to be more related to the 
biological field of work, although many of them 
are rather general, such as, working for your- 
self instead of others, emphasis on quality of 
work, can you meet emergencies quickly, work 
which is interesting with modest income, do you 
drive yourself steadily, etc. 

An analysis of other pages in the men’s and 
women’s forms shows more or less this same 
pattern. A majority of items bear directly on the 
field in question, but a large number of other 
items are of a general sort or only distantly 
related. It is believed that a good item analysis 
would yield a much shorter questionnaire which 
would be more reliable and valid. 

The phrasing of the items leaves something 
to be desired. Most of them are not more than 
four words in length, and many of them are 
single words, such as golf, poker, politics, smok- 
ers, military drill, etc. Some studies have shown 
that many persons have difficulty in making up 


their minds as to whether or not they should 
check items which are so vague or general, 
A few items stated more specifically might yield 
more meaningful results. 

The author gives odd-even reliabilities above 
.82 for each subdivision of the scale. No reliabil- 
ities are given for repeated administrations of 
the test, although he states that on a second 
administration within a month of the first, 6.1 
per cent of responses were changed from “o” to 
“plus” or from “plus” to “o.” 

The author provides a manual of 31 pages, of 
which the first 9 pages indicate the use of interest 
questionnaires and counseling, and pages 22 to 
27 give a sample record and a discussion of han- 
dling special cases. Nine pages contain norms 
for men’s and women’s scales. Separate norms 
are given for 8th, oth, roth, rrth and 12th 
grades, college freshmen, upper-class college 
students, and adults. No indication is given of the 
types of school sampled, or the selection of 
adults. In each instance a fairly sizable sample is 
reported, that is, more than 700 cases. 


C. A. Oakley, Lecturer in Industrial Psychol- 
ogy, Glasgow University, Glasgow, Scotland. 
Many think that vocational guidance should be 
based on temperament rather than on talent, but 
unfortunately temperament is much less, very 
much less, readily measured than talent. One 
aspect to which a good deal of attention has been 
given is the assessment of the subject’s major 
interests, because of the impression that he 
would be happiest, even if not necessarily most 
successful, in an occupation which he liked very 
much. The distinction is, of course, to some ex- 
tent artificial, since there must clearly be a posi- 
tive relationship between his outstanding abil- 
ities and his chief interests, But perhaps this 
point should not be emphasized, since the adoles- 
cent is immature; and some of his interests, 
influencing considerably the way he will do his 
work later on, may as yet be at an early stage 
of formation. 

The Cleeton Vocational Interest I nventory is 
well known in British laboratories, but has been 
given little practical application. Its purpose is 
to find out whether a student’s interests fit in 
closely with the patterns of people in one (or 
more) of ten basic occupational groups—the as- 
sumption, based upon previous researches, being 
that persons in particular occupations do have 
common interests—although “whether this is 
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due to training, experience or temperament is 
not definitely known." 

So far, so good. What has tended to set some 
British psychologists, who have experimented 
along these lines, against the Cleeton inventory 
is the selection of the items and, incidentally, 
also the rather complicated way in which the 
students are called upon to mark their answers. 
They are asked how they feel towards certain 
occupations, school subjects, work activities, and 
personal traits. It would have perhaps been bet- 
ter to have concentrated on one—or, possibly, 
two—of these groups rather than to have gath- 
ered them together on one comprehensive paper. 
This, incidentally, would have simplified not only 
the administration of the inventory but also the 
subject's task. He must really feel a little be- 
mused by the time he has worked his way 
through to section FRJ (or for women WHJ). 

Each of the three groups, A, B, and C, in 
which the items are arranged, seems in itself 
to be useful. But we would have preferred to see, 
for instance, all of the occupations in the ten 
Group A’s brought together to form one paper 
by themselves. By eliminating also some of the 
items in Group B, the inventory could, in our 
opinion, have been rendered more consistent 
and more suitable in length. 

The general conception of the inventory is 
ingenious. Much imagination has been exercised 
in producing the individual items, particularly 
for the Group C’s, and, if there is agreement that 
analyses of this kind should be dealt with as test 
scores, then the Cleeton inventory demonstrates 
how that can be done—we doubt if the possibil- 
ity had occurred to British psychologists before 
the Cleeton test was published. 

One point on which we agree wholeheartedly 
with Professor Cleeton is that the inventory 
should be used at yearly intervals for pupils in 
senior school classes, thus throwing light on 
changing inclinations as adolescence advances. 


Arthur E. Traxler, Associate Director, Edu- 
cational Records Bureau, New York, New York. 
The Cleeton Vocational Interest Inventory 1s 
representative of the kind of interest test in 
which the items are arranged in vocational 
groups rather than in random order. This inven- 
tory exists in two forms—one for men and one 
for women. These forms were first published in 
1937. In that edition, each form was designed to 
measure interests in nine vocational groups and 
to yield a tenth score known as S-R, or social 
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adjustment. Subsequent research by Cutler (4) 
and others casts doubt upon the validity of the 
social adjustment score. A revision of both 
blanks was published in 1943, from which the 
social adjustment section was eliminated and to 
which a tenth interest group was added. The 
items in the other nine interest groups were the 
same as those included in the corresponding 
areas of the earlier edition, The men’s blank 
yields scores for interests related to biological 
science, salesmanship, physical science, social 
science, business, legal and literary occupations, 
mechanical occupations, finance, public perform- 
ance occupations, and agricultural occupations. 
The ten areas in the women’s blank include office 
occupations, selling occupations, natural sci- 
ences, social service occupations, creative occu- 
pations, grade school teacher, high school or 
college teacher, performance and personal serv- 
ice occupations, mechanical and household occu- 
pations, and domestic interest fields. 

Each interest group contains 70 items ar- 
ranged in three groups—A, B, and C. Group A 
includes the names of 20 vocations; Group B 
contains 20 items pertaining to school subjects, 
reading materials, avocations, famous people, 
and the like; and Group C has 30 items con- 
cerned with activities and aspects of personality. 
The subject indicates each like or yes answer by 
means of a plus (+) and each dislike or no 
answer by means of a minus (—). There are no 
intermediate responses, 

The inventory may be administered for either 
hand scoring or machine scoring. About 40 or 
50 minutes, on the average, are required for 
completion of the 700 items. 

It is obvious that the arrangement of the items 
in the Cleeton inventory greatly simplifies the 
scoring as compared with that of some of the 
other leading interest tests. This simplification 
comes about in three ways. In the first place, 
the subject has to respond with plus or minus 
to each item. There is no opportunity to indicate 
indifference or uncertainty. This procedure may 
be a limitation, particularly in the case of young 
subjects who have had little or no work expe- 
rience. 

In the second place, the items, as already indi- 
cated, are grouped in occupational interest fields. 
This arrangement has been criticized by Nelson 
(see excerpt following this review) who indi- 
cated that responses to items in random order 
might be significantly different from responses 
to items so arranged that they have an obvious 
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relationship to interest fields and by Brewer 
(11) who pointed to the possible halo effect 
when similar items appeared in serial order. As 
for the correctness of the grouping, Recktenwald 
(14) applied the chi-square test to the grouping 
of the vocations in the nine categories of the 
1937 edition of the men's blank and found statis- 
tical support for the grouping. 

The third way in which the inventory is sim- 
plified is that only the plus items are scored, and 
each one is given a weight of one. The scoring 
procedure is so simple that in a small experiment 
at the Educational Records Bureau the average 
time to hand score the blank, record the percen- 
tile ratings, and draw the profile was 9 minutes, 
35 seconds a test. This is rapid for any test 
containing 700 items, and far faster than the 
time required by most of the other interest 
inventories. 

Spearman-Brown odd-even reliability coeffi- 
cients, as given in the manual of directions, range 
from .822 to .910. These reliabilities are of about 
the same magnitude as those which have been 
found for Strong's Vocational Interest Blank for 
Men, the Kuder Preference Record, and other 
leading interest inventories. 

Although several studies of the Cleeton inven- 
tory have been published, there is little available 
evidence relative to the validity of the scores. 
Congdon (1) found the inventory useful in 
guidance of students. Arsenian (3) obtained 
statistically significant correlations between 
Scores on the men's blank and certain group 
scales on the Strong blank for nien. The median 
of the correlations was a little below . 50. Hartzell 
and Murphy (9) found that the PSH and NSC 
Scores on the women’s blank were useful in iden- 
tifying the interests of young women entering a 
cosmetology training program. 

A small study pertaining to the validity of 
the Cleeton inventory was carried on at the Edu- 
cational Records Bureau in connection with the 
preparation of this review. Twenty-four women 
first ranked the ten fields subjectively in order 
of their interests and then filled out the women’s 
blank. The same procedure was used by five men 
in connection with the men’s blank. A Spearman 
rank-difference correlation was computed be- 
tween the subjective rankings and the rank order 
of the test scores of each individual. The correla- 
tions for the women ranged from .13 to 87, with 
a median of 69; those for the men ranged from 
47 to .87, with a median of .64. The median for 
the entire group was .68, which is fairly high but 
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lower than a median of .75 obtained when a sim- 
ilar procedure was applied to results for thirty- 
one members of the Bureau staff who filled out 
the Kuder Preference Record. 

On the whole, this reviewer has somewhat less 
confidence in the Cleeton inventory than in the 
Strong blank or the Kuder record, but he would 
be inclined to use it in situations calling for 
a quick, easily scored measure of vocational 
interests. 


Occupations 21:260-71 N '42. A. Gordon 
Nelson. [Review of the 1939 edition.] * This 
reviewer has found the Cleeton Inventory to be 
of value when used with other instruments for 
clinical diagnosis. The mere process of reacting 
to the many items of the scale appears to be a 
focusing experience for high school pupils who 
are vague regarding plans for the future. The 
scores on the various sections give a clue con- 
cerning the kinds of vocational information to 
which the counselee should be directed. Discrep- 
ancy between claimed and measured interests, 
and disparity between abilities and interests 
should be watched for. The Cleeton Inventory 
should be used only by persons familiar with 
the limitations of instruments of this type. A 
study of the answers to the questions on the front 
of the booklet and individual items throughout 
the Inventory may give the experienced coun- 
selor leads for interviews. The scale could be 
improved in validity if the items were arranged 
in random order rather than in easily recogniz- 
able occupational groups. This might overcome 
the tendency of some subjects automatically to 
put plus signs beside items which are obviously 
related to their claimed interests. The Manual 
contains a summary of possible uses of the In- 
ventory, but it should give the counselor more 
data concerning the derivation of the norms and 
the results of research on reliability and validity. 


For reviews by Forrest A. Kingsbury, N. W. 
Morton, Albert S. Thompson, M. R. Trabue, 
and E. G. Williamson of an earlier edition, and 
an excerpt from a review by Nora A. Congdon, 
see 38:1181 and 40:1682. 
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*Gregory Academic Interest Inventory. Grades 
13-16; 1946 ; IBM; 1 form; scoring scales are available 
for 28 interest areas: agriculture, architecture, bio- 
logical sciences, business administration, chemistry, 
civil engineering, commercial arts, electrical engineer- 
ing, elementary education, English, fine arts, geology, 
history, home economics, journalism, languages, mathe- 
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matics, mechanical engineering, military science, music, 
physics, physical education, psychology, public service 
engineering, religion, secondary education, sociology, 
and speech ; separate answer sheets must be used ; $2.50 
per 25; 3¢ per machine-scorable answer sheet; go¢ per 
scoring scale; 2-9 scales, 75¢ each; r0 or more scales, 
65¢ each; ro per manual; 2¢ per profile chart; inven- 
tories may be scored on all 28 interests for $1 each plus 
postage by sending to Institute for Psychological Serv- 
ices, 18 South Michigan Ave., Chicago 3, Ill.; nontimed 
(60) minutes; W. S. Gregory; Sheridan Supply Co. 
REFERENCES 

1. Grecory, WinBUR S. “Data Rej arding the Reliability and 
Validity of the Academic Interest Inventory." Ed & Psychol 
Meas 6:375-90 au '46, * (PA 21:891) 

Paul S. Burnham, Associate Director, Stu- 
dent Appointment- Bureau, Yale University, 
New Haven, Connecticut. In preparation since 
1938 and copyrighted in 1946, this inventory 
consists of 300 academic interest items to which 
students respond on a 5-point scale, indicating 
degree of interest in each item. Scores secured 
make it possible to compare the student’s inter- 
ests with the interest profile found to be charac- 
teristic of persons majoring in each of 28 aca- 
demic departments. 

Approximately 165 of the 300 items start 
with the word “study”; for example: “Study 
the history of architecture.” All other items start 
with some other action verb such as play, repair, 
operate, construct, measure, make, etc. One can- 
not help but wonder what effect such emphasis 
on the word “study” may have for the person 
who may be intensely interested in do`ng certain 
things but not much interested in studying them. 

Scales were developed on a basis similar to 
that used by Strong in connection with the Voca- 
tional Interest Blank. Nine hundred items were 
administered to a freshman class at the Univer- 
sity of Nebraska and to approximately 5,000 
juniors and seniors in other colleges. Scoring 
weights were derived by contrasting answers 
given by a minimum of 100 juniors and seniors 
in each major with the answers of freshmen. On 
the basis of weights so derived, 300 items were 
selected for the present form. 

The purpose is to provide a means of objec- 
tively measuring and comparing students’ inter- 
ests in various curricula in colleges and universi- 
ties in the United States. The manual claims 
that the results are useful for educational and oc- 
cupational guidance. Retest reliability (4-month 
interval) of the 28 scales is reported in the 
manual to vary from .67 to .92. Data were based 
on a retest of 100 students selected at random 
from the basic population of 1,500 University of 
Nebraska freshmen. Provision is made for both 
machine and hand scoring. 


TESTS & REVIEWS: VOCATIONS—INTERESTS 


636 


No data are given with respect to intercorrela- 
tion of scales so the user has no idea of the 
amount of overlapping. Apparently no factor 
analyses have been attempted to determine the 
extent to which the number of scales could be 
reduced. No data are given with respect to cor- 
relation with any measures of achievement or 
scholastic aptitude. Scores are converted into 
scaled scores, running from 1 through 9, with 
comparable percentage frequencies developed 
from the basic group. The author reports that 
it may be highly desirable for a school to de- 
velop norms for its own students. Scores are 
presented in the form of a profile, in an order 
intended to group scales which would seem to 
be logically related on the basis of subjective 
interpretation. 

Much work will have to be done to provide 
a better description of the functional utility of 
this interest test. The need for such a measure 
is great. 


Lysle W. Croft, Director of Personnel, Uni- 
versity of Kentucky, Lexington, Kentucky. 
There has been a need in the counseling field for 
some time for such a blank as the Gregory Aca- 
demic Interest Inventory. The inventory was 
developed as an aid to counselors in measuring 
student interests as they relate to the many 
curricula of colleges and universities. The au- 
thor feels that the inventory is helpful in four 
ways: (a) in aiding students in their occupa- 
tional planning; (b) in aiding students in educa- 
tional planning; (c) in aiding students in their 
selection of electives; (d) in aiding counselors 
with their academic problem cases. 

The administration of the inventory is simple, 
and it can be given in groups or individually. 
Although the author estimates an hour, it is 
rarely necessary to use over 45 minutes. The 
same scoring keys and norms are used for both 
men and women. A simplified scoring system has 
been developed with assigned scoring weights of 
+1 and —1. This is a needed improvement over 
the older system of weights ranging from T4 
to —4. The scoring of the schedule is simple 
but long, tiresome, tedious, and time consuming 
when used individually. It is necessary for each 
of the 28 interests to be scored twice for each 
side of the answer sheet, or a total scoring of 
four times for each interest. This accumulates to 
a total of 112 scorings necessary to complete the 
record for one individual. The best procedure is 
obviously group testing and machine grading. 
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All scores are added algebraically and recorded 
on the specially designed answer sheet. A special 
profile sheet is provided for recording the final 
scores. The raw scores are translated into scaled 
scores. Each scaled-score interval represents 
one-half standard deviation of an assumed nor- 
mal distribution. The simplicity of the recording 
of the scores into scaled scores based upon a 
normal distribution is a good feature of the 
Inventory. The per cent of distribution is blocked 
off into the upper 22 per cent which can be used 
as significant, the middle 56 per cent, and the 
lower 22 per cent. An interesting feature, similar 
to the idea used on Strong's Vocational Interest 
Blank, is the grouping of the similar interests 
on the profile chart. 

The reliability was determined by the test- 
retest method with the Pearson coefficients of 
correlation between the original and retest rang- 
ing from .92 for chemistry to .67 for physical 
education. The validity of the scoring method 
was secured by correlating scores received by the 
longer scoring method and the new simplified 
Scoring method. The range of the Pearson co- 
efficients of correlation was .95 for public service 
engineering and mathematics and .71 for home 

>= economics. 
~ The author suggests, and it is certainly ad- 
visable, that norms be developed for the indi- 
vidual school. 

As a counseling aid, the inventory has proved 
very satisfactory as a follow-up for the Kuder 
Preference Record. In the interpretation of the 
inventory, the same care should be used as is 
expected with other blanks. 

Because of the lack of published research on 
this inventory, this reviewer gave the blank to a 
class of seventeen students (juniors, seniors, and 
graduate students) in psychology. The inventory 
definitely selected the major field of twelve stu- 
dents; nine subjects scored a scaled score of 9 
(top 3 per cent of distribution) on their major. 
One student had nine fields represented in his 
scaled scores of eight or above. Three students 
seemed to be completely out of their field for 
their measured interest. This was confirmed for 
two of the students after interviews, and the 
third was on academic probation and under- 
going conferences in the University Personnel 

ce. 

Although it is felt that additional research is 
needed, particularly as to the validity, and other 
fields of interests, such as law (or prelaw) and 
accounting, should be added, the Gregory Aca- 


demic Interest Inventory is proving to be a 
valuable counseling tool. 


Herbert A. Toops, Professor of Psychology, 
The Ohio State University, Columbus, Ohio. 
This inventory consists of 28 scales, with corre- 
sponding scoring keys (more promised later). 
The 28 indices are obtained from as many scor- 
ings of a 4-page, 300-item academic "interest" 
test. The profiled results are intended for aid- 
ing a college student in choosing, within a uni- 
versity, his college or curriculum; within a cur- 
riculum, his majors; within a major, specific 
courses; and for academic guidance generally. 
That is an ambitious and worthy aim. 

The examinee marks one of five degrees of 
interest for each item, but these are telescoped, 
by integral combination in the scoring, to +1, 
—1, and “omit.” This, according to the author's 
interpretation of the evidence, does little violence 
to the measurement and enables each scale to be 
fully scored with only two stencils, by hand or 
by IBM scoring machine, a device which unfor- 
tunately has this mechanical limitation. 

It is unfortunate, too, the reviewer feels, that 
the response 1 means “very interested” and 5, 
"very disinterested,” thus reversing the exam-, 
inee's usual mental set where “thermometers” 
are concerned. The items are arranged in groups 
of fives, but the accompanying IBM answer sheet 
is not similarly sectioned ; and no alignment be- 
tween test sheet and answer sheet, to enable the 
examinee to maintain his place, is possible. 

The Strong method of arriving at scoring keys 
was employed : concretely, it was assumed that 
100 or more students in commercial arts are, in 
the main, interested in commercial arts. Nine 
hundred items were initially tried out in a num- 
ber of cooperating colleges and universities, and 
these finally were boiled down to 300, and the 
simplified scoring above described was adopted. 
Details of the method used, it is promised, will 
be revealed in future articles, The reviewer won- 
ders if, in subsequent analyses, vocationally 
"misplaced" persons could not be excluded as 
well as “bad items,” 

With three exceptions the test-retest reliabil- 
ities of the 28 scales, on only 99 subjects (am 
inadequate number), is .85 or above, and none 
are so high as .95, although half of them are .90 
or higher. This indicates a need generally for 
lengthening the examination, although few pres- 
ent-day tests would make an equal showing. The 
specificity of the individual scales—which really 
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is the issue—might better be freed, or largely 
freed, of attenuation before being ascertained 
by tryout. 

Intercorrelations of the 28 scales, based on 793 
men and 642 women, treated separately, indicate 
a very considerable specificity for at least some 
of the scales. These intercorrelations run from 
—.69 (characteristic of physics and journalism, 
also of speech and mechanical engineering, and 
also of speech and electrical engineering) to 
+.87 between mathematics and mechanical engi- 
neering. These would seem to prove the exist- 
ence of differential academic interests. The cor- 
relations for men and women in general are of 
the same sign and of approximately the same 
magnitude, so separate scoring schemes for the 
sexes are not provided, 

In general, the intercorrelations of scales in 
most instances support one’s preconceived no- 
tions of the probable interdependence of the 
academic interests ; yet a few disconcerting ones 
are reported: Why, for example, should mathe- 
matics correlate .70 with physical education and 
80 with religion, but only .52 with physics and 
.37 with electrical engineering? (Mathematics, 
on the other hand, correlates .87 with mechanical 
engineering, while electrical engineering and 
mechanical engineering intercorrelate 81. These 
relations suggest possible errors in the published 
intercorrelations.) 

No factor analysis has been made to deter- 
mine how many of the scoring keys are really de- 
sirable and are minimally necessary ; this would 
seem to be a much-needed next step. 

Once dependable profiles of the 28 scores, or 
the suggested "factors," are obtained, one would 
find it useful to make transparent stencils to 
omit, by such means as cut-out windows, for a 
given occupation, the scales of the individual 
profile not very pertinent to a given (proposed) 
curriculum of a counselee. As yet experimenta- 
tion has not gone so far. 

The research on this test is worth watching. 
It has many good points. A reviewer, however, 
cannot help but ask, “Will not such an instru- 
ment be better for prognosis than for educa- 
tional choice?” After all, one of the functions of 
a college education and of a college guidance 
program is to remove the bad effects of disabili- 
ties and inequalities of childhood environment 
and experience. If one “likes engineering” but 
“hates mathematics,” or if, alternatively, he 
“likes medicine” but “hates dissection”—both of 
which curricular “requirements” it is assumed 


by the author of the test are "just"—perhaps 
the proper action of the counselee is not to give 
up the objectives of engineering and medicine, 
respectively, but rather to seek a psychiatrist, if 
need be, to remove his disabling emotional im- 
pediments. The implication, too, that one’s aca- 
demic interests are as unchangeable, over the 
four years of college, as one’s love for fat men 
or hatred of cigar smoking may or may not bé 
borne out in practice, And the old question, “Is 
interest analogous to the steam in the boiler or 
to the recording needle on the steam gauge?” is 
not solved here. Another criticism, which falls 
particularly heavily on interest tests, applies to 
tests in general; A valid measuring instrument 
in a particular application may be used badly or 
well. Perhaps a testmaker is under obligation 
to tell us at least how no! to use it. This our test- 
maker to date has not done. 


J Consult Psychol 11:156 My-Je’47. 
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Interest Questionnaire for High School Students, 
1942 Edition. High school entrants; 1930-42; IBM; 
$4.40 per 100 machine-scorable test-answer sheets; 6 
per manual and stencils ; 95¢ per specimen set, postpaid ; 
nontimed (20) minutes; Oliver K. Garretson and Per- 
cival M. Symonds; Bureau of Publications, Teachers 
College, Columbia University. 


REFERENCES 
1. GARRETSON, OLIVER KELLEAM, Relationships Between Ex- 
pressed Preferences and Curricular ‘Abilities of Ninth Grade 
joys. Columbia University, Teachers College, 'ontributions to 
Education, No. 396. New York: Bureau of Publications, the 
College, 1930. Pp. v, 77. Out of print. * ga 4:3263) | 5 
2, SYMONDs, Pencivat M. Tests and Interest Questionnaires 
in the Guidance of High School Boys. New York: Bureau o 
Publications, Teachers College, Columbia University, 1930. 
Sa 


Pp. 61. Paper. Out of print. ( 2043) 

^. Gannerson, rivi KELLEAM; "delationshi s Between the 
Expressed Preferences and the Curricular Abilities of. Ninth 
Grade Boys." J Ed Res apres Fesi * (PA §:3837) 

Ay. Van EASTERN, Mary Erren, A Study of the Constancy of 
Individuals’ Responses to the Garretson-Symonds Interest 
Questionnaire and a Comparison of the Constancy of Responses 
to the Various Items of the Questionnaire. Unpublished mas- 


hesis, Fordham University, > 2 
ters mheis Kor avez, Van Dyke. Aptitudes and Aptitude 


Testina, pp. 360-1. New York and London: Harper & Brothers, 
GE a Ea agp tum. ue PASA, Ja, "Punie 
Appraise Vocational Interest Blanks.” Occupations 20:100:2 
"41. * (PA 16:757) 

Lysle W. Croft, Director of Personnel, Uni- 
versity of Kentucky, Lexington, Kentucky. This 
questionnaire is a two-page academic interest 
questionnaire designed to supply a counseling 
tool for measuring the academic interest of high 
school students in the academic, commercial, and 
technical curricula. The questionnaire consists 
of ten sections, a total of 234 items. The ques- 
tions are answered by L (like), I (indifferent), 
and D (dislike). 

In the interpretation of the test, the same care 


ji 
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is necessary that should be given to all interest 
blanks and they should be used only in conjunc- 
tion with intelligence and aptitude tests. 

The administration of the questionnaire is 
simple and can be given as a group test or indi- 
vidually. All information, including scores, is 
placed on the answer sheet. 

Although little research has been published on 
the questionnaire, it should prove valuable as a 
counseling tool at the high school level. Coun- 
selors, when using the blank, should develop 
their own norms. 


[ 638 ] 
XInventory of Vocational Interests: Acorn Na- 
tional Aptitude Tests. Grades 7-16 and adults; 1943; 
1 form ; $1.50 per 25; 1o¢ per manual; 5o¢ per specimen 
set; nontimed (35) minutes; Andrew Kobal, J. Wayne 
Gaeo and Karl R. Kunze; Acorn Publishing 
o. 


Marion A. Bills, Assistant Secretary, Aetna 
Life Insurance Company, Hartford, Connecti- 
cut. This test consists of 25 questions concerning 
various activities; for example, "What would 
best like to do with a motor boat?" For each 
question ten choices are allowed, three of which 
must be marked. The blank is scored for the 
centering of interest in the fields of (a) mechani- 
cal, (b) academic, (c) artistic, (d) business and 
economics, (e) farm agricultural. Since the ac- 
tivities are on a rather low level of accomplish- 
ment, it is often difficult for college students to 
find real preferences; they find it necessary to 
mark the three choices disliked least. In each list 
the first two choices indicate mechanical inter- 
est, the second two academic, etc. This makes 
scoring easy, but it also allows easy bluffing. 
The only relevant experimental data given 
are for 120 subjects apparently already em- 
_ ployed. Their stability on the job, expressed in- 
-terest, advancement on the job, and earning 
capacity were compared with the test scores. 
"These showed fair relationships. No instance is 
quoted of applicants being tested and future 
progress noted. Therefore, since several studies 
have shown that occupational interest often 
changes after employment, not too much empha- 
sis should be placed on these results for guid- 
ance purposes. The author makes the statement 
in his manual of instructions: “The validity of 
the Inventory is sufficiently high to warrant its 
application in individual diagnosis" The re- 
viewer has been unable to find evidence to sup- 
port this claim. 

The test has the advantage of being short and 


easily scored. Its disadvantages are: (a) no indi- 
cation of proper validation; (5) level of occupa- 
tion too low for college students; (c) difficulty 
in interpretation unless interests are very defi- 
nite, in which case the applicant probably can 
express them as well without a test. 


Edward S. Bordin, Director, Student Coun- 
seling Center, and Associate Professor of Psy- 
chology, The State College of Washington, Pull- 
man, Washington. The authors of this test stress 
the fact that it "avoids in its contents the expres- 
sion of preferences for vocations which are un- 
real as interests." They go on to say that instead 
of lists of occupations to be checked, which is 
the standard pattern of vocational interest inyen- 
tories, the test presents materials which “offer 
a search for the interests which are an important 
part of the aptitude rather than mere prefer- 
ences." This inventory purports to accomplish 
the above-stated goal by listing 25 different ques- 
tions covering as many different situations. 
"What would you like to do best with an auto- 
mobile?" or “Which position would you prefer 
in a newspaper establishment ?” are typical ques- 
tions. Each question has ten possible responses 
which are arranged so as to indicate defined 
occupational categories. The responses are ar- 
ranged so that the first two are intended to reflect 
mechanical interests; the next two, academic; 
the next two, artistic; then business and eco- 
nomic; and the final two, farm-agricultural. 

This reviewer finds it hard to accept this test 
situation as significantly freer of the operation 
of the subject’s preconceptions of himself than 
any of the other standard practices in vocational 
interest testing. The fact that the occupational 
types of responses always appear in the same 
order and two at a time seems to ensure that the 
subject will have every opportunity to be quite 
aware that by his responses he is saying, “I am 
the mechanical type” or “I am the academic 
type,” and so on as the case may be. These 
a priori doubts are not put to rest by the meager 
basis for validity offered in the accompanying 
manual. The only data cited are: (a) a pattern 
of scores on the five categories which is said to 
be typical for a group of 60 subjects drawn from 
laborers, shiftless though mentally normal prison 
inmates, and educationally low-graded individ- 
uals (the pattern presented is an extremely de- 
pressed one); (b) a correlation between ad- 
vancement on the job and interest expressed on 
the inventory of .65 and a correlation of .38 be- 
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tween recent earning capacity and the test varia- 
ble, both based upon 120. subjects. No further 
information is given nor is the reader referred 
to any other source for a fuller treatment of the 
studies. The mere citation of validity correla- 
tions without opportunity to check the adequacy 
of the studies on which such validities are based 
is insufficient evidence to warrant acceptance of 
the.test as a worth-while instrument. The test 
manual is also deficient in failing to cite any evi- 
dence on reliability or on the intercorrelation of 
scales. Five tables for converting raw to centile 
scores are presented. However, some of these 
are rather peculiar in that the same raw score 
conversion applies to as many as four different 
scales. Since no information is given on the 
method of standardization nor on the selection 
of the normative population, these tables must 
remain enigmatic. 


Harold D. Carter, Associate Professor of 
Education, University of California, Berkeley, 
California. This inventory is intended for use in 
secondary schools, colleges, and industry. There 
is one form, for use with both males and females. 
It is in the form of a check list, with ten items 
under each of 25 heads. The pupils are required 
to check the three most liked in each group of 
ten items. The administration and scoring are 
simple and economical, requiring no scoring 
keys. An attempt is made to get at interests with 
vocational significance, by avoiding lists of occu- 
pation names as items, and by using meaningful 
activities. The items can be classified as indicat- 
ing mechanical, academic, artistic, business and 
economic, and agricultural interests. 

The manual indicates the purpose of the test 
and gives clear and adequate instructions. for 
administration and scoring. It suggests the uses 
to which the test can be put in counseling. The 
claim is made that the tests are sufficiently valid 
for use in individual diagnosis, but the evidence 
presented is too sketchy to be convincing. There 
i$ no evidence as to the reliability of the test. 
The norms based upon five occupational groups 
seem completely inadequate as presented. The 
information given about the groups is vague 
and general. The evidence that inferior groups 
lack concentration of interests is interesting, but 
inadequate, as only sixty individuals were in- 
cluded. The validity coefficients of .65, -39 and 
.59 presented in the manual would be more mean- 
ingful if the conditions were described in more 
detail. 


The attempt to avoid the use of lists of occu- 
pation names has not been fully successful. For 
example, Item 17, dealing with federal civil 
service, includes the following ten subitems: 
transportation engineer, mechanical engineer, 
librarian, teacher of civics, administrative clerk, 
custodian of art collections, draftsman, buyer for 
the government, agriculturist, and forester. 
Other items similarly contain occupation names 
rather than “meaningful activities” to be selected 
or rejected. 

The attempt to eliminate the influence of the 
prestige hierarchy of occupations is thought pro- 
voking. One may question whether it is realistic 
to make such an attempt. It is not clearly desir- 
able at present to overemphasize the element of 
choice among uninteresting alternatives. The 
fact that interests are often unrealistic in their 
emphasis upon ambitious striving upward does 
not eliminate the significance of the prestige 
hierarchy. Perhaps a main characteristic of the 
interests of some persons is the fact that they 
reject all activities on the dish-washing and 
bottle-arranging level. * 

One can only speculate in the absence of fur- 
ther data, This test seems most likely to be 
useful in industry and in the secondary school. 
It seems unlikely that it will be found adequate 
at the college level, especially for individuals 
with aims toward the professions. It seems to 
hold promise as a step toward fulfillment of an 
obvious need for measurement of interests of the 
masses below the professional level. 


Patrick Slater, Chief Research Officer at 
the Social Survey and Honorary Psychometric 
Psychologist at St. Thomas's H ospital, London, 
England. This inventory serves to classify voca- 
tional interests under five main headings: me- 
chanical, academic, artistic, business and eco- 
nomic, and farm-agricultural. In completing it, 
subjects may reveal interests concentrated in one 
of these classes or in a combination of two. 
Answers can thus be used to differentiate sub- 
jects into fifteen classes, the combinations of two 
main interests at a time providing an additional 
ten classes, viz., mechanical combined with aca- 
demic, mechanical combined with artistic, etc. 
through to business and economic combined with 
farm-agricultural. Educationally low and men- 
tally immature persons, however, tend to reveal 
a range of interests scattered haphazardly 
throughout all five classes, and the inventory is 
therefore not intended for application to them. 
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For each of these fifteen classes a suitable list 
of occupations is provided. 

This ground plan commends itself as reason- 
able. But to develop its practical usefulness an 
exceptionally large amount of detailed informa- 
tion is needed. What predominant interest or 
combination of interests, as defined, would you 
consider appropriate to the following occupa- 
tions; athletic director, building superintendent, 
hotel manager, recreation director, radio dealer, 
scout leader? This is the list which the authors 
give as appropriate for a combination of mechan- 
ical with business and economic interests. From 
what detailed empirical evidence is this particu- 
lar list compiled? Do these occupations really 
involve a homogeneous combination of inter- 
ests? No detailed evidence is given. Although a 
a total of 117 occupations is listed, the only 
occupational study to which the manual refers 
is one of the “occupational history of 120 tested 
subjects” in unspecified occupations. 

Another question which perplexes me is what 
takes place in a subject’s mind when he is asked, 
“What would you like to do best with an AIR- 
PLANE?” The choice before him is: *(a) work 
on it as a mechanic; (5) pilot it; (c) study its 
history; (d) write about its importance in mod- 
ern warfare; (e) write a poem about it; (f) 
make drawings or sketches of it; (g) book pas- 
sengers for it; (Ah) advertise its uses; (i) con- 
struct a flying field for it; (7) use it on a big 
farm for spraying crops." For my part the three 
things I would genuinely like to do would be: 

(a) sell it; (b) if that failed, give it away; 
(c) if that failed, go away from it quietly, pray- 
ing nobody would ever find out it was mine. I 
would feel more like writing a poem after I had 
got rid of it. But to be true to form as a psycholo- 
gist what I should like best, according to the 
manual, should be to study its history and write 
about its importance in modern warfare. Regret- 
fully I must record that this inventory leaves 
me sheltering behind my ignorance from the fear 
that I may be an occupational misfit. 
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pep Qualification Inventory. Grades 9-16 and 
adults; 1945-47; 1 form; $3.75 per 25; $10 per copy of 
the manual, Job Placement Reference; 25¢ per sample 
copy; the Job Placement Reference and 5 blanks are 
obtainable on approval at a special price of $10; non- 
timed (30) minutes; Keith Van Allyn; Bureau of Per- 
sonnel Research (now Surveys, Inc.). 
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4. Triccs, Tiaras OnaLIND, "Critique ot Van Allyn's Sys- 
tem of Vocational Counseling.” J Appl Psychol 31:536-44 O 
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S. Van Attyn, B. Kerru. "Picking Good Dus Mechanics,” 
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Ralph F. Berdie, Director, Student Coun- 
seling Bureau, and Associate Professor of 
Psychology, University of Minnesota, Minne- 
apolis, Minnesota. The Job Qualification In- 
ventory is a 210-item inventory summarizing 
self-ratings in 35 different occupational areas, 
and designed to be used with its accompanying 
manual, entitled the Job Placement Reference. 
The overly ambitious purpose of the inventory 
is, according to the author, to investigate “every 
detail of an individual’s qualifications” and to 
present these details graphically in order of rela- 
tive importance. Throughout the manual, Van 
Allyn states the inventory gauges ability, indi- 
cates vocational aptitude, supplies data regard- 
ing ambitions, training, experience and accom- 
plishment, evaluates immature experience, in- 
vestigates interest, differentiates between veri- 
fied (permanent) and preference (transitory) 
interests, and detects malcontents and malinger- 
ers. More modestly, he maintains the inventory 
may be used by industry for purposes of selec- 
tion and training, and by schools for purposes 
of guidance and counseling. 

The inventory best serves as a screening 
device to aid in the selection of ability, atti- 
tude, and interest tests, and to provide orienta- 
tion in the interview. It provides a quick and 
economical method of obtaining preliminary 
information, and may stimulate the person be- 
ing tested to think in more comprehensive 
terms of occupational possibilities. 

No mention in the manual is made of the 
reliability or the consistency of self-ratings ob- 
tained on the inventory, and the only statement 
concerning the construction of the inventory 
relates it is the culmination of twelve years of 
research. A more detailed account of this re- 
search would have been desirable, as no de- 
tailed reference to it could be found in the psy- 
chological or educational literature. 

The evidence for validity consists of: (a) 
eight specimen profiles obtained by workers 
and students; (b) a composite profile based on 
scores of 42 industrial statisticians. The inven- 
tory was given to “a number of men and 
women known to be successful and happy in 
their work," and this data was used in pre- 
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paring a composite profile of all members in 
each group. No information is given concern- 
ing the sizes of the specific groups, their other 
characteristics, nor the significance of their 
inventory scores. The relationships existing be- 
tween the various ratings are not presented, 
nor are the relationships between these ratings 
and scores achieved on other tests, workers’ 
output, students' grades, or ratings of satis- 
faction. 

The manual contains three lists of a total 
of 1,603 occupations. The first list presents the 
occupations in order of the ratings so that 
occupations characterized by an obtained pro- 
file can be identified. The second list groups 
the occupations according to their basic re- 
quirements. The third list presents the occu- 
pations in alphabetical order so the profile 
characteristic of any given occupation can be 
identified. The Job Placement Reference con- 
tains more occupations than the Minnesota 
Occupational Rating Scales, which contains 
only 430, but the latter recognizes the relative 
independence of abilities and interests and pre- 
sents ratings only for abilities, whereas the 
former is in terms of ratings on the inventory. 

Until more data is made available regarding 
the Job Qualification Inventory, it can be used 
only as an informal, unproved adjunct to the 
employment or counseling process. 


Stanley G. Dulsky, Chief of Staff, Chicago 
Psychological. Institute, Chicago, Illinois. The 
job placement technique developed by Van 
Allyn involves the use of two tools: the Job 
Qualification. Inventory; and the Job Place- 
ment Reference, an encyclopedia. of job specifi- 
cations developed specifically for use with the 
Job Qualification Inventory. 

The Job Qualification Inventory (JOT) “pro- 
vides for a detailed analysis of an individual's 
actual and potential job qualifications.” It is 
concerned with vocational interest, “but per- 
forms the much broader function of verifying 
the expressed interests and revealing actual job 
qualifications—potential as well as achieved.” 
The JQI is concerned only with “sustained, 
verified interest, and not mere expression of 
preference at the moment of testing.” R 

The inventory consists of 210 questions, six 
questions for each of 35 areas or fields. Ques- 
tion 1 refers to preferences (do you enjoy such 
and such an activity) ; Question 2 refers to 
actual performance; Question 3 asks about 
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education and training in that area; Question 
4 determines vocational ambition for that area ; 
Question 5 inquires about paid experience in 
the field; and Question 6 refers to evidence of 


.accomplishment or recognition to suggest un- 


usual proficiency. Each question is answered by 
circling NO, ?, YES. Thirty-five scores are 
obtained and graphed. From the graph a “key” 
is obtained which refers to appropriate occu- 
pations in the Job Placement Reference. These 
jobs are keyed to the Dictionary of Occupa- 
tional Titles. 

It is apparent that the JOI is a much broader 
instrument than an interest inventory. It is at 
once an interest blank (Questions 1, 2, and 4) 
and a personal history or application blank 
(Questions 3, 5, and 6). Much important in- 
formation.can be obtained by studying all the 
responses. 

The integration of interest and background 
with specific occupations is very helpful to the 
counselor. The Brainard Occupational Prefer- 
ence Inventory attempts the same thing by 
keying the sections to the Dictionary of Occu- 
pational Titles. The Job Placement Reference 
goes one step further—it lists the specific oc- 
cupations as well as keying them to the Dic- 
tionary of Occupational Titles. 

This device, and the accompanying Job 
Placement Reference deserves a period of exper- 
imentation by the vocational counselor. Informa- 
tion obtained from it should be supplemented 
by tests and other pertinent data as well as 
the interview. Used judiciously by synthesis 
with other facts, it may prove to be another 
weapon in the battle to eliminate vocational 


misfits. 


Mech Eng 67:864-5 D '45. Donald E. Gorse- 
line, * the author does not give a clear-cut 
outline of the procedure he used in selecting, 
testing, and validating the contents used in the 
job-qualification inventory, a practice which is 
common procedure with most rating-scale or 
test builders. One has a right to ask, "Tust why 
were the particular 35 areas of occupations 
used? Why not 25 areas, or 39 areas ?" Likewise 
in the job-placement reference, the encyclopedia 
of jobs, what technique was used to determine 
the numbers and names of jobs listed on the 221 
pages? Inclusion of two pages of simple de- 
scription of the methods used to construct the 
materials used in the technique would have ob- 
viated the almost painful effort used to reassure 
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the reader of the excellence of the system. In 
the second place, the manual probably would 
have been improved by terse staccato sentences 
summarizing the methods of giving and scoring 
the job-qualification inventory and applying the 
findings. This should have been done for the 
sake of custom, since test users expect instruc- 
tions to be streamlined, if for no other reason. 
Constructively, much can be said for the Van 
Allyn procedure. The reader cannot help but feel 
that in spite of the almost extravagant claims 
made for it, the author, backed by 12 years of 
research, can prove his statement that this repre- 
sents an important forward step in the science of 
properly selecting, placing, and training em- 
ployees. This reviewer, who tries to keep abreast 
of movement and devices in the counseling field, 
can think of no other test or rating device that 
so skillfully extracts so much reliable infor- 
mation about interests and experiences of indi- 
viduals in so brief a time. The method used to 
avoid falsification about qualifications of appli- 
cants is excellent, too. While the technique will 
be more useful to a personnel worker clinically 
trained in psychology than to an inexperienced 
counselor, yet it is simple enough to be applied 
by the average industrial employer. An added 
point in its favor is the fact that it probably 
has better predictive value for adults and older 
people; a group often neglected by test builders 
in the past. This reviewer is inclined to the 
opinion that this technique will have wide usage 
in industry. Unquestionably there is no other 
tool available that will give so much informa- 
tion on so wide a variety of industrial personnel 
problems ; particularly re-employment and place- 
ment of war veterans, upgrading, downgrading, 
employment terminations, transfer and promo- 
tion requests, apprentice and training selection, 
prescreening of potential malcontents, proper 
placement, etc. as this one does. While it is by 
no means a panacea for all employment selec- 
bi it should prove to be a distinctly useful 
tool. 


Sch R 53:433-4 S "45. Robert C. Woellner. 
* an excellent example of current trends in 
matching jobs and men, and it will serve as 
a counselor's reference manual for helping ad- 
visees determine the occupations in which they 
should be most successful and contented. This 
reference manual is used in conjunction with 
the Job Qualification Inventory—a type of 
questionnaire consisting of eight pages to be 


filled in by the advisee. The purpose of this 
inventory is to obtain data relating to the 
advisee’s ambitions, training, experience, and 
accomplishment. On the basis of the data thus 
assembled, a counselor can have before him a 
profile of an individual’s relative ratings in 
thirty-five general categories of traits which are 
supposedly basic elements and the different 
combinations of which are essential to success 
in specific occupations. The highest points in 
the profile serve as a letter combination for the 
Key Index, which in turn is used to locate the 
jobs which are listed in the Job Placement 
Reference section of the publication. In order 
to assist the counselor in locating specific and 
appropriate jobs which fit the ambitions, train- 
ing, experience, and accomplishment of a given 
individual, this book contains a Reference Sec- 
tion which makes use of the Key (derived from 
the Job Qualification Inventory) and the code 
number given by the much-used Dictionary of 
Occupational Titles compiled by the United 
States Employment Service, along with the titles 
of the jobs in the particular occupational 
groups. * Counselors will find in this publica- 
tion valuable assistance through its presenta- 
tion of (1) an inventory which provides in an 
organized fashion the aspirations, training, ex- 
perience, and accomplishment of an advisee; 
(2) a readily usable method for comparing the 
composite of these qualifications with a vast 
variety of jobs; and (3) a full description of 
the technique used. The effectiveness of the 
technique suggested by Van Allyn depends, to 
a great extent, on how the inventory is filled 
in. Though the questions in the inventory are 
chosen and phrased to encourage objectivity, 
there remains much opportunity for subjective 
evaluation by the advisee. If the individual is 
assisted in filling out the inventory by being 
supplied as much evidence as is available con- 
cerning him upon the basis of test results, ob- 
jective observations of others, etc., this tech- 
nique should prove useful. Counselors who 
deal with youth having little or no occupational 
experience, with veterans, and with adults voca- 
tionally disoriented by conditions of the recon- 
version period will find worth-while assistance 
in Job Placement Reference. Tt is unfortunate 
that, because of its price, this publication may 
not be given as wide use as it merits, 
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Kuder Preference Record. Grades 9-16and adults; 
1934-48; IBM; 1 form; 2 editions; separate answer 
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sheets or pads must be used; 95¢ per 25 job charts; 
nontimed (50) minutes; G. Frederic Kuder; Science 
Research Associates. 
@) FORM BB: EDITION FOR SELF-SCORING. 48¢ per test; $2 
er 25 answer pads; So¢ per 25 profile sheets for either 
ys and girls or men and women; 75¢ per specimen set. 
b) FORM BM : EDITION FOR MACHINE-SCORING. IBM ; 35¢ 
per test; $2.35 per 100 machine-scorable answer sheets ; 
$7.50 per set of scoring keys; so£ per 25 profile sheets. 
C) FORM BI [A SHORT FORM FOR INDUSTRIAL USE]. 1948 ; 
IBM ; separate answer sheets must be used; 45¢ per 
test booklet; $2.10 per 25 answer pads; $1.35 per 25 
backboards; $2.35 per roo machine-scorable answer 
sheets ; $3 per set of machine-scoring keys; 5o per 25 
profile sheets; o5¢ per 25 job charts; 75¢ per specimen 
set; (30) minutes. This form is not reviewed below. 
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Ralph F. Berdie, Director, Student Coun- 
seling Bureau, and Associate Professor of 
Psychology, University of Minnesota, Minne- 
apolis, Minnesota. This inventory has been 
found clinically useful as a vocational counsel- 
ing aid in secondary schools, colleges, and uni- 
versities, in the Army, the Navy, and the 
Marine Corps, in the Veterans Administration, 
and in industry. It is easy to administer, only 
slightly monotonous to the individual tested, 
„and convenient to score. Time for administration 
is usually less than one hour. Hand scoring 
requires about ‘five minutes. Large groups of 
Form BM can be scored in even less time by 
running each answer sheet through the IBM 
test scoring machine eighteen times. Interpre- 
tation of the test is not difficult, and the clas- 
sification of occupations according to major 
interests in the manual facilitates this. 

Although the methods used in constructing 
the Kuder Preference Record were not as 
rigorous as those methods we would have pre- 
ferred, the final judgment as to the value of the 
test rests not upon how the test was constructed, 
but rather upon how well the test serves in 
clinical practice and how well it predicts those 
occupations in which individuals will derive 
the most satisfaction. Since the publication of 
the latest edition of this test in 1942, it has 
become one of the two most extensively used 
vocational interest tests, and data are now ac- 
cumulating which are relevant to this question 
of validity. 

The 1946 revision of the manual, which con- 
tains an abundance of well presented informa- 
tion, summarizes the important validity studies 
which have appeared. In presenting the case 
for the validity of his test, Kuder, as Strong 
does with his test, places most emphasis upon 
the fact that people in different occupations, 
when tested, obtain profiles characteristic of 
people in those occupations. The scores are 
presented in tabular form, and the profiles pre- 


sented graphically. Mean scores and standard 
deviations for each of the nine scales are given 
for men and women in 72 different occupations, 
where Strong, in his latest publication, presents 
data for only 56 different occupations. How- 
ever, no men’s occupational group studied by 
Strong had fewer than 147 people in it, and 
the sizes of the groups varied from 147 to 585. 
For these groups information is available con- 
cerning age, professional experience, and edu- 
cation. 

The men’s occupational groups studied by 
Kuder, ranged in size from 16 to 185, and no 
information is presented concerning these re- 
lated factors. Strong has shown that in empiri- 
cally constructing an interest test, criterion 
groups of 400 are desirable, if not necessary, 
and that groups of 200 may be deemed neces- 
sary. This generalization cannot be applied 
without experimental evidence in the valida- 
tion of previously constructed scales, but evi- 
dence for validity based upon only 16 cases, or 
even 60, must be considered extremely tenuous. 
The results presented by Kuder do show, how- 
ever, that each of the 72 occupational groups 
obtains mean scores on one or more of the nine 
scales which are statistically different from the 
mean scores on those scales for a base group 
of 2,667 men and another base group of 1,429 
women. 

Kuder also presents profiles originally ob- 
tained from a study by Barry, for groups of 
women students preparing to enter 24 different 
occupations. Here again the groups are small, ` 
ranging in size from 18 to 128, but again the 
obtained profiles were generally consistent with 
the occupation chosen. 

The reliability of the test is satisfactory, with 
test-retest reliabilities for the 9 scales ranging 
from .81 to .98. These compare favorably with 
test-retest reliabilities, ranging from .84 to .95, 
reported by Burnham for rr scales of the 
Strong Vocational Interest Blank. As the reli- 
ability of “test-interpretation” is not the same 
as the reliability of a test score, and as inter- 
pretations based upon profiles of test scores are 
different, we hope more reliable, from interpre- 
tations based upon single scores, these relia- 
bility coefficients must stand in their own frame 
of reference. 

As with other interest tests, only low cor- 
relations have been found between the Kuder 
Preference Record and achievement tests, such 
as the Cooperative English Test and the Iowa 
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High School Content Test. None of the re- 
ported correlations exceed .59, most are much 
lower. Even lower correlations have been found 
between the Kuder Preference Record and 
ability tests. A study of the relationships be- 
tween the Tests for Primary Mental Abilities 
and the Kuder Preference Record shows only 
one correlation coefficient of above .30. A simi- 
lar study between the Tests for Primary Men- 
tal Abilities and six scales of the Strong 
V ocational Interest Blank shows only one corre- 
lation coefficient of above .30. The intercorrela- 
tions between scores on the nine scales are also 
low. In a group of six studies, a total of 216 
intercorrelations are presented between the 
scales, and 36 of these are above .30, 14 above 
.40. As compared to other interest tests, the 
scales of the Kuder Preference Record meas- 
ure relatively independent functions. 

Many counselors find the specific occupa- 
tional scores obtained with the Strong Voca- 
tional Interest Blank are more useful and that 
the pattern analysis introduced by Darley gives 
a more complete picture of the personality than 
can be obtained by using a limited number of 
scores of a more general nature. The Kuder 
Preference Record, with its more general 
scores, however, does provide a quick and eco- 
nomical point of departure in helping students 
select occupations and the development of re- 
gression equations that will, on the basis of 
the nine scales, provide scores on specific occu- 
pations, should enhance the test’s clinical value. 


E. G. Chambers, Assistant Director of Re- 
search in Industrial Psychology, University of 
Cambridge, Cambridge, England. This test is 
an ingenious attempt to obtain an objective 
score for a person’s occupational preferences 
with the aim of helping in vocational guidance. 
The subject is presented with 169 groups of 
occupations: there are three occupations in 
each group and the subject is required to rank 
all three in order of preference. From the an- 
swers given, a score is computed for each of 
nine occupational spheres, including mechanical, 
persuasive, musical, etc. 

The reviewer’s small acquaintance with the 
use of the Kuder Preference Record suggests 
certain criticisms. Firstly, the subject often has 
to express a preference between activities for 
which he knows he has no aptitude. This puz- 
zles some subjects since they feel sometimes 
that they are being asked impossibilities. Cer- 
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tain subjects are constrained in their answers 
by a knowledge of their own abilities or lack of 
abilities; others make an imaginative effort and 
give imaginative answers which may or may 
not have some value. Secondly, the subject is 
forced to make a choice in every instance. Thus, 
although he may dislike all three occupations 
in any one group, he still has to choose one 
as being the most preferred, and this “prefer- 
ence” carries equal weight with a preference 
which he feels positively and strongly. This 
counting of a dislike of varying degree as a 
positive preference appears to the reviewer to 
be unwarranted and a weakness in the test. 

It is rather difficult to see exactly what is 
meant by the “validity” of a test of this de- 
scription. Surely it is a vast assumption to take 
workers in a particular occupation as neces- 
sarily having strong occupational preferences 
for that sphere of employment? Very often it 
happens that individuals become experts in 
some career upon which they embarked through 
economic pressure or in some haphazard way 
and, though experts, their genuine preferences 
may be for something very different. Thus, a 
highly successful salesman may have a strong 
yearning to be a musician or an artist. 

More important from the point of view of 
practical vocational guidance is that the Kuder 
Preference Record places the emphasis upon 
preferences rather than upon capacities. The 
reviewer is strongly of the opinion that capaci- 
ties are much more important than prefer- 
ences in vocational guidance. Many careers are 
wrecked for individuals who follow their pref- 
erences without having the requisite abilities. 
On the whole, therefore, the reviewer feels that 
the information obtained by the use of this test 
is open to some doubt and tends to overempha- 
sise the preference factor in vocational guid- 
ance. It seems likely that a skilled interviewer 
could elicit important preferences more quickly 
and more reliably by normal interview meth- 
ods than by the use of this lengthy preference 
record. 


Donald E. Super, Associate Professor of 
Education, Teachers College, Columbia Uni- 
versity, New York, New York. The simple 
scoring of the Kuder Preference Record gave 
it a great advantage over Strong’s complex 
stencils when it was first published. The fact 
that Kuder had previously conducted research 
with his inventory for a period of six years 
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seemed something of a guarantee of validity. 

Unlike Strong’s blank, which had been de- 
veloped on occupational groups and so had vo- 
cational norms, the Kuder record had been 
developed on student groups, and the keys had 
been constructed on the basis of internal con- 
sistency and independence. For this reason its 
occupational significance was an open question 
until the inventory was administered to adults 
judged successful in various occupations, or 
students were followed up in order to ascertain 
the relationship between their interest scores 
and success in various vocations. 

Data have been accumulated which now give 
one some basis for interpreting Kuder scores 
in vocational guidance. The Kuder Preference 
Record has been used in twenty-two published 
studies to date, and its author has accumulated 
scores for persons in a relatively large number 
of occupations. These and related material have 
been published in a manual which stands out 
because of the breadth and detail of material 
covered. 

Enough data are on hand for the Kuder to 
be used with some confidence in vocational 
guidance, although the actual numbers in any 
one group are still generally small. Norms are 
for 2,667 adult men, classified according to 44 
occupational groups, and for 1,429 women in 
29 occupations. The groups range in size from 
16 (men English teachers and women language 
teachers) to 185. (men meteorologists). The 
patterns of interest scores tend to differ ac- 
cording to expectation. Accountants, for ex- 
ample, are significantly high on computational, 
literary, and clerical interests. 

Triggs compiled norms and developed an oc- 
cupational scoring formula for 826 nurses, as a 
group and according to specialties, such as 
public health nursing and supervision. The 
close agreement of Triggs’ means and stand- 
ard deviations with those in Kuder’s manual 
gives one more confidence in the validity of the 
data for his smaller groups. By means of such 
occupational indices it is possible to obtain 
Kuder scores which have the same type of pre- 
dictive value as Strong’s scales. In addition, 
Kuder's technique preserves the purity of traits 
which results from internal consistency and in- 
dependence and makes the development of new 
occupational indices less laborious than Strong's 
technique. Occupational indices are now avail- 
able only for nurses and for accountant- 
auditors, but additional occupational indices 


will be developed. Kuder assumes a responsi- 
bility for the continued development of his in- 
strument which has characterized the best tests 
of the past. 

One of the principal weaknesses of Strong’s 
blank has been the presence of a factor which 
makes scores as office worker, nurse, elemen- 
tary school teacher, and housewife so common 
as to render impractical the use of these keys. 
That the Kuder does differentiate nurses and 
stenographers from women in general suggests 
that it may be more valuable for use with 
women than the Strong. 

A survey of the literature shows that grades 
tend to be related to appropriately measured 
interests in some cases, but not in others, de- 
pending upon whether or not there is sufficient 
range of interest in the group in question. 

No studies appear to have been made which 
correlated Kuder scores with achievement or 
continuation in a vocation, as Strong has done 
with his inventory. 

Various investigators have studied the rela- 
tion of Kuder scores to Strong’s scores. In gen- 
eral the relationships were disappointingly low 
for those who have thought of the Kuder as 
a cheaper substitute for the Strong blank, for 
they ranged from .28 (literary-author) to .73 
(scientific-chemist) for the supposedly related 
scales. To interpret Kuder scores on the basis 
of Strong validations would be quite unjusti- 
fiable. Even in the case of scales which do tend 
to agree, e.g., the persuasive and the business 
contact, the correlation is far from perfect, and 
one occasionally finds surprising lack of agree- 
ment. 

More research needs to be done before the 
record can be considered a well-understood in- 
strument, but it is already a valuable tool in 
the counselor’s kit. More studies which will 
throw light on what the Kuder is measuring 
and how it differs from Strong’s blank need to 
be made. At the same time the pioneering 
which has characterized Kuder’s work should 
be encouraged. 


Occupations 21:267-9 N '42. Arthur H. 
Brayfield. * Kuder attempts to identify certain 
generalized activity patterns which are psycho- 
logically meaningful. This approach to the 
measurement of interests or preference is in 
line with factor analysis studies of the problem 
as well as with clinical experience in using in- 
terest inventories. It is consistent with the 
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"pattern analysis" interpretation of other in- 
terest inventories. A point favoring the Kuder 
blank is the extreme care taken in its construc- 
tion and standardization. The author stresses 
that it is still in an experimental stage and that 
more studies are in progress regarding its use 
and validity. For example, Kuder reports in 
the Manual an exploratory study using the dis- 
criminant function to identify special occupa- 
tional interest groups. The reviewer recom- 
mends the cautious use of the Preference Record 
with the feeling that, at the present time, it 
should be used to supplement but not to replace 
available interest measures which have been 
more adequately validated. Counselors should 
add to their experience with the Record before 
using it widely, Experience seems to indicate 
that significant scores on the Kuder artistic key 
are “easier to get” than significant ratings on 
the Strong artistic key. The Kuder artistic 
scale may make it rather easy for people with 
avocational interests in this area to score high. 
Studies quoted above show promise in the 
Preference Record for identifying interests 
which have curricular significance. Such hy- 
potheses will be clarified with more research. 
Studies on the occupational significance of the 
test need to be made. This review applies only 
to the Form A series. There is little informa- 
tion available regarding the new Form BB. 
Norms for the new test are inadequate. The 
claims of the publishers regarding both forms 
should be evaluated carefully in the light of 
the present research. 


For reviews by A. B. Crawford and Arthur 
E. Traxler of an earlier edition, see 40:1671. 


[ 641 ] 
*Motivation Indicator. Grades 10-12; 1947; IBM; 
1 form; separate answer sheets must be used ; $1.25 per 
25; 256 per 25 machine-scorable answer sheets; $1 per 
set of scoring stencils; 25¢ per specimen set ; nontimed 
(45) minutes; G. Bernard Baldwin; Educational Test 


Bureau. 


Norman. Frederiksen, Associate Professor of 
Psychology, Princeton University ; Research As- 
sociate, College Entrance Examination Board ; 
Princeton, New Jersey. The Motivation Indi- 
cator is an interest test designed for high 
school students. Its intended use is mainly to 
reveal to the classroom teacher the predomi- 
nant curricular interests of each student in 
order that they may be used as a basis for mo- 
tivating that student to greater effort and im- 
proved learning. It is also suggested that the 


test may be used, along with other information, 
in vocational counseling. 

The test yields fourteen scores. Nine scores 
are related to the high school curriculum (bio- 
logical sciences, physical sciences, social sci- 
ences, literary arts, graphic arts, industrial 
arts, agricultural arts, clerical work—verbal, 
and clerical work—number). Five scores are 
related to “the motivation of social behavior” 
(altruistic, promotional, administrative, dis- 
tributive, and creative planning). Exactly how 
these interest areas were arrived at is not clear 
from the description given in the manual. An- 
swers are recorded on a separate answer sheet 
which may be scored either by hand or by 
machine. 

The test involves a “multiple-matched com- 
parison technique." The part of the test in- 
volving curricular interests is a double 8% X 
11 inch page on which are printed 100 boxes 
in ten columns. In each box is printed a brief 
description of a “thing to do.” The directions 
use the analogy of a map showing city blocks 
with streets. The student is directed to start in 
the top street and go from left to right, pausing 
at every second intersection. At each of these 
intersections he reads the four “things to do” 
in the four adjacent blocks, chooses the two he 
likes best and indicates them by blackening 
spaces on a separate answer sheet. When he 
has finished the horizontal streets, he traverses 
the vertical streets in the same manner. When 
the test is completed, each item or “thing to 
do” has been compared with the eight items 
which surround it, and it could have been 
chosen four times. Since there are nine items 
in each area of interest, the maximum score for 
any area is 36. The arrangement is similar for 
the social interest items, except that there are 
fewer items and the maximum score is 20. The 
testing method is ingenious, but one wonders 
how much difficulty the students find in follow- 
ing the rather brief directions. 

Two paragraphs in the manual are devoted 
to a discussion of the necessity for choosing 
items which will serve as stimuli for “recalling 
to the student his own satisfying experiences.” 
Certain of the items, such as “Study new 
methods of helping young people make wise 
vocational choices” and “Assist in a revision 
of the items of an aptitude test,” seem rather 
remote from the experience of most high school 


students. : . 
The reliabilities of the nine scores dealing 
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with curricular interests are in general satis- 
factory, while those for social interests are in 
general too low for use in individual guidance. 
The reliabilities were computed by the test- 
retest method with a six-day interval and are 
based on 75 high school sophomores. The nine 
reliability coefficients for curricular interests 
vary from .87 to .96, with a median of .94. The 
reliabilities for the five social interests vary 
from .64 to .87 with a median of .84. 

As evidence of the validity of the Motivation 
Indicator, a study is cited which is based on a 
comparison of mean test scores obtained by a 
high-interest criterion group and a low-interest 
criterion group. Students were assigned to one 
or the other of these groups, with respect to 
each interest category, on the basis of criteria 
involving curriculum selection, hobbies and 
activities and achievement. About half the popu- 
lation could not be assigned to either the high or 
the low category. The critical ratios of the dif- 
ferences between mean interest test scores of 
high and low criterion groups in the various 
areas ranged from 9 to 13. In the absence of any 
report of the size of the groups used, it is impos- 
sible to evaluate such evidence of validity. If N is 
sufficiently large, even a small difference might 
be highly significant. It is stated only that junior 
and senior students in two high schools were 
used. 

The test is also alleged to be valid on the 
basis of evidence pertaining to internal con- 
sistency. Items were selected on the basis of 
high intercorrelations among the items in each 
area. It is stated that the “median coefficient of 
intercorrelation between items in an area was 
.72,” which would seem to indicate high inter- 
nal consistency, The median intercorrelations 
for the various areas are not given, and no 
statistics are presented to show that items in 
one area have low correlations with items in 
other areas. It is stated that the fourteen areas 
were chosen because they “were most clearly 
differentiated from each other.” The intercor- 
relations of scores in the various areas are not 
presented. 

A table of norms is included, based on some- 
what more than a thousand students in twenty 
high schools in the state of Washington. Repre- 
sentative sampling is said to have been secured 
by “proportional inclusion of agricultural, in- 
dustrial, and cultural populations.” Grade norms 
are not presented, since means vary but little 
from grade ten to grade twelve. Although sex 


differences are marked, no sex differentiated 
norms are presented. 


Arthur E. Traxler, Associate Director, Edu- 
cational Records Bureau, New York, New 
York. This new inventory is designed to meas- 
ure interests in nine curricular areas and five 
social motivation areas. The curricular groups 
are biological sciences, physical sciences, social 
sciences, creative writing, graphic arts, indus- 
trial arts, agriculture, clerical work (verbal), 
and clerical work (number). The social areas 
are altruistic, promotional, administrative, dis- 
tributive (sales), and creative (planning). 

The curricular division of the inventory con- 
tains 81 items, and the social division has 25 
items. Each item contains statements of four 
activities from which the subject is required 
to choose the two he likes best. Through an 
ingenious “block” arrangement of the booklet, 
each activity in the curriculum motivation divi- 
sion is matched with eight others surrounding 
it—one from each of the other eight areas, The 
social-motivation part is arranged in a similar 
manner, except for the fact that each activity 
is matched with activities from only four other 
areas. All responses are entered on separate 
answer sheets which may be hand scored or 
machine scored. 

There are 212 responses in the entire inven- 
tory, of which 162 are in the first division and 
50 are in the second. The Indicator is adminis- 
tered without a time limit. Most individuals 
would probably require 20 to 30 minutes to 
complete it. 

Since each of the nine activities in each area 
of the curriculum section and each of the five 
activities in each area of the social motivation 
section has an opportunity to be chosen four 
times, the highest possible score is 36 for each 
curriculum area and 20 for each social area. In 
the manual of instructions, Baldwin reports 
test-retest reliabilities of the area scores based 
on results from high school sophomores. The 
reliabilities of the curriculum areas range from 
.873 to .957 with a median of :94; those for the 
social areas range from .64 to .87 with a median 
of .84. With the exception of the coefficient of 
64 for creative initiative, these reliabilities 
seem definitely high for the test-retest method 
when applied to subtests containing only 36 
and 20 responses respectively. If these coeffi- 
cients are verified in subsequent studies, the 
Motivation Indicator will be among the most 
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reliable of all tests of similar length. At the 
time this review was written the test was not 
available in quantity for use in checking this 
point. 

In general, a good case can be made for the 
comparison technique employed in items in 
such inventories as the Baldwin's Motivation 
Indicator and the Kuder Preference Record. 
This procedure involves the comparison of each 
activity with a variety of other activities and 
thus allows the ranking of the activity among 
diverse experiences. Thus, the procedure prob- 
ably results in a more precise indication of 
one’s interest in a particular activity than can 
be obtained from an inventory which requires 
the subject merely to say whether he likes, dis- 
likes, or is indifferent to the activity. On the 
other hand, within a given period of time, in- 
terest in a larger number of different activities 
can be sampled by the latter method. 

One weakness in the use of the comparison 
type of item is that unless great care is exer- 
cised in the construction of the items, activities 
which are in reality not comparable will some- 
times be paired. For example, the activities in 
one of the items of the Motivation Indicator 
are: (1) “prepare the script for an amateur 
radio program”; (2) “study the construction 
plans for an airplane”; (3) “improve livestock 
by careful breeding”; and (4) “illustrate the 
title page of the school annual.” Activities 1, 2, 
and 4 are short-term projects requiring from 
a few hours to perhaps a day or two. Activity 
3, however, is a long-term project calling for 
years, or even a lifetime, of work. An individ- 
ual who had an interest in agriculture might 
not choose number three simply because it 
seems to involve such a long commitment in 
comparison with the other three. 

Answer sheets for the Motivation Indicator 
were filled out by two members of the staff of 
the Educational Records Bureau. There was 
agreement that the setup of the booklet is rather 
confusing to one taking the test and that if the 
Indicator were administered to a group careful 
supervision would be required. The pattern of 
the area scores and percentiles agreed with sub- 
jective impressions, although it was felt that in 
some instances the differences between areas 
were exaggerated by the Indicator scores. The 
scoring keys definitely could be improved to 
facilitate speed and accuracy of scoring. The 
publisher has attempted to reduce the number 
of scòring sheets to be handled by printing the 
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keys for two curriculum areas on each of four 
sheets. This arrangement is very confusing to 
the scorer. There should be a separate key for 
each curriculum area. 

The Motivation Indicator is an interesting 
and novel addition to the available interest in- 
ventories, but more research data are needed 
on which to base a recommendation concerning 
the use of this new instrument in a guidance 
program. 


J Consult Psychol 11:157 My-Je '47. * By 
means of an ingenious and somewhat complex 
system of administration, each response of an 
examinee is a choice among four of the items, 
of which two are chosen as preferred activi- 
ties, * 

[ 642 ] 
Occupational Interest Blank for Women. 1931; t 
form; scoring scales are available for 10 occupations : 
bookkeepers, grade school teachers, high school teach- 
ers, office clerks, office managers, private secretaries, 
retail saleswomen, sales proprictors, stenographers, 
trained nurses; 3¢ per blank; $1 per set of scoring 
scales; Grace E. Manson; University of Michigan 
Press. 
REFERENCES 
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44 Laveen, Grace ErrzamerH, The Vocational Interests of 
High School Girls: As Inventoried by the Strong and Manson 
Blanks, Columbia University, Teachers College, Contributions 
to Education, No. Bszi Harry D. Kitson, faculty sponsor, New 
York: Bureau of Publications, the College, 1942. Pp. vil, 102. 
$1.60. * (PA 16:5023) 


Gwendolen Schneidler Dickson, Chief, Edu- 
cational Counseling Division, Veterans Adminis- 
tration, Washington, D. C. As indicated by the 
test author, “The present project is but another 
link in the long chain of investigations into the 
subtleties of interests and personality traits 
(2)." It should be recognized that the Occu- 
pational Interest Blank for Women represents 
one of the pioneer studies in the development 
of instruments for measuring the occupational 
interests of women. 

Other workers, such as Strong and Kuder, 
however, have since produced superior and 
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more comprehensive instruments which are 
available to counselors and which should be 
used in place of this earlier test when it is de- 
sirable to measure women’s interests. 


Frances Oralind Triggs, Educational Records 
Bureau, New York, New York. This test, copy- 
righted 1931, consists of 160 items, all names 
of occupations, The responses are made in a 
five-degree scale: like very much, like, no de- 
cided feeling, dislike, dislike very much. The 
inventory may be scored for ten occupations : 
private secretary, office manager, bookkeeper, 
stenographer, office clerk, high school teacher, 
grade school teacher, trained nurse, sales pro- 
prietor, retail saleswoman. The scoring is simi- 
lar to that used to score Strong’s Vocational 
Interest Blank. Each item has ten numerical 
weights differentiating the value of the score 
according to the ten occupations for which the 
inventory may be scored. These weights were 
arrived at by comparing the responses of 150 
persons in each occupation to a group of people 
in general. 

The study, of which this test is an outcome, 
Was a very ambitious one. It was published in 
the Michigan Business Studies and will stand 
as a classic in the history of objective measure- 
ment of interests (2). 

There have been some excellent evaluations 
of the results of using this test. For instance, 
there are data to indicate the extent of inter- 
correlation of scores on the test, the relation- 
ship of scores to age, the relationship of scores 
to general and scholastic aptitude and to other 
instruments purporting to measure interests, 
the relationship to vocational choices made by 
persons taking the test, the relationship to suc- 
cess or degree of satisfaction in a chosen edu- 
cational or vocational field, and the relation- 
Ship to sociological factors. Spearman-Brown 
reliabilities of the scores on the scales are re- 
ported by the author and range from approxi- 
mately .73 to .98. 

In the light of the work which has been re- 
ported in the measurement of interests of 
women, this test cannot be too severely criti- 
cized. However, certain comments might be 
made. The fact that the items on the test are 
based only on the names of occupations “com- 
monly open to women” is a restricting factor 
from the outset. Women are not entering only 
occupations “commonly” open to them in our 
Present society. The fact that the test is scored 


for these occupations only does not allow the 
necessary breadth for a counseling tool. There 
is some indication in the literature that there 
may be a selective factor operating in the norm 
groups used as a basis for the present stand- 
ardization of the test. 

It should be said again that historically this 
instrument and the research based on it con- 
tinue to have value in the study of interest 
measurement in general. Any use of this in- 
Strument as a counseling tool should be ex- 
perimental only, as it has probably been re- 
placed by others which have more value. 


[ 643 ] 
*Occupational Interest Inventory. Grades 7-9, 10- 
13 and adults; 1944-46; IBM ; 1 form; 2 levels; $175. 
per 25; 25¢ per specimen set of any one level, postpaid ; 
7€ per copy, machine-scoring edition; 2¢ per machine- 
Scorable answer sheet; nontimed (30-40) minutes; 
UE A. Lee and Louis P. Thorpe; California Test 
ureau. 
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Edward S. Bordin, Director, Student Coun- 
seling Center, and Associate Professor of Psy- 
chology, The State College of Washington, 
Pullman, Washington. This is still another 
vocational interest test which attempts to edge 
into the field covered by the Kuder Preference 
Record and Strong's Vocational Interest Blank. 
Unfortunately, like so many of the other re- 
cent tests, its authors have not yet paid the 
price in research necessary to obtain unquali- 
fied acceptance of their instruments. 

The manual accompanying the test does not 
even pretend to present empirical evidence of 
its validity. Instead its validity is argued on 
logical grounds, namely, that the Inventory was 
constructed by following admirable procedures 
in the selection and framing of items, Esti- 
mates of the reliability of each of the ten meas- 
ures derived from the test are provided as well 
as percentile norms by sexes and for composite 
populations based upon substantial numbers of 
cases. One would like to know more about the 
composition of the norm groups for the ad- 
vanced form, but the manual simply states that 
the sample included high school and college 
students, veterans, and adults, without specify- 
ing the proportions. 

The format of the test and manual are rea- 
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sonably effective and attractive. The descrip- 
tions of what each scale measures are clear, and 
the idea of using illustrative cases to aid in 
clarifying the interpretation is a good one. But 
without adequate studies to buttress them, these 
materials cannot be accepted. Unfortunately, 
the sales of such tests are too often a function 
of the merchandising methods of the publisher 
rather than of the quality of the test. 


Stanley G. Dulsky, Chief of Staff, Chicago 
Psychological Institute, Chicago, Illinois. This 
inventory measures interests in six fields: 
personal-social, natural, mechanical, business, 
the arts, the sciences; it determines three types 
of interest: verbal, manipulative, and computa- 
tional; and it reveals three levels of interest: 
low (simple, routine), medium (skilled) and 
high (original, creative). 

The method for determining interest is the 
paired comparison method. The authors state, 
“There are no purely objective criteria for es- 
tablishing the validity of an occupational inter- 
ests inventory. . . . However certain factors 
may add or detract from the validity. . . ." 
They have tried to make their test valid by 
carefully selecting the items from the Diction- 
ary of Occupational Titles so as to get a repre- 
sentative sample of occupations; they have 
designed the items to avoid reactions to occupa- 
tions and to get instead reactions to groups of 
activities; they have tried to achieve a g 
balance among the items by presenting 240 
groups of activities associated with six major 
interest fields, 40 of them in each major field. 
The 40 choices in each field fall into three levels 
so that an item in one field at a low level is 
compared with an item in another field at a low 
level; they have presented the items by the 
paired-choice method. 

Reliability of the scale has been determined 
by repeating it after four weeks on 100 twelfth 
graders in the case of the Advanced Series and 
on 180 ninth grade students in the case of the 
Intermediate Series. Coefficients of reliability 
on the Advanced Series range from .88 to .93 
for fields, types, and levels. In the Intermediate 
Series the reliability coefficients range from 84 
to .93. 

Percentile norms were obtained from over 
1,000 twelfth grade students in a number of 
California high schools fer the Advanced Series 
and from over 1,000 ninth grade students in a 
number of California secondary schools for the 


Intermediate Series. The authors claim that the 
norms for the Advanced Series may be used 
“not only for high school and junior college 
students but also for the general adult popula- 
tion.” For the Intermediate Series they claim 
that the norms are suitable not only for high 
school students "but also for the general adult 
population, especially those who lack technical 
or professional training.” An investigation of 
the norms themselves shows very little differ- 
ence between the Intermediate and Advanced 
Series. 

In the reviewer’s opinion each of the six 
fields is too large and complex and contains too 
many diverse occupational activities. For ex- 
ample, the personal-social field contains activi- 
ties such as domestic service, personal service, 
teaching, law and law enforcement, social serv- 
ice, and health and medical service. A person 
interested in teaching would probably score 
high on teaching and social service and low on 
the other four sections, thereby lowering the 
score on the field as a whole. A similar condi- 
tion exists with the business field. For this 
reason total score might mask an important 
occupational preference that could be detected 
only by a careful analysis of the individual sec- 
tions making up the field. 

The analysis into types of interests—verbal, 
manipulative, and computational—is good and 
will help to overcome some of the objections 
mentioned above. In the reviewer's opinion 
the level of interest is the least important of the 
data yielded by this inventory. The level of 
interest at which an individual wants to work 
is determined primarily by intelligence. Even 
though a very intelligent boy who is capable 
of working at a professional level indicates a 
low level of interest, no one would guide him 
into a vocation that did not make full use of 
his mental ability. Similarly, if a dull boy ex- 
presses a high interest in creative and original 
work, he would still have to be guided into a 
skilled or semiskilled occupation. In using this 
inventory with a number of high school stu- 
dents, the reviewer has never encountered a 
level of interest below the medium, or skilled, 
level. 

The authors have given no information re- 
garding the interests of adults who are working 
in specific occupations. Can we assume that a 
high school boy interested in mechanical and 
scientific fields also possesses the same kind of 
interests as a group of mature engineers or 
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people entering the engineering profession? On 
what grounds do the authors state that the 
norms obtained on high school students are 
also suitable for the general adult population? 
They present no evidence to indicate that this 
is true. In presenting information on the norms 
the authors should have indicated the exact 
number of students on whom the test was 
standardized and the kind of high schools from 
which they were selected rather than to give a 
blanket statement that the norms were obtained 
“from over 1,000 inventories given to twelfth 
grade students in a number of California high 
schools.” Certainly the norms would be affected 
by the type of school and the type of curricu- 
lum offered to the students used in the stand- 
ardizing population. 

In the reviewer's opinion the Kuder Prefer- 
ence Record is superior to this inventory for 
revealing broad occupational interests, not only 
of high school students, but also of adults. 


[ 644 ] 
*Occupational Interests: Self Analysis Scale. Ap- 
plicants for employment; 1943; separate scales for men 
and women; $3 per introductory set including ro tests, 
postpaid ; 4¢ per extra test, postpaid; introductory sets 
may be ordered on approval; nontimed (8-12) min- 
utes; Eugene J. Benge; Management Service Co. 


Stanley G. Dulsky, Chief of Staff, Chicago 
Psychological Institute, Chicago, Illinois. This 
is one of those “quickies” that can be concocted 
as fast as a stenographer can type. Nothing will 
injure the field of interest determination so 
rapidly and completely as the production of 
armchair inventories of this kind. Any resem- 
blance between this scale and a scientific instru- 
ment is purely coincidental. Saying any more 
about this scale would constitute a gross waste 
of space. 


[ 645 ] 
*Primary Business Interests Test. Grades 9-16 and 
adults; 1942; IBM; $1.65 per 25 machine-scorable 
test-answer sheets; 35¢ per set of hand-scoring sten- 
cils; $8 per set of machine-scoring stencils; so per 
specimen set; nontimed (20) minutes; Alfred J. Car- 
dall, Jr.; Science Research Associates, 


REFERENCES 
1. CARDALL, ALFRED J. A Test for Primary Business Inter- 
ests Based on a Functional Occupational CI lassification, Unpub- 
lished doctor’s thesis, Harvard University, 1941. 
2. CARDALL, ALFRED J. “A Test for Primary Business Inter- 
ests Based on a Functional Occupational Classification." Ed & 
Psychol Meas 2:113-38 Ap '42. * (PA 16:3767) 


George K. Bennett, President, The Psycho- 
logical Corporation, New York, New York. 


The 75 items comprising this interest inventory 
are fairly specific titles descriptive of business 


activities. For example: “Pay bills and bring 
back receipts”; “Set up new system of ac- 
counts.” Great emphasis is placed by the author 
upon the systematic selection of these items 
from job-analysis data. The 2,000 odd descrip- 
tive terms so obtained were then reduced to 
245 items upon which a "cluster analysis" was 
carried out. As a result, 75 items were retained 
and five scoring scales constructed. The items 
included in each scale were selected on the 
basis of high mutual intercorrelation. Weights 
were assigned to each item in proportion to its 
estimated importance. 

This interest check list may well be useful in 
guidance if the counselee has had some experi- 
ence with all or most of the activities listed and 
where it is important to locate an area within 
the field of business. 

Some portions of the manual are misleading, 
others are confusing, and at least one is clearly 
incorrect. 

Reference is made in the manual and in the 
descriptive journal article (2) to the rigorous 
search for validity. Upon examination it ap- 
pears that the author is satisfied that an item 
(and consequently a score) is "valid" because 
the item appears frequently in job analysis and 
also correlates positively with other similar 
items. No evidence is given regarding the pre- 
dictive efficacy of any scale. 

Since only a few items are scored for more 
than one scale, the number of items per scale 
is very small, ranging from a maximum of 23 
for the accounting scale down to 6 for collect- 
ing-adjusting. Nevertheless split-half reliability 
coefficients of .73 to .92 are reported. These 
coefficients have apparently been computed on 
data from the same group as was used in the 
item selection procedure. F urthermore, several 
of the items within any scale are overlapping 
or paraphrases of each other. The propriety of 
this type of reliability determination is open to 
serious question. 

In an attempt to demonstrate the discrimi- 
native power of the test, the manual cites criti- 
cal ratios between commercial and academic 
high school seniors ranging from 1.23 to 13.15. 
The numbér of commercial seniors is 323, no 
number or standard deviation being given for 
the academic group. Aside from the dubious 
practice of using the critical ratio as an index 
of discriminating power, the interpretation of 
the critical ratio is in error. Referring to four 
CR's of 3.3 and above, the manual states, 
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“These ratios are unusually satisfactory, indi- 
cating that there is less than one chance in 
1,000 that there is any likelihood of a student's 
score in the commercial group falling below 
the mean in the noncommercial group." Tak- 
ing the mean scores for the academic group 
given on page 6 of the manual and comparing 
these with the percentile distributions for the 
commercial group appearing on page 5, the 
absurdity of this high-flown statement is at 
once apparent. The percentage of commercial 
students falling below the mean of academic 
students becomes, for each scale, the following: 
accounting, over 20; collecting and adjusting, 
about 40; sales-office, over 40; sales-store, over 
30; steno-filing, over 10. 

One further defect warrants mention. Norms 
appear to include persons of both sexes. Since 
other research on vocational interest has dis- 
closed striking sex differences (e.g, very few 
boys are interested in stenographic activities), 
the present author may be considered negligent 
in failing to mention this factor if, indeed, he 
has given it any consideration. 


Glen U. Cleeton, Director, Division of Hu- 
manistic and Social Studies, Carnegie Institute 
of Technology, Pittsburgh, Pennsylvania. The 
purpose of this test according to the compiler 
is "to measure an individual's preference for 
the specific job activities which characterize be- 
ginning business jobs. These immediate and 
specific preferences point to the initial job, 
predict the individual's interest or boredom in 
his first activities, and determine to some ex- 
tent his progress in his work. Scores indicate 
the relative extent of interest in five functional 
occupational classifications." 

The five classifications are: (a) accounting, 
(b) collections and adjustments, (c) sales- 
office, (d) sales-store, (e) stenographic-filing. 
To the extent that the test measures such inter- 
ests, it obviously would be useful in educational 
and vocational counseling and in placement in 
commercial employment. 

In developing the test, experimental test 
items, selected by job analyses from work 
diaries of ro6 individuals, were subjected to 
frequency study. Of 2,000 test items thus ob- 
tained, many were discarded and a residual of 
115 were used to prepare an experimental 
edition of the test which was administered to 
285 individuals. Intercorrelations were then 
computed, and five occupational patterns iso- 
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lated. In the process the number of items was 
further reduced so that only 75 are included in 
the published edition of the test. 

The usual limitations characteristic of all in- 
terest tests with reference to evidence of valid- 
ity apply to this test. However, it would seem 
that the chief weakness of the test is the small 
number of items included for certain occupa- 
tional patterns. There are 23 items in pattern 
1, 6 items in pattern 2, 11 items in pattern 3, 
30 items in pattern 4, and 9 items in pattern 
5. With such a small number of items, relia- 
bility coefficients ranging from .73 to .92, com- 
puted on the basis of the split-half method, are 
not statistically defensible. The author argues 
that retention of a greater number of items to 
assure higher reliability "would have been at 
the expense of the validity of the test." This 
contention is, of course, likewise debatable. 

Only one reference to published studies re- 
lating to this test is included in the manual, 

Percentile norms based on scores of 323 high 
school commercial seniors and 304 business 
college freshmen are included in the manual of 
directions. Data are also included to show dif- 
ferences between scores of commercial high 
school seniors and academic high school seniors. 

Several occupations, classified on the basis 
of the initial data used in determining the five 
basic patterns, are set forth in the manual as 
an aid to interpretation. A case study is also 
reported as an illustration of the procedure 
recommended in counseling persons to whom 
the test has been administered. 

The test is self-administering and can be com- 
pleted in 20 to 25 minutes. An answer form 
for use in machine scoring is provided together 
with a perforated stencil for hand scoring. 


George A. Ferguson, Assistant Professor 
of Psychology, McGill University, Montreal, 
Canada, This test is designed to systematically 
appraise an individual’s preferences for selected 
job activities or work elements which occur with 
greatest frequency among the lower wage level 
business jobs. The author recommends its use 
by placement officers in the allocation of em- 
ployees to jobs that coincide with their specific 
business interests and by school administrators. 
in the allocation of commercial students to dif- 
ferent training programs. 

Reliability coefficients calculated by the 
method of split-halves are reported to range 
from .73 to .92 for scores obtained on the dif- 
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ferent groups of items. Whether any meaning 
can be attached in this context to coefficients 
thus computed is debatable. Some exploration 
of the stability of the scores and resulting in- 
terest profiles over even a relatively short 
period of time would have proved more infor- 
mative. It is not improbable that at least some 
of the items on this test are hypersensitive to 
minor occurrences in the experience of the in- 
dividual and reflect transient work attitudes 
rather than any basic and persisting interest 
component. 

To evaluate the use of the test in the coun- 
seling of students, the author tested a group of 
commercial and a group of noncommercial 
students. The differences between mean scores 
obtained by the two groups on the different 
interest areas may be regarded with one ex- 
ception as statistically significant, the critical 
ratios varying from 3.3 to 13.15. The author 
concludes that, "These ratios are unusually 
satisfactory indicating that there is less than 
one chance in 1,000 that there is any likelihood 
of a student’s score in a commercial group fall- 
ing below the mean in the non-commercial 
group.” This is an erroneous inference and 
represents an unfortunate misinterpretation of 
the facts. To illustrate, on collections and ad- 
justments the mean scores obtained for the 
commercial and noncommercial groups are re- 
spectively 13.2 and 11.12. The difference in 
means is 2.08, and the critical ratio 3.3. From 
these data we may guess roughly that the 
chances are perhaps about 400 in 1,000 that a 
student’s score in the commercial group will 
fall below the mean of the noncommercial 
group. The author is using an incorrect cri- 
terion to evaluate the discriminatory power of 
his test. In such cases highly significant differ- 
ences between means may be obtained, and yet 
a test may be of very little practical value in 
discriminating between groups. 

We may conclude that the author’s claims 
for his test are not proved. Considerably more 
research data are required, and a more rigor- 
ously critical attitude applied to the interpreta- 
tion of such data, before this test can be recom- 
mended for general use. 


[ 646 ] 
*Thurstone Interest Schedule. Ages 9-16 and 
adults; 1947; replaces Thurstone's Vocational Interest 
Schedule; 1 form ; $1.50 per 25, postpaid ; 35¢ per speci- 
men set, postpaid; 10(15) minutes; L, L. Thurstone; 
Psychological Corporation. 


[64 ] 

Vocational Interest Blank for Men, Revised. Ages 
17 and over ; 1927-38; revised scoring scales are avail- 
able for measuring maturity of interest, masculinity- 
femininity, studiousness, thirty-nine occupations, and 
six occupational groups (I, II, V, VIII, IX, X): 
Group I: artist, psychologist, architect, physician, 
dentist, osteopath ; Group II: mathematician, engineer, 
chemist, physicist; Group III: production manager; 
Group IV: farmer, aviator, carpenter, printer, mathe- 
matics-physical science teacher, policeman, forest serv- 
ice man; Group V : Y.M.C.A. physical director, person- 
nel director, public manager, Y.M.C.A. secretary, social 
science high school teacher ; city school superintendent, 
minister; Group VI: musician; Group VII: certified 
public accountant ; Group VIII : accountant, office man, 
purchasing agent, banker, mortician; Group IX : sales 
manager, ema salesman, life insurance salesman; 
Group X: advertising man, lawyer, author-journalis 
Group XI: president of a manufacturing concern; 
form; $2 per 25; 2¢ per report blank; $1 per scoring 
scale ; 2-9 scales, 8o each ; 10 or more scales, 70 each; 
75€ per 25 machine-scorable answer sheets; 756 per 
25 answer sheets for use with Hankes Scoring Ma- 
chine; 15¢ per specimen set; nontimed (40) minutes; 
Edward K. Strong, Jr.; Stanford University Press, 
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144. Borvtx, EnwAzD S. "A Theory of Vocational interests 
a Me BOR? Phenomena.” Ed & Psychol Meas 3:49.65 9b "A * 


(PA 17:3240) 


145. BRE! Joun M. MGlMsificoon of Teens in Ine 
jes.” tions 21:448-51 F '43. * 17:2159 
Tov ERE UMAN, 3 AMUEL F. Hests Scores and Graduation,” 


146. PUNAN Vtt, P- « (P4 17:1354 
OE RUE. Serer. The Stron Vocational Interest 
Blank as an Aid in Kocman Prognosis. Unpublished master's 

i lumbia University, 1943. 
DÀ Columbia a K., Ja. Vocational Interests of Men 
and Women. Stanford University, Calif.: Stanford University 
Press, 1943. Pp. xxix, 746. $6.50. * (London: Oxford Univer- 


. 40s.) (PA 18: 

Lo Weider, Je un TRUM MATS Auer qu 
D. “ 'omparison of Interest Measurement 

hee Te n reference Record ‘and the Strong Vocational In- 


terest Blanks for Men and Women.” Ed & Psychol Meas 3:239- 


"43. * (PA 18:26; 
iyu AOI RALER b "Factors Related to Vocational In- 
terests.” Psychol B A187; Mr '44. * (PA 18:2558) 

151. Bernie, RALPH F. “he Prediction of College Achieve- 
ment and Satisfaction.” J Appl Psychol 28:239-45 Je '44. (PA 


18:3277 y z 
2. ‘Canter, Haroto D. Vocational Interests and Job Orien- 
tation: A Ten-Year Review. Applied Psycholo, Monibgrapha 


he Ameri Association for Applied Psychology, 
Stanford "University, Calif.: Stanford University Press, May 
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1944. Pp. 85. Paper, lithotyped. * (London: Oxford University 
Press, 1 95. 2) (PA 18:3616 

153, UNLAP, Jack W., AND Harper, Berta P. “Profiles 
e HUS Scores.” J Higher Ed 15:159-60 Mr '44. * (PA 
18:25 

154. Kuper, G. Frevertc. “Note on ‘Classification of Items 
3) nterest Inventories’.” Occupations 22:484-7 My '44. * (PA 
18:3294 

155. T'riccs, Frances OmaLiND. "A Further Comparison of 
Interest Measurement by the Kuder Preference Record and 
the Strong Vocational Interest Blank for Men." J Ed Res 37: 


18:3307) 
HARLES G. “Vocational Interests of Voca- 
ion Officers." Occupations 24:25-32 O '45. * 


(PA 20:533) "Er 

15 . Lone, Louis. “Relationship Between Interests and 
Abilities: A Study of the Senne Vocational Interest Blank and 
the Zyve Scientific Aptitude Test.” J Appl Psychol 29:191-7 
Je '4s. * (PA 19:3175 

158. SEAGOE, May V. “Prognostic Tests and Teaching Suc- 
cess." J Ed Res CAU My ^5 * (PA 19:3184) 

189. SuuLTZ, Irvin T., AND BanNamas, BENTLEY. “Testini 
for dership in Industry." Trans Kan Acad Sci 48:160-4 
"as. * (PA SD 
, 160. Strone, Epwarp K., Jr. “The Interests of Forest Serv- 
ice Men.” Ed & Psychol Meas s:157-71 su '4s. * (PA 20:1243) 

161. STRONG, Epwarp K., Jr. "Interests of Public Adminis- 
trators.” Pub Personnel R 6: oor "as. * 

162. STRONG, EDWARD K, Jr. eighted vs. Unit Scales.” J 
Ed Psychol 36:193-216 A) 45, * (PA 19:2379) 

163. SUPER, NALD E. "Strong's Vocational Interests of 


"RES dede Women: A Special Review." Psychol B 42:359-70 Je 

45. 

164, TOWNSEND, AcATHA. "Achievement and Interest Rat- 

ings for Independent-School Boys," pp. Ds In 1945 Achieve- 
chools and Supple- 


tion of Salesmen b Psychological Tests,” 1127-. 
e note niat 20:1606) yt gi J Psychol 21:127-50 
168. Paterson, DoNALD G. “Vocational Interest Inventories 
in Selection.” Occupations 25:152-3 D "46. * (PA 21:1286) 
169. STRONG, Epwarp K., Jn. "Interest of Senior and Junior 
Public Administrators.” J Appl Psychol 30:55-71 F '46. * (PA 


ee ap A In i945 Fall 
‘choo! y 
Educational Records Bulletin, NOS PEN works 


Educational Records Bureau, Janu: 6. Pp. xiii : 
ithe ei. DI T eum 3) ary a p. xiii, 66. Paper, 

; Bordin, Epwarp S. “Relative idi - 
fessed Interests to Kuder and Strong Interest lex Son" 


Program in Independent 


tional Interest Blank for Men.” J Appi 


175. Super, Donato E, “Vocational Interests and Vocati 
hoice: Present Knowledge and Future Research in. Tet Rel 
Psychol Meas 7:375-83 au "47. * (PA 


22:3502) 


For reviews by Harold D. Carter, John G. 
Darley, and N. W. Morton, see 38:1178 and 


ee For related reviews, see 648, 650, and 
52. 


[ 648 ] 
*[Re Vocational Interest Blank for Men. Danrtey, 
Joun G. Clinical Aspects and Interpretation of the 
Strong Vocational Interest Blank. New York: Psy- 


chological Corporation, 1941. Pp. 72. Paper. $1.20. * 
(PA 16:328) 2 Dg 


J Ed Psychol 33:556—7 O '42. Bertha Peter- 
son Harper. * This well-written little book 


should serve as an informative source not only 
for the beginning student in this branch of psy- 
chology but also for the more experienced clini- 
cian, * Sound advice is provided those desiring 
to do actual counselling on vocational interest 
problems in a list of specific “‘do’s” and “dont’s” 
with helpful explanatory discussion. Darley 
finds that in vocational interviews it is far more 
effective in most cases to work "through the 
pattern analysis procedures and eventually ar- 
rive at the specific high letter grade, as coun- 
selling progresses." The author's suggestions 
are given more concrete form in the presenta- 
tion of a number of well-chosen illustrative 
cases; he also treats the case with no primary 
interest pattern. * Darley's concluding contri- 
bution in this monograph is a challenging list 
of problems that remain to be investigated in 
the field. Psychologists should find this work a 
most worth while and useful addition to the 
literature on the topic of interest measurement. 

Occupations 20:240-1 D "4r. Domald E. 
Super. This is the type of book about which 
the reviewer wants to write, not a mere review, 
but another book. It says so many things which 
should have been said some time ago, so many 
of the things that one has wanted to say oneself, 
and it says them so well that one is tempted 
to say them over again in his review. * an at- 
tempt to summarize extended clinical experi- 
ence with....Strong’s Vocational Interest Blank 
* an extremely important book. Darley has 
taken time not only to record his clinical obser- 
vations, but also to quantify his data whenever 
possible. Many users of Strong's Blank have 
made observations similar to many of Darley's, 
and some have made quantitative studies of 
specific questions; but few have organized their 
material so well for other workers. * four con- 
tributions will be noted. First, and perhaps 
most important, is the analysis of occupational 
Scores according to occupational families. * 
Darley has taken this type of analysis further 
than other users of Strong's Blank, and has in- 
cluded several statistical analyses of this prom- 
ising interpretive approach. A second important 
contribution is the chapter on the principles of 
counseling on the basis of Strong's Blank. All 
counselors, especially those with relatively 
little counseling experience, would do well to 
read and ponder these suggestions for test in- 
terpretation in the interview. The origin and 
development of interests are considered, the 
conclusion being that interests are the out- 
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growth of personality. Too little evidence is as 
yet available, but the suggestions are worth 
while. The clinical syndromes which Darley de- 
scribes are evidence of the type which is needed 
and should be helpful to practical counselors. 
Some interesting illustrative cases are included, 
together with a discussion of cases with no pri- 
mary interest pattern. Here, as in the discus- 
sion of other topics, Darley has pointed to chal- 
lenging and important research problems and 
has given useful suggestions for present prac- 
tice. All in all, the monograph is a valuable 
illustration of the contribution which can be 
made by the technician-counselor ; it is to be 
hoped that soon, more and longer books of this 
type will be available to guide the practicing 
counselor in the use of his most valuable tools. 


[ 649 ] 
Vocational Interest Blank for Women, Revised. 
Women ages 17 and older; 1933-47 ; the 1947 revision 
replaces Forms WA (for women who have out 
of school several years) and Form WB (for students) ; 
IBM; Form W; scoring scales are available for 
masculinity-femininity and 24 occupations : artist, au- 
thor, buyer, dentist, dietician, housewife, lawyer, labo- 
ratory technician, librarian, life insurance saleswoman, 
nurse, occupational therapist, office worker, physician, 
psychologist, social oie stenographer-secretary, 
teacher in elementary grades, teacher of English in 
high school, teacher of home economics in high school, 
teacher of mathematics and physical sciences in high 
school, teacher of physical education in high school, 
teacher of social sciences in high school, and Y.W.C.A. 
secretary ; $2 per 25; 1¢ per report blank; $1 per scoring 
scale; 2 to 9 scales, 8o¢ each; 10 or more scales, 70€ 
each; 75€ per 25 machine-scorable answer sheets; 75€ 
ner 25 answer sheets for use with Hankes Scoring 
Machine; 15¢ per specimen set; nontimed. (30-60) 
annie Edward K. Strong, Jr.; Stanford University 
ress. 
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45. Cawrzv, Anne Mary. “A Study of the Vocational In- 
terest Trends of Secondary School and [EE Women." Ge- 
netic Psychol Monogr 35:185-247 My '47. * (PA 22:450) 

Gwendolen Schneidler Dickson, Chief, Edu- 
cational Counseling Division, Veterans Adminis- 
tration, Washington, D. C. [Review of Form 
WA.] The ingenious technique which Dr. 
Strong has developed to obtain an individual's 
expressions of likes and dislikes and then to com- 
pare these responses with the distinctive patterns 
of interest which have been found to be charac- 
teristic of various occupational groups has been 
more effective when applied to men than to 
women. Whatever the reasons may be, experi- 
ence with this inventory shows that many 
young women do not possess distinctive pat- 
terns of likes and dislikes, as measured by this 
test, which are characteristic of specific occu- 
pational groups of women, such as physicians, 
Social workers, authors, librarians, artists, and 
lawyers. Instead, their interests are not differ- 
entiated from those of the general population 
of women, of housewives, and of occupational 
groups which do not possess distinctive interest 
patterns, such as stenographers, clerical work- 
ers, nurses and grade school teachers. Because 
of these limitations, the vocational counseling 
of young women with the aid of this test is 
sometimes more difficult than the counseling of 
young men. This does not mean, however, that 
the counselor should not attempt to use this 
instrument to discover the existence of such 
well-developed interest patterns as do occasion- 
ally appear in young women. 

Despite these limitations and others which 
have been pointed out by the test author and 
previous reviewers, this test is one of the best 
tools available to counselors for attempting to 
determine objectively the vocational interests 
of women. It should be used with care and 
clinical judgment and preferably in conjunction 
with another test of interest such as the Kuder 
Preference Record, which, by requiring expres- 
sions of preferences for activities, yields scores 
on several general interest dimensions which 
are as distinctive and reliable for women as 
they are for men. 


For reviews by John G. Darley and Ruth 
Strang, see 40:1681 and 38:1179. For related 
reviews, see 650 and 652. 


[ 650 ] 
*[Re Vocational Interest Blanks for Men and 
Women.] Carter, Haroto D. Vocational Interests 
and Job Orientation: A Ten-Year Review. Ap- 


plied Psychology Monographs of the American Asso- 
ciation ps Applied Psychology, No. 2. Stanford 
University, Calif.: Stanford University Press, May 
1944. Pp. 85. Paper, lithotyped. $1.50. * (London: Ox- 
ford University Press, 1944. 9s. 6d.) (PA 18:3616) 


Pub Personnel R 5:247 O '44. Edward B. 
Greene. * by far the most readable and au- 
thoritative work of its size that has come to 
hand on this subject. It is entirely devoted to 
examining the most important pieces of re- 
search in relation to vocational interests which 
have been published during the last ten years. 
It contains no case histories or illustrative 
tables, but it has documented references to 262 
technical articles. The author's knowledge and 
technical adequacy are probably second to none 
in this field. He and his associates have con- 
tributed important theories and facts over a 
period of r2 years. * divided into eight chap- 
ters of which the most important seem to be 
the second, "Nature of Concepts"; the fifth, 
"Validity and General Usefulness of Available 
Measures of Vocational Interests" ; the seventh, 
"Individual Differences in Interests"; and the 
eighth, “The Problem of Vocational Choice" 
* The book is very much needed to counteract 
the present high-pressure sales activity of those 
who claim that they can contribute greatly to 
the selection of industrial or civil service work- 
ers, sales managers, or executives by a stand- 
ard set of questions dealing with interests or 
emotional adjustments. At present the best in- 
terest questionnaires are about as “reliable” as 
the best general tests of intelligence. Interest 
inventories are useful “in the prediction of both 
educational choice and vocational choice. They 
are much less useful in predicting educational 
success or vocational success. . . . No objective 
definition of success seems to stand the strain 
of comparison of persons in various occupa- 
tions.” 


[ 651 ] 

*[Re Vocational Interest Blanks for Men and 

omen.] Hankes’ Answer Sheet for Vocational 
Interest Blank. The answer sheets must be scored on 
a speek scoring machine developed by E. J. Hankes 
and owned by Engineers Northwest; 1-49 copies, 3¢ 
cach; 50-90, 234€ each; 100-499, 234¢ each; 500-099. 
24%4¢ each; 1000 and over, 2¢ each; 70¢ for scoring one 
Hankes’ Answer Sheet (Men are scored for 42 scales, 
no group scales, and women for 19 scales) plus 10€ for 
each order of 5 tests or less; 9oé for scoring directly 
from test booklet; 80¢ for scoring from IBM answer 
sheets; E. J. Hankes ; Engineers Northwest. 


[ 652 ] 
X[Re Vocational Interest Blanks for Men and 
Women.] Stronc, Epwarp K., Jr. Vocational Inter- 
ests of Men and Women. Stanford University, 
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Calif.: Stanford University Press, 1943. Pp. xxix, 
S * ru Oxford University RS 405.) 
A 18:313 : 


Am J Sociol 49:378 Ja '44. Robert J. Havig- 
hurst. * Of most interest to sociologists are 
Strong's findings on differences of interest pat- 
terns between various social groups. * 

Ed Res B 24:136 My 16 *45. William H. 
Stone. * The presentation is well organized to 
meet the needs both of those readers who want 
only summaries of findings and of those who 
are interested in the technicalities of statistical 
research employed. * one is not likely to over- 
estimate the importance of this volume both to 
education and to guidance. 

J Appl Psychol 28:532-4 D '44. John G. 
Darley. * [this book] is not easy reading, but 
it is essential reading for those who must un- 
derstand the individual's motivating forces, of 
which interest is a significant one * While [the 
book's] organization seems straightforward and 
logical, it has occasionally led to some confu- 
sion and duplication in specific topics. For ex- 
ample, the topic of classifying occupations into 
families appears in Chapters Three, Eight, Ten, 
Fourteen and Twenty-two. Parallel to this, it 
is disconcerting to find early references to 
tables and discussions occurring much later in 
the book, as in the case of the text and many 
footnotes in the chapters comprising Part one. 
The author, by virtue of the amount and scope 
of his empiric data, is in the embarrassing posi- 
tion of having planted so many trees that he 
may not recognize the size and shape of his 
forest. The task of reporting the sheer number 
of researches, with the ramifications of the 
various studies, precludes the broad generaliza- 
tions and summary hypotheses one might wish 
to see. This criticism, however, should not be 
allowed to obscure the fact that in no other 
source can the research worker or clinician find 
so much that is provocative, essential, and sig- 
nificant in the field of interest measurement. 
The task of the reviewer and reader is merely 
made more difficult by the close attention re- 
quired to encompass all that the author has dis- 
cussed. * These comments do not do full jus- 
tice to the amount of material Dr. Strong has 
assembled, nor to the dispassionate and de- 
tached manner in which he presents it. One 
could almost wish he had overgeneralized his 
interpretations or been guilty of special plead- 
ing, so that the reviewer could denounce and 
defend with a fine show of critical skill. But 
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the book almost defies criticism first because it 
is a genuine and long-needed contribution to an 
important area of research and second because 
it deals almost entirely with an overwhelming 
mass of empirically derived data that speaks 
for itself over the entire range of problems in 
interest measurement. If the research that will 
follow after this volume is not significant and 
productive, it will be only because research 
workers have failed to study and understand 
the yeoman service Dr. Strong has performed 
for applied psychology. 

J Consult Psychol 8:51-2 Ja-F '44. Har- 
old D. Carter. * The presentation is such as 
to interest serious students of counseling, 
especially teachers, vocational advisors, and 
psychologists concerned with the problems of 
personality, motivation, and happiness. Strong 
reports penetrating research on a vital problem. 
* well written and attractively arranged * It 
lays a wealth of data before the reader, facili- 
tating his work by means of complete, accurate, 
and convenient arrangements of references, 
index, and tables. The general style and con- 
tent of the book will combine to make it attrac- 
tive to a wide audience. Strong has accepted 
the contributions of other workers enthusiasti- 
cally but critically, has assimilated them, and 
himself has grown in the process. Here he pre- 
sents the results of research, including his own, 
with ready admission of limitations, with im- 
personal scientific emphasis upon the truth, 
and with a positive and constructive attitude. 
Stimulating suggestions for further research 
are numerous, and are to be found stated ex- 
plicitly or contained implicitly in nearly every 
chapter. The reviewer accepts all the major 
conclusions presented by Strong in this book 
as true and significant. It is a pleasure to read 
a book in which the viewpoints are so closely 
in accord with the facts gathered from an im- 
posing array of significant publications. Teach- 
ers in courses on general, developmental, social, 
educational, and applied psychology will find 
Strong’s Vocational Interests of Men and 
Women a valuable reference or source book. 
Tt deserves to be read and discussed by every 
psychologist and guidance worker. C ontribu- 
tions of such importance appear but rarely. 

J Ed Psychol 35:186-8 Mr ’44. Stephen M. 
Corey. * Strong's entire argument bears upon 
two different problems involved in interest 
measurement: first, the differentiation of oecu- 
pational groups; and, second, assigning indi- 
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viduals to one or more groups depending upon 
their interest scores. The author believes that 
his interest scale does both of these tasks rea- 
sonably satisfactorily. The reliability of the in- 
strument, of course, has been well established, 
and the coefficients reported usually fall be- 
tween .85 and .9o. Even a reliability of this 
magnitude, however, leaves much to be desired 
when the scale is used for individual prognosis. 
Many questions of fundamental importance in 
appraising the interest inventory were answered 
by correlation data. * The reviewer at times 
was dissatisfied with the interpretation of co- 
efficients of a certain magnitude. For example, 
when the author described the relationship be- 
tween scores on his interest inventory and 
measures of intelligence as being about the mag- 
nitude of .40 (p. 49) the contention was that 
this represented a slight degree of association. 
Later on it was contended that a correlation 
of .73 was usually considered a high coefficient. 
Any criticism of this sort is not so serious as 
it might be in view of the fact that the author 
always gave the raw correlation so that a care- 
ful reader could make his own interpretation. * 
the evidence is clear that an adult can, if he 
wishes, check those interests, which will give 
him a high rating on a desired occupation ( p. 
686 f.). Strong's attitude toward this problem 
is expressed by the contention that “under nor- 
mal conditions there is little or no apparent 
temptation to fudge." (sic) (p. 687). The ar- 
gument behind this hope was not too convinc- 
ing to the reviewer. It reminded him again of 
the great gap that exists between highly refined 
psychological tests like Strong's questionnaire 
and the measuring instruments that are em- 
ployed by the physical scientists, The latter 
would have little confidence in a scale that 
would let a man weigh what he wanted to weigh 
even if under most circumstances men would 
want to know the truth (?) about their avoir 
du pois, The student who wants to bring him- 
self abreast of an important area which has 
not been summarized since Fryer’s Measure- 
ment of Interests, 1931, will find Strong’s vol- 
ume indispensable. Not only is it a 750-page 
storehouse of research data, but it also is full 
of hypotheses and questions which will be 
most provocative to research students in the 
field. 
Occupational Psychol 18:148-52 Jl ^44. 
Charles S. Myers. * Interests in occupations 
are, for Dr. Strong, determinable by likes, in- 
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differences and dislikes. He disarms criticism 
as to the psychological identity of "interests" 
with "liking" by stating that his aim has always 
been a practical one in the "atmosphere of ap- 
plied psychology. . . . Some 'pure' psycholo- 
gists,” he naively suggests, “should investigate 
experimentally the nature of interests and how 
they develop in early life" (p. viii). Although 
he devotes an entire chapter to the nature of 
interests, perhaps his position is not so dif- 
ferent from that of the testers of intelligence: 
we may not know enough to define its nature, 
but we may nevertheless contrive to assess it, 
and we may differentiate various kinds of it, 
with practical profit. * The candidate for guid- 
ance is required—surely unnaturally—to mark 
each of roo occupations enumerated on the 
Blank with an L, I or D, according as he likes, 
is indifferent to, or dislikes each of these hun- 
dred occupations. But this Blank is by no means 
confined to "interests" nor to "occupations." 
Of the 400 items to be checked therein, only 
100 relate to occupations, as such. * a large 
number of Dr. Strong's conclusions rest on the 
acceptance of the statistical manipulations to 
which he submits his data. Sometimes he seems 
to be aware of the difficulties of such accept- 
ance. Thus in endeavouring to assess, and to 
establish a belief in, the permanence of late 
adolescent and adult interests, he obtains a cor- 
relation of 0.75 in the case of college seniors 
with a second scoring obtained from them ten 
years later. But, as Dr. Strong observes, "it 
must not be overlooked that these measures 
represent the average person—for some, per- 
manence is much greater, and for some it is 
much less. We can never be certain, in the 
case of any one individual, that his interests 
will be as permanent as these figures sug- 
gest. . . ." (p. 51). Despite this caution Dr. 
Strong is prepared only to concede that “per- 
manence of intelligence-test scores is somewhat 
greater than for interest-test scores" (p. 359). 
* Many would doubt the value, for any given 
occupation, of an average score on “interests,” 
character and temperamental qualities thus cal- 
culated from several hundreds say of physicians 
(presumably including surgeons, so differerit in 
mental make-up!). They would also doubt 
whether interests, etc., do not notably change 
as, say, a medical student passes through his 
six or more student professional years. And 
they wonld doubt whether a college youth's 
"interests" inducing him to enter on a given 
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career are associated with adequate knowledge 
of what will later be required of him and of 
what will be required of him in the other 99 
listed occupations. Occasionally Dr. Strong an- 
ticipates such objections. Thus—rather sur- 
prisingly—he warns us “No one should blindly 
accept the guidance indicated by the interest 
test until he has ascertained what is required 
in the indicated occupation and has determined 
that he has the abilities and interests required 
for the work” (p. 55). Yet his great. faith in 
his life’s work persists unabated. For he con- 
tinues—“Knowledge of one’s interest score 
makes such a determination possible, for now 
one or only a few occupations need be investi- 
gated instead of all of them." * To estimate the 
validity of predictions....the criterion adopted 
by Dr. Strong is that of being still engaged 
in the occupation at the time of the follow-up. 
No attempt is made directly to ascertain either 
efficiency or happiness at the work. Thus the 
lowest criterion is chosen—that of discharge 
from the work in the case of inefficiency or of 
leaving in the case of discontent. * Dr. Strong 
presents an elaborate table of 703 inter-corre- 
lations between 38 occupations on the surely 
insecure basis of the interest scores of 285 col- 
lege seniors on pairs of occupational scales. 
Thus he is led to classify 36 of these occupa- 
tions into eleven groups, four of which, how- 
ever, contain only one occupation each. One of 
these groups consists of artist, psychologist, 
architect, physician and dentist. We may well 
wonder why physicians “score most like artists,” 
overlapping in scores by 37 per cent, “whereas 
accountants and bankers overlap with artists 
by only 7 and 8 per cent, respectively” (p. 
110). More reasonably, mathematician, physi- 
cist, engineer and chemist form another group; 
and lawyer, author-journalist, and advertising 
man form yet another group. On the other hand, 
another strange group consists of aviator, 
farmer, carpenter, printer, mathematics-science 
teacher, policeman and forest service. What 
can there be here psychologically in common? 
* Enough has been said to confirm the descrip- 
tion of its author on the paper cover as "the 
man who has done more than anyone else to 
develop [along special lines] the practical ap- 
plication of vocational-interest testing." But 
how far the lines on which he has proceeded 
will prove to be of permanent value must be 
left for the future to decide. Statistical meth- 
ods, even when justified statistically, are far 


from being always justified psychologically. 
Certainly, according to the statistical proofs of 
their validity adduced by Dr. Strong, his 
methods seem to work moderately well, en 
masse. They have the advantage of objectivity 
and may well have advantages for group-testing 
when the time available for vocational guidance 
and personnel selection is short, even although 
his scheme is a lengthy and expensive way of 
arriving at the information procured, and how- 
ever imperfect that information may be from 
the psychological standpoint. The value of re- 
sponses to questionnaires of this type is greatly 
enhanced if they are regarded as providing a 
basis for later discussion between the individ- 
ual applicant and his vocational adviser. And 
the value of such questions themselves arises 
partly from the fact that they can hardly fail 
to make the applicant think. 

Occupations 22:455 Ap '44. Donald E. 
Super. Dr. Strong's work in the measurement 
of vocational interests has for nearly twenty 
years been a landmark in the field of voca- 
tional testing, and a guidepost for vocational 
counselors. Any competent publication. which 
throws light on the nature of vocational inter- 
ests and on the use of Strong's Blank is there- 
fore likely to be of considerable interest to all 
counselors who are alert for opportunities to 
improve their insight and techniques. When 
the publication is by Dr. Strong himself, when 
it deals in detail with the numerous investiga- 
tions of the nature and' measurement of inter- 
ests which have been completed during the past 
twenty years, and when it is written in a clear 
and easily read style, it becomes a “must” book 
for every vocational counselor's library. * The 
organization of the book seems confused when 
the table of contents is first studied. This is 
primarily a typographical problem, however, 
resulting from the number of subheadings. 
These make the book more easily used, and 
need not discourage the reader. Although full 
of tables and figures, the material is clearly 
and interestingly presented. Any college grad- 
uate should be able to follow the argument and 
understand the conclusions. Certainly anyone 
qualified to use Strong’s Interest Blank should 
be able to read the book with ease, and every- 
one who uses the test or any other interest test 
should read it carefully and with considerable 
profit. Although Dr. Strong's treatment of in- 
terest testing is broad and by no means limited 
to his own work, there are some important 
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omissions. * Dr. Strong brings out some 
hitherto neglected facts on the nature of inter- 
ests. One of these is the basic similarity of 
interests regardless of occupation, age, or sex; 
group differences are important, but limited in 
range. Another is the importance of occupa- 
tional or socio-economic level, and the possi- 
bility of measuring vocational interests at the 
lower levels by taking the level into account. 
The author is to be congratulated on the thor- 
oughness with which he has studied his test 
and the factors which it measures, and on the 
clarity and detail in which he presents his find- 
ings. It is a sign of maturity in vocational test- 
ing that such books are beginning to appear. 
Prison World 8:35 S-O ’46. * Has consid- 
erable and illuminating sections on abilities and 
achievement as related to interests, differenti- 
ation of occupations, guidance problems and 
procedures, differentiation of superior and in- 
ferior members of a group, skilled trades, etc. 
Psychol B 42:359-70 Je '45. Domald E. 
Super. * Strong has brought together the re- 
sults of nearly twenty years of work with his 
Vocational Interest Blank. Although it deals 
largely and at length with his work on his test, 
the book also brings together the results of 
other studies using his interest inventory and 
of work with the inventories to which his is 
related. Other instruments for the inventorying 
of interests are considered when work with 
them throws additional light on problems which 
Strong considers pertinent, but not otherwise: 
for example, the Kuder Preference Record is 
barely mentioned, the Cleeton Inventory not at 
all. Despite recent important work with infor- 
mation tests of interest, interest tests, as con- 
trasted with inventories, are dismissed in less 
than three pages in which the approaches of 
Burtt, Flanagan, Super, and Wyman are briefly 
discussed, and the rather different approaches 
of R. B. Cattell and E. B. Greene are not men- 
tioned. Even some studies with Strong's in- 
ventory are excluded, as in the case of the re- 
viewer's investigation of the relationships of 
vocational and avocational interests, when they 
do not seem to contribute directly to the main 
currents of the book. * Strong has wisely de- 
voted the first hundred pages to a review of 
the development, nature, and use of the test, 
and has reserved for the last six hundred pages 
the detailed discussion of matters covered more 
generally at first. Thus the clinician or con- 
sultant who uses the test in his regular work 
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and who is interested only in understanding 
the inventory as a diagnostic or predictive in- 
strument can, with occasional reference to later 
sections, quickly read the relevant part of the 
book: the style is clear and concise, the lan- 
guage not too technical and the presentation 
suitable for those who have not specialized in 
measurement. * when, on page 414, Strong 
rejects the statistical combination of test scores, 
he is guilty of a basic inconsistency. If his in- 
ventory technique, consisting of a statistical 
summation of interests in specific occupations 
and activities, provides useful information not 
made available by a clinical study of responses 
to the specific items in the inventory, why 
should not a statistical summation of scores on 
the interest scales provide data as useful as the 
clinical study of those test scores? This sugges- 
tion of the reviewer's should not be taken as a 
suggestion that the clinical study of the scale 
scores is not valuable: it is, and so is the study 
of responses to specific items in the inventory, 
as Shellow has reported. Knowing the relative 
physicist and physician scores of an embryo 
psychologist, for example, could be useful in 
deciding between emphases in experimental and 
clinical psychology. But a statistical combina- 
tion of these test scores could, as Strong's own 
arguments for the consideration of primary, 
secondary, and tertiary interests imply (432- 
448), provide an even more valid interest score 
for any given occupation. As Strong points out, 
it is dangerous to recommend an occupation on 
the basis of a high score in that occupation 
alone. This score, he states, should be supported 
by high scores in related occupations. Why not 
indicate this support by a combination of scores 
on related occupations? * In spite of the few 
defects discussed....Strong’s work shows a re- 
markable freedom from statistical weaknesses, 
especially in view of the long period over which 
it was spread and the variety and complexity 
of the problems studied. * Strong and his pub- 
lishers, to whom applied psychologists are be- 
coming increasingly indebted, have done an ex- 
cellent job of preparing the manuscript and 
putting it into print. The table of contents is 
well organized and very detailed, the topic 
headings are numerous and meaningful, the 
index adequate and accurate. In view of the 
vast amount of material that has been correctly 
reproduced in this lengthy volume, it seems 
like the act of a petty critic to point out a few 
errors that have been noted. * The volume is 
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a gold mine for students in search of thesis 
topics. But one cannot hope to deal adequately 
with a classic in a single review, especially 
when the subject of the book is twenty years 
of research. If some idea of the importance of 
the work has been conveyed, if interest has 
been fostered in a few problems, and if the 
reader has been stimulated to a thoughtful 
study of Strong’s book, this review has accom- 
plished its purpose. 

Psychometrika 9:145-6 Je ’44. G. Frederic 
Kuder. The Strong Vocational Interest Blank 
has made important history in the field of meas- 
urement. Since the appearance of the first form 
in 1927, the test has probably been the subject 
of more investigation and research than any 
other single test. And this research has been 
largely conducted or inspired by Strong him- 
self. * the field of the measurement of interests 
through use of the Strong Vocational Interest 
Blank is covered systematically, completely, and 
in an interesting style. The data are reported 
in an impartial vein; the con’s as well as the 
pro’s bearing upon the various procedures used 
by Strong are reported in fine good humor. The 
reader may not always agree with the conclu- 
sions reached by Strong, but in any case he 
must recognize that the book is a model of sci- 
entific reporting. No one can consider himself 
abreast of the interest field if he is not familiar 
with the contents of this book. One fundamen- 
tal point, perhaps, deserves particular mention. 
Thurstone’s application of factor analysis to 
Strong’s interest scales in 1931 pointed to the 
possibility of a new technique in obtaining 
scores for various occupations. Since Thurstone 
demonstrated that the variance of the occupa- 
tional scores could be almost entirely accounted 
for by four factors, it followed logically that 
formulas could be developed by multiple cor- 
relation for obtaining occupational scores from 
measures of these four factors or from any set 
of a few selected measures which would ac- 
count for the variance of all the scales. The 
actual scoring could then be reduced to the 
fundamental measures used. Later research by 
Strong, using more scales, increased the num- 
ber of factors involved, but the implication of 
the findings is the same. It has been a matter 
of surprise to some people in the field that 
Strong has not followed up this lead. Strong s 
reasons for not doing so, as given in his book, 
are not convincing. The first and less impor- 


tant reason is that "occupational scores can be 
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obtained by machine scoring with no less trouble 
than with calculations involving multiple cor- 
relations." I am inclined to doubt this asser- 
tion. Strong is comparing a streamlined scoring 
method done on a mass production basis with 
a straight computing machine job of calculat- 
ing scores using regression weights. There is 
no reason why the application of a standard 
series of weights can not also be streamlined 
and done on a mass production basis. Strong's 
other reason is by far the more important, since 
it concerns validity. Strong cites Dwyer's data 
to the effect that "the multiple correlation 
scores will correlate ‘80 or better’ in only the 
majority of cases with the results obtained by 
use of occupational scales.” At this point we 
should stop to consider the purpose of the 
scales. If one particular scoring method is taken 
as the criterion, then we are doomed to labor 
in vain to develop a more valid method, for we 
can not hope to improve on what is implicitly 
assumed to be perfect to begin with. A more 
logical basis of judgment appears to be to de- 
termine which method is more successful in 
differentiating those in an occupation from 
men-in-general. It has not been demonstrated 
that such differentiation is accomplished better 
by scoring items than by assigning multiple 
regression weights * to a few selected measures 
which account for most of the interest variance 
in the occupations studied. Scores obtained by 
the latter procedure should be more valid than 
those obtained by the item-scoring method. Use 
of the item-scoring method carried no assurance 
that the optimal weight is given to each factor, 
since the number of items representing each 
factor is not controlled and item intercorrela- 
tions are not allowed for in determining item 
weights. Theoretically, it is impossible for an 
occupational scale obtained by Strong’s method 
to differentiate better than weighted selected 
measures, if the measures used account for all 
the variance of the occupational scales. There 
is no reason to believe that the theoretical indi 
cation will not be borne out by the evidence, 
when obtained. 

Pub Personnel R 5:114-5 Ap 44. Cade 
Shartle. * a valuable piece of work which 
should be read, at least in part, by persons in 
the personnel and vocational guidance field, and 
should be read in detail by those who have 

* It should be noted that these regression weights must be 
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problems in regard to the measurement of in- 
terest. 
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Vocational Interest Schedule. Grades 13-16; 1935— 
$ IBM; 1 form; replaced in 1947 by the Thurstone 
interest Schedule (see 646) ; nontimed (10-30) min- 
utes; L. L. Thurstone; Psychological Corporation. 
REFERENCES 

1-4. See 40:1683. » 

5. Lentz, Tueo. F. “The Opinionnaire Technique in Voca- 
tional Guidance.” Abstract. Psychol B 37:451 Jl '4o. * (PA 
14:5712, title only) 5 

6. Reen, Homer B, “The Relation of Bernreuter Personality 
and Thurstone Vocational Interest Scores to Each Other and 
to Scholastic and Mechanical Achievement.” Abstract. Psychol 
B 37:449-50 Jl ’40. * (PA 1438716, title only) 

7 Wicker REDERIC. “The Interrelationships of Some 
General and Specific Preferences.” J Social Psychol 11:275- 
302 My "40. * (PA 14:5149) » 

8. Gorpon, Hans [ol AND HERKNESS, Wa ter W., Ja. “Pupils 
Appraise Vocational Interest Blanks.” Occupations 20:100-2 N 
"41, * (PA 16:757) 

9. Reen, Homer B. “The Place of the Bernreuter Personal 
ity, SR Mechanical Aptitude, and Thurstone Vocational 
Interest Tests in College itrance Tests.” J Appl Psychol 
choy O "41. * Abstract Psychol B 38:711 O "41. * (PA 
16:120; 

10. Briccs, Lrsrre. "The Thurstone Vocational Interest 
Schedule and Students' Actual Vocational Choices." Trans 
Kans Acad Sci 45:272-3 '42. * (PA 12 3708) 

11. Gordon, Hans d, AND Herkness, Warrer W., Jr. “Do 
Vocational Interest Questionnaires Yield Consistent Results?” 
Occupations 20:424-9 Mr '42. * (PA 16:2059) 

12. JENSEN, Mitton B., AND RorrEn, JuLIAN D. “The Value 
of Thirteen Psychological Tests in Officer Candidate Screen- 
ing." J Appl Psychol 31:312-22 Je '47. * (PA 21:4107) 

Donald E. Super, Associate Professor of 
Education, Teachers College, Columbia Uni- 
versity, New York, New York. This interest 
inventory has been reprinted in a new form to 
make possible the reproduction of the blank as 
a part of an IBM answer sheet. This makes 
it more economical in every respect than the 
original four-page blank with large stencils. 
Unfortunately this change in format represents 
the sum total of progress that has taken place 
in the development of Thurstone's Vocational 
Interest Schedule since the original work was 
done. At that time it was an important contri- 
bution to the field because it illustrated and 
stimulated the factorial approach to the study 
and measurement of vocational interests; as an 
experimental instrument for use in research it 
had a place in the vocational psychologist's kit. 

During the 12 years that have elapsed since 
the first publication of the schedule, little re- 
search has been done with it. Those who were 
challenged by Thurstone's factorial approach 
apparently recognized that a reliable measure 
of interest would require far more items than 
the 72 contained in this one, and that the items 
would need to be less transparent than the oc- 
cupational titles used exclusively by Thurstone. 
The scores are not reliable: changing two 
“likes” concerning which the writer was doubt- 

ful because of stereotyping to "dislikes" (his 


more considered reaction) changed his physical 
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science interest score from .g1 (considerable 
interest) to .55 (slight interest). The writer 
has found no published studies which throw ` 
light on the validity of Thurstone's schedule in 
vocational guidance or selection ; it is not likely 
that it would be found valid, when scoring keys ~ 
are based on as few as nine items and the reli- - 
abilities are as low as that number of items, 
implies. 

The manual reflects this lack of occupational 
research with and consequent lack of knowledge - 
about the test. It does not mention validity, It - 
does not even mention reliability. Interpreta- 
tion is discussed only in terms of factor pro- < 
jections: With keys based on from nine to 
twenty occupational items, this is tantamount 
to stating that people who*say they would like 
to be physicians should consider medicine as a 
possible occupation. 

Strong’s blank, better understood now partly 
as a result of Thurstone's work, and Kuder's, 
are more reliable and less transparent. There is - 
no possibility that so brief an instrument as this — 
can become as useful as these two longer in- 
ventories, even if more were known about it. 
The publishers have therefore rendered a serv- 
ice to the profession by replacing this blank with 
a completely revised 1947 edition, the Thurstone’ 
Interest Schedule (see 646), which was re- - 
ceived too late for review. 


For reviews by Harold D. Carter, J. B. 
Miner, and N. W. Morton, see 38:1180 and 
40:1683. 

[ 654 ] 
Vocational Interest Test for College Women. - 
1930; 1 form; $10 per 50; 25¢ per sample set; non- - 
timed (30-40) minutes; Kathryn McHale; American - 
Association of University Women. 


REFERENCES 
_ 1. McHarz, Katuryn. “An Experimental Study of the Voca- 
tional Interests of a Liberal Arts College Group.” J Appl Psy- 
chol SEATS Je'24. * 

2. McHate, Katuryn. “An Information Test of Interests.” 
Psychol Clin 19:53-8 Ap '3o. * (PA 4:5135) 

Frances Oralind Triggs, Educational Records 
Bureau, New York, New York. According to 
the author, the purpose of this information in- - 
terest inventory, copyrighted in 1930, was to — 
"formulate a working idea to guide an objective 
study of vocational interests... ; the idea be- 
ing, that if one made an information test based .— 
on interests, perhaps one would arrive at some- 
thing more tangible by way of a guidance tool | 
(1)." The test consists of 246 multiple-choice - 
items divided into seven groups: law, business, — 
medicine and science, homemaking, costuming 
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and sewing, agriculture and horticulture, and 
teaching. 

This test is now only of historical interest as 
one type of instrument—an information test— 
for the measurement of vocational interests. 
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Vocational Inventory. pu. 8-16 and adults; 1940; 
1 form; $2.50 per 25; $1.25 per 25 individual analysis 
reports; 25¢ per specimen set, postpaid; nontimed 
(150) minutes; Curtis G. Gentry; Educational Test 
Bureau. 
REFERENCES 

1. Gentay, Curtis G. "Knoxville's Seven-Point Vocational 
Guidance eee Sch Executive 59:10-2 Ap '4o. * (PA 14: 

2. Gorpon, Hans C., AND HERKNESS, Warren W., Jr. “Pupils 
Appraise ur Interest Blanks.” Occupations 20:100-2 N 

3. FRoEnLicR, CLrrronp. “A Study of the Gentry Vocational 
Inventory.” Ed & Psychol Meas 2:75-82 Ja '42. * (PA 16:2449) 

.4. Gorvon, HANS C., AND HERKNESS, Watter W., Jr. "Do 
vicational Interest Ovestionnaires Yield Consistent Results?” 
Occupations 20:424-9 Mr '42. * (PA 16:2059) 

Edward S. Bordin, Director, Student Coun- 
seling Center, and Associate Professor of Psy- 
chology, The State College of Washington, 
Pullman, Washington. Among current voca- 
tional interest inventories this one is unique in 
that it includes information as well as prefer- 
ence items in its total of over four hundred 
questions. Considering the fact that the former 
type of item was advocated, tried, and aban- 
doned in the developmental stages of interest 
testing and that the author of this inventory, 
unlike authors of other inventories, claims that 
his test measures aptitudes as well as interests, 
one would wish to see the results of extensive 
research as a basis for judging the effectiveness 
of the instrument, The manual refers to a fol- 
low-up study of 1,000 Knoxville high school 
graduates which appears to have been the pri- 
mary basis for the validation and standardiza- 
tion of the inventory. However, only fragmen- 
tary reports of the study are presented, and no 
references are given to sources for more com- 
plete reports. Considering the sensational claims 
for validity based on this study, it is a distinct 
disservice to the field of vocational interest 
testing not to make the complete report available 
so that critical verification would be possible. 

In addition to an incomplete presentation of 
validity studies not expected in a test manual, 
the manual for the Inventory makes the glaring 
omission of normative tables. There are data 
on the distribution of highest scores for the 
1,000 high school students, but nowhere can 
one find data on what were the average scores 
for this group on each scale and in what way 
the scores on each scale were distributed. Nat- 


urally, there are no tables for converting the 
raw scores to any form of derived scores. 
Without such information a test cannot be 
used. 

In terms of format, the inventory is more 
cumbersome than most to score because it not 
only fails to provide for use of a machine scor- 
able answer sheet but actually calls for comple- 
tion or other types of write-in responses. Fur- 
ther, since the answers are written on the test 
booklet, it necessarily increases the cost of 
testing. 

An Individual Analysis Report booklet has 
been prepared to accompany the inventory. It 
has many features which, in the hands of well- 
trained teacher-counselors, probably prove use- 
ful for aiding high school students to become 
more completely oriented in vocational terms. 
However, many will deplore the advocated 
practice of turning over test scores of this sort 
to the student to be kept in his permanent rec- 
ords and to be used or misused without super- 
vision at some future date. Unfortunately, this 
is a practice which is all too common. 

This reviewer finds by far the most serious 
offense in the publisher’s statement on the 
folder enclosing the test materials. It follows: 
“Extensive knowledge of the guidance field is 
not necessary for deriving the benefits from 
and making use of the Vocational Inventory. 
The manual and accompanying accessories give 
necessary information for the proper use of 
the Inventory and the interpretation of the 
results.” Misuse of tests and general counsel- 
ing malpractice can be the only results of this 
type of test merchandising. 


Harold D. Carter, Associate Professor of 
Education, University of California, Berkeley, 
California. This inventory is intended to sup- 
plement the usual school records as a basis for 
educational guidance in high school and col- 
lege. It is also intended for general personal 
guidance and job placement and for assisting 
employers in personnel work. 

The inventory provides an indication of one's 
general field of vocational interest and of likes 
and dislikes for activities involved in the occu- 
pations. The student's strengths and weak- 
nesses are classified with reference to the fol- 
lowing eight major occupational groups: social 
service, literary work, business, law and gov- 
ernment, art, mechanical designing, mechanical 
construction, and science. 
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As evidence of validity, the manual presents 
results of a follow-up study of 1,000 high school 
graduates, showing that 84 per cent were at 
work in the occupational groups in which they 
rated highest on the inventory. They were 
found to be well adjusted vocationally, as 
judged by satisfaction expressed by employers 
and employees. The manual also presents tabled 
data showing the tendency of persons to suc- 
ceed in courses taken in school when those 
courses were in agreement with measured in- 
terests. Although these data do not provide a 
clear quantitative indication of the degree of 
validity, they do show that the test has some 
validity. 

The reliability of the test is given in terms 
of per cent change of returns between grades 
9 and 12 for 412 students who were retested. 
The data indicate a marked tendency for voca- 
tional interests to remain stable, but again there 
is a lack of the usual evidence in correlational 
form. One does not know how reliable the test 
is in terms of the customary criteria. 

For normative data, the manual presents the 
score frequency distributions for each of the 
eight vocational fields for 1,000 high school 
seniors. In addition, sample tabled data present 
the basis for profiles of interests of 16 students. 
The manual explains the use of the accessory 
materials which come with the test. It also 
explains how projects in homeroom guidance 
can be built around the use of the inventory. 

The test contains a mixture of interest items 
and information items. It is practically self- 
administering and can easily be used in test- 
ing large groups. It appears likely to be useful 
for self-exploration of interests as a basis for 
occupational adjustment. Probably its other 
uses will be markedly restricted because about 
two and one half hours are required for taking 
the test and because the scoring is tedious. In 
view of the length of the test, it is probably 
highly reliable. 


Donald E. Super, Associate Professor of 
Education, Teachers College, Columbia Uni- 
versity, New York, New York. This inventory 
yields two types of scores, one for a number 
of occupational fields such as social service and 
business, and one for personality in terms of 
introversion-extraversion. This would lead one 
to expect a Strong- or Kuder-type inventory of 
vocational interests and a Bernreuter-type in- 
ventory of personality, something like the Clee- 


ton before the personality section was found 
useless and dropped. The expectation is war- 
ranted in the case of the personality section, at 
least for the first 25 items, although the last 24 
are a mixture of self-ratings of abilities which 
might be classified as introvert or extrovert and 
of biographical data such as the frequency of 
parental disciplining (“seldom corrected” by 
parents is weighed for introversion, on the 
basis of no available evidence). In the former 
case, however, the expectation is not warranted, 
Instead, one finds items of the Strong and 
Kuder varieties, mixed in with items which test 
vocational vocabulary and information, self 
ratings of skill and knowledge in school sub- 
jects, and biographical questions. There are 
even some questions which may have been in- 
tended as information questions but which are 
more probably attitude items, for instance, Item 
375, "Recently a factory owner retired and 
turned over his prosperous business to his 
employees. Was this wise? (Yes or No).” The 
answer is “yes,” with credit for social service. _ 
This may be justifiable, in the light of evidence 1 
from other work, but there is no evidence in _ 
the manual or in published studies of this in» — 
ventory to indicate that this is the answer given 
by persons engaged in social service occupa- - 
tions or even that people who answer “yes” to 
this item tend to make high scores on the other 
social service items (no internal consistency or 
item validation data are even referred to in the 
manual). 

The use of a variety of types of items is due 
to the fact that Gentry has attempted to make 
his "inventory" not a test or inventory in the 
usual sense, but rather a complete inventory of 
the individual, including his preferences, his 
manifest interests (shown in specialized knowl- 
edge), aptitudes (self-rated), and personality 
traits. He has, in fact, attempted to make a vo- 
cational guidance test comparable to the apti- 
tude tests used in selecting students for pro- 
fessional schools, but covering all the major 
fields. This might be an indication that Gentry 
has shown a high degree of sophistication in 
test construction, basing his work on the most 
promising developments in the fields of interest 
inventories, general information tests, special- 
ized vocabulary tests, and personality inven- 
tories. If this were indeed the case, the instru- 
ment is one which should be widely studied 
and, when well understood, put to use in voca- 
tional counseling. 
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Unfortunately, however, Gentry has not mani- 
fested a high level of sophistication in test 
construction and has not based his work on 
appropriate advances made by others. The 
Vocational Inventory bears the earmarks of 
amateurishness in many ways, some of minor 
importance and some quite crucial. Under the 
former head might be cited poor mechanics, 
illustrated by having the examinee actually 
write yes or no in the test booklet instead of 
marking a multiple-choice answer and by a 
scoring booklet which has to be matched with 
the test booklet so that both may be read in 
scoring instead of using a machine or stencil- 
scored answer sheet. Such a publication in 1940 
reflects both on author and on publisher. 
Among the major defects in test construction 
is the lack of evidence of item analyses for 
either internal consistency or external valid- 
ity, such as Kuder (internal consistency) and 
Strong (external validity) used in the con- 
struction of their interest inventories and such 
as work in personality and in aptitudes has 
repeatedly demonstrated is essential for validity. 
If this were in fact a soundly built aptitude 
battery, it would not have a separate score for 
the personality section: instead, these items 
would be weighted for various occupational 
fields, just as the preference, information, and 
other items are, except that there would be 
statistical evidence from item analyses to jus- 
tify the weights, The grouping of items at some 
points in such a way as to make their relation- 
ship obvious to the examinee and thus spuri- 
ously to increase reliability and decrease valid- 
ity through halo effect, as demonstrated in 
studies of the organization of test items, is an- 
other major defect. In a professionally ethical 
worker, such lack of analysis of a test is an 
index of technical naiveté. 

In the light of the above and other similar 
defects, it is not surprising that Froehlich, in 
one of the few published studies of the Gentry 
inventory, concluded that "this test should be 
more carefully standardized and evaluated 
before it is used in a counseling situation" (3). 
This reviewer is inclined to wonder why Froeh- 
lich thought it necessary to make his study in 
order to demonstrate this. When such serious 
defects are so patent, it is certain that other 
less obvious defects exist. The time and effort 
of test users is then better spent in further 
validating other more promising instruments, 
and the energies of test constructors are better 
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devoted to developing new instruments based 
on more sophisticated evaluations of current 
research. Gentry's basic idea is sound, but its 
execution is naive. This reviewer therefore con- 
siders the Gentry Vocational Inventory an in- 
strument which should be withdrawn from cir- 
culation until adequate analysis of the test has 
been made and revision and restandardization 
have been completed. The norms based on 
1,190 junior and 960 senior high school boys 
and girls which the publisher promises for the 
immediate future are useless without such an 
analysis and standardization. 


Occupations 21:266-7 N '42. Arthur H. 
Brayfield. A thirty-page booklet containing two 
sections, a Vocational Inventory and a Per- 
sonality Inventory. * The Personality Inven- 
tory’ purports to measure introversion-extro- 
version, but since there are other inventories 
for the same purpose which seem to. have been 
better constructed and standardized, the Per- 
sonality section will not be reviewed here. * 
appears to be an attempt to construct a single 
instrument which will be an over-all interest, 
aptitude, and information test. The author does 
not make a clear distinction among these three 
areas in his explanatory statements. The re- 
viewer found 64 per cent of the items to be 
questions of information. * The author does 
not make clear what he is attempting to meas- 
ure other than the individual's "general voca- 
tional picture.” Insufficient information is given 
regarding the method of construction and 
standardization. The information whith is given 
casts doubt upon the adequacy of the methods 
and techniques used. The attempts to get a 
measure of validity are inadequate. The reviewer 
is in complete sympathy with the author’s at- 
tempt to provide an instrument useful in mass 
guidance. However, this appears to be an over- 
simplification. The evidence points to the de- 
sirability of adhering to certain technical con- 
siderations in test construction in order that 
the instrument may be meaningful as well as 
practical. The inventory might have some value 
as an interest measure. Obviously it is not the 
“all-around test" it attempts to be. 


MANUAL DEXTERITY 


[ 656 ] ey 
Benge Han-Dexterity Test. Adults; 1043; indi- 
vidual; $5 per testing outfit; worklimit ( 10-20) min 
utes; Eugene J. Benge; Management Service Cs 
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C. H. Lawshe, Jr., Professor of Psychology, 
Purdue University, Lafayette, Indiana. This 
test is designed as a rough measure of the speed 
of eye, hand, and arm movements. It purports 
mainly to eliminate, for industrial positions in- 
volving rapid hand movements, those employees 
who have poor or slow finger and hand dexter- 
ity. Jobs involving inspection, assembly, and 
repetitive machine operations are indicated. 

The test is of the work-limit variety, con- 
sisting of a board with ten pegs, upon each of 
which the applicant places ten disks, one at a 
time. Four trials are taken—right hand, left 
hand, both hands simultaneously, and both 
hands in alternation—total time for the four 
trials representing the total score on the test. 
Percentile norms for the complete test, based 
apparently upon the performance of semiauto- 
matic machine operators, are provided. 

No reliability and limited validity data are 
available. The test was tried on 173 employees 
of the Sonoco Products Company of Hartsville, 
S. C., manufacturers of paper specialties for the 
textile industry. Of 73 rated satisfactory, 78 
per cent made an average score or better on 
the test. Of the 94 unsatisfactory workers, 72 
per cent failed to attain the average score. The 
test was purportedly used successfully in com- 
bination with vision and intelligence tests in 
identifying satisfactory present and potential 
employees. 

Administration is simple, r5 minutes being 
required for each test together with the scoring. 
Directions are, however, meager. Such details 
as the exact method of the alternate hands trial, 
what to do about dropped disks, and the vol- 
untary or involuntary moving of the pegboard 
by the applicant are not sufficiently standard- 
ized. The necessary counting of the disks is a 
distraction to the applicant that might be elimi- 
nated by designating the upper limit for ten 
disks on each peg. The norms provided fail, of 
course, to discriminate between handedness. 
There is no evidence in the accompanying liter- 
ature that the present length of the component 
parts of the test are optimum for any industrial 
purpose. 

While the limited amount of validity data 
supplied by the publisher seems promising, 
users of the test should proceed with caution 
and most certainly should prepare standard 

directions so as not to introduce undesirable 
variabilities into the administration procedure. 


Users should also inquire into the reliability of 
the test inasmuch as the publisher presents no 
such information. 


Joseph E. Moore, Head, Department of Psy- 
chology, Georgia School of Technology, At- 
lanta, Georgia. “This is a short test of eye-hand 
coordination," to quote the test constructor. 
The test consists of 10 wooden pegs set into 
a board and a package of 100 fiber disks. The 
subject goes through the test four times, plac- 
ing 10 fiber disks one at a time over each peg. 
No standard size box is provided to hold the 
washers or disks. The author suggests a tray 
approximately one inch deep by eight inches 
square. The test is to be taken with the sub- 
ject sitting down. The board is to be placed at 
a height of 28 inches from the floor—no higher 
than 30 inches is suggested. 

The test proper consists of four different 
operations. The subject uses his left hand, his 
right hand, then both hands simultaneously, 
and then both alternately. No penalty is indi- 
cated for errors in counting. 

Decile norms are provided on the test. Nei- 
ther the number nor the sex of the employees 
on whom the norms were secured is given. 
There is no reliability given for this test. The 
validity of the test also seems to have been 
overlooked. A reprint, not a part of the manual, 
tells how the test was.applied to 173 employees 
in a paper manufacturing plant. These em- 
ployees were previously rated either satisfac- 
tory or unsatisfactory on a formal merit rating 
system. The test is reported to have selected 
78 per cent of the satisfactory employees, but 
28 per cent of the unsatisfactory workers 
scored above average and 22 per cent of the 
satisfactory employees dropped below average 
on the test. The author has this to say about 
his test, “The HanDexterity Test has evolved 
over seven or eight years. We originally used 
a larger disk and a larger board than is now 
the case. Hence, our original standardizations, 
including validity coefficient, are not pertinent 
to the present board.” 

The data given by the testmaker are far 
from adequate on validity, reliability, and test 
norms. The information on how the norms 
were gathered and what the norms mean is 
entirely lacking. A purchaser of this test has 
little to go on to tell him whether or not he is 
getting sound value for his dollars, 
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[ 657 
*Benge Two Hand tien. Test. Adults; 
1943; individual; $7.50 per testing outfit; 6(15) min- 
utes; Eugene J. Benge; Management Service Co. 

Milton L. Blum, Assistant Professor of Psy- 
chology, The City College of New York, New 
York, New York. This is the eleventh in 
a series of tests devised by the Management 
Service Company. The advertising material of 
the firm announces that the test is to be used 
for selecting applicants with good eye-hand-arm 
coordination. The applicant is to plunge a rod 
held in each hand into two holes of one diam- 
eter and then into two holes of smaller diam- 
eter. A mechanical counter automatically totals 
the correct thrusts. 

Checking the psychological literature for the 
past seven years reveals no reference to the 
Benge Two Hand Coordination Test. The au- 
thor in private correspondence states: “All 
that we have on the tests so far are percentiles. 
Administration has not been sufficiently stand- 
ardized so that we get high reliability from one 
situation to the other. Several validity studies 
have looked promising but we also get such 
high correlations between this test and our 
“handexterity” test that I am coming to be- 
lieve that the simple handexterity test is suffi- 
cient in most situations.” 

Since one does not normally expect a high 
correlation between a test of finger dexterity 
and an eye-hand coordination test, it would 
appear that either both tests are measuring 
finger dexterity, or both tests are measuring 
eye-hand coordination. This is not the first in- 
stance of an author naming a test on the basis 
of supposed validity and then finding that the 
theoretical basis for the test is not substanti- 
ated through validation. 

From the author's statement it can be seen 
that the test is relatively unreliable and has 
questionable validity. The claims of the author 
that it can select machine operators and me- 
chanical apprentices have not been substanti- 
ated. This test should not be used as an aid to 
select employees even in jobs requiring two- 
hand-eye-arm coordination. i 

It surely is unpleasant for the reviewer to 
write this type of review but if psychological 
testing in industry is not to receive 2, ‘black 
eye” it would be better that npe Si 
ployers and potential test users know the ax 
It may well be that further work on this tes 
will result in improvement and modification 1n 
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design and that the test will ultimately be ac- 
cepted as a standardized test. At present, it is 
not. In a brochure the claim is made that 
“Benge tests . . . provide sharp instruments for 
eliminating unfit candidates." This test is not 


"sharp." 
[ 658 ] 


xCrawford Spatial Relations Test. Grades 9-16; 
1942 ; individual ; 1 form; no longer available; work- 
limit (3-12) minutes; John Crawford and Dorothea 
Crawford ; formerly distributed by Psychological Cor- 
poration. 

REFERENCES 

1. Crawrorp, Jous Epmunp. "A Test for Tridimensional 
Structural Visualization: A New Test for Mechanical Insight 
Designed Primarily to Measure Ability or Aptitude in Draft- 
ing." J Appl Psychol 24:482:92 AG "40. (PA 15:82) 

2. Crawrorn, Joux Epmunv. Measurement of Some Factors 
Upon Which Is Based Achievement in Elementary Machine De- 
tail Drafting. Unpublished doctor's thesis, University of Pitts: 
burgh, 1941. (Abstracts of Theses. . » 1941, 1942, [us ssp 

LE DENNETT, George K., AND CmurxsuANK, RurH, M. A 
Summary of Manual and Mechanical Ability Tests, Preliminary 
Form, i 27-8. New York: Psychological Corporation, 1942. 
Pp. v, 80. Paper, lith . $0.70. * 

4. Crawronp, Joux ÉoMUND. "Spatial Perteption Tests for 
Dee Drafting Aptitude." Ind Arts & Voc Ed 31:102 

a 42. 

Ps Esres, Sranrey G. “A Study of Five Tests of ‘Spatial’ 
Ability." J Psychol 13:265-71 Ap'42. * (PA 16:3771 

6. Jaconsry, Eroon E. "An Evaluation of Certain Tents in 
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Meas 3:259-67 au '43. * (PA 18:2537) 3 
WAA cena a nieto ue 
Ap 44. * (PA 18:2936) 

William R. Grove, Psychologist and Director 
of Child Study Service, Phoenix. Elementary 
Schools, Phoenix, Arisona; formerly Professor 
of Psychology and Director of the Division of 
Psychological Services, The University of Pitts- 
burgh. In the words of the test authors, “The 
scheme of this test is a large tridimensional puz- 
zle so designed that the manipulative factor is 
small compared to the insight or structural vis- 
valization factor (1).” The test is indeed a clev- 
erly designed construction puzzle consisting of 
nine pieces. It fits compactly into a neat case 
which is approximately ten by ten by two inches, 
There are excellent photographs of the test 
materials in the August 1940 issue of the Journal 
of Applied Psychology (1). 

The original test was constructed of poplar 
and painted brown. Later, the Crawfords had 
their test cast in aluminum. It was this equip- 
ment which is described in Bennett and Cruik- 
shank's A Summary of Manual and. Mechanical 
Ability Tests (3). The castings are well made 
and fit together neatly, but with sufficient tol- 
erance to make them easy to manipulate, The 
test materials currently available from the Psy- 
chological Corporation are somewhat less than 
satisfactory. These are made of soft wood which 
has been cheaply stained, permitting the grain 
to show through. The pieces have been care- 
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lessly glued and would tend to come apart with 
use. These are matters that can be readily 
remedied by more careful gluing and by pro- 
viding a more adequate paint job which should 
completely cover the grain of the wood. The 
description of the test materials in the 1946 
catalog of The Psychological Corporation is 
not accurate. à 

When assembled, the test forms a flat round 
disk a little more than six inches in diameter 
and extending three-quarters of an inch above 
the base plate. In appearance it closely resem- 
bles a much older test formerly sold by the 
C. H. Stoelting Company. It is probable that 
the basic idea for this test was suggested by the 
older test. The test referred to was too complex 
to be of much practical value and never came 
into very general use. 

That tests of this general type have a basic 
validity in measuring one aspect of mechanical 
ability has been demonstrated repeatedly. The 
Crawfords' data show that draftsmen and 
machinists exceed other workers in their per- 
formance on this test to the extent represented 
by mean scores which fall respectively at the 
93rd and 7oth percentiles of the Crawford 
norms. 

The norms provided for the test are practi- 
cally meaningless, however. They are based 
upon the scores of 427 men "employed in in- 
dustrial and technical jobs ranging from me- 
chanical engineers to handymen in machine 
shops, including designers, detailers, lay-out 
men, machinists, general mechanics, and other 
skilled and semi-skilled workers." Such norms 
are probably a conglomeration of accumulated 
records which may or may not approximate in- 
dustrial norms for a particular industry or group 
of related industries. Normative data should be 
described adequately enough for other investi- 
gators to duplicate them by testing a compar- 
able sample. Otherwise, they are of very limited 
usefulness. 

In spite of some extremely incredible coeffi- 
cients of correlation reported by the Crawfords, 
it is the opinion of this reviewer that this test 
will be found quite deficient in reliability. It is 
my considered judgment that a test of this type 
consisting of nine pieces and administered on 
a single-trial basis in an average of five minutes 
or less cannot constitute a sufficient sample to 
be used reliably for individual diagnosis. 

In summary, I wish again to quote the test 
authors in a statement which seems to me to 


= 


be entirely accurate and well formulated : “Little 


“weight should be given any score on this test 


in the absence of confirming evidence from 
other sources; but either a very high or a very 
low score is probably a valid basis upon which 
to seek such confirming data” (1, italics mine). 


[ 659 ] 
*Hand-Tool Dexterity Test. Adults; 1946; indi- 
vidual; $16.50 per testing outfit, postpaid ; worklimit 
(10-20) minutes; George K. Bennett; Psychological 
Corporation. 
REFERENCES 


1. Bennett, Grorce K., anp Fear, RicuaxD A. “Mechanical 
Comprehension and Dexterity.” Personnel J 22:127 My '43. * 
(PA 17:2843) 

2. BENNETT, Grorce K., anp WESMAN, ALrxANDER G. “In: 
dustrial Test Norms for a Southern Plant Population.” J App 
Psychol 31:241-6 Je '47. * (PA 21:4095) 


C. H. Lawshe, Jr., Professor of Psychol- 
ogy, Purdwe University, Lafayette, Indiana. 
Here is a test that seeks to combine meas- 
urements of achievement based on past experi- 
ence in handling tools and mechanical apti- 
tude. A wooden frame comprising two uprights 
attached to a horizontal baseboard is used. 
Twelve bolt, washer, and nut units of varying 
sizes are fastened securely through holes bored 
in the right upright. The subject is to transfer 
the units by means of tools (crescent wrench, 
end-wrenches or screwdriver) from one upright 
to the holes in the opposite upright. Care has 
been taken to standardize the procedure into 
an efficient motion pattern. Score is the amount 
of time taken by the subject to remove the nuts 
and bolts from the right upright and to fasten 
them on the left. Total time required for one 
administration. does not normally exceed 15 
minutes. Directions in the manual are precise 
and simple. Presumably the test could be ad- 
ministered to small groups, although it is pri- 
marily an individual test. 

Percentile norms based on the performance 
of 1,123 male factory workers of the Union Bag 
and Paper Company, Savannah, Georgia, are 
available. A reliability coefficient of -9I has been 
computed by the retest method. A validity co- 
efficient of .46 is reported for scores made by 
male machine tool operators and one of . 51 for 
female aircraft construction riveters with fore- 
men's ratings as a criterion. Again using the 
machine tool operators, a multiple correlation 
coefficient of .67 was obtained for the Hand- 
Tool Dexterity Test, the Bennett Test of Me- 
chanical Comprehension, Form AA, and fore- 
men's ratings. All the above test Scores were 
obtained before hiring and the foremen's rat- 


687 ] TESTS & REVIEWS: VOCATIONS—MANUAL DEXTERITY 


660 


ings were made after the subjects had had four- 


months or more experience on the job. The 
relation between specific job experience and 
test scores is not reported. 

The possible effect of such factors as intelli- 
gence and mechanical understanding upon the 
manipulative skill measured apparently is slight 
inasmuch as test scores are found to correlate 


“reasonably low with both factors, .26 with the 
Revised Beta Examination and .42 with the 


Test of Mechanical Comprehension, Form AA, 
both for 66 machinist trainees. Later analysis 
gave correlations of .36 with the Revised Beta 
Examination and .28 with the Test of Mechani- 
cal Comprehension for 927 and 1,109 adult 
white male applicants respectively. 

While “it is apparent that the Hand-Tool 
Dexterity Test in selecting industrial trades- 
men, machine operators, and industrial appren- 
tices is best utilized in a test battery also con- 
taining a measurement of mechanical compre- 
hension, its value would appear to be greatest 
in helping select workers for those industrial 
jobs on which speed of operation is particularly 
desired. 


Neil D. Warren, Associate Professor of Psy- 
chology and Dean of Men, The University of 
Southern California, Los Angeles, California. 
This is a worksample test designed to measure 
proficiency in the use of ordinary mechanical 
tools such as the wrench and screwdriver. The 
subject is required to use these tools as well as 
the unaided fingers of both hands to remove a 
set of nuts and bolts from one upright and in- 
stall them in another. It is obvious that the task 
demands highly complex movements involving 
digital dexterity, gross arm coordination, and 
eye-hand coordination. The very complexity of 
the task places a considerable burden upon the 
directions for administration to provide stand- 
ardized procedures at all times. While the au- 
thor specifically recognizes this difficulty in the 
manual of directions, it is believed that the in- 
structions do-not fully prevent rather great 
variation in the methods used by different sub- 
jects. For example, while a screwdriver is pro- 
vided and may be used, there is no reference 
to it in the directions, nor does the task make 
it essential to use it at all. Three wrenches are 
provided which may be used in a variety of 
ways. 

Although the specific applicability of the test 
would seem to be limited to occupations requir- 


ing routine manipulation of hand tools, the 
“author has implicitly extended its use in a se- 
ries of studies of validity for such jobs as turret- 
lathe operator, precision grinder, and riveter. 
These studies involved the use of the Hand- 
Tool Dexterity Test as a part of a battery. The 
validity coefficient against foremen's ratings of 
machine tool operators was .46; against similar 
ratings of women riveters in aircraft construc- 
tion it was .51. The article which reports these 
data fails to provide adequate information for 
complete interpretation of the figures. 

The only norms provided in the manual are 
based on the performance of a large sample 
(N- 1,23) of male bag and paper factory 
workers. The nature of the group and the jobs 
represented are not reported. The usefulness 
of such norms is clearly questionable. 

The reliability of the test is given in the man- 
ual as .91. This was determined by the test- 
retest method, but there is no indication of the 

nature or size of the population upon which 
the study was conducted. 

The test has the advantage of considerable 
"face validity" in that the materials and tools 
are standard mechanic's devices and the task is 
much like many practical job situations. In the 
absence, however, of further normative infor- 
mation and validity data it would appear that 
its indiscriminate use in vocational selection is 
open to question. Moreover, the problem of 
standardization of instructions, of the nuts and 
bolts, and of the tools has not been fully solved 
by the author of the test. It is to be hoped that 
the results of further research on the test will 
soon be reported in the literature. The test is 
well made and, as long as the nuts turn freely 
on the bolts, offers few mechanical difficulties. 
It fills a real need for a test by which skills with 
tools can be measured. 


[ 660 ] 
Kent-Shakow  Formboard: Industrial Model. 
‘Adults ; 1928; 1 form; revision of the Worcester Form- 
board; $60 per testing outfit; Grace H. Kent and 
David Shakow; Sven G. A. Nilsson, 16 Maverick 
Road, Worcester, Mass. 


REFERENCES 

1-9. See 40:1401. 

19 eve, Witam R. Modification of Kent-Shakow Form- 
board Series. Unpublished doctor's thesis, University of Pitts- 
burgh, 1937. Pp. iv, 70. (Abstracts of Theses . . . 1937, PP- 
175-83. 

n. move, WrirLrAw R, “Modification of the Kent-Shakow 
Formboard Series.” J Psychol 7:385-07 Ap '39. * (PA 13:5884) 

1:2. BENNETT, Georce K., AND CRUIKSHANK, RuTH M. A 
Summary of Manual and Mechanical RD) Tests, Preliminary 
Form, pp. 35-6. New York: Psychological Corporation, 1942. 
Pp. v, 8o. Paper, lithotyped. $0.70. * ( PA 16:2432) 

13. TEEGARDEN, Lorene. “Manipulative Performance of 
Young Adult Applicants at a Public Employment Office—Part 


I.” J Appl Psychol 26:633-52 O "42. * (PA 17:2907) 
he 


S 


661 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


14, TEtGARDEN, Lorene. “Manipulative Performance of 
Young Adult UE. at a Public Employment Office, II." 
J Appl Psychol 26:754-69 D '42. * (PA 17:2908) 

15. EBERT, ELIZABETH, AND SIMMONS, KATHERINE. The 
Brush Foundation Study of Child Growth and Development: I, 
Psychometric Tests. MER of the Society for Research in 
Child Development, Vol. 8, No. 2. Mosen D. C.: the So- 
ciety, National Research Council, 1943. Pp. xiv, 113. Paper, 
Hep $1.50. (PA 18:3322 $ i 

16. TEEGARDEN, Lorene, “Occupational Differences in Ma- 
nipulative Performance of Applicants at a Public Eopiaynens 
Offices" J pel Psychol a7:416-37 O "ax. * (PA 18:272 

17. Wilson, Airce W. Educational Norms for High School 
Boys on the Modified Kent-Shakow Formboard Series. Unpub- 
lished "HER, thesis, RARIOR SR CRAS 1944. (Ab- 
tract. seed . 326-35. 

MUR tee o "Cant A Study of Reliability of the 
Modified Kent-Shakow Formboard Series. Unpublished doctor's 
thesis, University of GIVAUE 1944. 

19. NEWMAN, Joser. “The Prediction of Shopwork Per- 
formance in an Adult Rehabilitation Program: The Kent- 
Shakow Industrial Formboard Series." Psychol Rec 5:343-52 
Mr '45. * (PA 20:2339) P ps 

20. Wess, Ruru C. “Reliability of the Grove Modification 
of the Kent-Shakow Formboard Series.” J Appl Psychol 31: 
155-9 Ap '47. * (PA 21:2804) 

Milton L. Blum, Assistant Professor of Psy- 
chology, The City College of New York, New 
York, New York. This test is available in two 
models; the larger model is the industrial form, 
and the smaller one, the clinical form. The 
formboard consists of five recesses that are 
quadrilaterally symmetrical. The sets of blocks 
furnish eight different tasks of increasing dif- 
ficulty ; each task involves one problem repeated 
five times. The repetition is intended to reduce 
the possibility of a chance solution. The first 
task, used for practice and checking the under- 
standing of directions, consists of two blocks 
for each hole. The most complicated task, chal- 
lenging to superior adults, requires fitting five 
blocks into each hole. In addition to the per- 
ception of form and position, this test in its 
upper levels requires resourcefulness and 
adaptation of procedure. 

The K-S formboards have been used in a 
number of different situations. Shakow and 
Pazeien (9) have reported decile norms for the 
clinical formboard on groups of adults of both 
sexes, Teegarden (13, 14, 16) in a series of 
studies reports percentiles based upon large 
groups of adult males and females who were 
applicants at a public employment office. Her 
work on this test is interesting. She has pro- 
posed the division of the tasks into two series. 
The three simple tasks (2D, 3S, 4S) show no 
sex differences and are essentially a dexterity 
or speed group. The remaining tasks (3D, 4D, 
4DD, 5D) might be called a capacity or adap- 
tive group since they apparently require more 
than manipulative ability. This group of com- 
plex tasks shows a sex difference with males 
performing more rapidly, possibly because they 
have more experience in constructing, assem- 


bling, and putting things together. Grove (11) 
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has proposed a modification of the formboard 
and has developed percentiles on a group of 
delinquent adults. This test has been used also 


on psychotic and brain-injured cases as well as i 


in child guidance clinics. 

The test is sufficiently challenging to allow 
the examiner to make qualitative observations 
of the subject's performance. It has been sug- 
gested that an experienced examiner can ob- 
serve the subject's emotional reaction to success 
or frustration and his adaptability and resource- 
fulness in varying or trying new methods. 

Some of the research on this test indicates 
its reliability and validity. Ebert and Sim- 
mons (15) retested subjects after one or more 
years had intervened. They report correlations 
between .62 and .80. Newman (19) has used 
the Industrial Series for the prediction of shop- 
work performance in an adult rehabilitation 
program. A correlation of .76 is reported be- 
tween shop ratings in accuracy, speed, and con- 
structive thinking and time scores on the form- 
board. 

"However, there is a void in the research 
done on this test. It is unfortunate that a series 
of studies have not been conducted comparing 
test performance with job success in various 
types of mechanical and manipulative industrial 
jobs of both the simple and the complex order. 
Considering the use made of this and other 
tests, the reviewer believes that too little work 
has been done on this test since it first appeared 
in 1928. The fact that it has not disappeared 
from. use is good. People working with tests 
on both the vocational guidance and vocational 
selection level are encouraged to use this test 
whenever it is possible to spend about thirty 
minutes in measuring performance on two- 
dimensional spatial relations. It has the advan- 
tage of measuring a considerable range of abil- 
ity both in the simple manipulative functions 
and in the more complex aspects. The form- 
board also allows the trained examiner to make 
astute observations of a clinical nature. 


For a review by Lorene Teegarden, see 
40:1401. 
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MacQnuarrie Test for Mechanical Ability. Ages 
16 and over; 1925-43; 1 form; $1.75 per 25; 25¢ per 
specimen set, postpaid; Spanish edition available at 
Same prices; 12(25) minutes; T. W. MacQuarrie; 
California Test Bureau. 
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John R. Kinzer, Assistant Professor of Psy- 
chology, The Ohio State University, Columbus, 
Ohio. This paper-and-pencil test is widely used 
to furnish a rough appraisal of both manual 
and mechanical aptitudes. It is an easily ad- 
ministered group test and may be given in 
about a half hour. The test is in reality a bat- 
tery of seven subtests which may be analyzed 
separately. The total score is much less valu- 
able than the evaluation of the various sub- 
scores. 

The various parts are called: tracing, tap- 
ping, dotting, copying, location, blocks, and 
pursuit. The tracing test requires the subject to 
draw a line through small openings in a series 
of vertical lines. The tapping test requires the 
subject to make pencil dots as rapidly as pos- 
sible. The dotting test requires the subject to 
place one dot in each of a series of small circles 
spaced irregularly. The three subtests just de- 
scribed have the greatest manual dexterity ele- 
ment. The next four tests correlate highest with 
intelligence (in this order: blocks, pursuit, lo- 
cation, and copying) and are assumed to meas- 
ure the ability to learn mechanical skills, The 
copying test requires the subject to copy a series 
of simple designs. In the location test the sub- 
ject locates points drawn on a large scale and 
transposes them into an area drawn on a smaller 
scale. The blocks test requires one to visualize 
space by telling how many blocks in a pile 
touch a given block. The pursuit test requires 
the subject to follow a line in a maze, by eye 
alone. 

MacQuarrie (2) reported reliabilities on the 
separate tests as follows: tracing, .80; tapping, 


.75; dotting, .74; copying, .86; location, 72i 
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blocks, .80; pursuit, .76; and a total score re- 
liability of .90. The reviewer, in his own ex- 
perience, has found some interesting correla- 
tions with other tests. The highest correlation 
of any test among the first three subtests and 
the Minnesota Rate of Manipulation Test is 
between the dotting test and the placing part 
of the Minnesota test. The correlation there is 
only .24. It, therefore, might be assumed that 
these three MacQuarrie tests measure a dif- 
ferent kind of manipulatory ability than that 
measured by the Minnesota test. Of the last 
four tests (copying, location, blocks, and pur- 
suit), blocks correlates highest with intelli- 
gence as measured by the Otis Group Intelli- 
gence Test (Intermediate Form). The corre- 
lation coefficient between blocks and Otis is 48 
(based on 230 industrial adult males). A slight 
sex difference has been found by the reviewer 
which favors men and also the younger age 
groups. This probably should be taken into con- 
sideration by industrial users of this test. 

The MacQuarrie test is a valuable aid in 
selecting trainees for mechanical occupations, 
particularly when it is used with other tests 
and with adequate application blank data. 


C. H. Lawshe, Jr., Professor of Psychol- 
ogy, Purdue University, Lafayette, Indiana. 
The MacQuarrie Test for Mechanical Ability 
is a paper-and-pencil test comprised of seven 
parts or subtests—tracing, tapping, dotting, 
copying, location, blocks (tridimensional space 
perception), and pursuit, Certain advantages 
are apparent. It is adapted to group testing. 
Because of its brevity and simplicity of ad- 
ministering, the test has enjoyed widespread 
use in a variety of industrial selection and vo- 
cational guidance situations, Moreover, the sub- 
tests, which can be given separately, can each 
be correlated against a criterion. Optimum 
weighting of the parts as well as the whole in 
a test battery for specific predictive situations is 
thus possible. 

In selecting radio assembly operators a mul- 
tiple R of .46 (with optimum weighting) of all 
subtests of the MacQuarrie against a work- 
sample criterion was obtained. The total test 
Score gave a simple correlation coefficient of 
-42, but a good time saver was achieved in 
obtaining a multiple R of -41 for the subtests of 
tracing, location and blocks against the criterion 
(37). Pursuit, tracing, and dotting “came 
through” in Predicting probable success in typ- 


ing in the case of Hunter College students, and 
the pursuit and blocks subtests were successful 
in combination with other stenographic apti- 
tude tests in predicting Superior stenogra- 
phers (36). Blocks, tracing, pursuit, location, 
and copying selected individuals with an apti- 
tude for mechanical drawing (4) ; the first four 
of those subtests also picked out aviation me- 
chanics with fair success (21). The complete 
MacQuarrie was also instrumental in selecting 
aircraft inspectors in a battery including other 
manipulative tests and the Otis Self-Adminis- 
tering Test of Mental Ability (35). An adapta- 
tion of the tracing subtest into a sewing oper- 
ation selected good experienced sewing machine 
operators when attempts to predict good oper- 
ators from scores on the whole MacQuarrie had 
been unsuccessful (30). In an early report by 
MacQuarrie (2) validation and reliability data 
were established as follows: Scores on the test 
correlated less than .20 with an unspecified 
group mental test. Retest reliability for the 
whole test was .9o. Reliability for the subtests 
varied from .72 for location to .86 for copying. 

New norms based upon scores of 1,000 males 
and 1,000 females 16 years and over are sup- 
plied in the new edition for the seven subtests 
and the whole test. Norms for high school stu- 
dents aged 14 to 16 have also been worked 
out (39). 

A working idea of the MacQuarrie's various 
possible uses and limitations may be obtained 
from careful scrutiny of the validation reports 
cited above. It is apparent that MacQuarrie's 
"mechanical ability" does not refer to the un- 
derstanding of mechanical principles or famil- 
iarity with tools so much as to manipulative 
skills involving finger and hand dexterity, visual 
acuity, muscular control, and spatial relations. 
The test seems to merit consideration wherever 
measures of aptitude for jobs requiring the 
above worker skills and characteristics are 
wanted. 


Alec Rodger, Senior Psychologist to the Ad- 
miralty; on leave from the National Institute 
of Industrial Psychology; London, England. 
This test is over twenty years old. It has, there- 
fore, an old-fashioned look. We find in it a trac- 
ing test, a tapping test, a dotting test, a copying 
test, a location test, a blocks test, and a pursuit 
test—all to be performed with paper and pencil 
alone. But it is not to be criticised on that ac- 
count. The important question is, Does the test 
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usefully*and economically add io Gur other 
means of predicting success in jobs requiring 
manipulative skill? So we turn hopefully to the 
section of the manual of directions headed “some 
typical experience of users,” and to the separate 
bibliography prepared by the publishers. Do we 
find convincing evidence of the value of the 
test? Unfortunately, we do not. It has appar- 
ently been given to more than five million 
people, but very few of them seem to have con- 
tributed to the validation data. 

But we must be cautious in our criticism 
even on this point. Those of us who work in 
the field of industrial psychology know how 
difficult it is, very often, to collect evidence of 
the usefulness of our techniques. Probably 
many of us use procedures which, we are cer- 
tain, have value but which we would find it 
hard to justify to the statistician. However, this 
provides us with no excuse for presenting our 
data in the way in which it is presented in the 
MacQuarrie manual. What is the enquirer to 
make of entries of the following kind? “(6) 
CAN PACKERS: Dotting, .33; Tapping, .25. 
Source: Stead and Shartle. (7) ADDING 
MACHINE OPERATORS: Tracing, .38; 
Tapping, .29. Source: Stead and Shartle.” Dr. 
MacQuarrie may in fact have amassed exten- 
sive and convincing evidence of the value of 
his test for the selection of can packers and 
adding machine operators, but the statements 
we have quoted cannot be regarded as adequate 
indications of this. And what is the seeker after 
information expected to conclude from another 
statement: “(10) MACHINIST AND TOOL- 
MAKER APPRENTICES: 8o per cent to 
90 per cent of boys satisfactory if total score is 
75 or better”? 
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*Mellenbruch Curve-Block Series. Adults; 1046; 
originally called Miami-Oxford Curve-Block Series; 
individual; $18 per testing outfit; 10(15) minutes; 
P. L. Mellenbruch; the Author, 253 Rodes Ave., Lex- 
ington, Kentucky. 
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William R. Grove, Psychologist and Director 
of Child Study Service, Phoenix Elementary 
Schools, Phoenix, Arizona; formerly Professor 
of Psychology and Director of the Division of 
Psychological Services, The University of Pitts- 
burgh. Those who are familiar with the O'Con- 
nor Wiggly Block will be interested to learn that 


ph. Vp 
Dr. Mellenbruch had devised a somewhat simi- 
lar test prior to the appearance of the O'Connor 
test. This was the predecessor of his present 
curve-block series. The existence of this original 
curve-block had not been reported in the litera- 
ture until recently (1) so far as this reviewer is 
aware. The outstanding weakness of the O'Con- 
nor Wiggly Block is its extreme lack of reli- 
ability, a fact which seriously limits whatever 
validity the test may have. The original curve- 
block of Mellenbruch probably had the same 
basic weakness. His present test is a graded 
series of wiggly blocks which should materially 
increase the reliability of the test. 

In its present form the test consists of six 
rectangular blocks, each two and three-fourths 
inches square by eight inches long. Each block 
is cut lengthwise along wavy lines into sec- 
tions. The blocks of the series are made pro- 
gressively more complex by simply increasing 
the number of sections. The easiest block is cut 
into three sections, while the most difficult con- 
sists of twelve. Each block is painted a differ- 
ent color. A clever device that facilitates ad- 
ministration of the test is the numbering of each 
section on the end to indicate its standard posi- 
tion for presentation to the subject. 

Although the test author has given thought 
and study to the structural design of the series, 
this reviewer believes that the test could be 
somewhat further improved from a structural 
standpoint. This should be done before an at- 
tempt is made to standardize the test. Specifi- 
cally, the second block of the series should be 
modified so as to eliminate the possibility of an 
incorrect solution that is so close to the correct 
solution as to cause confusion for even very 
intelligent subjects. At other points also a care- 
ful study of the reaction of subjects would re- 
veal that certain errors are too plausible and 
thus weaken the test by introducing too much 
of the puzzle element into its solution. 3? 

The test is as yet unstandardized. Prelimi- 
nary data on 93 cases are reported, but whether 
these are children or adults, college students, 
or day laborers, is nowhere stated. Probably 
they represent an ill-defined accumulation of 
cases that have little or no meaning as norma- 
tive data. It is to be emphasized that although - 
this test series seems to have some possibilities 
of merit, it will require a great deal of pains- 
taking research to develop those potentialities 
to the point where they will have any practical 
utility. 


^ 


663, THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 692 


Willard A. Kerr, Associate Professor of 
Psychology and Education, Illinois Institute of 
Technology, Chicago, Illinois. This test is 
among the most promising of the tests which 
attempt to measure complex spatial ability 
along with the speed and accuracy of eye-finger 
coordination. Mellenbruch pioneered in the de- 
velopment of this type of three-dimensional 
dexterity test in 1926, antedating by two years 
O'Connor's somewhat similar Wiggly Block 
Test. 

Mellenbruch has proceeded from his original 
curve-block to the production of a series of 
such blocks, according to these principles: (a) 
The blocks in the series should range from very 
simple to very difficult. (b) Steps of increasing 
difficulty from one block to the next should be 
approximately equal. (c) The total series 
should not be too cumbersome but should be 
large enough for the smallest parts to be readily 
perceived and easily handled. (d) The series 
should be simple enough in administration to 
be practically self-administering. (e) The test 
should reduce to a minimum the factor of 
Chance success or failure. 

Although the six assembled blocks are iden- 
tical in shape and size, their parts are so de- 
signed that no inside surface is flat or straight 
—and in no instances are the sections in any 
one block identical or reversible, nor are the 
sections of different blocks interchangeable. The 
Six curve-blocks, each of a different color, con- 
stitute an attractive series, a fact which should 
help elicit interest and motivation of individ- 
uals about to be tested. 

Mellenbruch reports a split-half reliability 
Coefficient of .75. No validity data are offered, 
but the author states a plan to run validation 
studies. 

This spatial-manipulative test should be a 
useful tool in industrial research and possibly 
in selection of certain types of personnel. 
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Minnesota Rate of Manipulation. Adults ; 1931-46 ; 
revision of Minnesota Manual Dexterity Test; ind 
vidual; 1 form; $18 per testing outfit; worklimit (10- 
I5) minutes; revision by W. A. Ziegler; Educational 
Test Bureau. 
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25. Sr^sHORE, Hanon G. "The Improvement of Perform- 
ance on the Minnesota Rate of Manipulation Test When 
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Edwin E. Ghiselli, Associate Professor of 
Psychology, University of California, Berkeley, 
California. This test, which in spite of some 
deficiencies has been accepted as standard equip- 
ment for psychometricians, has been revised by 
the addition of three new tasks. The original 
tasks of placing the blocks into the holes by 
one hand and turning over the blocks by both 
hands are retained. Following these tests the 
subject performs three more tests as follows: 
the displacing test wherein he moves each block 
in turn from one hole to the next hole above, 
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the one-hand turning and placing test which is 
similar to the original placing test except that 
each block is turned over before it is inserted, 
and finally, the two-hand placing and turning 
test which involves simultaneous motions of 
both hands in the two tasks of turning and 
placing. 

The reliability coefficients of the various 
parts of the test appear to be of the order of 
.9o as estimated from a single administration. 
This would suggest a test-retest reliability of 
80 to .85. The intercorrelations among the 
various parts are approximately .50. Relation- 
ships of this order are too high if the different 
parts of the test are to be considered as meas- 
uring different functions. Under the circum- 
stances, then, if total scores are to be used, the 
new additional three tests probably add very 
little. 

It is reported that the boards and blocks are 
carefully constructed within close tolerances. 
This will certainly be a distinct gain since there 
has been marked variation in dimensions among 
various copies of the original board. 

For the original test, general population 
norms were available from a University of Min- 
nesota study which stood as a classic in good 
sampling procedure. While norms are presented 
for the various parts of the revised test, they 
cannot be evaluated since no information is 
given concerning the characteristics of the nor- 
mative sample. The lack of breakdown of scores 
by age and by sex is indicative of the inade- 
quacy of the norms. 

In the use of the original test, some psy- 
chometricians have stressed the value of the 
qualitative data obtained by observation of the 
testee. The addition of the new tasks in the re- 
vised edition of the test presumably would in- 
crease its value in this respect. It has been 
the experience of the reviewer, however, that 
with normal persons the qualitative ratings are 
closely related to the test scores and contribute 
very little. On the other hand, with persons 
with physical handicaps which possibly might 
affect manual dexterity, qualitative data ob- 
tained from observations appear to be very use- 
ful. It is to be noted that the displacing and 
turning tests have been used with the blind and 
satisfactory norms are available for such per- 
sons. 

A survey of the validity coefficients of the 
original test indicates no significant differences 
between the turning and placing tests. On the 


average, for various packing operations and 
jobs involving placing movements, coefficients 
of the order of .20 to .30 may be expected. The 
validity for assembly operations is disappoint- 
ingly low, being of the order of .10 to .15. The 
manual for the revised test gives validity coeffi- 
cients of .57 for one-hand placing and turning, 
and .33 for two-hand placing and turning, for 
60 men engaged in machine operations in a 
paper plant. This is the only statistical evi- 
dence cited for the validity of the new test. The 
present publishers seem to be more impressed 
with incidental observations of individual cor- 
respondence between test scores and perform- 
ance. It would seem that the test is a wholly 
adequate instrument for selecting championship 
heavyweight boxers and basketball teams. 

The reviewer gained the impression that the 
publisher is much more interested in making 
sales to untrained and unsuspecting persons 
than in developing a scientifically sound instru- 
ment. In light of considerations mentioned 
above, the revised test would appear to have 
little or no advantage over the original except, 
perhaps, in the precision of its physical con- 
struction. 


John R. Kinzer, Assistant Professor of Psy- 
chology, The Ohio State University, Columbus, 
Ohio. This series of tests is intended to measure 
rapidity of movement of the hands, fingers, 
and arms. The chief uses have been in the se- 
lection of both office and shop workers for 
occupations requiring gross manipulatory move- 
ments, The tests are all performed on a long, 
narrow board which contains 60 holes in four 
rows of 15 each, into which are fit 60 cylindri- 
cal blocks. 

The revised (1946) examiner’s manual de- 
scribes standard procedure for administering 
five separate tests and gives norms for two, 
three, and four trials. The five tests are called: 
(a) Placing, (b) Turning, (c) Displacing, 
(d) 1-Hand Turning and Placing, (e) 2-Hand 
Turning and Placing. This series of five tests 
yields rate of manipulation scores involving a 
variety of manipulative movements, Arm move- 
ment predominates in the placing and displac- 
ing tests. Coordinated arm, finger, and rotating 
wrist movements in both right and left arms 
predominate in the turning test. In the 1-hand 
turning and placing test, coordinated finger and 
arm movements in the preferred hand predomi- 
nate; similar movements made simultaneously 
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by both arms are included in the 2-hand turn- 
ing and placing test. 

The reliability and validity data on the tests 
are meager. The 1946 Examiner's Manual re- 
ports certain observations of test users and a 
study by Clifford E. Jurgensen (13) to indi- 
cate reliability and validity. These data are ex- 
tremely inadequate. A user of the tests must 
set up norms and critical scores for his own 
population. The necessity for this would be sug- 
gested also, from the reviewer's experience with 
the tests, because of irregularity in the con- 
struction of the boards. The manufacturer 
claims that the boards are constructed with 
"close tolerances," but in spite of that claim 
there is much variation in hole and block sizes. 
The soft material used in the boards allows the 
holes to become oversize quickly. Also, warp- 
ing has been a prevalent source of difficulty. 

In spite of certain physical variations from 
board to board the tests are excellent for selec- 
tion and placement of workers in simple ma- 
nipulatory occupations. The tests are valuable 
in clinical and guidance work in schools. A skill- 
ful examiner can make some very useful clini- 
cal observations of the subject's reaction to 
frustrating and fatiguing elements of the tests. 


J Consult Psychol 11:157 My-Je ’47. * sum- 
marizes in convenient form much of the work 
that has been done on the test since its intro- 
duction in 1933 * The “stanine,” a useful 
9-point standard score originated in the Air 
Forces, is here given what is probably its first 
civilian application. 


For reviews by Lorene Teegarden and 
Morris S. Viteles, see 40:1662. 
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Minnesota Spatial Relations Test. Ages 11 and 
over; 1930; revision of H. C. Link's Spatial Relations 
Test; individual ; $48 per testing outfit ; worklimit (1s- 
45) minutes; Donald G. Paterson, Richard M. Elliott, 
ibs Dewey Anderson, Herbert A. Toops, and Edna 
Heidbreder ; Educational Test Bureau. 
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cores to Scho- 


Milton L. Blum, Assistant Professor, The 
City College of New York, New York, New 
York. This test was originally part of the bat- 
tery developed at the University of Minne- 
sota under a grant to study mechanical abil- 
ity (2). The forerunner of the test was Link's 
Spatial Relations. Test. Reliability coefficients 
of about .80 and validity coefficients of about 
-50 are reported for the Minnesota Spatial Re- 
lations Test. 

The test measures two-dimensional spatial 
relations. It does not have the element of com- 
bining and integrating as does the Kent-Shakow 
Formboard. It is therefore a test of spatial re- 
lations on a more simple level. People with 
rather poor ability can successfully perform the 
test. Speed is the most important element in this 
test. Accuracy is self-corrected since wrongly 
sized or shaped blocks do not fit. 

The test consists of four boards, each con- 
taining 58 oddly shaped and differently sized 
spaces. Boards A and B use the same set of 
blocks to be placed first in Board A and then 
in Board B. The arrangement of the blocks 
prior to placement is standardized. Board B is 
used to place the blocks in the proper starting 
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position for Board A. The completion of Board 
A makes the blocks ready for Board B. The 
examiner, therefore, is not troubled with ar- 
ranging and rearranging the blocks. Boards C 
and D are slightly different and require an- 
other set of blocks; the principle is the same 
as in Boards A and B. 

Board A is used as a practice run, and the 
score on the test is the time taken to complete 
Boards B, C, and D. The authors have pub- 
lished norms for men, women, and boys. The 
reviewer has found that it saves time to elimi- 
nate the last two boards and modify the direc- 
tions slightly so as to give Boards A and B as 
the test. Percentile nórms are available on a 
large group of people between the ages of 16 
and 24 seeking guidance, and such will be fur- 
nished upon request. 

Most of the 58 blocks come in series of three: 
a small, a medium, and a large block of the 
same shape. The blocks in each series fit rela- 
tively close to one another on the board. False 
size discrimination occurs more often than false 
shape discrimination. Some subjects gain al- 
most immediate insight into this size relation- 
ship and proceed in methodical fashion. Most 
subjects begin by filling in the holes with blocks 
in random fashion or in a position sequence and 
change to the three-block procedure. Very few 
perform the entire test without shifting their 
method. If they do not shift, their scores are 
very poor. Y 

The test with its large number of holes and 
blocks looks complicated. Actually it is not; the 
reviewer has never tested a person who was not 
able to complete the task. This serves a useful 
purpose when a poor subject's morale needs 
bolstering during a test battery. Of course there 
is a considerable time difference between the 
fastest and slowest person. 

The test is simple to administer. It is self- 
scoring except for timetaking and is useful in 
a battery for vocational guidance when some 
indication of mechanical ability related to two- 


dimensional spatial relations is desired. For the . 


measurement of tridimensional spatial relations, 
the Wiggly Block Test or Crawford Spatial 
Relations Test should be used. The test meas- 
‘ures very little finger or hand dexterity, and 
O'Connor's Finger Dexterity Test or the Plac- 
ing Test in Zeigler’s Minnesota Rate of Ma- 
nipulation Test would be better measures of 
these. 


For a review by Lorene Teegarden, see 
40: 1663. 
[ 665 ] 
*Pennsylvania Bi-Manual Worksample. Ages 17 
and over; 1943-45; individual; 1 form; $15 per test- 
ing outfit; worklimit (10-20) minutes; John R. Rob- 
erts; Educational Test Bureau. 
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Edwin E. Ghiselli, Associate Professor of 
Psychology, University of California, Berkeley, 
California, This test departs from the usual 
measures of finger dexterity in that it de-empha- 
sizes the insertion of pegs into holes and the 
placing of washers around pegs. The task set by 
the test involves the simple operations of select- 
ing a bolt with one hand and a nut with the 
other, assembling the two objects and placing 
the assembly in a receiving hole. Upon com- 
pletion of 100 such operations, the second part 
of the test, which consists of disassembling the 
nuts and bolts and returning them to their 
proper dishes, is administered. Two scores are 
obtained, one for assembly and the other for 
disassembly. In both cases the score is the time 
to complete 100 operations. For normal persons 
the outside time for the assembly test is about 
10 minutes and for the disassembly test about 
4.5 minutes, with mean times of approximately 
5 and 2.6 minutes, respectively. The disassem- 
bly appears to contribute little except reduction 
of work for the administrator. 

Reliability of the test is given as .95 by the 
split-half method. Particularly for speed tests 
of this sort the split-half method gives a spuri- 
ously high estimate of reliability. With the test- 
retest method, reliability undoubtedly would be 
of the order of .80 to .85; and, while perhaps 
leaving something to be desired, nevertheless, 
it is satisfactory in comparison with other meas- 
ures of dexterity. The assembly and disassem- 
bly parts are correlated to the extent of .55. 
Correlations with other types of dexterity tests 
are of the order of .40. 

Evidence relative to validity in terms of cor- 
relations with measures of job success is com- 
pletely lacking. The author of the test points 
out that the large bulk of successfully employed 
industrial workers engaged in tasks demanding 
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dexterity make scores above the median of un- 
selected individuals. This, however, cannot be 
accepted as satisfactory evidence of validity. On 
the other hand, the author emphasizes the 
qualitative observations relative to the work 
methods of the testee that can be gained during 
the administration of the test. On the surface 
this would seem to be true when the nature of 
the task is considered, and indeed, a descrip- 
tion of work methods is given in the manual. 
Nevertheless many testees fail to understand or 
follow the standard method of work prescribed 
by the manual, and this fact introduces varia- 
tions of unknown extent in administration. The 
reviewer has received favorable comments from 
psychometricians concerning the usefulness of 
the test as a means for obtaining qualitative 
data. 

Norms are given separately for both parts ot 
the test by age and sex. It is stated that these 
norms are based upon an unselected sampling. 
However, even though the numbers of cases 
for both sexes are approximately 2,000, since 
the classes of individuals constituting the nor- 
mative sample are not given, the distribution 
of scores cannot be considered wholly adequate 
as general population norms. Useful norms are 
given for several hundred persons successfully 
engaged in manipulative industrial jobs. Of 
considerable significance is the fact that distri- 
butions of scores are given for blind and par- 
tially blind individuals of both sexes and broken 
down into those successfully employed in ma- 
nipulative industrial jobs and those without 
such work history. While the numbers of cases 
on which the distributions are based are rela- 
tively small, it is important to know that there 
has been successful experience in administering 
the test to the blind. 

An additional advantage of the test is that 
the task required is something within the ex- 
perience of all testees. Tt has been independently 
reported to the reviewer that persons who are 
test-shy feel quite comfortable with the Penn- 
sylvania Bi-Manual W orksample. 


Thomas W. Harrell, Associate Professor of 
Psychology, University of Illinois, Urbana, 
Illinois. This test purports to measure a com- 
bination of “finger dexterity of both hands, 
gross movements of both arms, eye-hand co- 
ordination, bi-manual coordination, and some 
indication of the individual’s ability to use both 
hands in cooperation.” 


The test objects are a board, 105 bolts, and 
105 nuts. The board is 2 feet X 8 inches in 
which there are 100 holes and 2 trays. The 
holes, 14 inch apart, are arranged in 10 equal 
rows. There is a tray at each end of the holes; 
one tray contains nuts, the other bolts. The 
holes are 4 inch in diameter; the nuts are 14 
inch in diameter and the bolts are 174 inches 
long. 

The task is first one of assembly and then of 
disassembly. In assembly, a bolt with a nut on 
it is placed in each of the holes. Twenty holes 
are used for practice, 80 for the test. Disassem- 
bly is taking nuts off the 100 bolts and putting 
them in the appropriate trays. Assembly and 
disassembly scores correlate .55. 

Score is obtained in time in seconds for as- 
sembly and .for disassembly separately. Norms 
are given for converting the time scores into 
percentiles and into transmuted scores based 
on the standard deviation. 

Norms are given for an “unselected sampling” 
of 1,793 males and 2,186 females; for 275 high 
school boys and 275 high school girls; for 200 
men and 200 women of an industrial group who 
had done manipulative tasks in industry for six 
months. Their jobs or occupations are not given 
and might be meaningful. Distributions of time 
scores are given for each of the above groups 
in comparison with 19 blind men and 3 blind 
women with successful employment in indus- 
try; 118 blind men and 63 blind women with- 
out industrial experience; and 23 men and 7 
women who could barely see. On assembly 
young people are better than old and men 
slightly better than women. 

Reliability for the whole test using the Spear- 
man-Brown formula is .947 for assembly. 

No definite validity results are given although 
the general statement is made in the manual 
that "Data . . . indicate clearly its value in 
differentiating the good workers on manipu- 
lative tasks, similar to the way tests of general 
mental ability differentiate the good student 
from the poor student . . . numerous small in- 
plant studies of successful workers on highly 
repetitive manual work have consistently 
yielded averages equal to a transmuted score 
of 6 or better when compared with the general 
population, and rarely do we find a single 
worker in these groups with a transmuted score 
of less than 5." 

This seems to be a good test. The reliability 
is high. Satisfactory validity is recited but is 
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not given in anything like the detail to which 
test users are entitled. 


Albert Gibson Packard, Assistant. Director 
in Charge of Aptitude Testing, Public Schools, 
Baltimore, Maryland. The manual for this 
worksample states that it was developed to ful- 
fill the need in the vocational field for a test 
situation which would combine the basic ele- 
ments inherent in a relatively simple work sit- 
uation, One of the most difficult problems in 
test construction is that of determining content 
for the test. Without a systematic analysis of 
many simple work situations and a summariza- 
tion of the basic elements that operate or func- 
tion in these situations in terms of their total 
frequency, there is little or no assurance that 
these basic elements are measured as they may 
or may not function. This procedure is not in- 
dicated by the author. 

"Moreover, observation of the individual at 
work will reveal certain qualitative aspects of 
his performance which are important over and 
above native speed and dexterity." This point 
is well taken, but the procedure requires sub- 
jective interpretation. This writer has observed 
qualitative aspects of performance on a number 
of tests but has found no way to eyaluate them 
objectively. 

""The worksample should be considered a job 
and, as such, needs no validation." Where the 
worksample duplicates or is a miniature of a 
specific job, there is some chance that the above 
statement would be justified. However, the 
chances are remote that any industrial or voca- 
tional psychologist would find jobs that dupli- 
cate or are miniatures of this worksample. This 
leaves the problem of validation of this work- 
sample for closely related jobs and other jobs 
that seem to require measures of this sort in the 
hands of those who wish to experiment with it. 

Standard scores are given for interpretation. 
There is no indication as to how the classifica- 
tions were determined in terms of work situa- 
tions. Normal or average performance on a test 
is usually accepted as a band between plus and 
minus one standard deviation or between plus 
and minus one-half standard deviation. The in- 
terpretation given classifies all cases on this 
test of 5.0 standard score and below as “prob- 
ably cannot compete" or "should not be con- 
sidered,” and those above 5.0 standard score 
as “skillful” and “extremely fast and accurate." 


No justification for this method of interpreta- 
tion is indicated. 

Reliability reported by the split-half method 
on II2 vocational school boys is .897 and by 
application of the formula for a lengthened test 
as .947. This reliability coefficient would justify 
the use of this test as an individual measure for 
this group. Does this reliability hold for other 
groups? 

Correlations are given in the manual between 
the worksample and the Minnesota Rate of 
Manipulation, the Revised Beta Examination 
and the O'Connor Finger Dexterity Test. They 
indicate a certain degree of uniqueness in this 
test, for while positive they are not high (.35 
to .46). The populations vary for each coeffi- 
cient, which leads one to raise the question as 
to what they might be if all were based on the 
same population. 

The physical make-up of the test can be 
classified as very good. The directions for ad- 
ministering are clear and adequate. 

Norms are given for men and women for 
both aspects of the test (assembly and disas- 
sembly). The assumption must be made, how- 
ever, that these groups are representative of 
the general male and female populations. De- 
scriptions of these groups based on other fac- 
tors would be most helpful. 


Neil D. Warren, Associate Professor of Psy- 
chology and Dean of Men, The University of 
Southern California, Los Angeles, California. 
Little information about this test is available in 
publications other than the manual prepared 
by the author of the test. It is relatively new 
(1943), convenient, and neatly designed ; hence, 
it would appear probable that reports of its use 
should begin to appear in the literature in the 
near future. 

The test has been devised to provide a com- 
plex work situation involving both gross move- 
ments of the arms and fine coordination of the 
fingers. Both hands must be used to perform 
the task. This worksample requires the assem- 
bly and disassembly of nut and bolt units. The 
bolts secured from one tray and the nuts from 
another are assembled and placed in rows of 
holes before the subject. In the disassembly 
task the nuts and bolts are separated and re- 
turned to their respective bins. The scores are 
in terms of the time required to complete each 
of the separate tasks. It may be questioned 
whether such a task can be described properly 
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as a worksample. Certainly as a standard task 
its applicability to specific work situations is 
somewhat limited. 

The nature of the task as such is to require 
relatively long periods for both assembly and 
disassembly. This undoubtedly contributes to 
satisfactory reliability for the test. In the attrac- 
tive and comprehensive manual accompanying 
the test reliability is given as .897 (Spearman- 
Brown correction: .947). The norm groups 
reported in the manual include a large (N = 
3,979) group of “unselected adults” and ade- 
quate groups of high school students and in- 
dustrial workers. The latter group is differen- 
tiated by having a history of at least six months 
of successful employment in manipulative tasks. 
The norms are listed in standard scores as well 
as in percentiles, Preliminary data on groups 
of blind subjects are also reported. 

The author's evidence of validity is confined 
to correlational data showing the relationship 
of scores on the test to those on such tests as 
the Minnesota Rate of Manipulation Test, the 
O'Connor Finger Dexterity Test, and the Ben- 
nett Test of Mechanical Comprehension. These 
correlations range from .17 to .46. The data 
reported also provide an indication of the dif- 
ferences between unselected adults and indus- 
trial workers, There are, however, few pub- 
lished reports indicating the application of the 
test to specific occupations. Until the validation 
is expanded to include studies of the relation- 
ship of test scores to occupational criteria and 
the test is given further statistical elaboration, 
such as factor analysis, psychometrists must 
consider the test a Promising but not yet 
wholly acceptable measuring instrument. 
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*Purdue Pegboard, Grades 9-12 and adults; 1941; 
$12.50 per testing outfit; gog per 25 score sheets; 214 
(10) minutes; prepared by the Purdue Research 
Foundation; Science Research Associates, 
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553, Trade edition, $5.35; textbook edition, $4.9 T Gey ets 

Edwin E. Ghiselli, Associate Professor of 
Psychology, „University of California, Berke- 
ley, California. Unlike most tests of finger dex- 
terity, the Purdue Pegboard is a time-limit 


rather than a work-limit test. This character- 
istic, however, appears to be its only advantage. 
The test consists of four parts, insertion of pegs 
with the right hand, the left hand, both hands 
simultaneously, and finally, a simple assembly 
involving the placing of a washer and a collar 
over the peg. The first three parts require a 
half minute each and the last part one minute, 
the scores on all parts being based on the num- 
ber of operations completed in the time allotted. 

A most unfortunate characteristic of the test 
is its physical, construction. The base which 
contains the holes into which the pegs are in- 
serted is made of wood. Insertion of the metal 
pegs causes sufficient wear so that after the test 
has been administered to a few hundred per- 
sons the holes increase in size and the test scores 
are significantly changed. This means that 
norms and data relative to reliability and valid- 
ity must be considered with caution. In addi- 
tion, the holes are drilled at different depths so 
that some testes lose time by trying to force 
a peg into a shallow hole. This differential depth 
varies from board to board and apparently is 
not intentional inasmuch as three or four con- 
secutive holes may be of one depth while the 
following holes are of another. 

The test can be administered by giving each 
part once, twice, or three times. For the first 
three parts, involving simple peg insertion, 
adequate reliability of about .85 is not achieved 
unless they are administered three times. For 
the assembly part this reliability can be achieved 
with two administrations. Norms are available 
for several hundred industrial workers, but the 
constitution of the groups is not given and 
therefore the distributions cannot be consid- 
ered adequate, 

The authors of the test state that the scores 
have ‘satisfactory validity but that for reasons 
beyond their control they are unable to present 
the exact data. Under certain circumstances, 
correlations as low as .20 may be considered as 
evidence of adequate validity for a test of this 
Sort. Since neither the coefficients nor informa- 
tion concerning the specific types of jobs on 
which the test was employed are cited, conclu- 
sions relative to validity must be held in abey- 
ance. In light of this and other matters consid- 
ered, the Purdue Pegboard would appear to be 
of doubtful value. 


Thomas W. Harrell, Associate Professor of 
Psychology, University of Illinois, Urbana, 
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Illinois. No definite validity results are given 
in the preliminary manual or on the score sheet. 
The manual states that “considerable data have 
been. accumulated as to the validity of this test 
in respect to a number of jobs in one of our 
large ordnance plants. Government regulations, 
however, prevent the release of these data and 
comparable studies are now being made else- 
where.” Tiffin has written, in a letter to the 
reviewer, that it was impossible to obtain the 
ordnance data. Accompanying the Tiffin letter 
was a curve showing a close, but statistically 
undetermined relation between assembly scores 
and production index for approximately 30 
quillers. 

The score sheet states that “High scores on 
the Pegboard have been shown related to suc- 
cess in the majority of occupations requiring 
motor coordination. Such occupations include: 

. ." and a list of 16 occupations follow. Data 
cannot be located to support this statement. 

No correlation is given between insertion and 
assembly. Because of the lack of results it is 
impossible to appraise the importance of these 
separate scores which purportedly measure dex- 
terities that differ in an important way. 

An advantage of this test is that it has a time 
limit whereas many similar appearing tests 
have work limits; consequently the Purdue 
Pegboard can conveniently be given simultane- 
ously to more persons per examiner. If two 
important different dexterities are measured, 
the test has the advantage of measuring them 
both with the same board. The disadvantage of 
the test is that with the exception of a small 
number of quillers with an undetermined valid- 
ity coefficient there is no validity demonstrated. 


Albert Gibson Packard, Assistant Director in 
Charge of Aptitude Testing, Public Schools, 
Baltimore, Maryland. Many tests have been 
designed for the measurement of motor skills. 
Studies indicate considerable specificity (low 
correlation) between them, which raises the 
question as to the application of such measures. 

The preliminary manual for this test states 
that it can be applied to many job situations 
where the particular aspects of motor dexterity 
measured by the tests are required. There 
is no quarrel with this statement; but in order 
to apply a test of this type with any degree of 
confidence, except experimentally, it is neces- 
sary to know its predictive efficiency, an ade- 
quate description of the job situation in which 


TESTS & REVIEWS: VOCATIONS—MANUAL DEXTERITY 


667 


it was validated, and a description of the group 
upon which it was established. The preliminary 
manual indicates that such information is not 
available at this time. 

Psychologists have been stumbling along for 
many years with so-called measuring instru- 
ments which do not meet the requirements of 
a measuring instrument. One of the prime 
requisites is that of consistency. For individual 
test use the desirable reliability coefficient has 
been set at not less than .95. The coefficients 
reported do not meet this requirement. How- 
ever, by increasing the length of the test for 
the first trial, it is quite likely-that this relia- 
bility may be obtained. 

The physical make-up, compared with some 
performance tests that have been placed on the 
market, is very good. 

It would be most helpful if the manual re- 
ported intercorrelations between the right 
hand, left hand, both hands, and assembly ad- 
ministrations of the tests; also correlations 
between the various administrations of this test 
and the numerous other dexterity tests used in 
industrial plants. From these intercorrelations 
it may be possible to determine the commu- 
nality or the uniqueness of this test in order to 
match it experimentally with any particular in- 
dustrial job situation or to determine the appli- 
cation of this test in a composite of traits for 
job differentiation and selection, Considerable 
evidence has been accumulated which indicates 
that job situations require a composite of char- 
acteristics which function as a pattern. 

The norms for women for the one-trial ad- 
ministration were based on 954 applicants; for 
two-trial administrations they were based on 
100 applicants; for the three-trial norms, on an 
interpolation from the two-trial norms. The 
assumption one must make in using these 
norms is that they represent the general female 
adult population. Are these women all of one 
age? Is there a significant difference between 
age groups? What factors other than race, 
previous occupation, and geographic locality 
were considered that would indicate that this 
is a representative population? 


[ 667 ] 
*Small Parts Dexterity Test. Adults; 1946; indi- 
vidual ; $25 per testing outfit, postpaid ; worklimit (10- 
25) minutes; John Crawford; Psychological Corpora- 
tion. 


Joseph E. Moore, Head, Department of Psy- 
chology, Georgia School of Technology, At- 


x 


lanta, Georgia. This test consists of four parts, 
a one-inch plywood board containing 3 circular 
bins for 45 metal collars, 45 pins, and 45 
Screws; a metal plate with 42 holes for the pins 
and 42 holes for the screws; and 2 metal trays, 
one to receive the pins, the other to receive the 
screws. A slot in the board serves as a recep- 
tacle for a screwdriver and a pair of tweezers, 
both of which are included. 

The test is taken in two parts, Part 1 starts 
with a practice trial in which the subject, after 
being shown how to proceed, picks up a pin, 
inserts it in one of the holes, then picks up a 
flanged collar, and places it over the protrud- 
ing end of the pin, flange down. Each pin must 
have a collar placed on it before the subject 
proceeds to the next one. He does 6 pins in the 
practice trial and is timed on doing 36 in the 
test proper. Part 2 consists of picking up a 
screw with the fingers, giving it “just enough 
of a twist to start it in the threads,” and then 
using the screwdriver with both hands to screw 
it all the way through to the tray below. The 
practice trial for Part 2 is given over 6 screws. 
The test proper is the time necessary to turn 
36 screws through tlie plate into the tray below. 

The author certainly did not understate it 
when he put at the top of the manual the words 
“Preliminary Manual,” It seems that everything 
about this test is preliminary or tentative. 
There are no data on reliability or validity. In 
the author's own words, which express a hope 
rather than a verified conclusion, he states that 
“performance on this test is expected to be re- 
lated to such jobs as wiring of intricate devices, 
radio tube manufacture, fine inspection work, 
and the adjustment of motors, clocks, watches, 
typewriters, office machines, and other instru- 
ments,” The buyer has no basis of knowing 
what the test measures from the manual. The 
author even failed to include “the small quan- 
tity of validating data obtained with a pilot set 
of the test,” The statement that “the accumu- 
lation of really significant data waits upon the 
cooperation of the test’s industrial users” is to 
the reviewer a regrettable one. Why should an 
industrial user have to pay a high price for an 
instrument that has nothing to Support it except 
the preliminary form and a “hope” from the 
Saker that maybe he has developed something 

value? 

The “tentative norms,” if one wishes to use 
the term tentative in its loosest sense, were ac- 
cumulated from “470 adult males seeking in- 
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dustrial employment, unselected as to education 
and previous experience.” The norms are in 
three groups, high, average, and low. Separate 
norms for the pins and collars and those for 
the screws are given. The range is given. No 
other statistical data whatsoever are given. 

This instrument and the superficial job of 
“validating” it and developing its wholly inade- 
quate norms reflect adversely, and justly so, on 
the testmaker and the Psychological Corporation 
which prematurely distributed it. If the test- 
maker knew no better how to develop and val- 
idate a test, certainly those who passed on his 
material in the Psychological Corporation should 
have known better. 

Test users should expect from a testmaker 
and a test distributor alike a serious and sincere 
effort to show what, if anything, their instru- 
ment can do that other tried tests are not doing. 
The failure on the part of the maker of the 
Small Parts Dexterity Test and of the distrib- 
utor in meeting the foregoing standards makes 
a sad and regrettable episode in the field of 
test development, 


MECHANICAL ABILITY 


[ 668 ] 
Detroit Mechanical Aptitudes Examination, Re- 
vised. Grades 7-16; 1928-39; Form A: $3 per 100; 
15¢ per specimen set, postpaid ; 31 (40) minutes; Harry 
J. Baker, Paul H. Voelker, and Alex C. Crockett ; Pub- 
lic School Publishing Co. 
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Lloyd G. Humphreys, Supervisor of the 
Testing Division, University of Washington, 
Seattle, Washington. There is little that needs 
to be added to Lorge's earlier review of this 
test. To summarize briefly, mechanical aptitude 
is presumably measured by the 8 subtests of 
(a) tool recognition, (b) “motor” speed (mark- 
ing X's in circles), (c) size discrimination, 
(d) arithmetic fundamentals, (e) disarranged 
Pictures, (f) tool information, (g) belt and 
pulley motions, and ( h) digit-letter substitution. 
No satisfactory explanation for this particular 
analysis of mechanical ability is given. The in- 


adequate correlational data presented indicate 
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that the parts are, as expected, quite diverse. In 
addition, 7 of the subtests are combined into sup- 
posedly meaningful groups as follows: motor— 
Tests 2 and 8; visual imagery— Tests 3, 5, and 
7; mechanical information— Tests 1 and 6. 
Though the absence of the complete table of 
intercorrelations leaves this matter somewhat 
in doubt, it is probable that only the last of the 
groupings can be firmly supported empirically. 

Validation data are largely lacking. Scores 
on an earlier form of the test were correlated 
with shop grades, the number of cases not speci- 
fied. The resulting correlation was .64. A cor- 
relation of .65 is also reported between this test 
and the Detroit Advanced Intelligence Test for 
185 twelfth grade pupils. This finding is not 
evaluated by the test authors. It is, of course, 
too high to make the test very useful. It is also 
needlessly high; i.e., certain sections of the 
mechanical test undoubtedly overlap much 
more with the intelligence test than do other 
sections. The mechanical information group— 
Tests 1 and 6—probably has a lower correla- 
tion than most of the other subtests with the 
intelligence test. Since this group also has a 
reasonably high degree of homogeneity and re- 
liability, it is probable that it contributes the 
most useful score to be obtained from this test. 
This hypothesis obviously needs empirical con- 
firmation before the counselor can proceed to 
use this score with confidence. 

The reviewer wishes to condemn particularly 
the sort of validation presented on the next to 
the last page of the manual. A table shows the 
necessary levels of “mechanical ability” and 
“mental ability” for about seventy occupations. 
These levels are based on “opinions of ap- 
proximately two hundred Detroit counselors 
and teachers.” Such data are no substitute for 
correlation coefficients between test scores ob- 
tained before training and later criteria of suc- 
cess, 


Dewey B. Stuit, Professor of Psychology 
and Dean, Student Personnel Services, The 
State University of Iowa, Iowa City, Iowa. 
This test is designed to measure mechanical 
aptitude. The eight subtests are as follows: (1) 
tool recognition, (2) motor speed, (3) size 
discrimination, (4) arithmetic fundamentals, 
(5) disarranged pictures, (6) tool information, 
(7) belt and pulley, (8) letter and number 
combinations. The titles of the first six tests are 
self-explanatory. Test 7 requires the examinee 
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to deterfnine the speed and direction of the 
second of two pulleys. In Test 8 the examinee 
places a number (taken from a key at the left 
of the page) under each of eight letters placed 
at the top of the page. The test appears to 
measure the speed with which this operation is 
performed. 

The examination is easy to administer and 
not particularly difficult to score except that it 
has no separate answer sheet and, of course, is 
not machine scorable. There is some possibility 
of misinterpretation of directions in Test 7; it 
would have been better if the directions had 
stated that the arrows referred to the move- 
ment of the upper halves of the pulleys. 

Validity data on the test are conspicuous by 
their absence, Under the heading of “valida- 
tions” the authors report a correlation of .65 
with the Detroit Advanced Intelligence Test. 
Evidently the authors believe that this correla- 
tion indicates that the test is not measuring the 
same factors as those measured by intelligence 
tests. Actually this correlation seems quite 
high, indicating considerable overlap with in- 
telligence tests. Correlations with such criteria 
as shop grades are not reported in the test 
manual, 

Considerable space is devoted in the manual 
to the subject of age norms. These are provided 
for the subtests as well as for the test as a 
whole. Provision is also made for converting 
the scores for each half-year group, ages 10 to 
16, into letter ratings A, B, C+, C, C—, D, and 
E. Separate norms are not provided for girls 
and boys. 

The reliability coefficient for the total test, 
computed by the test-retest method for 259 
eighth grade pupils is 898. Reliability coefficients 
for the subtests range from .571 for Test 2 to 
875 for Test s. Intercorrelations are not re- 
ported except for Tests 1 and 6, the coefficient 
being .615. 

The authors state that the test should be of 
value in a school's program of guidance and 
counseling. Some subjective data are presented 
indicating the level of mechanical ability (and 
general intelligence) required for success in 
various occupations, Unfortunately the authors 
fail to present the necessary evidence to indi- 
cate that this test does measure the mechanical 
aptitude deemed necessary for successful par- 
ticipation in these occupations. Until data are 
presented showing that this test will predict 
success in shop courses or in mechanical occu- 
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pations it would appear that the test would 
prove of limited usefulness in guidance and 
counseling. 


For a review by Irving Lorge and an ex- 
cerpt from a review by J. Wayne Wrightstone, 
see 40:1656. 

[ 669 ] 
*Mechanical Aptitude Tests: Acorn National 
Aptitude Tests. Grades 9-16 and adults; 1943-45; 1 
form; $2.00 per 25; 10€ per manual; 50é per specimen 
set; 45(50) minutes; Andrew Kobal, J. Wayne 
irisbinons, and Karl R. Kunze; Acorn Publishing 


Reign H. Bittner, Director of Personnel Re- 
search, Owens-Illinois Glass Company, Toledo, 
Ohio. The test is designed as an indicator of 
aptitude for a mechanical trade for pupils in 
the seventh to twelfth grades of school. It is 
also intended for industrial use in the selection 
of apprentices, mechanic learners, and em- 
ployees in semiskilled mechanical occupations. It 
consists of four subtests as follows: (a) Com- 
prehension of Mechanical Tasks; (b) Use of 
Tools and Materials [Verbal] ; (c) Matching 
Tools and Operations; (d) Use of Tools and 
Materials [Nonverbal]. The first three subtests 
consist of verbal material; the last subtest is 
presented pictorially and, although named the 
same as Subtest 2, appears on inspection to 
measure a skill somewhat different from that 
tested in Subtest 2. 

A well-written manual of directions is pro- 
vided. Inspection of the manual indicates that 
the publisher has attempted to present in an 
unbiased manner all the known facts about the 
test. A class record sheet is also provided 
which includes directions for administration 
and scoring as well as a form for recording test 
scores. This record form is deficient in one re- 
spect; no space is provided for recording the 
percentile rank, only the raw score being pro- 
vided for. 

The test appears to be easy to administer. 
The format is generally satisfactory but could 
be improved. Answers are recorded by under- 
lining the alternative chosen whereas a simpler 
and more easily scored response could be used. 
Because of the format of Test 3, underlining 
should make this especially difficult to score. 
The format of Test 4 presents the directions 
after the sample question which may be con- 
fusing to the subject. 

The reliability of the test is reasonably satis- 
factory. An odd-even reliability coefficient of 


.83 is reported for the sum of the three verbal 
parts. A coefficient of .86 is reported for the 
grade 9 normative population, using the Kuder- 
Richardson Case IV method of computation. 
Presumably, this refers to the total score al- 
though it is not stated in the manual. It is the 
opinion of the reviewer that if the Kuder- 
Richardson Case II method were used to esti- 
mate the reliability a more satisfactory estimate 
would have resulted. 

Data on the validity of the test are meager. 
Only one study is reported. At the Colt Fire- 
arms Company, correlations between test scores 
and various criteria for young employees in 
machine shop, sheet metal, welding, and vari- 
ous semiskilled machine operations were as fol- 
lows: shop rating, .68; success in industrial 
training, .64; advancement record, .52. Details 
of the study are not available so that it can be 
properly evaluated, but these results look prom- 
ising. It is regrettable that the test was not 
more extensively validated before publication. 
Whether or not this test is a valid measure of 
mechanical aptitude must be determined on the 
basis of careful empirical studies. Without fur- 
ther evidence of this kind, it is impossible to 
judge whether the test is generally applicable 
as a measure of mechanical aptitude. 

Percentile norms are presented for separate 
school grades from seventh to twelfth and for 
mechanical and nonmechanical industrial em- 
ployees. Each school grade population included 
about 1,000 cases. The mechanical industrial 
group included about 400 applicants for jobs in 
machine shop, metal work, and carpentry, and 
the nonmechanical group was composed of 
about 350 employees in office work, business 
and other nonmechanical occupations. Separate 
norms should probably be given for males and 
females since there are usually considerable sex 
differences on this type of test. 

Directions for administration of the test do 
not appear complete enough to insure uniform 
testing procedure. In addition, the directions 
given in the manual differ from those printed 
on the class record form in important respects. 
Directions for scoring Part 2 of the test also 
differ in the manual and on the class record 
form: in one case all three required responses 
to an item must be correct before credit is 
given; in the other each required response 
marked in accordance with the key is consid- 
ered correct. 

An attempt has been made to simplify scor- 
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ing by using a scoring key word for each sub- 
test. Alternatives are lettered, and the alterna- 
tive which is identified by one of the letters in 
the key word is correct. This does away with 
scoring stencils, strip keys, etc. In the opinion 
of the reviewer this complicates rather than 
simplifies scoring since it makes the scoring 
operation a mental rather than a mechanical 
task. The scorer must remember the letters in 
the key word and compare mentally the letter 
of the response marked with the key letters 
to decide whether the item is marked correctly. 
It is believed that this will make scoring slow 
and subject to errors. Another criticism of this 
type of scoring key is that the key may easily 
be "stolen" and given wide circulation. Once 
the simple key word for any subtest is obtained 
by unauthorized persons the answers to the 
whole test are given away. Printing the key 
words on the class record form appears to be 
especially hazardous from this point of view. 


James M. Porter, Jr., Associate Professor of 
Psychology, Rensselaer Polytechnic Institute, 
Troy, New York. This test consists of four 
subtests: (a) Comprehension of Mechanical 
Tasks, 15 minutes, 30 mechanical problems 
which require thinking along practical lines; 
(b) Use of Tools and Materials [Verbal], 12 
minutes, 20 questions on the use of tools for 
stated jobs; (c) Matching Tools and Opera- 
tions, 8 minutes, 40 problems requiring judg- 
ment concerning tools and mechanical devices 
with relation to operations; (d) Use of Tools 
and Materials [Nonverbal], ro minutes, 20 
groups of five pictures in which the examinee 
is to select the item out of place in each group. 

Reliability and validity indices are reported 
only for the first three subtests. Standardization 
has been more adequate than is frequently the 
case. Study of the items leads one to doubt that 
the battery measures more than mere informa- 
tion since the testee is not required to demon- 
strate understanding of how mechanical de- 
vices work. 


Alec Rodger, Senior Psychologist to the Ad- 
miralty; on leave from the National Institute 
of Industrial Psychology; London, England. 
The most striking feature of this test is the 
unsatisfactory printing. The sans-serif type is 
oppressive to the eye and far more suitable for 
advertisement display than for a 45-minute ex- 
amination paper. Moreover, most pages are 


cramped (one very badly) and, in the review- 
er's copy at least, the inking has been unevenly 
done. These faults are all avoidable, and they 
are not altogether trivial, because they may pro- 
duce a strongly unfavorable impression on the 
potential purchaser. 

The underlying notion is to gauge “mechani- 
cal aptitude” by testing everyday mechanical 
knowledge. Recent research lends general sup- 
port to the view that this is a more profitable 
approach than one which makes use of tests 
involving such processes as tapping, tracing, 
and dotting, but where is the evidence that this 
particular test is valuable? It is certainly not 
offered in the manual of directions. The para- 
graph on validity is extremely inadequate. It 
gives three coefficients of correlation with 
vague criteria of occupational success, and one . 
coefficient of correlation with the results of 
scores in other (unspecified) tests of mechani- 
cal aptitude. The numbers involved are not 
mentioned, nor is there any satisfactory indi- 
cation of the nature of the jobs covered. 

The critical student of tests, looking into the 
credentials of the Acorn Mechanical Aptitude 
Test, will reach a conclusion he reaches all too 
frequently. He will say, This may be a good 
test, but nobody seems to have shown that it is. 


[670] 
*Mellenbruch Mechanical Aptitude Test for Men 
and Women. Grades 7-16 and adults; 1944; Forms A, 
B; $2.85 per 25; 50¢ per specimen set; 35(40) minutes ; 
P. L. Mellenbruch ; Science Research Associates. 


Lloyd G. Humphreys, Supervisor of the 
Testing Division, University of Washington, 
Seattle, Washington. The test consists of two 
forms of 84 items each. Items are arranged in 
7 sets of 12 pictures of common mechanical 
objects. For each set of 12 pictures, there is a 
corresponding group of pictures of 14 mechani- 
cal objects, of which 12 are related to the first 
group. The examinee's task is to make the 
correct matchings. 

Items were selected from an original list of 
425 paired photographs. The original group of 
items was presented to men and women in 
shops and in shop-training schools. Items were 
discarded primarily on the basis of the relation- 
ships between individual items and ratings ob- 
tained from the foreman or teacher. Items 
which differed markedly in difficulty for the 
two sexes were also discarded. The test shows 
only a 6-point difference between comparable 
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samples of men and women. Though a stand- 
ard deviation is not given, this difference is 
probably small for a test of this type. No data 
are presented, however, to support this pro- 
cedure ; i.e., it is possible that a mechanical test 
should give maximum sex differences. Validity 
may be reduced, for both men and women, at 
least for most occupations, by including femi- 
nine material. 

Reliability is highly satisfactory. The corre- 
lation between comparable forms, generally a 
conservative estimate, is .87 for 169 unselected 
cases. 

Norms are adequately varied, but the num- 
ber of cases seems to be small, particularly for 
the industrial groups. (I am reasoning from 
mean scores listed in one part of the manual 
with accompanying N's. The table of norms 
does not list the N of any group.) 

Validity data are sketchy. One should not be 
too critical of the present test, however, since 
itis probably no worse than average with re- 
spect to the relative lack of validity coefficients. 
The correlation between test scores and teach- 
er's rank in engineering drawing for 57 women 
is .59. The correlation with degree of past par- 
ticipation in 30 mechanical activities of 430 un- 
selected men and women is .60. While this lat- 
ter finding constitutes validity of a sort, it is 
not necessarily the same as predicting future 
success in mechanical occupations. The method 
of item selection indicates some degree of use- 
fulness for the tests, but no data are presented 
from which a validity coefficient could be esti- 
mated. 

Correlations presented indicate substantial re- 
lationships between this test and two other me- 
chanical tests—.so with the “Stenquist (rev.)" 
for 100 Air Force officers and .61 with the Air 
Force Mechanical Information Test, a verbal 
test of mechanical information, for 98 Air 
Force employees. Since the latter test was 
found to be valid for pilot selection, as well as 
for the more obviously mechanical occupations 
in the Air Corps, similar validities can be as- 
sumed for Mellenbruch's test. 

For two relatively unselected samples of 131 
and 127 cases, the correlations with “Army 
Alpha (Rev. and F.5)" are .33 and .29. The 
degree of relationship indicated by these data 
is gratifyingly low. The reviewer has the im- 
pression from taking the test, however, that 
this relationship could be appreciably reduced 
by appropriate jtem analysis. "Several: items 


seem to measure abstract reasoning rather than 
mechanical information. If the items were eor- 
related with the Army Alpha, a considerable 
spread of item coefficients would probably be 
found. Items with high values could then be 
discarded. It is possible that the poor reproduc- 
tion of many objects and the use of many obso- 
lete objects also contribute to the correlation 
with Army Alpha. Such factors, at any rate, 
probably contribute unwanted variance to the 
total test scores and should be corrected in a 
revision. 


C. A. Oakley, Lecturer in Industrial Psy- 
chology, Glasgow University, Glasgow, Scot- 
land. This test is of a type which has become 
quite well known in recent years. It is of a 
pictorial character and seeks to pick out me- 
chanically inclined people by finding whether 
they know what various mechanical or semi- 
mechanical things are used for. Drawings of 
these things, which are described in the manual 

s “common objects and devices,” are set out 
in two columns, the person doing the test hav- 
ing to state for each picture in the first column 
which picture in the second column “goes with 
it best.” Provision is made for 84 of these 
matchings. 

Criticism of the general character of tests of 
this kind is usually concerned with two aspects: 
(a) Is the general conception sound? and (b) 
Are the articles to be paired well selected? The 
answer on the first point is in the affirmative. 
Candidates for employment are given a better 
opportunity in this sort of group test than in 
most other tests for showing whether in their 
everyday life they are really interested in me- 
chanical things. In many branches of personnel 
selection and vocational guidance this is helpful 
information. 

The other point raises, however, more doubt. 
How well have the various articles been se- 
lected? It is scarcely possible for someone from 
Great Britain to be specific in expressing an 
opinion, for some of the objects and devices are 
of American usage. Nevertheless, the selection 
does seem rather odd, for it is difficult to detect 
much consistency in the choices, articles of real 
mechanical interest appearing in the columns 
besides nonmechanical things such as jewelled 
rings and shoes. Undoubtedly something of in- 
terest has been learned when it is discovered 
that a candidate can differentiate, for instance, 
between a badminton and a tennis racket. But 
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does it have much bearing on his mechanical 
inclination? 

Although the manual does not report that 
the test has been tried out on large groups, 
some of the conclusions are interesting—par- 
ticularly the low but positive correlation found 
with intelligence. While intelligence is unques- 
tionably called for in doing the test, Mellen- 
bruch seems to have been reasonably successful 
in keeping it to relatively small proportions. 


[ 671 ] 


Minnesota Assembly Test. Ages 11 and over ; 1930; 
a revision of Stenquist Mechanical Assembling Test by 
John L. Stenquist; sometimes called Minnesota Me- 
chanical Assembly Test; regular and abridged forms; 
$35.50 per set of testing materials ; $29 per set of test- 
ing materials for abridged form; revision by Donald 
G. Paterson, Richard M. Elliott, L. Dewey Anderson, 
Herbert A. Toops, and Edna Heidbreder ; C. H. Stoelt- 
ing Co. 


REFERENCES 
. 1. Anperson, L. Dewey. “The Minnesota Mechanical Ability 
Tests." Personnel J 6:473-8 Ap '28. * (PA 2:2642) 
_ 2, Riley, G. "Stanford Binet ‘Indicators’ of Mechanical Abil- 
ity." Psychol Clinic 18:128-32 My-Je '29. * PA TECH) 

3. Paterson, Donar G.; ELLIOTT, KicHARD M.; ANDERSON, 
L. Dewey; Toors, Hersert A.; AND HEIDBREDER, Epa. Minne- 
sota Mechanical Ability Tests. Minneapolis, Minn.: University 
of Minnesota Press, 1930. Pp. xxii, 586. $5.00. * (PA 5:1268) 

4. Harvey, O. L. “Mechanical ‘Aptitude’ or Mechanical 
‘Ability’?—A Study in Method.” J Ed Psychol 22:517-22 O 
"31. * (PA 6:910) d 

5. FRANDSEN, Arpen N. “Mechanical Ability of Morons,” 
7 Appl Psychol 193778 Ag 3t. * (PA 10:1000) à 

6. BiNGHAM, WaLrER Van Dyke. Aptitudes and Aptitude 
Testing, pp. 294-309. New York and London: Harper & Broth- 
ers, 1937. Pp. ix, 390. $3.00; 12s, 6d. * (PA 11:2885) 

7. Stanton, Mitprev D. Mechanical Ability of Deaf Chil. 
dren. Columbia University, Teachers College, Contributions tc 
Education, No. rats Rudolf Pintner, faculty sponsor. New 
York: Bureau of Publications, the College, 1938. Pp. ix, 65. 
$1.60. * (PA 13:144) 2 

8. HanRELL, WiLLARD. “A Factor Analysis of Mechanical 
aniy Tests.” Psychometrika 5:17-33 Mr '4o. * (PA 14:4285) 

9. Brusn, Epwarp N. “Mechanical Ability as a Factor in 
tngneenng andes J Appl Psychol 25:300-12 Je '4r. * 

1514377 : 

1o. BENNETT, Grorce K., anp Crurksmaxk, Rum M. A 
Summary of Manual and Mechanical Ability Tests, Prelimi- 
nary Form, pp. 43-4. New York: Psychological Corporation, 
1942. Tp: v, 80. Paper, lithotyped. $0.70. * (PA CHI 
“tr, Mykrenuss, Hermer R. "A Study of the Usefulness of 
Objective Measures of Mechanical Aptitude in Guidance Pro- 
grams for the Hypacousic.” Am Ann Deaf 91:123-50, 205-25 
Mr, My '46. * (PA 20:3901) 


William R. Grove, Psychologist and Director 
of Child Study Service, Phoenix Elementary 
Schools, Phoenix, Arizona; formerly Professor 
of Psychology and. Director of the Division of 
Psychological Services, The University of Pitts- 
burgh. This test is a revision of the Stenquist 
Mechanical Assembling Test, which it was de- 
signed to supersede. Most of what has been said 
in reviewing the Stenquist test is equally ap- 
plicable to this one, and the reader is referred to 
that review (see 679). 

The Minnesota revision of the Stenquist test 
increased the reliability of the latter from a 
raised odd-even coefficient of .72 to one of .94 
in a population of junior high school boys. This 


increase was accomplished by lengthening the 
test. While the Stenquist test had consisted of 
IO mechanical models in one box, the Minne- 
sota revision consists of 33 models in three 
boxes. The Stenquist series could be adminis- 
tered in 30 minutes, but the revision requires 
about twice that long. 

The basic validity of the method incorporated 
in this test for measuring the mechanical ability 
of junior high school boys was first established 
by Stenquist and fully substantiated by the 
Minnesota research. The principal advantage 
of the Minnesota test over its predecessor is the 
extensive research program of which this test 
was an important part. Not only did this re- 
search provide in this test an instrument of 
more adequate reliability and carefully tested 
validity, but it also provided data on other 
tests of mechanical ability validated and stand- 
ardized on the same junior high school popula- 
tion. It was found possible by statistically com- 
bining this test with several others to predict 
a carefully constructed criterion of mechanical 
ability to the extent represented by a correla- 
tion coefficient of .65. This is a notable accom- 
plishment when we consider the relative unique- 
ness of mechanical ability as contrasted with 
academic ability. 

This test is also available in a short form 
which consists of twenty of the mechanical 
models of the long form assembled in two boxes. 
The short form requires about forty minutes 
to administer. Its raised split-half reliability 
coefficient is only .77, which is not too satis- 
factory. Because of this fact, individual scores 
obtained from the short form must be inter- 
preted with extreme caution. 

Research with this test has indicated that it 
is not a particularly suitable test for use with 
girls or women. It is notably less reliable when 
used with women and probably also less valid. 
Girls and women on the average obtain much 
poorer scores on this test than do boys and 
men. 

Although adult male norms have been pro- 
vided, the test was found to be somewhat too 
easy for use with mechanically inclined men. 
Evidence of the validitv of the test for measur- 
ing an aspect of mechanical ability was obtained 
in the form of differential occupational norms. 
However, the test is much less reliable when 
applied to men than it is with junior high school 


boys. 
All the weaknesses noted in discussing the 
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Stenquist Mechanical Assembling Test apply 
equally to this one. In addition, another im- 
portant practical weakness should also be 
mentioned. Tt is the difficulty of keeping the mate- 
rials in good working order. Parts of the sev- 
eral mechanical models tend to become lost or 
mixed up. Some parts are prone to become 
broken or sprung with use. Models which the 
test authors bought for a few cents tend to be- 
come quite expensive when ordered from a test 
publisher. Nor is the expense limited to the 
actual cost of the replacements, but it includes 
the time and bother of a trained person in see- 
ing that the replacement needed is properly 
ordered. 

Part of these criticisms could have been ob- 
viated if the test authors had provided more 
adequate descriptions and fuller specifications 
for the exact mechanical models used, together 
with complete pictures of each of them assem- 
bled and disassembled. It would have helped 
if some method of substituting alternates for 
outmoded models could have been provided. 
The test in its present form is already becom- 
ing somewhat out of date. 


[ 672 ] 
O'Rourke Mechanical Aptitude Test, Junior Grade. 
Grades 7-12; 1926-40; Forms A, B, C; $3.25 per 5o, 
postpaid ; 25€ per specimen set, postpaid; 55(65) min- 
utes; L. J. O’Rourke; Psychological Institute. 


REFERENCES 

I. ScuppzR, CHARLES ROLAND, AND RAUBENHEIMER, A. S. 
“Are Standardized Mechanical Aptitude Tests Valid?" J Juve. 
nile Res pear Ap '30. * (PA 5:897) 

2. Bincam, WaLrER VAN Dyke. Aptitudes and Aptitude 
Testing, pp. 318-20. New York and London: Harper & Brothers, 
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3. BENNETT, GEORGE K., AND CruiksHank, Ruru M, A Sum- 
mary of Manual and Mechanical Ability Tests, Preliminary 
Form, p pst. New York: Psychological Corporation, 1942. 
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Minnesota EU Form Board Test and the O'Rourke Mechani- 
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2 
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Trainees.” J Appl Psychol 27:399-406 O ’43. * (PA 18:265) 
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the Use of Meci 1 Aptitude Tests in the Selection of 
Trainees for Mechanical Occupations.” Ed & Psychol Meas 
tor au '44. * (PA 19:1776, 

7. TUCKMAN, J^con. “The Correlations Between ‘Mechanical 
A Vbi and Cra Se Sera: Further 

ervations." Occupations 22:244-5 Ja '44. 18:1914) 

8. Santatn, A. O. “The Use of Certain Standardized Peay 
in the Selection of Insnectors in an Aircraft Factory." J Con- 
sult Psychol 9:234-7 S-O "4s. * (PA 20:1241) 


Jay L. Otis, Director, Personnel Research 
Institute and Professor of Psychology, Western 
Reserve University, Cleveland, Ohio. These 
tests were reviewed by Herbert A. Landry in 
The 1940 Mental Measurements Yearbook 
(see 40:1668) and the present writer finds him- 
self substantially in agreement with Mr. Lan- 
dry's excellent review. Apparently one form 


(Form A) has been revised, since it now bears 
a 1940 copyright date. The forms are essen- 
tially tests of mechanical information; and, as 
Mr. Landry pointed out, they measure only 
one aspect of mechanical “aptitude.” Although 
names may not be too important, it would help 
considerably if these tests were called mechani- 
cal “information” tests. This would help to 
minimize any possible misinterpretation on the 
part of persons with limited training in psy- 
chometrics, 

General norms based on 9,000 out-of-school 
males, ages 15 to 24, are now presented with 
Form A. Presumably, these norms also hold 
for Forms B and C. Of this group, 41 per cent 
completed one year of high school, 31 per cent 
completed only elementary school, and 28 per 
cent had not completed elementary school. A 
set of norms based on 70,000 “workmen who 
applied for mechanical jobs” is presented for all 
three forms; and, in addition, average scores, 
standard deviations, and the range for the mid- 
dle two-thirds of the scores are presented for 
33 mechanical occupation groups. 

Validity coefficients ranging from .64 to .84 
are reported. These represent correlations be- 
tween test scores and ratings in vocational 
training and shop courses. 

It is unfortunate that Mr. Landry’s construc- 
tive criticisms made in 1940 have not been 
heeded. What appears to be a good mechanical 
information test of considerable usefulness in 
the vocational guidance or industrial situation 
still contains illustrations which are much too 
small and sometimes confusing. Also, the relia- 
bility of the test is still unknown, except by 
those users who have conducted their own reli- 
ability studies, 


George A. Satter, Assistant Professor of Psy- 
chology, University of Michigan, Ann Arbor, 
Michigan. Each of the three forms of the 
O’Rourke Mechanical Aptitude Test consists 
of two parts. Part I requires that line drawings 
of tools, parts, and materials (a nut, a threaded 
section of pipe, a cotter key, etc.) be matched 
with similar drawings of other tools, parts, and 
materials with which they are commonly used; 
in a similar fashion, in this part of the test, 
statements of function are to be matched with 
drawings of the tools and materials used in the 
previous matching exercise. In each of the 
forms, 102 matchings of the above variety sam- 
ple those basic knowledges of the carpenter, 


— E 


—— ———————— 


707 ] TESTS & REVIEWS: VOCATIONS—MECHANICAL ABILITY 672 


the electrician, the plumber, the auto mechanic, 
and the machinist which the individual with the 
requisite interest and aptitudes might have ac- 
quired incidentally in making household re- 
pairs, in tinkering with his car, and in observ- 
ing the skilled craftsman at work. 

Part II of each of the forms consists of 60 
five-alternative, multiple-choice items presuma- 
bly designed to sample all areas of applied 
mechanical knowledge from carpentry and 
plumbing to auto mechanics and machine shop 
practice. 

Uniformly, the items present the appearance 
of having been carefully constructed and of 
having a high degree of face validity. Several 
of the items in each of the forms (the old fash- 
ioned, pear-shaped incandescent bulb and the 
coil box and timer of the Model T, for exam- 
ple) definitely label the tests as products of the 
twenties; the content of these items does not 
appear to be appropriate for a test which pre- 
sumably measures mechanical aptitudes by us- 
ing the indirect device of measuring incidental 
knowledges. 

The forms are printed in six-page booklets 
which provide, along with the conventional 
spaces for personal data, directions to the testee, 
and sample exercises, items of less commonly 
found subject matter— Instructions to the Ex- 
aminer, a list of the “O’Rourke Series of Aids 
in Placement and Guidance," directions for 
scoring, norms, and certain data descriptive of 
the validity of the test. Thus, while the layout 
is in general good, the inclusion of subject mat- 
ter, more appropriate for an accompanying 
manual, crowds the test materials and demands 
that the drawings be unnecessarily small and 
that special directions be given the testee con- 
cerning the irrelevant sections which appear on 
the first and last pages of the test booklet. 

There are situations (e.g., frequently in in- 
dustrial selection) where individuals must be 
enticed into taking tests. Face validity helps in 
many cases to overcome this initial prejudice; 
so does the appearance of the test booklet. The 
O'Rourke tests “score high" on the first count ; 
“low,” on the second. 

The items of the two parts of the test are 
scored independently on a unit basis; no correc- 
tion is made in scores for chance successes. 
Before they are summated with those of Part 
I, scores of Part II are multiplied by four. 
Whether this procedure is followed in order to 
equalize the dispersions on the two parts, to 


maximize the predictive efficiency of the parts, 
or for both purposes is not explained in the 
manual. - 

Percentile norms, based on the test perform- 
ance of 70,000 men who applied for mechanical 
jobs and the average scores of workmen in 33 
occupational groups, are provided. The useful- 
ness of norms of this sort, without accompany- 
ing data descriptive of the groups from which 
they were compiled, is obviously quite definitely 
limited. 

Materials which are ordinarily the subject 
matter of an accompanying manual are to be 
found in several places: (a) "Instructions to 
the Examiner" in administering the test ap- 
pear on the first page of the test booklets; 
(b) "Directions for Scoring," on the final page; 
and (c) the scoring key, norms, and data de- 
scriptive of the test's validity appear on a sepa- 
rate sheet. 

Other materials which the present-day test 
user has come to expect in manuals are con- 
spicuous by their absence. No statement is 
made concerning the rationale underlying the 
construction of the test, the types of groups 
the test is intended for, and the types of jobs 
it can do. No word is said concerning the pro- 
cedures used in standardization or in item selec- 
tion. The internal characteristics of the test can 
only be surmised from the none-too-detailed 
statements of validity. 

Tn the reviewer's opinion lack of organization 
in the manual materials and the apparent neg- 
lect in presenting data, which test users have. 
come to demand, may discourage some who 
initially are attracted by the content of the test. 

The test author reports that “Correlations 
. . . between test scores and shop ratings by 
different investigators vary considerably . . . 
Correlations reported between test scores and 
ratings in vocational training courses are as 
high as .84; between test scores and ratings as 
machinist apprentices .64; between test scores 
and ratings in school vocational classes .83.” 
No information is given about the size or com- 
position of the groups providing these coeffi- 
cients. 

The average scores reported for various oc- 
cupational groups may be used to support the 
contention that the test can make valid dis- 
criminations. It is apparent from the statistics 
provided that the test does a reasonably good 
job of discriminating between apprentices and 
journeymen and between the highly skilled 


S o 


‘THE THIRD MENTAL MEASUREMENTS YEARBOOK [ 708 


craftsman (machinist, electrician, millwright, 
mechanic) and the members of -oceupational 
groups which are typically less highly skilled 
(watchmen, timber cutters, plasterers, painters, 
etc.) The reviewer would doubt, however, 
whether the ability to make discriminations of 
this sort is of practical usefulness in the selec- 
tion situation; standardized oral trade ques- 
tions are unquestionably more appropriate. 

Prospective users of the test may be a bit 
disturbed by the lack of statistics descriptive of 
thes internal characteristics of the test and of 
the dimensions of mechanical aptitude which 
this test measures. 

It is the reviewer’s opinion that the O’Rourke 
Mechanical Aptitude Test will be found to be 
most useful as part of a vocational or educa- 
tional guidance battery and in selecting candi- 
dates for apprenticeship training courses. Its 
length (the administration time is roughly one 
hour) may limit its usefulness in many indus- 
trial employment offices. 


For a review by Herbert A. Landry, see 
40: 1668. 
[ 673 ] 
Perceptual Mechanics Test. Ages 16and over ; 1938; 
$3 per introductory set including 10 tests, postpaid; sé 
E extra test, postpaid ; introductory sets may be or- 
ered on approval; (10) minutes; Eugene J. Benge; 
Management Service Co. 

Charles M. Harsh, Associate Professor of 
Psychology, The University of Nebraska, Lin- 
coln, Nebraska. This test has incomplete shaded 

` drawings of 50 mechanical devices—auto sup- 
~ plies, builder's and carpenter's tools, electrical 
household equipment, office equipment, and 
plumber's tools, Performance involves recog- 
nizing an incomplete object in a group of five 
or ten and finding its name in a somewhat 
longer list of objects on the next page. This 
applies the clever idea that speed of recognition 
is related to familiarity with objects. The five- 
minute time limit supposedly reveals this speed 
of recognition, but the rather awkward Scoring 
System puts a premium on speed of finding the 
appropriate name in a long list. Thus a clerical 
ability factor may seriously mask the mechani- 
cal familiarity factor, especially for persons 
whose mechanical ability is greater than their 
verbal or “school” ability. The manual suggests 
that the vocational counselor will note the sub- 
ject’s familiarity with the six different types of 
devices, but norms are available only for the 
total score on all devices, 


‘The "Management Service Company offers 


this as one of a battery of eleven sélection tests. 
The advertisements (and the test directions) 
suggest that it measures mechanical aptitude for 
shop training. Test scores are reported to cor- 
relate .68 with a “mechanical assembly test" 
(assembly test and population unspecified). In 
view of the shortcomings of assembly tests, 
this is weak evidence that the test measures 
mechanical aptitude. Familiarity with ‘these 
common devices may possibly be related to 
mechanical aptitude, but there is need for more 
convincing evidence that persons who score 
high in the Perceptual Mechanics Test turnout 
better in shop work than those who score lower. 
If such evidence could be provided, the brevity 
of the test would make it most attractive for 
screening purposes. s 


[ 674 ] 
XPrognostic Test of Mechanical Abilities. Grades 
7-12 and adults; 1947; IBM; Form A; $2.50 per 25; 
25 per specimen set, postpaid; 7¢ per copy; machine- 
scoring edition; 2¢ per machine-scorable answer sheet ; 
38(45) minutes; J. Wayne Wrightstone and Charles F. 
O'Toole; California Test Bureau. 


J Consult Psychol 11:224-5 Jl-Ag "47. This 
test, based on an analysis of the requirements 


of courses of study in mechanical trades, con- 


sists of five parts: arithmetic computation, 
reading drawings and blueprints, identification 
and use of tools, spatial relationships, and check- 
ing measurements. Total score reliabilities range 
from .89 to .92 for single-grade ranges; subtest 
reliabilities are not given. Total scores corre- 
late .78 with systematic instructors" ratings in 
a course in aviation trades. Norms, based on 
5,268 boys in grades 7 to 12, are given in terms 
of subtest and total score percentiles for each 
grade. Some- diagnostic implications for guid- 
ance are drawn from the profile of subtest 
Scores. 
[675 ] 

XPurdue Industrial Training Classification Test. 
Grades 9-12 and adults; 1042; preliminary manual; 
Forms A, B; $1.65 per 25; so¢ per specimen set ; 35 (40) 


minutes; C. H. Lawshe and A. C. Moutoux ; Science 
Research Associates. 


REFERENCES 
1. Lawsuz, C. H., Jm. “The Purdue Indüstriai Training 
gunian Test.” J Appl Psychol 26:770-6 D '42. * (PA 17: 
245 
2. Lawsur, C. H., Ta AND THORNTON, G. R. "A Test Bat- 
tery, for Identifying Potentially Successful Naval Electrical 
Trainees.” J Appl Psychol 27:399-406 O '43. (PA 18:265) 


D. Welty Lefever, Professor of Education, : 


The University of Southern California, Los 
Angeles, California. This test was developed to 
be used as an aid in the selection and classifi- 
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cation of persons for admission’ to industrial 
training programs. Although originally planned | 
for applicants conipleting the sophomore level 
of high school, further experinientation has in- 
dicated some. predictive value in the selection 
of adults for industrial training on the pre- 
employment level. 

The scope of the test is by no means as broad 
as its title might suggest. The test items consist 
almost entirely of elementary arithmetic exer- 
cises functioning in a setting supplied by a 
series of working drawings of simple objects. 
The ability to read dimensional data from such 
drawings is required. The basic computational 
exercises: appear to have considerably greater 
face validity than they would possess in the usual 
type of arithmetic test. The use of a pictorial 
setting: seems to represent a distinct advantage 
for the” test but its fundamental predictive 
power probably lies in the close correlation be- 
tween arithmetic skill (along with the pre- 
requisite intelligence) and the ability to handle 
technical problems successfully. 

One of the criteria employed in the construc- 
tion-of items is somewhat misleading: “Each 
item shall be of an appropriate level of diffi- 
culty to sample adequately the mechanical 
knowledge of individuals composing the group 
with which the tests will be used." The ability 
areas covered by items involving simple meas- 


' ürement, reading a drawing, and the fundamen- 


tal operations of computation limited to whole 
numbers, fractions, and decimals would by no 


.. means measure adequately the broad field. of 
~ "mechanical knowledge." 


As a predictive device for selecting candi- 
dates for shop training, the validity of the test 
seems to be rather clearly indicated by the 
studies reported in the preliminary edition of 
the test manual. As early as possible, follow-up 


"studies should be made in which the validation 


criterion will be actual trade success. 
Charles I. Mosier, Director of Research, Per- 
sonnel Research Section, The Adjutant Gen- 


 eral's Office, Washington, D. C. This test is 


designed for use as one component of a battery 
to select students for industrial training courses 
involving the use of shop mathematics. It is 


- intended that it be supplemented with measures 


of other essential functions. 

The. 14 arithmetic. operations sampled (out 
-of an original list of 26) are listed in the man- 
ual. The: hems. are pictorially presented with a 


minimum of yerbal directions and in simple 
words. In fact, the lack of genetal "directions 
may adversely affect scores ‘of individuals who 
are not sophisticated i in taking objective tests. 
The problems are in groups, each based upon a 
diagram. Interdependent items have been suc- 
cessfully avoided. Items in the final forms were 
selected on the basis of difficulty and item reli- 
ability. Reliability is satisfactory and relatively 
high for so few items. The comparability of the 
two forms is close, with means on the parallel 
forms less than one point apart. The number 
of items in each form is very small against 
usual standards, so that only a few discrimina- 
tions are possible within the range. If the test 
is used only to determine admission to or ex- 
clusion from a training course, this is not a 
handicap. 

The data reported in the manual show ‘statis- 
tically significant differences between the highest 
and lowest rated quarters of groups of students ~ 
in industrial training classes. No relationship to 
actual probability of success or failure is given, 
nor are critical scores indicated. 

Norms are given for six groups, varying 
widely in composition and ability. The test is 
relatively easy and affords little discrimination 
for the upper quarter of such groups as high 
school sophomores. 

Although the test has been in use since 1942, - 
the manual, still in preliminary form, has ap- 
parently not been revised since 1944. Only two 
validity studies are summarized in the manual, 
both using instructors ratings and contrasting 
extreme groups. No references to technical ar- 
ticles are given in the manual. 

The test is useful as a measure of mastery of * 
the elementary arithmetic operations, in spatial 
rather than verbal content, for use as a partial 
predictor of success in training courses involv- 
ing such mastery. Its value for either purpose, 
particularly prediction of job success, is un- 
known to the reviewer. 


[ 676 ] 
*Purdue Mechanical Adaptability Test. AS 15 
and over; 1945-46; Forms A (for men), T (for 
women) ; $2.50 per 25; 256 per specimen set, postpaid; 
15(20) minutes; C. H. Lawshe, Jr., and Joseph Tiffin; 
Division of Applied Psychology, Purdue University. 
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Reserve University, Cleveland, Ohio. This test 


is offered as an aid in selecting men for jobs or - 


training programs calling for mechanical inter- 
ests and abilities. It is not described as an apti- 
tude test but simply as a measure of “experien- 
tial background in mechanical, electrical, and 
related activities." The test consists of 60 rela- 
tively simple information items in the electrical, 
carpentry, machine-shop, and auto mechanics 
areas. It is essentially a true-false examination 
with provision for a "don't know" response to 
each item. The test is easy to administer and to 
score. The format and style of type used are 
excellent. 

The authors report a correlation with intelli- 
gence of .33 with a group of 600 industrial 
applicants, and they feel that this is low enough 
to justify the conclusion that the test is suffi- 
ciently unrelated to intelligence. The reliability 
(stepped up from odd-even correlation) is re- 
ported as .84. 

Four validity studies based upon the follow- 
ing are reported in the manual: 14 ice-company 
mechanics, 12 machine-shop apprentices, 6 time- 
study men in a musical instrument manufac- 
turing plant, 46 machine operators in a screw 

* manufacturing company. For the ice company 
mechanics, the rank-order correlation between 
supervisor's ratings and test scores was .81; 
for the machine-shop aprentices it was .48; and 
for the time-study men it was .83. A validity 
coefficient is not reported for the machine- 
operator group. However, the authors present 
a graph showing the per cent of machine oper- 
ators rated "high" (on a standard merit rating 
System) when successively higher critical test 
“scores are applied. 

The norms issued with the test are based on 
667 industrial applicants. 

It is pleasing to note that this test is appar- 
ently the result of a rather lengthy validation 
study and a thorough item analysis. Originally, 

. 400 items were administered in a series of ex- 
perimental forms to high school and trade 
school students, and 60 items were finally se- 
lected which comprise the present form of the 
test. Another worth-while feature is the test 
manual, which, although still in "preliminary" 
form, presents complete information and refer- 
ences on reliability, validity, and derivation of 
the scoring method. One might quibble over 
the number of cases upon which a validity coeffi- 
cient is based or the magnitude of a reliability 
coefficient, but at any rate, the authors do pre- 


sent this essential information forthrightly, so 
that those considering the test for use in the 
industrial or guidance situations can make a 
fairly accurate evaluation before purchasing in 
quantity. 


Dewey B. Stuit, Professor of Psychology 
and Dean, Student Personnel Services, The 
State University of Iowa, Iowa City, Iowa. 
This test is designed to aid in identifying indi- 
viduals who are mechanically inclined and who, 
therefore, are most likely to succeed in jobs or 
training programs requiring mechanical abili- 
ties and interests. The test is essentially a meas- 
ure of the individual’s experiential background 
in the mechanical and electrical fields and is 
based on the assumption that those persons 
who have been most observant of and who have 
profited most from past experience in these 
fields will make the best progress in mechanical 
jobs or training programs. 

The sixty items comprising Form A were 
selected from 400 original items constructed 
to sample the general fields of mechanical and 
electrical activities. These items were arranged 
in four experimental forms and administered 
to students in grades 10, 11, and 12 in two 
comprehensive high schools. A test of general 
mental ability, the Adaptability Test, was ad- 
ministered to the same population. Items show- 
ing a high correlation with general mental abil- 
ity were immediately eliminated. The roo items 
which discriminated best between the upper 30 
and lower 30 per cent (after eliminating items 
correlating highly with general mental ability) 
were retained as Form R. This form was ad- 
ministered to 250 high school students in grades 
IO, 11, and 12 and minor adjustments made in 
certain items. Experimental Form S (a slight 
modification of R) was then administered to a 
population of 462 men in industrial situations. 
The Adaptability Test was also administered to 
these men. Items showing high correlations 
with general mental ability were eliminated, 
and the 60 items discriminating best between 
the upper and lower 30 per cent of the experi- 


mental population were retained to comprise | 


Form A. 

The responses to the items are recorded in 
the test booklet in boxes labeled Yes, No, Don’t 
Know. Scoring is done by means of a stencil. 
The raw score in the test is based upon a modifi- 
cation of the standard correction for guessing 
(R-W) formula, where two choices exist. Raw 
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scores are converted to percentiles by means of 
a table of norms based upon the performance 
of 667 men in industrial situations. 

The validity of the test has been studied in 
a variety of industrial firms. Results are re- 
ported in terms of coefficients of correlation 
(rank order) or in terms of percentages of high 
and low men (in the test) rated high or low in 
performance by foremen or supervisors. In gen- 
eral there appears to be a substantial relation- 
ship between scores in the test and success on 
the job. The chief criticism to be made of these 
studies is that they involved fairly small groups, 
three numbering only 6, 12, and 14 men. The 
largest group was made up of 46 men. 

The reliability of the test, computed by the 
split-half method and corrected by the Spear- 
man-Brown formula, is .84; the population 
used consisted of 487 men, all applicants for 
industrial jobs. Correlations with mental ability 
tests are quite low, e.g., .32 with the Adapta- 
bility Test for the group on which the coeffi- 
cient of reliability was computed. For a group 
of 173 college men, using the same two tests, 
the correlation was .17. Correlations with the 
California Capacity Questionnaire, the Otis 
Self-Administering Test of Mental Ability, 
Higher Examination, and the Minnesota Paper 
Formboard were found to be low. The highest 
correlation was that with the Bennett Test of 
Mechanical Comprehension, .71. 

The authors appear to have developed a test 
which is useful in the selection of men for 
skilled or semiskilled jobs or for training pro- 
grams leading to such jobs. The test seems to 
measure factors largely uncorrelated with gen- 
eral mental ability. The selection of items was 
carefully done although this reviewer did find 
seven items containing so-called “specific de- 
terminers” such as all, any, most, and always. 
In each case the correct response could be de- 
termined on the basis of the “specific deter- 
miner” alone, Little evidence is presented to 
indicate that the test will serve a useful purpose 
in guidance programs ; until more data are avail- 
able it would seem best to recommend the use 
of the test in experimental selection programs. 

In addition to Form A (for men) there is 
available Form T (for women). This test resem- 
bles Form A but contains 100 items, several of 
them being identical with those in Form A. Data 
concerning the development, validity, and stand- 
ardization of Form T, comparable to those for 
Form A, are not available. 


[677 ] 
Revised Minnesota Paper Formboard Test. Ages 
9 and over; 1934-41; Forms A, B; $1.35 per 25; 
50¢ per specimen set; 20(25) minutes; Rensis Likert 
and William H. Qüasha; Psychological Corporation. 
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Dewey B. Stuit, Professor of Psychology and 
Dean, Student Personnel Services, The State 
University of Iowa, Iowa City, Iowa. The pres- 
ent revision was first published in 1934 and is 
a modification of the paper formboard test pre- 
pared by D. G. Paterson, R. M. Elliott, L. D. 
Anderson, H. A. Toops, and E. Heidbreder at 
the University of Minnesota in 1930. 

The test consists of 64 items of the multiple- 


choice type which measure the examinee’s abil- 
ity to think spatially in two dimensions. Each 
item first presents the parts of a geometrical fig- 
ure, cut up in various ways, followed by five 
“assembled” geometrical figures. The examinee 
is required to select the figure which represents 
the correct combination of the parts. Responses 
are recorded at the top of the page, just above 
the number of each item. The test is easy to 
administer and score; the directions to exami- 
nees are complete; and eight preliminary exer- 
cises give the examinee a good familiarity with 
the nature of the test exercises. 

One of the most recent and welcome features 
of the test is the machine-scored answer sheet. 
Many users who had previously devised their 
own answer sheets will now be able to order 
them directly from the publisher. Norms -for 
the machiné-scored edition are presently lim- 
ited to those based on a population of 548 white 
enlisted men in the U. S. Army. Since the 
machine-scored „edition will be widely used, 
more adequate norms should soon be available. 

The reliability of a single form of the test 
is .85. When both forms are administered, the 
reliability is .92. The manual does not indicate 
the nature of the group to which the test was 
administered to obtain these coefficients. 

The authors indicate that the test is predic- 
tive of (a) ability to master descriptive geom- 
etry and mechanical drawing, (b) success in 
mechanical occupations, and (c) success in 
engineering courses. Various studies are re- 
ported in the literature indicating satisfactory 
experience with the test in the prediction of 
success in these three areas. Estes (29) reports 
a correlation of .27 with grades in descriptive 
geometry and .31 with instructor’s ratings in 
that subject. Shuman (50) reports an average 
correlation of .44 with success in various skilled 
and semiskilled occupations. Ghiselli (30) re- 
ports a correlation of .57 between test scores 
and success of inspector-packers. A correlation 
of .49 between test scores and success in me- 
chanical drawing is reported in the manual. It 
should also be stated that other investigators 
have found the test of less value as a predictive 
index (Barrett, 48; Morgan, 42). 

Several studies have been made showing that 
scores in the test are highly correlated neither 
with intelligence test scores nor with other 
mechanical aptitude tests (Bennett and Cruik- 
shank, 26; Morgan, 42; Traxler, 40; Tuck- 
man, 47). Tinker (46) suggests that speed, 
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level, and power scores be obtained for the test 
and their validity determined. These studies 
emphasize that the Revised Minnesota Paper 
Formboard should not be regarded as a com- 
plete index of mechanical promise but rather as 
measuring one component of mechanical and 
engineering aptitude. Used in that manner the 
test can serve a useful purpose in both guidance 
and personnel work. As for most other tests, 
the usefulness of this instrument will be greatly 
enhanced by the publication of more validity 
coefficients. 

'The test is presently published and distrib- 
uted by Science Research Associates; it con- 
tinues to be distributed by the Psychological 
Corporation. The manuals provided by these 
two organizations are practically identical ex- 
cept that the one published by Science Research 
Associates contains a table describing the major 
types and the conditions of mechanical work 
including ability requirements (academic, me- 
chanical, social, clerical), wages, number em- 
ployed (men and women), largest age group, 
outlook for youth, and educational or training 
requirements. Both manuals contain norms for 
various age, educational and occupational 
groups. Supplementary norms have been pub- 
lished by Baldwin (41) and Hanman (31). 


For a review by Alec Rodger, see 40:1673. 
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*SRA Mechanical Aptitudes. High school and 
adults; 1947; Forms AH, BH; separate answer pads 
must be used; 43¢ per test booklet with answer pad ; 
$1.65 per 25 answer pads ; 45¢ per 25 profile sheets ; 75¢ 
per specimen set; 35(45) minutes; prepared by Rich- 
ardson, Bellows, Henry and Co., Inc. ; Science Research 
Associates. 
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Stenquist Mechanical Aptitude Tests. Grades 6-12; 
1921-22; 2 parts; $1.60 per 25 of either part; 35¢ per 
specimen set, postpaid; John L. Stenquist; World 
Book Co. " 
a) TEST 1. 45(50) minutes. 
b) rest 2. 50(60) minutes. 
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James M. Porter, Jr., Associate Professor of 
Psychology, Rensselaer Polytechnic Institute, 
Troy, New York. Test 1 consists of 95 prob- 
lem pictures, relating to common mechanical 
objects which require the matching of related 
parts. Test 2 consists in part of material similar 
to that employed in Test 1 and in part of cuts 
of machines and machine parts about which the 
testee has to answer questions of a general 
nature—no first-hand experience with the parts 
pictured is presupposed. 

The tests do not measure manipulative abili- 
ties, but performance on Test 1 shows a median 
correlation of .69 with performance on the 
author’s assembling tests; performance on Test 
2 shows a median correlation of .66 with per- 
formance on the same assembling test. 

A median correlation of .67 is reported be- 
tween shop and science teachers’ ranking for 
“general mechanical aptitude” and performance 
on Tests 1 and 2. Correlations with measures 
of general intelligence are reported as ranging 
between .2 and .4. Test 1 has been reported 
to correlate .146 with engineering grades, and 
Test 2 .428 with the same criterion. 

Norms are presented in T scores and percen- 
tile ranks for 4,099 males (age 10-18) in the 
case of Test 1, and for 1,087 males (age 10-15) 
in the case of Test 2. 

No measures of reliability are given. Norms 
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supplied by the author are based entirely on 
school-age boys. It would appear that more re- 
cent tests of mechanical aptitude would yield 
more satisfactory measures of potential ability. 
The principles upon which these tests were con- 
structed have been widely copied. Portions of 
Test 2 probably test mechanical comprehension 
beyond mere information. 


[ 679 ] 

Stenquist Mechanical Assembling Test. Grades 3-6, 
-I2 and adults; 1917-20; individual; (30) minutes; 
ohn L. Stenquist; C. H. Stoelting Co. 

a) SERIES I AND II. Grades 5-12 and adults; 2 forms 
(Series I, II). 

b) sers 111, Grades 3-6. 
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William R. Grove, Psychologist and Director 
of Child Study Service, Phoenix Elementary 
Schools, Phoenix, Arizona; formerly Professor 
of Psychology and Director of the Division of 
Psychological Services, The University of Pitts- 
burgh. The actual equipment developed by Sten- 
quist for this test is now of little but historical 
interest, for the test has been superseded by the 
Minnesota Assembly Test. It is to be noted that 
the latter actually duplicates much of the Sten- 
quist equipment and adopts the methodology 
developed by Stenquist almost intact. In the 
opinion of this reviewer, a more accurate name 
for the more recently developed series would 
have been the Minnesota Revision of the Sten- 
quist Mechanical Assembling Test. 
Stenquist has provided an adequate descrip- 
tion of his test and of the steps by which it was 
developed (1). His ede i summarized the 


research data available to that time and pro- 
vides an adequate manual of instructions and 
norms, 

It is the methodology developed by Sten- 
quist and refined by the Minnesota group that 
must be considered a substantial contribution 
rather than the actual form in which the test 
itself was left at the conclusion of Stenquist's 
research. In this sense, the more recent re- 
search of the Minnesota group has tended 
rather to substantiate and further document the 
basic nature of the contribution made. We omit 
a discussion of the reliability and validity of 
this test since this will be more appropriately 
considered in connection with its revision, the 
Minnesota Assembly Test (see 671). 

The test is of the worksample type. The 
method essentially consists of presenting to the 
subject a series of not too complicated mechan- 
ical contrivances, easily taken apart and put 
together. Although nowhere explicitly stated, 
this reviewer is of the opinion that a further 
requisite is that each mechanical model shall be 
a standard product with which the average 
Subject for whom the test is designed shall 
have had reasonable opportunity to become 
moderately familiar. The models are presented 
to the subject disassembled, and the test con- 
sists of his reassembling them. The evidence 
is unequivocal that this method succeeds in 
measuring with considerable validity one aspect 
of mechanical ability, 

The weaknesses of this method were recog- 
nized and pointed out by Stenquist. The more 
obvious of these are the rather elaborate and 
bulky equipment required and the awkwardness 
of administering the test as a group test. A 
more subtle weakness is the tendency for cer- 
tain of the mechanical models used in the stand- 
ardization to become outmoded and no longer 
readily available. This affects adversely the 
basic validity of such a model as a test item and 
renders obsolete the standardization of which 
it was a part. 


[ 680 ] 
*Survey of Mechanical Insight. Grades 9-16 and 
adults ; 1945; IBM; t form; $2.50 per 25; 25¢ per speci- 
men set, postpaid; ro¢ per copy, machine-scoring edi- 
tion; 2¢ per machine-scorable answer sheet; 25(35) 
minutes; D. R. Miller; California Test Bureau, 


Reign H. Bittner, Director of Personnel Re- 
search, Owens-Illinois Glass Company, Toledo, 
Ohio. The test is designed to measure aptitude 
at the high school and adult levels for solving 


n5] 


the types of mechanical problems involved in 
jobs requiring the operation, maintenance, re- 
pair, or design of various types of machinery. 
It is a thirty-minute test composed of 35 mul- 
tiple-choice problems in the movements of gears, 
levers, and pulleys presented pictorially. The 
publishers state that this is a new approach in 
measuring this type of aptitude but this type of 
problem has been used previously in such tests 
as the Test of Mechanical Comprehension by 
George K. Bennett. 

It is stated that an attempt has been made to 
rule out the effect of previous experience and 
special education on test scores by basing prob- 
lems on machines which the average person 
with mechanical interest would not have seen 
before. Although one study is cited which found 
no significant difference between the scores of 
one group of subjects who had had one year of 
physics and another group who had not had 
the course, it seems doubtful that this objective 
has been achieved. The items are all based on 
principles of mechanics that are general in ap- 
plication and, if learned, should be readily 
applied to these problems. It is wondered how 
much actual training in mechanics was given 
to the group referred to above in the first year 
physics course. 

The reliability of the test, reported in the 
manual as .88 by the split-halves method with 
the Spearman-Brown correction, appears to be 
within satisfactory limits. No data on the valid- 
ity of the test are given in the manual, although 
one section of the manual is headed "Reliability 
and Validity." The publishers have provided 
the reviewer with the supplementary statement 
that the test correlated .47 with efficiency rat- 
ings of postage meter repair workers. With 
such meager data on validity, it is not possible 
to judge whether the test is a useful predictor 
of success in the kinds of jobs for which it is 
designed as a selection instrument. 

Percentile norms based on 250 cases are pro- 
vided for interpreting the raw scores. The 
normative population is described by the pub- 
lisher as “a fair sample of a cross-section of the 
general population which included subjects from 
various mechanical trades, art trades, miscel- 
laneous non-mechanical jobs, mechanical and 
non-mechanical night trade courses, and air- 
craft loftsmen and engineers.” No data are given 
on sex, age, or education of this population. It 
is difficult to see how this small group as de- 
scribed could be representative of the general 
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population. Percentiles derived from such a 
small population are usually subject to consid- 
erable unreliability. It is the opinion of the re- 
viewer that separate norms should be provided 
for males and females since it is usual for the 
two sexes to differ markedly in performance on 
this type of test. 

It is recommended by the publishers that 
subjects be separated into fast-accurate, fast- 
inaccurate, slow-accurate, and slow-inaccurate 
groups by computing for each the per cent of 
items attempted which are answered correctly. 
However, no standards are supplied to aid in 
the interpretation of these scores. These per- 
centages merely give accuracy scores, and to 
be interpreted in terms of “fast” or "slow" one 
must know something about the distribution of 
attempts, which is not supplied. 

The test is apparently easy to administer and 
score, although six of the items may lead to 
difficulties in administration. In five items no 
item “stem” is provided; ie. the problem is 
not stated. In the other item, the stem states as 
a possibility a condition which is impossible. 
The format of the test is good. Drawings are 
clear and reproduction is good. 

Judgment on the usefulness of this test in 
the kind of situations for which it was designed 
must be reserved until more validation data are 
available. It is apparently worthy of trial in 
such situations but should not be accepted with- 
out empirical checks on the relation of the test 
scores to criteria of performance. 


Jay L. Otis, Director, Personnel Research 
Institute and Professor of Psychology, Western 
Reserve University, Cleveland, Ohio. This test is 
offered as a measure of mechanical aptitude. It 
consists of 35 multiple-choice items with three 
alternatives. The items are of the pictorial type 
found in many tests of mechanical comprehen- 
sion which require the examinee to perceive 
how various parts of a mechanism will move 
when a given part is operated. The items are 
arranged in order of difficulty. 

The author reports a reliability of .88, ob- 
tained for 250 cases by the split-halves method. 
Presumably this is the same population upon 
which the norms are based—"250 males and 
female cases selected from various occupations.” 
Tt would seem that for a test of mechanical apti- 
tude especially, the author should present sepa- 
rate norms for males and females. This is not 
done. Nor are various occupations from 
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which the cases were selected described in any 
manner. This makes the norms issued with the 
test practically useless, so that any person de- 
siring to use the test is strongly advised to 
make up his own norms for specific purposes. 

The author states that the test measures apti- 
tude rather than achievement because the fac- 
tors of previous experience and special educa- 
tion have been minimized by basing most of 
the items upon “machines which either are 
archaic or are of special design so that the prob- 
ability of their having been seen by the average 
person with mechanical interest is remote.” In 
support of this claim the author states that no 
significant difference was found between “a 
group” which had completed a first course in 
physics and a comparable group which had not 
had a physics course, 

The Survey of Mechanical Insight may be a 
very useful measure of mechanical aptitude. Un- 
fortunately its degree of usefulness has not yet 
been determined. The author presents no valid- 
ity data whatsoever, and, as indicated above, 
the present norms are of little value. It hardly 
seems fair for the purchaser to have to deter- 
mine the validity of the test himself and to start 
from scratch to accumulate data for some usable 
norms. 


Shailer Peterson, Director of Educational 
Measurements, Council on Dental Education, 
American Dental Association; and Research 
Associate, The University of Chicago; C. hicago, 
Illinois. According to the author, the purpose 
of the test is "to measure aptitude for solving 
the types of mechanical problems involved in 
jobs requiring the operation, maintenance, re- 
pair, or design of various types of machinery." 
Another statement is more specific, indicating 
that the "test measures the ability to predict the 
resultant motion of various parts of a mecha- 
nism when a given part is operated." 

The test contains 35 items which appear to 
have been selected, at least in part, from the 
many systems of levers, gears, pulleys, and 
cams that are often found in advertisements by 
patent concerns. Some are relatively simple 
machines while others are somewhat compli- 
cated and involved. They are arranged in order 
of difficulty, and the author claims that those 
which have been selected have probably not 
been seen before by the average person, thereby 
reducing the effect of previous training and ex- 
perience. Evidence that formal training does 


not affect these test scores is given by the fact 
that no significant difference in performance is 
found between a group having had physics and 
a comparable group not having had physics. 

The author claims that relationships have 
been demonstrated between this test and both 
mechanical interest and aptitude. The reliability 
coefficient obtained by the split-halves method 
and corrected by the Spearman-Brown formula 
is .88. All of these calculations are apparently 
from the relatively small sample of 250 “male 
and female cases selected from various occupa- 
tions.” 

The size of the sample tends to reduce the 
meaning of the percentile norms that are re- 
ported. The single set of norms includes all 2 50 
cases. In an aptitude field such as this, it would 
seem desirable to prepare norms both on groups 
engaged in machine work and those not occu- 
pied in mechanical work. In addition, the per- 
centile norms have not been prepared in as 
usable a fashion as one customarily finds, 

From an inspection of the 35 items, one would 
judge that the test on the whole is very satis- 
factory. All the items are of the multiple-choice 
variety, each having three choices. On a num- 
ber of items the alert student will find that if 
one choice is assumed correct a second must 
also be true, thereby leaving him with only the 
third alternative as long as he knows that only 
one response is to be made for each item. 


[ 681 ] 
*Survey of Object Visualization. Grades 9-16 and 
adults; 1945; 1 form; $1.50 per 25; 256 per specimen 


set, postpaid; Spanish edition available at same prices; 
20(25) minutes; D. R, Miller; California Test Bureau 


Charles M. Harsh, Associate Professor of 
Psychology, The University of Nebraska, Lin- 
coln, Nebraska. Like the Thurstone Surface 
Development Test and its various adaptations, 
this survey is a measure of three-dimensional 
visualization. The subject must indicate which 
of four objects can be formed by folding or 
rolling a flat pattern, such as a tinsmith might 
lay out. Several items in the survey may be 
answered correctly by merely counting sides, 
but the difficult items require accurate form per- 
ception and three-dimensional imagery. The 
manual claims that the ability measured by the 
test is necessary for drafting, engineering, and 
various mechanical and artistic occupations, but 
the evidence for this claim is not indicated. For 
a 25-minute test, the reliability of 9r seems 
satisfactory. Percentile norms are given for an 
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unspecified heterogeneous sample of 250 cases. 
The manual suggests that speed and accuracy 
should be distinguished, but the diagnostic use 
of such information is left entirely to the in- 
genuity of the examiner. Since this is an inter- 
esting and apparently practical test, there is 
need for more evidence as to the validity of the 
test for predicting success in training or occu- 
pations. 

A few minor faults might be remedied. Items 
2, 3, 4, 26, and 35 are disconcerting to capable 
examinees because of perspective illusions or 
inaccuracies of scale. A line is omitted in the 
drawing for Item 16, Response A. Shading 
would decrease the reversible perspective which 
complicates such items as 33 and 34. Since in- 
creasing difficulty is desirable, Items 40, 41, 
and 44 should probably come before Items 29 
and 33. For persons of grade school education, 
the test might be more valid if the figures were 
larger and also shaded to show the third dimen- 
sion. College students are not much troubled 
by the perspective line drawings, but children 
and below-average adults often have great diffi- 
culty interpreting the drawings as three-dimen- 
sional objects. The training factor may be very 
important. 

Hand marking and scoring is simple, but the 
machine-scored answer sheet introduces space 
discrepancies with considerable chance of error. 


Clifford E. Jurgensen, Personnel Director, 
Minneapolis Gas Light Company, Minneapolis, 
Minnesota. This test requires the examinee to 
predict how an object will look when its shape 
and position are changed. Each of the 44 items 
consists of a drawing of a flat pattern together 
with four possible shapes it may take when 
folded on the dotted lines or rolled together. 

The test is practically self-administering. In- 
structions require approximately five minutes 
and include two examples for which the correct 
answer is given and two practice exercises to 
be worked by the examinee. The test proper has 
a time limit of 25 minutes. Scoring is objective, 
and consists of the number correct minus one- 
third the number incorrect. 

Split-half reliability, corrected by the Spear- 
man-Brown formula, is .91 with 266 cases. No 
data are given regarding the nature of the group 
on, which reliability was computed. 

Percentile norms are based on 250 male and 
female cases selected from various occupations. 
No data are given concerning the nature of 


these occupations, nor is any mention made of 
effect on test scores of such factors as education, 
age, or sex. The author recommends that the 
per cent of items attempted which were an- 
swered correctly be considered in addition to 
the total score in order to identity the fast and. 
accurate, fast and inaccurate, slow and accurate, 
and slow and inaccurate. However, no standards 
are given to facilitate interpretation of these 
four types. 

No data on validity are given in the test man- 
ual. Although the test was intended for occupa- 
tions such as draftsman, engineer, machinist, 
maintenance mechanic, commercial and fine 
artist, etc. and appears to be of such a nature 
that it might have some validity for such occu- 
pations, evidence to this effect should be in- 
cluded in the test manual. Lack of validity data 
gives rise to the suspicion that validity was not 
cited because it was unfavorable or that the test 
was published prematurely. Such suspicions do 
not build confidence in the test, author, or 
publisher. 

It is to be hoped that research data will be 
forthcoming on validity, improved standards, 
relationship with other tests, etc. In the mean- 
while the test should be used with extreme cau- 
tion and only by those who are prepared to do 
basic research on the test before attaching any 
meaning to test scores. 


Shailer Peterson, Director of Educational 
Measurements, Council on Dental Education, 
American Dental Association; and Research 
Associate, The University of Chicago; Chicago, 
Illinois, As the catalog announcement of this 
test indicates, “This test measures the ability to 
visualize the shape a flat pattern will take when 
it is assembled to form a three-dimensional 
object.” The test would appear to measure some 
of the same factors measured by the various 
paper formboard tests. The geometric forms 
selected have a wide range of shapes with an 
accompanying wide range of difficulty. The il- 
lustrations themselves are rather small, making 
it difficult for some students and even some 
proctors to distinguish easily the differences be- 
tween certain multiple-choice figures. This size 
factor becomes noticeable in only a few of the 
items. 

This reviewer has used this particular test on 
nearly six thousand cases and finds that the few 
items in which variations in degree of curvature 
are the distinguishing features seem to confuse 


^ with curved ends is related to performance on 
- both the rolled and folded figures in which the 
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students who have relatively little trouble with 


: definite angles and straight folds. It would be 


interesting to know whether performance on 
the difficult items having-the rolled surfaces 


discrimination of slope does not appear to be 
so confusing. 

The test proves interesting to the student, 
which makes it easy to administer. Moreover, 
many of the factors that are measured by the 
test seem obvious to the faculty members who 
have occasion to refer to test data, which in 
turn makes the meaning and importance of the 
test understandable. 

This reviewer has found that there is a higher 
correlation between performance on this test 
and tests measuring manual dexterity than there 
is with tests of mental ability. He has also found 
relatively high correlation between this test and 
grades received by students in professional 
schools in those courses demanding hand coordi- 
nation and finger dexterity. The test publishers 
report that an analysis of the data indicates that 
the factors measured by this test are those nec- 
essary for such jobs as draftsman, engineer, 
loftsman, machinist, and maintenance mechanic. 

The norms supplied by the publishers are very 
inadequate, for they are based upon only “250 
male and female cases selected from various 
occupations." A breakdown of even these 250 
cases into two or three occupational categories 
would be very useful. The reported reliability 
coefficient is .9r computed by the split-halves 
method and corrected by the Spearman-Brown 
formula. 


Patrick Slater, Chief Research Officer at 
the Social Survey and Honorary Psychometric 
Psychologist at St. Thomas's Hospital, London, 
England. The problems presented by this test 
are in the form of drawings. On the left is a 
flat pattern. It can be visualized as rolled or 
folded into the shape of one of four objects 
Shown drawn in perspective on the right. The 
subject must attempt to choose the correct one 
and record his choice. 

The amount of acquired knowledge needed 
to solve the problems is relatively small and 
should not affect variations in the scores of 
people for whom the test is intended to any 
great extent. The variations observed are likely 
to depend mainly on differences in general in- 
telligence and in the ability to recognize shapes 


and imagine the consequences of their manipu- 
lation—i.e., the ability sometimes described as 
spatial judgment or practical ability. The prin- 
ciples involved are sound and well established, 
and the test undoubtedly possesses diagnostic 
value, as claimed, for selecting draftsmen, engi- 
neers, mechanics, commercial artists, etc. For 
each such specific purpose, however, supple- 
mentary selection procedures would also be 
needed. The test would not be (and is not 
claimed to be) suitable for measuring spatial 
judgment among young people. 

Many other tests for the same abilities have 
already been published. In the British Isles the 
most notable are the form relations and space 
perception tests published by the National In- 
stitute of Industrial Psychology. 

In the construction of such tests, attempts to 
represent three-dimensional objects diagram- 
matically give rise to serious technical prob- 
lems. Right angles have to be portrayed by 
acute or obtuse angles according to the demands 
of perspective, and it is therefore difficult to 
indicate which are which unambiguously. It is 
also difficult to indicate whether an object pos- 
sesses sides or angles which are not exhibited 
in the diagram. Still more serious is the risk 
of introducing reversible images. Shading may 
assist in solving these problems, but in my 
opinion no completely satisfactory solution can 
be reached without stereoscopic aids. 

The Survey of Object Visualization leaves 
these problems unsolved and the reliability and 
validity of the scores must be accordingly re- 
duced. Some items could be improved by more 
careful attention to these details, and the order 
of difficulty appears questionable in some cases. 
But these faults are not so grave as to prevent 
the test from serving its intended purposes. 

The norms given are based on 250 adults, 
male and female together. Fuller norms would 
be welcome. 


[682 ] 

*Survey of Space Relations Ability. Grades 9-16 
and adults; 1944; IBM; Form A; $2 per 25; 25 per 
specimen set, postpaid; 8¢ per copy, machine-scoring 
edition ; 2¢ per machine-scorable answer sheet ; Spanish 
edition available at same prices; 15(20) minutes; 
Harry W. Case and Floyd Ruch; California Test Bu- 
reau. 


E. G. Chambers, Assistant Director of Re- 
search in Industrial Psychology, University of 
Cambridge, Cambridge, England. This is an 
interesting variant of the various forms of tests 
for the appreciation of spatial relations. Its basic 
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principle is similar to that of other tests. There 
is a series of geometric figures, each of which 
is subdivided into parts of different shape, and 
the candidate is required in each case to pick 
out the constituent parts from among ten simi- 
larly shaped .drawings. The chief difference 
from other tests lies in its resemblance to blue 
prints—a feature purporting to increase the 
motivation of test candidates. The reliability 
coefficient of .93 is good, but the validity coeffi- 
cients, though satisfactory in magnitude, are 
based on very small groups. Further evidence 
of its validity by the "follow-up" method is de- 
sirable before it can finally be accepted as a 
selective test. 

The test is an easy one, so much so that the 
speed factor enters largely into the score. The 
method of scoring itself is open to criticism, 
since omissions due to lack of time are penal- 
ized exactly equally with errors of perform- 
ance. It would be worth examining the relia- 
bility and validity of the test using as score the 
number of correct answers only, thus avoiding 
the possible overemphasis of the speed factor. 

The subdivision of the geometric figures 
makes the test purely one of recognizing differ- 
ent shapes, which is relatively easy. The con- 
structive element in the appreciation of spatial 
relationships, is entirely omitted, and the test 
might be improved by the inclusion of this fac- 
tor. This could easily be effected by the addi- 
tion (or substitution) of geometric figures 
which are not subdivided. The subject would 
then have the task of mentally building up each 
geometric figure from the appropriate parts. 
This is a harder task and one more in keeping 
with the usual conception of the ability to ap- 
preciate spatial relations. Making the test more 
difficult would tend to diminish the preponder- 
ance of the speed factor. 


Clifford E. Jurgensen, Personnel Director, 
Minneapolis Gas Light Company, Minneapolis, 
Minnesota. Each of 32 items in this test con- 
sists of a design made of parts ranging in num- 
ber from two to seven, and a series of ten dis- 
crete parts. Testees are instructed to select the 
parts which will make the specified design when 
fitted together, and to identify their selection 
by encircling corresponding numbers. Some- 
times parts must be mentally turned over and 
rotated in order to make them fit. Lines are in 
white against a blue background in order to 
resemble a blueprint. This procedure was in- 


tended to increase the acceptability of 
to applicants and employees, and 
are to be commended for their considérati 
a much neglected but highly importati 
in industrial testing. The test was desi 
be useful for mechanical jobs requi? 
ability to perceive rapidly and accurate 
relationships among objects in space, as is im- 
portant in such mechanical jobs as assembly, in- 
spection, drafting, blueprint reading, lofting, 
tool and die making, etc. as well as mechanical 
professions such as architecture, design engi- 
neering, and mechanical engineering. 

The test is practically self-administering. In- 
structions and practice exercises require ap- 
proximately five minutes and the test itself has 
a fifteen-minute time limit. Scoring is completely 
objective. 

The experimental form of the test resulted 
in validity coefficients ranging from .34 to .62 
with five groups, the number of cases in each 
ranging from 17 to 52, the criteria being in- 
structor’s ratings of quality and quantity of pro- 
duction in various shop courses. The test manual 
gives no data on the correlation between course 
grades and subsequent job success, or between 
test scores and job success. 

The experimental form of the test gave a 
reliability coefficient of .93 for the five groups 
(total N of 150) on which validity was deter- 
mined, 

Percentile norms are based on 1,000 adults 
including both sexes aged from 16 to 50, sub- 
jects being "selected to represent a true cross 
section of the adult population available to in- 
dustry." No data are given regarding effect on 
test scores of such factors as education, age or 
sex, although research on other tests in the 
mechanical area (e.g., Bennett's Test of Me- 
chanical Comprehension, Revised Minnesota 
Paper Formboard, etc.) have shown such fac- 
tors to be quite important. No data are given 
on the relationship between this and other tests. 


y 


James M. Porter, Jr., Associate Professor of 
Psychology, Rensselaer Polytechnic Institute, 
Troy, New York. This test lacks adequate 
standardization. The total number of cases in- 
volved in the experimental validation was 150. 
No occupational or vocational norms are sup- 
plied. Its utility in employee selection apparently 
remains to be established. Its typographic make- 
up, white lines on blue background, is likely to 
prove a desirable feature. 
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XTest of Mechanical Comprehension. Grades 9 and 
over; 1940-47; IBM; 1 form; 3 editions; separate an- 
swer sheets must be used ; $3.50 per 25, postpaid ; $1.50 
per 50 machine-scorable answer sheets, postpaid; 35¢ 
per specimen set of any one edition, postpaid ; nontimed 
(30) minutes; George K. Bennett and Dinah E. Fry; 
Psychological Corporation. 

2 FORM AA. Men in Grades 9 and over; 1940-47. 

b) Form nn. Men in grades 13 and over; 1941. 

c) FORM W-1. Women in grades 9 and over; 1942. 
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Charles M. Harsh, Associate Professor of 
Psychology, The University of Nebraska, Lin- 
coln, Nebraska. The wide use of this test since 
the first form appeared in 1940 has made it 
familiar to most testers. The technique of illus- 
trating physical principles, usually by two draw- 
ings, has appealed to many experts, as wit- 
nessed by the adaptations of the test for all of 
the armed forces. Forms AA, BB, and W-1 all 


have good format, clear type, and satisfactory 
drawings. The answer sheets are admirable, 
with arrows for page alignment. Use of the 
paper scoring stencil is a bit awkward, but a 
celluloid key (official or homemade) can easily 
be used. The manuals of directions are models 
of conciseness and honesty, including norms, 
references, and the available data on reliability 
and validity. 

Form AA is suitable for males in high school 
or trade school; Form BB is for male appli- 
cants for engineering schools or jobs; and Form 
W-1 is for women. Norms are provided for 
school grades and for applicants for various 
occupations or training courses. Reliabilities 
from .77 to .84 are reported, with standard 
errors of scores from 3.7 to 4.8 points. The 
repeat reliability of .9o (Form BB) is of dubi- 
ous significance, as it is easy to remember one’s 
answers to most of the questions. 

Despite the widespread use of the test there 
is no conclusive evidence as to its usefulness. 
The many reported correlations with other tests 
are only indirect evidence of validity, and (ex- 
cept for Form BB) it correlates as well with 
verbal tests as with mechanical or spatial tests. 
It is said to correlate from .3 to .6 with success 
in army engineering-type training, but the data 
have not yet been released to permit evaluation 
of their significance. If verbal and numerical 
tests are equally correlated with success in 
training, one will want to know whether the 
mechanical comprehension score improves on 
the prediction possible from the other tests, 
There is little doubt that the test measures com- 
prehension of many mechanical principles, but 
its value for prediction has been questioned on 
the ground that several items involve principles 
or facts which one is unlikely to encounter in 
everyday mechanical experience, outside of a 
physics course. Studies reported in the manual 
show that persons with training in physics get 
Scores 4 or 5 points higher, on the average, 
than persons without such training. The differ- 
ence is large enough to be considered in inter- 
preting a person's score. Candidates for engi- 
neering and technical courses have usually had 
more than average training in general science 
and physics, which may be sufficient to explain 
why their norms are considerably above the 
norms for average persons. But their general 
intelligence test scores also tend to be above 
average, so there is need for clearer evidence as 
to the specific contribution of the Test of Me- 
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chanical Comprehension. The norms for Form 
W-1 suggest that scores for women are more 
related to schooling than to mechanical interest 
or experience. 

The uncertainty as to the test's usefulness 
can only be overcome by more direct studies 
of validity to indicate the conditions under 
which the test is useful for achievement screen- 
ing and the conditions under which it may be 
useful for predicting future attainment. We 
hope such evidence will soon be available from 
the many studies in which the test has been 
used. 


Lloyd G. Humphreys, Supervisor of the 
Testing Division, University of Washington, 
Seattle, Washington. This test is available in 
three forms: Form AA for men who are rela- 
tively unselected mechanically; Form BB, also 
for men, approximately 12 points more diffi- 
cult than Form AA, for candidates to engineer- 
ing schools, etc.; Form W-1 for women (for 
women, it is intermediate in difficulty between 
Forms AA and BB for men). The reviewer has 
pointed out elsewhere (see 670) that the as- 
sumption that mechanical tests which show 
marked male superiority should not be used for 
women needs empirical verification. For many 
occupations either Form AA or Form BB may 
be more valid for women than Form W-1. 

Mechanical principles are pictorially repre- 
sented, The examinee has three choices: result 
A will follow, result B will follow, or either 
(neither) will result. Items for Forms BB and 
W-r1 were selected from a larger group on the 
basis of internal-consistency item analyses. On 
Form BB, the authors report a correlation of 
.97 between scores on the original 76 items and 
the final 60 items. Nothing as meaningless as 
this statistic, involving as it does a self-correla- 
tion of 1.00 for the 60 overlapping items, should 
be placed in a test manual. 

Items for Form AA, on the other hand, 
were selected from a larger group of 75 on the 
basis of item correlations with a composite 
criterion consisting of the scores on the original 
items, the MacQuarrie Test for Mechanical 
Ability, the Detroit Mechanical Aptitudes Ex- 
amination, and the Revised Minnesota Paper 
Formboard. Because of obvious deficiencies of 
this criterion, it is fortunate that relatively few 
items were discarded as a result of the analysis. 
If the test is good, it is the result of the author's 
ingenuity in the construction of the original 
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test items and not of the statistical analysis. 
Reliability is not high by older standards. 
Self-correlations, probably a little low since the 
items are not highly homogeneous, vary from 
77 for Form W-1 to .84 for the other two 
forms. A test-retest correlation for Form BB 
goes as high as .90, but this is undoubtedly spuri- 
ously high because these items are relatively 
easily remembered, Within wide limits, reliabil- 
ity is not critical. The test user is interested in 
validity. - 
The reviewer is using the test quite confi- 
dently in a vocational advisement program, in 
spite of the relative lack of validatiori presented 
in the manual. Experience with similar tests in 
the selection of aircrew indicates that the pres- 
ent test should be exceedingly useful. These 
tests were among the most valid (including 
psycho-motor tests) for pilot and aerial gunner 
selection. The same tests were also related to 
military "occupations" more clearly mechani- 
cal. The validity for pilots is due primarily to 
two factors, equally valid and present in most 
mechanical comprehension tests in about equal 
amounts: a mechanical information or back- 
ground factor, and a visualization factor. These 
factors can be described more fully as follows: 
All tests involving mechanical materials of any 
sort appear on the first of these factors. Infor- 
mation tests are highest on the factor. Tests 
involving comprehension of principles are also 
high. The second factor is defined by certain 
mechanical tests, like the present one, that in- 
volve function. Information tests have zero 
loadings. Also appearing on the visualization 
factor are such tests as the Revised Minnesota 
Paper Formboard, Thurstone's Punched Holes 
and Surface Development, and Part II of the 
Army's Mechanical Aptitude Test. To the ex- 
tent that both of these factors are found in a 


` criterion, the present tests should have good 


validity. 

A problem that needs to be investigated for 
tests of this type is the effect on the function 
measured of training in academic physics. It 
would seem that the right answers could be 
obtained for most items on either of two bases: 
knowledge of theoretical physics or experience * 
with things mechanical, on the farm, in the 
shop, etc. Would these tests have different 
validities (and factor patterns) for two groups 

* That the writer assumes that experience affects the scores 
made on these tests does not constitute an adverse criticism. We 
have reached the point where we ask if the test can be used to 


predict future success, not if it can be assumed to measure an 
innate ability. 


= 


3 differing widely in background along the lines 
“indicated ? go 
Norms on Form AA are varied and useful 
_and generally based on adequate numbers of 
cases. Norms on the other two forms are less 


~~ adequate. 


The reviewer commends the inclusion in these 
x a of rather generous numbers of corre- 
- lations with other tests. Two tests should not 
be given the same name without providing the 
test user with data which permit him to evalu- 
ate the degree of similarity. Different names, 
by the same token, should not be used unless 

. the tests are functionally different. 


George A. Satter, Assistant Professor of Psy- 
- chology, University of Michigan, Ann Arbor, 
Michigan. The three forms of this test are 
designed to measure understanding of a variety 
of mechanical and physical relationships. They 
may be used with high school students (and 
adults with comparable educational attain- 
ments), candidates for engineering schools, 
engineering students, and among women who 
are candidates for technical training or light 
mechanical work. 

The subject matter of the 60 items which 
comprise each of the three forms presents the 
appearance of being essentially the subject mat- 
ter of a high school course in physics which 

' has been sheared of its technical jargon and 
placed in a work-a-day setting. Each item con- 
sists of a picture illustrating some principle of 
mechanics, hydraulics, etc. and a question with 
two or three alternatives. The pictures are 
drawn on an adequately large scale; they are 
clear and, for the most part, free from ambigui- 
ties. 

The items are printed in a 16-page booklet 
and are prefaced with what appears to be an 
adequate set of directions and two sample exer- 
cises. Answers may be marked either on a sepa- 
rate sheet designed for easy hand-scoring or on 
a sheet which can be machine scored. 

There is no time limit on any of the forms 
but the authors suggest that twenty or twenty- 
five minutes is adequate for most people to 
complete the exercises. 

Scores on each of the forms represent the 
number of items answered correctly minus one 

„half the number wrong (each item has three 

alternatives), Hand scoring is facilitated by 

strip keys which accompany the test. 
Percentile norms are provided for a number 
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of different types of groups which vary in size 
from 145 to 2,460. Form AA (the form de- 
signed for high school students and for persons 
of comparable educational attainment) has been 
administered to students at all stages of high 
school training, to engineering school fresh- 
men, and to candidates for WPA mechanical 
courses, policeman, and fireman positions. Form 
BB, a form more difficult than AA, has been 
given to engineering school freshmen and to 
candidates for engineering school, merchant 
marine officers' training, and light mechanical 
work. The form designed especially for women 
(W-1) has been used to measure the abilities 
of freshmen and senior high school students, 
enlisted Waves, candidates for mechanical 
courses, and individuals employed at light me- 
chanical work. It would seem that the test has 
many possibilities for use in industrial selection 
and placement situations; it is regrettable that 
the Psychological Corporation has not been able 
to supply data descriptive of the performance 
of a larger variety of industrial groups. 

Manual materials accompany each of the three 
forms. These materials are pointed and well 
above the average in completeness. They cover 
such items as the rationale underlying the con- 
struction of the test, directions for administer- 
ing and interpreting the results of the test, pro- 
cedures used in construction, norms, and data 
descriptive of the test’s reliability and validity, 

While the quality of the manual materials is 
in general good, some might prefer that the 
test authors had gone into greater detail in pre- 
paring certain sections, Validity in each case 
is dismissed with a sentence or two; groups 
from which standardization data and norms 
were obtained are described with a word or 
two; and procedures used in construction and 
validation are presented in sketchy form. 

The authors recognize the possible usefulness 
of an external criterion, lament their inability 
to obtain a suitableone, and follow the conven- 
tional, more convenient, procedure of using 
total test score as a criterion. In selecting items 
for each of the three forms of the test, the 
authors followed the Kelley-Wood procedure, 
i.e., selected items on the basis of how well they 
discriminated between “high” and “low” scor- 
ing groups. Internal criteria (total scores on 
the test) were employed in the selection of items 
for Forms BB and W-1; and a composite cri- 
terion made up of weighted scores on the 
MacQuarrie Test for Mechanical Ability, the 
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Detroit Mechanical Aptitudes Examination, 
the Revised Minnesota Paper. Formboard, and 
the Test of Mechanical Comprehension was em- 
ployed in defining the “high” and “low” groups 
used in selecting items for Form AA. The de- 
scriptions of the procedures presented in the 
test manuals do not specify the standard used 
in selecting items. One would presume, since 
no statement to the contrary is made, that in 
every case the authors selected 60 items with 
the highest discrimination values from among 
those (from 75 to 92) given a trial adminis- 
tration. 

The results of the validation studies reported 
in the test manuals are sketchy and hardly of 
the sort which would be very helpful to the 
prospective users of the tests. No data, other 
than a group of correlations with other tests, 
are presented for Form W-r. The validity of 
Form BB is dismissed with the following state- 
ment, “Several studies have been made with 
this test as a predictor of success at engineer- 
ing-type occupations with validity coefficients 
from .3 to .6,” and a promise to release, at some 
time in the future, data collected from military 
groups. Coefficients of .5 between average grade 
from technical military courses and scores on 
Form AA and the differences in mean scores 
of those reporting physics training and no 
physics training substantiate the authors’ con- 
tention that the test is useful for predicting 
performance in technical training. 

Scores on the two forms (AA and W-1) 
which are designed for use with wide ability 
ranges correlate from .25 to .54 with paper- 
and-pencil measures of intelligence, with other 
measures of mechanical aptitude, and with 
achievement in physics. The use of multiple 
criteria (such as high correlation with total 
score and low correlation with measures of in- 
telligence) in selecting items for future editions 
of this test might prove effective in reducing this 
overlap. 

For selected groups (enlisted Waves, fresh- 
man engineers, and ninth-grade boys), relia- 
bilities estimated from split-half correlations are 
reported as .77, .80, and .84. 

~ The Test of Mechanical Comprehension, in 
the reviewer’s opinion, should prove to be a 
useful tool especially to those persons engaged 
in educational and vocational guidance. It should 
also find increasing usage in the technical 
school and the industrial employment office. It 
is an attractive test; the items are intrinsically 


interesting; all the forms appear to have been . 
well constructed; and they are easy to give. - 
The range of usefulness of the test will un- 

doubtedly increase as more validity data are . 
made available. -— 


MISCELLANEOUS . 
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*Cancellation Test. Adults; 1946; 1 form; 75¢ per 
25; 80¢ per set of scoring keys; 25¢ per specimen set, 
postpaid; 10(15) minutes; John R. Roberts; Educa- 
tional Test Bureau. j; 

Joseph E. King, Director, Personnel Man- 
agement Consultants, Chicago, Illinois. Cancel- 
lation is an historical item type which factorial 
studies have labeled a measure of perceptual 
speed. Like items of number and word compari- — 
son, picking out the highest number, and other 
“proofreading” types, cancellation items meas- 
ure facility in perceiving detail embedded inr 
irrelevant material or aptitude in recognizing. 
likenesses and differences rapidly. 

The Cancellation Test is at present in ex- 
perimental form. A study of 78 industrial 
workers provides preliminary norms and score 
ranges for six job classifications. No other 
statistical data are reported. The publishers in- 
dicate that the test is available "only to per- 
sonnel directors known to us." 

The Cancellation Test is an example of the 
almost universal fallacy in current test devel- 
opment and publication. A test is considered 
salable if a group of items have been brought 
together (possibly studied for internal con- 
sistency and difficulty), a reliability of .90 has 
been gained, and norms (and possibly validities 
or score ranges for given job classifications) 
are available. There is little concern as to (a) 
what the test measures or (b) whether a simi- 
lar (if not identical) test is currently available. 
The publishers state that "basically there is 
nothing new or novel about a cancellation test ; 
it is one of the oldest types." They do not speak 
of its factorial composition. 

It seems to the writer that psychometrists 
and test publishers have a professional respon- 
sibility which has been infrequently realized. In 
the industrial field alone, about 1,000 tests are 
promoted to the usually nontestwise personnel 
director. These tests measure complex abilities, 


overlap from publisher to publisher, lack busi-...- 


ness and industrial orientation, and are labeled 
with vague titles and even vaguer definitions of 
what is measured and its usefulness for per- 
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sonnel selection and placement. Testing pro- 
grams will be found in more than 5 per cent 
of the business world only when psychologists 
realize that test development is a scientific tool 
requiring specialized techniques and adequate 
research study. 
[ 685 ] 

Diagnostic Interviewer's Guide. Job reped 
1937; $6 per 100; $1 per specimen set including 10 
forms; E. F, Wonderlic and S. N. Stevens; E. F. Won- 
derlic, 750 Grove St., Glencoe, Ill. 

1. Lamp, Di eyes Selecting Employees, 

t. D, DoNALD A. The Psychology of Sel 5 
pis Elbe pp ier To feine E 
lishing Co. Ltd. 243.) (PA 9:4726) ER 
a. HovLAND, Cart Iver, AND WowpznLIC, E. F. “Prediction 

of Industrial Success From a Standardized Interview." J Appl 
Psychol 23:537-46 o "39. * (PA 14:1039) 

Clyde H. Coombs, Assistant Professor of 
Psychology and Chief of the Research Divi- 
sion, Bureau of Psychological Services, Insti- 
tute for Human Adjustment, University of 
Michigan, Ann Arbor, Michigan. The guide 
consists of a series of questions in four cate- 
gories—work history, family history, social his- 
tory, and personal history. An employment in- 
terviewer can use these questions as a guide 
and record the answers to the questions during 
the interview on space provided (frequently 
inadequate). As pointed out in the manual, the 
guide is designed primarily for the selection of 
high caliber personnel. 

Each of the four main sections is followed by 
a series of questions, from five to twelve in 
number. The interviewer, on the basis of his 
recorded answers, is to answer the questions 
listed, which are scored favorably or unfavor- 
ably. A gross score is thus obtained to be used 
as an element in the selection of an employee. 

It appears that the time of the interview 
would be mostly spent in writing down the an- 
swers to the questions asked during the inter- 
view. Also, the scoring procedure is rather 
poorly set up and would be subject to clerical 
error. 

A particular employer might find the guide 
helpful if he had little knowledge of interview- 
ing. In general, however, it would serve best 
as a guide for preparing an interview blank 
specifically designed to meet the needs of a par- 
ticular situation. Much of the information to be 
gathered during the interview could better be 
gathered by a biographical inventory in an ob- 
jective manner and the items eventually vali- 
dated. This would also cut down a lot of writ- 

‘ing by the interviewer during the interview. 
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Douglas H. Fryer, Associate Professor of 
Psychology, New York University, New York, 
New York. The Diagnostic Interviewer s Guide 
was developed for the selection of outside 
representatives for public contact work in the 
Household Finance Corporation and is recom- 
mended as a guide for a final selection interview 
following other screening of applicants. All 
major areas of the individual's life are sur- 
veyed: work history, family history, social his- 
tory, and personal history. The interviewee's 
answers to questions about him in these areas 
are recorded. Following this survey, a number 
of yes-no questions are answered about the 
applicant by the interviewer; there is a total of 
34 questions in the several areas, making avail- 
able a score range of —34 to +34. The ques- 
tions are largely concerned with the applicant's 
qualifications in contact work with people. The 
personnel survey provides the basis for answer- 
ing the questions, but the interviewer is in- 
structed to emphasize in his observation verbal 
facility, personality, attitudes, and appearance. 
No norms are provided for the use of the score 
for selection purposes, but the statement is 
made that "In general a negative score or a 
very low positive score should eliminate the ap- 
plicant from serious consideration." 

Reliability and validity of the Diagnostic In- 
terviewer's Guide, reported elsewhere (2), are 
based on the interviewer's score in marking the 
34 questions about the applicant. A correlation 
of .71 was found between different interview- 
ers' scores on 23 cases interviewed separately. 
A corrected odd-even correlation of .82 is re- 
ported for roo cases. A significant favorable 
difference (Cr — 4.25) is shown between 100 
workers still on the job and roo dismissed 
Workers at the Household Finance Corporation. 
Scores are shown to increase relatively as work- 
ers are more likely to succeed on the job. 

The Diagnostic Interviewer's Guide is a 
promising tool for estimating ability in public 
contact, but very little is known about what it 
accomplishes in personnel selection and nothing 
has been added to this knowledge since 1939. 
For example, its unbelievably high repeat relia- 
bility may be due to interviewers having com- 
mon prerequisites in experience on which the 
answers to yes-no questions are based; the 
answers to the questions based on the survey of 
personal history may duplicate less effectively 
the validity of the experience record when 
scored; and any estimation of the goodness of 
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the interviewee’s contact behavior during the 
interview, while less reliably rated from this 
base than from the experience record, may be 
the only independent contribution of this inter- 
viewing procedure to the over-all measurement 
of the applicant’s qualifications. The Diagnostic 
Interviewer's Guide is in a preliminary stage of 
development as a measuring device; most inter- 
viewing procedures cannot be said to have 
reached this stage. 
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Douglas H. Fryer, Associate Professor of 
Psychology, New York University, New York, 
New York. The Employee Evaluation Manual 
for Interviewers, by Richard A. Fear and Byron 
Jordan is a training manual on the use of the 
Employee Evaluation Form for Interviewers, 
prepared by the same authors. It is said that 
experienced interviewers can adapt their pro- 
cedures to the use of these materials but that 
new and inexperienced interviewers will need 
training by specialists in the field. 

The manual presents the philosophy and tech- 
nique for the interviewing in which the goal is 
to consider the applicant's entire experience and 
personality development in relation to the re- 
quirements of a specific job. The form provides 
the facilities for recording. Its efficient use de- 
pends, it is said, upon complete specifications of 
the job for which the applicant is being inter- 
viewed. Average time of interviewing is about 
22 minutes with wide variations for various levels 
of qualifications. The interviewer rates the appli- 
cant on the blank upon seven factors: (a) previ- 
ous experience, (b) training, (c) manner and 
appearance, (d) sociability, (e) emotional sta- 
bility, (f) maturity, and (g) leadership capacity. 
The first three are rated in accordance with the 
requirements of the job, and the latter four as 
phases of personality development. An over-all 
rating of fitness for the specific job follows. Rat- 
ings are made on a 3-point graphic scale of 
average, above and below average. A number of 
questions are to be marked on the blank under 
each factor, indicating satisfactory or unsatisfac- 
tory qualifications. The final task of the inter- 
viewer is to write an over-all summary statement 
of qualifications for the job for which the inter- 
view was given. The ratings and the marking 
of the questions provide the basis for this verbal 
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summary statement from which interpretation 
is to be made in the hiring. 

This procedure emphasizes the idea that the 
employer is hiring a man and not a machine and 
that he wants to determine how well he will fit 
into his organization before hiring—the old con- 
cept of a "worker-in-his-work unit" by Scott 
and Clothier. Here is fine practical idealism for 
the application of the Gestalt view in personnel 
work. At the higher levels of employment, e.g., 
in hiring a traffic manager, it would seem that 
this procedure could be executed in about two 
hours for an interview; possibly at the lower 
levels, e.g., in hiring a watchman, it can be 
accomplished in twenty-two minutes, or even 
less. It should satisfy personnel men that they 
are doing a good job of interviewing and that 
their judgments are based on adequate informa- 
tion. But it would be wise, before introducing the 
procedure permanently, because of its cost, for 
any businessman to determine for each and any 
type of job whether these judgments correlate 
higher with efficient “worker-in-his-work units" 
than do the “snap” judgments of personnel men 
from application blanks. It might turn out that 
long period cooperation and morale would be 
greatly affected by hiring from this kind of 
interview. Whether a battery of tests or a scored 
history form, taking less time of personnel men 
in operation, would do better or worse, with or 
without this interviewing procedure, is still an- 
other unanswered question. This procedure is 
but an idea, while beautifully worked out and 
with many suggestions for skillful interviewing, 
yet to be evaluated in any way except in opinion. 


C. H. Ruedisili, Associate Dean of the College 
of Letters and Science and Lecturer in Psychol- 
ogy, The University of Wisconsin, Madison, 
Wisconsin. The Employee Evaluation Form for 
Interviewers is "a device for helping inter- 
viewers improve their present methods by con- 
sidering the whole individual in his fitness for a 
job and recording this information for future 
use." It was designed for use in employee selec- 
tion, assignment, and upgrading. The nature and 
purpose of the form, as well as its use and in- 
terpretation, are described in Employee Evalu- 
ation Manual for Interviewers by the same 
authors. 

Based on the assumption that “the best indica- 
tion of what an individual will do in the future is 
what he has done in the past," the interviewer 
rates the applicant as below average, average, or 
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above average on each of seven factors: previous 
experience, training, manner and appearance, 
sociability (teamwork), emotional stability, ma- 
turity, and leadership capacity. Ratings on the 
first three factors are judged as they apply 
directly to the specific job, whereas those for the 
remaining four factors are considered as distinct 
phases of personality apart from any definite 
job requirements. Under each factor are four to 
eight question items to be checked (v) if the 
applicant's responses seem to be favorable or 
crossed (X) if they are unfavorable. In addition, 
brief facts selected from the interview to substan- 
tiate each rating are to be recorded. The inter- 
viewer then writes a summary evaluation (which, 
according to the authors, is the most important 
part of the form) of the applicant, and finally he 
makes an over-all rating for the specific job 
based on all the facts learned. Three sample cases 
are filled out on the forms for illustrative pur- 
poses, and indicate the kinds of facts and sum- 
maries to record. The use of the form in an 
aviation company indicates that the essential in- 
formation needed can be obtained in about 20 
to 30 minutes. 

A functional outline for conducting the inter- 
view is recommended, beginning with the factor 
of work experience, then leading to a discussion 
of school or training experience, and ending with 
personal history. This sequence is most effective 
because it enables the applicant to start on a 
subject that he knows thoroughly and can dis- 
cuss without the danger of emotional blocks. By 
the time the personal history section is reached, 
the interviewer should have effected the neces- 
sary rapport to approach this more intimate 
area. 

The major portion of the manual (which is a 
brief but excellent treatise on interviewing) deals 
with an interpretation of the items covered on 
the evaluation form. Each of the 41 items is dis- 
cussed with suggestions for getting the signifi- 
cant information relating to the individual’s 
background. Helpful hints regarding technique 
and concrete tips to encourage full discussion 
are described. Suggestions are given for prob- 
ing the various areas, with warnings and guide- 
posts to aid the interviewer. The causes of job 

terminations, reasons for dropping out of school, 
factors leading to personality problems—these 
problems, and others to enable the interviewer 
to obtain the significant information in the differ- 
ent areas—are discussed clearly and succinctly. 
The authors readily admit, however, that the 


interpretation of the responses made by the ap- 
plicant has only been outlined. That this de- 
pends greatly on the experience and skill of the 
interviewer is granted ; still, it is the interpreta- 
tion of the information that determines the sum- 
mary evaluation and ratings. 

This form has been used by the reviewer as a 
training tool in a class in the field of personnel 
psychology and proved to be more usable than 
such rating scales as Bingham's Rating Form for 
Use of Interviewers and Oral Examiners. 
Though it is more difficult to administer than 
Wonderlic’s Diagnostic Interviewers Guide 
(since it does not consist of very specific ques- 
tions to be answered by the applicant), the addi- 
tional energy spent is justified. In the hands of a 
competent interviewer the Employee Evaluation 
Form for Interviewers can be an effective tech- 
nique to improve the interview. 


Occupations 22:457 Ap '44. Martin L. Rey- 
mert. This manual, to be used in conjunction 
with a rating scale interview blank, gives in 
compact, logical form the a-b-c's of job inter- 
viewing. * The authors “do not profess to present 
anything profound or, in fact, new about inter- 
viewing,” they have “simply gathered together 
some common-sense principles which have been 
found helpful in determining an applicant's fit- 
ness for a job and organized them in a manner 
suitable for industrial use.” This seems an un- 
derstatement, however, since this brief manual 
might well serve as required reading for new 
interviewers and, no doubt, will also prove very 
helpful as a reminder of pertinent items to sea- 
soned interviewers, * The manual seems to be 
sound psychologically and is undoubtedly based 
on considerable interview experience by the two 
authors. 

Pub Personnel R 5:244-5 O ’44. Kenneth 
Byers. * The authors do not profess to present 
anything new or startling in their manual and 
evaluation form but they have gathered together 
some common sense principles which have been 
found helpful in determining an applicant's fit- 
ness for a job, and have organized these princi- 
ples in a manner believed suitable for industrial 
use. The technique presented is not to be com- 
pared with a clinical interview by a trained 
psychologist but is to be considered as a device 
for helping interviewers improve their present 
methods by considering the whole individual in 
his fitness for a job and recording the informa- 
tion for future use. * The employee evaluation 
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form and this manual are well worth the study 
of employment interviewers both in industry and 
government. They should prove particularly 
helpful in training new interviewers and in clari- 
fying the thinking of, or opening new avenues of 
thought to, the experienced interviewer. Because 
of the employment philosophy presented, as well 
as the points of interest in the employee evalua- 
tion form itself, the manual should be of interest 
to everyone in the public personnel field. 


[ 687 ] 
*How Supervise? Candidates for supervisory train- 
ing; 1943; Forms A, B; $1.75 per 25, postpaid; 35¢ per 
specimen set, postpaid; nontimed (15-40) minutes; 
edited by H. H. Remmers; Quentin W. File; Psycho- 
logical Corporation. 
REFERENCES 

1. Finge, Quentin W. “The Measurement of Supervisory 

Quality in Industry." J Appl Psychol 29:323-37 O '4s. * (PA 


20:884) 
2 Ez, Quentin W. “Are Management's Views of Supervi- 
sion Faulty?” Discussion by Charles S. Slocombe. Personnel J 


24:242-9, discussion 251-4 Ja '46. * (PA 20:1634) 

3. Fire, Quentin W., AND Remmers, H. H. "Studies in 
Bur valuation." J Appl Psychol 30:421-5 O '46. * (PA 
21:572 


4. SARTAIN, A. Q. “Relation Between Scores on Certain 
Standard Tests and Supervisory Success in an Aircraft Fac- 
tory." J Appl Psychol 30:328-32 Ag ’46, * (PA 21:250) G 

5. Stocomne, CmaRLES S. “Appraisal of Mr. File's Study. 
Personnel J 24:251-4 Ja '46. * (PA 20:1657) 

D. Welty Lefever, Professor of Education, 
The University of Southern California, Los 
Angeles, California. The measurement of skill 
and understanding in the area of human relation- 
ships is difficult at best. This test has been care- 
fully and intelligently developed and seems to be 
sufficiently successful for use by trained workers 
under controlled conditions. The manual recom- 
mends the test as a "valuable aid" and not as a 
"cure-all" in the selection and placement of 
supervisors. A rather clear recognition of the 
limitations and precautions essential to a valid 
application of the test characterizes the discus- 
sion and directions contained in the manual. 

The authors include in their listing of uses: 
the selection of candidates for training in super- 
vision, the evaluation of the supervisory training 
received, the study of supervisor attitudes to- 
ward workers and toward the company, and the 
collection of basic data to guide the counseling 
and interviewing of supervisors. The test ap- 
pears to be most valuable in furnishing the bases 
for the discussion of managerial and supervisory 
problems and in measuring the outcomes of 
training programs. This reviewer believes that 
considerable experimentation and careful study 
in a given business, industry, or civil service 
installation would be required before the test 
should be permitted to become a major factor in 
the actual selection of supervisory personnel. 


The answers to the items are based on a mod- 
ern philosophy of personnel management and 
were determined specifically from the replies of 
a group of experts from the Training Within 
Industry program. By keeping in mind the gen- 
eral principles of a democratic social philosophy 
and the simple precepts of mental hygiene, a very 
large proportion of items can be answered cor- 
rectly. ` 

Persons of superior intelligence with recent 
psychological study can make high scores with- 
out special course work or experience in super- 
visory theory or techniques. Fifty graduate stu- 
dents in education were given Form A of the 
test. The median score of the group was 54, 
which, according to the manual, corresponds to 
the 95th percentile on the table of norms based 
on the scores of 577 supervisors. A few items 
were the subject of considerable controversy. If 
these special items had been omitted, the median 
score would have reached the goth percentile. 

The manual accompanying the test is excep- 
tionally well prepared. Much-needed cautions 
are given to guide in administering, scoring, and 
interpreting the test results. Emphasis is placed 
on the important principle that knowledge about 
the best techniques of supervision is necessary 
but by no means sufficient to guarantee effective 
practice. 


Charles I. Mosier, Director of Research, Per- 
sonnel Research Section, The Adjutant General's 
Office, Washington, D. C. The stated objective 
of this test is to measure understanding of super- 
visory principles and, by implication, ability to 
supervise. The authors recommend the test in 
selection of supervisors and selection for super- 
visory training, in appraising training needs and 
training outcomes, as well as for counseling 
supervisors and surveying their attitudes toward 
company policies. 

Items were drawn from training courses and 
textbooks on principles of supervision, and the 
70 items in each form were selected from an 
original 204 items on the basis of internal con- 
sistency rather than for discrimination among 
levels of supervisory competence. Even so, selec- 
tion of the most reliable items has not resulted 
in a satisfactory reliability for a single form. 
Since both forms must be used to get satisfac- 
torily reliable measures, the advantages of sepa- 
rate forms disappear. 

The keyed answers were determined by the 
consensus of training officers and textbook writ- 
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ers. The items are essentially of the true-false 
type. (Although a third response, “?”, is offered, 
marking it has the same effect as omitting the 
item.) All of the items state principles with 
which the candidate is to agree or disagree, or 
which he is to mark as desirable or undesirable. 
For many of these there is insufficient informa- 
tion to make a categorical answer satisfying. 
There are no items which present specific situa- 
tions and call for particular solutions, although 
there is evidence on the validity of such items 
for supervisory success.* The test does, however, 
provide an excellent check list from which such 
items could be constructed. It is unfortunate that 
the more flexible multiple-choice form was not 
used, since this would not only have tended to 
increase reliability for the same number of items 
but also have permitted the presentation of prob- 
lems and alternative solutions. j 

The only evidence of validity offered by the 
authors is curricular validity for a supervisory 
training course. The source of the items, the 
method of keying, and the criterion for selection 
of the final items support, but do not add to, 
the authors’ finding that there is a statistically 
significant difference in test scores before and 
after a training course. None of these however, 
provides evidence as to the validity of the test 
for the prediction of supervisory success, either 
before or after training. Moreover, its validity 
for appraising the effects of training is limited 
to those training outcomes which are reflected 
in verbal statements of what should be done. 
Whether such outcomes alone lead to actual im- 
provement in supervisory practice is another 
question. 

For the more important purpose of predict- 
ing supervisory success, the only evidence of- 
fered by the authors is the postulate: Although 
those who do understand supervisory principles 
may not apply them,. those who do not have 
the knowledge cannot apply it. This is weakened 
by the finding that supervisory and management 
officials disagree with the key (ie. make low 
scores). This finding may mean, as File claims, 
that the sample represents unenlightened man- 
agement or that the "principles of supervision" 
are not as directly related to supervisory success 
as one might suspect. The latter conclusion is 
supported by at least one study (4) in which 
there was no relationship between How Super- 
vise? scores and rated success as a supervisor 

* Bransrorp, T. L.; EL D. C.; anp Manpett, M. M. 


“A Study of the Validity of Writt inistrati 
Personnel.” Abstract. y Psychol E n Tests for eigene 
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(aircraft foreman). Moreover, the general 
soundness of the a priori logic breaks down in 
the case of any who, though skillful supervisors, 
are unable to verbalize the basis of their skill. 
The test does apparently provide an adequate, 
though relatively unreliable, measure of verbal 
intelligence and shows substantial correlation 
with tests of “intelligence.” Whether it con- 
tributes more to the selection of successful super- 
visors is not demonstrated. 

How the test is to be used to select people 
for supervisory training is not clear. Since the 
test measures course content, those with high 
scores are least in need of training; since the 
test is highly correlated with verbal intelligence, 
those with low scores are least able to profit 
from training. Neither the test nor the manual 
resolves this very real and very tough dilemma. 
Certainly the use of other tests in conjunction 
with this one is indicated if this particular objec- 
tive is to be met. 

Certain technical defects in the manual should 
be noted. Reliability coefficients are reported, 
but there is no indication of how they were 
determined nor of the group on which they were 
based. Percentile norms and standard scores 
are given for each form and for both combined. 
These norms are based on a sample of 577 super- 
visors, but the group is not further defined, e.g., 
blue-collar or white-collar, before or after train- 
ing. The two forms are asserted to be equiva- 
lent, but no means or standard deviations are 
given in support of this. These statistics can, of 
course, be roughly inferred from the percentile 
norms, on the assumption that they are equal 
for both forms. Reference to technical papers on 
the construction or validation of the test would 
be helpful. The manual is written for the un- 
sophisticated test user; this should not preclude 
the inclusion of the information needed by the 
trained test technician. 

In summary, the test is limited by its item 
form and has inadequate single-form reliability. 
There is no evidence of its validity except as an 
achievement test of the verbal knowledge ex- 
pected to result from a course on principles of 
supervision. Its reliability is too low to be very 
helpful as a diagnostic test of specific areas of 
training needs, and the evidence of its validity 
for the selection of successful supervisors is thus 
far unsatisfactory. 


C. H. Ruedisili, Associate Dean of the College 
oj Letters and Science, and Lecturer in Psychol- 
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ogy, The University of Wisconsin, Madison, 
Wisconsin. This test, of the typical inventory 
form, is designed to measure a person’s “knowl- 
edge and insight concerning human relations in 
industry.” Each of the two forms consists of 70 
items divided into three areas: two of them, 
Supervisory Practices and Company Policies, 
pertain to specific supervisory practices and 
methods found in industry, and each item is to 
be answered as desirable, uncertain, or undesir- 
able; the third, Supervisor Opinions, deals with 
the human-relations problems of handling work- 
ers, and the subject indicates his agreement, un- 
certainty, or disagreement with the statements. 

Designed to measure supervisors' understand- 
ing of their job, the manual recommends that 
How Supervise? can be of value in selecting 
candidates for and measuring the results of 
supervisory training programs, in upgrading 
employees, in employee attitude studies, and in 
general counseling and interviewing programs. 
Like most manuals published by the Psychologi- 
cal Corporation, this one is well written and 
covers the topics one looks for and, too often 
even at the present time, fails to find in many 
test manuals. These are: description of test, 
validity and reliability, administration, scoring, 
norms, uses of test, precautions of use, and keys. 
The test is easily administered and has no time 
limit. Normally it should be completed in 30 
minutes, but this will vary from 15 to 40 min- 
utes depending on the reading ability of the 
subjects. Scoring the test, using column keys, 
takes from one to two minutes. Norms, for each 
form or for both combined, based on 577 super- 
visors, include both percentiles and standard 
scores. 

Two experimental forms of 102 items each 
(the items were constructed after conferences 
with industrial supervisors and personnel men 


‘and after a careful survey of the literature in 


the field) were developed. “Best” or right an- 
swers were those selected by 37 members of the 
government’s Training Within Industry super- 
visory staff and 8 recognized authorities in the 
industrial relations field. These forms were ad- 
ministered to 750 supervisors in 10 industries. 
The items chosen for the final forms were those 
discriminating between the high-scoring super- 
visors (upper 27 per cent) and the low-scoring 
supervisors (lower 27 per cent). 

File states that two other studies indicated 
the improvement in test scores that results from 
a training program. Supervisor’s scores after 


training were significantly higher than before; 
however, no statistical proof was furnished to 
substantiate this conclusion. Besides, these data 
only mean that there is an increase in knowl- 
edge of supervisory practices; there is no as- 
surance that understanding. goes with knowl- 
edge. The supervisor may know what to do in 
a specific situation and yet not behave in the’ 
approved pattern because of his own emotional 
or personality tendencies. 

The reliability of one form is reported as .76, 
while for the combined scores on both forms 
it is .86. 

How Supervise? is a useful measuring in- 
strument in the field of industrial supervision, 
but it is not the complete answer to all the con- 
fusing questions that arise. Sartain (4), for in- 
stance, found that the experimental edition 
administered to 40 supervisors in an aircraft 
factory correlated —.18 with the criterion of 
success (sum of four ratings). He concluded 
that the test was of little or no predictive value 
for success in supervision in that particular 
plant. 

File and Remmers (3) present additional. 
evidence of validity. The average percentile 
scores for 46 successful supervisors and 14 non- 
supervisors in an office machine manufacturing 
company were 75 and 23; the critical ratio of 
the difference between the proportions above 
and below the soth percentile was 5.8. A sec- 
ond study in a surgical supply manufacturing : 
concern showed a difference in the mean scores . 
for a group of superior and inferior supervisors 
(as rated by the testing supervisor) ; the criti- 
cal ratio of this difference was 4.4. The last evi- 
dence.deals with a group of 16 supervisors in 
a large laundry divided into 5 groups from 
“superior” to those with “most limited super- 
visory responsibilities.” The raw scores for 
these groups were 54.7, 49.7, 44.7, 40, and 32. 
The authors, on the basis of the available evi- 
dence, frankly admit that these studies “do not 
constitute conclusive evidence of universal va- 
lidity of How Supervise? as a supervisory de- 
vice.” Such frankness and honesty is refresh- 
ing and appreciated. They indicate that a more 
comprehensive investigation is now in progress, 
and one is definitely needed ; certainly one should 
not accept validity on the basis of subjective 
evidence and mere reports of valuable results. 

Though the selection value of this test has 
not been definitely determined, it still is a tool 
with interesting possibilities. When used for 
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discussion purposes in a supervisory training 
program, for getting at supervisor's and em- 
ployee's attitudes in this area, or as a teaching 
device in college courses (the reviewer uses it 
in a course in personnel psychology and strongly 
recommends it) How Supervise? can be of real 
, value. 


[ 688 ] 
*Interaction Chronograph. Ages 3 and over; 1944- 
47; $500 per month for rental of the chronograph; 
the rental fee includes “instruction in giving, recording, 
- scoring, and interpreting interviews, assistance in set- 
` ting up job standards, statistical assistance in interpret- 
ing test results, machine maintenance and frequent 
consulting service”; Eliot D. Chapple; E. D. Chapple 
Co., Inc. 
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XMinnesota Occupational Rating Scales and 
Counseling Profile. Grades 9-16 and adults; 1941; 
the Counseling Profile is no longer available; $4 per 
manual; $3 per 300 record forms; earlier editions of 
the scales prepared by Eleanor S. Brussell, Harland N. 
Cisney, J. Spenser Carlson, Gwendolen G. Schneidler, 
and Donald G. Paterson; 1941 edition prepared by 
Donald G. Paterson, Cfayton d'A. Gerken, and Milton 
E. Hahn; Science Research Associates. 
REFERENCES 
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M. H. Elliott, Director of Research, Pub- 
‘lic Schools, Oakland, California. These mate- 
rials are intended to aid the vocational coun- 
selor. Three different items are included in the 
“complete unit.” The Minnesota Occupational 
Rating Scales are contained in a manual of in- 
formation about occupations based upon the 
tating scales published by Bingham (4). Occu- 
pations are rated as to level of ability required 
in the areas of academic ability, mechanical 
ability, social intelligence, clerical ability, musi- 
cal talent, artistic ability. The ratings represent 
the "pooled judgments of vocational psycholo- 
gists.” The usefulness of the information is 


increased by inclusion of several different re- 
classifications such as: occupations requiring 
similar abilities, occupations requiring certain 
levels of particular abilities, etc. The Counsel- 
ing Profile "is a device which facilitates use of 
the Occupational Rating Scales in the coun- 
seling situation." Concentric discs are set to 
indicate the amount o: ability which the coun- 
selee is estimated to possess. There are addi- 
tional settings for the stated occupational aims 
and for vocational interest as measured by tests. 
Presumably the profile will then assist in com- 
paring the counselee's abilities with particular 
occupations or in reconciling stated aims with 
abilities. The Individual Counseling Record is 
a concise one-page form for use with the Pro- 
file and Rating Scales. 

The authors of the manual containing the 
Rating Scales are very cautious in their claims. 
Various possible uses of the materials are de- 
scribed and the limitations are indicated. 

The three items are not of equal value. The 
manual containing the Rating Scales presents 
some useful information about a variety of oc- 
cupations in simplified and readily available 
form. Probably this booklet could be used in 
college classes in vocational guidance techniques 
as an introduction to the field. The information 
presented would also be useful as a digest 
which teachers of occupational information 
classes at the high school level could put into 
the hands of students. The present reviewer 
believes that the main usefulness of the manual 
containing the rating scales is at the introduc- 
tory stage of counselor training and that it is 
of little use as a tool for the trained counselor. 
The danger as the authors point out is that 
"Many persons will feel that the techniques 
are so simple that anyone can use them . . . 
The authors have not invented a short-cut for 
amateurs and incompetents.” 

The Counseling Profile seems to be an in- 
genious but relatively useless "gadget." 

The usefulness of the Counseling Record 
would depend upon the idiosyncrasies and 
preferences of individual vocational counselors. 
It appears to be a concise and adaptable form 
for summarizing vocational information. 


[ 690 ] 
*Personnel Selection and Classification Test. 
Adults; 1942-44; Forms A, B; $3.75 per 25; 50¢ per 
Specimen set, postpaid; (80-90) minutes; Willis W. 
Clark, Ernest W. Tiegs, Louis P. Thorpe, T. W. 
MacQuarrie, and Elizabeth T. Sullivan; California 
Test Bureau, 
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Dorothy C. Adkins, Chief, Test Develop- 
ment Unit, U. S. Civil Service Commission, 
Washington, D. C. This test has the aim of 
meeting “a demand for a relatively short, easily 
administered, and dependable measure of fitness 
for employment in various jobs and occupa- 
tions in business and industry." It is relatively 
short and easily administered, but most readers 
of the manual would fail to be satisfied, on the 
basis of the evidence presented, that the test 
provides a dependable measure of fitness for 
employment. 

The test is composed of several subtests which 
yield seven scores: reading (sets of yes-no 
questions on several paragraphs); arithmetic 
(completion items on various operations) ; 
mental alertness or intelligence, also referred to 
as verbal intelligence (multiple-choice vocabu- 
lary items) ; mechanical (tapping or dotting, 
pursuit, and blocks tests from the MacQuarrie 
Test for Mechanical Ability) ; adaptability- 
sociability (yes-no items of the personality in- 
ventory type); dependability or persistence 
(yes-no items of the personality inventory type) ; 
and oral directions (25 items requiring follow- 
ing of oral instructions). 

The poorest of these tests is the reading test. 
In the first place, over go per cent of the items 
can be answered correctly (aside from errors 
in the scoring key) by a sufficiently “intelli- 
gent" subject without access to the paragraphs 
on which they are supposedly based. In the 
second place, those subjects who do try to base 
their responses on the paragraphs may find 
the directions ambiguous. Many of the items 
contain specific determiners. Some individual 
items are poor in other respects. From the sen- 
tence, “Read all regulations carefully and ob- 
serve them promptly," one is supposed to decide 
that “observe” means “do.” From the state- 
ment that “If everyone cooperates more work 
is done, the business prospers, and workers are 
more secure,” it is supposed to follow that “The 
more business prospers the more secure are the 
workers." 

The arithmetic test appears to be reasonably 
satisfactory, although the scoring is not com- 
pletely objective. 

The vocabulary items, requiring choice from 
among four words of one that means the same 
as the key word, are well constructed, although 
the ability measured by them is probably nar- 
rower than is suggested by "mental alertness or 
intelligence." 


The MacQuarrie test includes both a tapping 
test (requiring the subject to make three pencil 
dots in each of a series of circles) and a dotting 
test (requiring the subject to make one pencil 
dot in each of-a series of smaller circles). One 
form of the Personnel Selection and. Classifica- 
tion Test includes the tapping test and the other 
includes the dotting test. Question is raised as 
to whether the tests are really tests of the same 
function. The United States Employment Serv- 
ice has found Pearson 7’s in the neighborhood 
of .55 between the two tests for five samples 
ranging from 99 to 591. Two tests of the same 
function can, of course, correlate this low be- 
cause of unreliability. Data on the reliability of 
these individual tests, however, are reported in 
neither the manual accompanying the original 
MacQuarrie battery nor the manual for the 
present battery. 

The tapping test presents a scoring problem. 
The scoring instructions call for examining the 
circles to see that approximately (italics mine) 
three dots are made in or near each circle. The 
score is "the number of circles with 3 dots in 
or near them divided by 2.” The instructions 
make no mention of the hardness of lead in the 
pencils to be used. It is very difficult in some 
cases to make out whether a circle contains one, 
two, or three dots, especially if very soft lead is 
used. Dots are often made directly over other 
dots, so that it is difficult to arrive at a score 
reflecting speed of tapping very accurately. 

The pursuit test consists of a maze of curved 
lines, each starting in a numbered square at the 
left and ending in a square at the right. The 
subject is to trace each line by eye and record 
its number in the proper square at the right. 
The drafting is definitely faulty in one place, 
which may mislead the subject by two score 
points, If the score is the number of squares 
correctly numbered, as apparently is intended, 
the scores are not directly proportional to num- 
ber of correct responses. 

The blocks test is satisfactory insofar as can 
be determined by inspection. 

The tests of the personality-inventory variety, 
which are supposed to measure “adaptability- 
sociability” and “dependability-persistence,” do 
not escape the common defect of this type of 
test for use in competitive situations. For almost 
all questions, the socially acceptable response 
is pretty obvious. Examples are “Do you feel 
that people often treat you rather badly?" and 
"Can you be persistent, if necessary nis 
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Inspection of the oral directions test reveals 
no defects. 

The manual presents a Table of Minimum 
Desirable Specifications, indicating for 40 jobs 

"(ranging from baker to aviator to executive) 
the level of performance needed on each of the 
seven parts of the test. Thus, an aviator needs 
high-average (50-8oth percentile) scores on 
reading, arithmetic, mental alertness or intelli- 
gence, and social adaptability, and high (80- 
goth percentile) scores on mechanical dexterity, 
dependability or persistence, and understand- 
ing oral directions. 

The section Notes on Selection, Placement, 
and Adjustment presents references to a num- 
ber of researches on such tests as those in- 
cluded in this battery and a variety of state- 
ments which, although rather carefully phrased, 
may mislead the unwary reader into an impres- 
sion that there has been considerable research 
on the actual tests in question. Such a statement 
as “Practically all professions, executive posi- 
tions, and the higher technical callings make 
great demands in all of the skills and qualifi- 
cations measured by the Personnel Selection 
and Classification Test” is presented without 
evidence. Aside from the lack of data to sub- 
stantiate this statement, surely the ceilings on 
most of the tests are too low for them to be 
maximally useful for the purpose implied. And, 
again, “Many workers in these (inspector, book- 
keeper, caterer, railway conductor, and aviator) 
and similar callings have failed in their posi- 
tions because of a morbid, sarcastic, sensitive, 
antagonistic, suspicious, or lazy attitude.” Surely 
there is an implication here that the test is go- 
ing to detect such attitudes. 

Under the topic Reliability and Validity, we 
find only that “statistical data will be furnished 
to users upon request” and that “the reliability 
coefficients of individual tests vary from .8o to 
94." Data on validity might well replace some 
of the material provided elsewhere in the man- 
ual. One may wonder how the reliability coeffi- 
cients were estimated, particularly for the speed 
tests included in Part 4, and how the compara- 
bility of the two forms was established. One 
may also be curious, when he comes to a table 
of Norms for General Use, as to just what con- 
stitutes a “typical adult population." This re- 
viewer, who admittedly did not request statis- 
tical data, would take the position that such 
information should routinely be included in test 
manuals. A more recently published brochure 


of the California Test Bureau, Tests for Per- 
sonnel Selection and Placement, 1947, includes 
the median percentile ranks of 151 junior retail 
executives and 41 machinist trainees. These 
subjects presumably were persons already em- 
ployed rather than applicants for employment 
in a competitive situation. If this is true, it lim- 
its the value of the results to some extent, par- 
ticularly in the case of the tests of the person- 
ality inventory type. Nevertheless, such data 
for a variety of occupations would be very use- 
ful. From the information to which the reviewer 
had ready access, this test appears to warrant 
revision and research, but the evidence that it 
satisfies the purposes for which it was intended 
is incomplete. 


George K. Bennett, President, The Psycho- 
logical Corporation, New York, New York. 
This is a general purpose test, attempting in 
one booklet and in an over-all time of about 80 
minutes, the measurement of seven quite dis- 
parate aspects of ability and personality. The 
intent is to provide the employment office or 
personnel department with a profile of charac- 
teristics which may be matched to the require- 
ments of a considerable variety of jobs ranging 
from sales clerk to aviator. A table of Mini- 
mum Desirable Specifications in terms of broad 
Score ranges has been included in the manual. 
This table “is based on personnel research at 
points where such research is available, and the 
judgments of experts in personnel work at 
other points.” A single table of norms, “based 
on typical adult population,” is given. (The 
table of norms is reported as “based on ap- 
proximately 2,000 cases, mostly adult males 
employed in industrial and commercial con- 
cerns.”) At the request of the reviewer the pub- 
lisher has furnished him with tables of intercor- 
relations based on 200 adults and reliability 
coefficients (Kuder-Richardson) based on a 
population of 600, half of whom were secondary 
school students and the remainder, industrial 
employees. 

There are several important defects in this 
test. For any test battery leading to a score 
profile, it is well established that reliabilities 
must be high and intercorrelations low. Data 
obtained through correspondence indicate that 
the reliabilities are relatively high (.81 to .95), 
and the intercorrelations are low. However, the 
populations used for reliability and intercorre- 
lations are not the same, and each of the two is 
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apparently different from the population used 
for the norms. The differences in populations 
cast considerable doubt on the adequacy of the 
computed statistics when the tests are used for 
ordinary industrial purposes. Some of the re- 
ported intercorrelations are surprisingly low: 
oral directions versus verbal intelligence, .41; 
oral directions versus arithmetic, .34; verbal 
intelligence versus arithmetic, .45. 

A second major defect is the test content in 
certain sections; mechanical dexterity is meas- 
ured by 534 minutes of performance on three 
sections ef the MacQuarrie Test for Mechanical 
Ability: tapping, pursuit, and blocks, The first 
of these, consuming 30 seconds, does employ 
some motor skill; the remaining 90 per cent of 
time devoted to this would seem to be occupied 
in perception and spatial reasoning. The two 
personality tests consist of 30 “yes-no” ques- 
tions each. A typical “social adaptability” ques- 
tion is “Do you endeavor to meet new people 
in your community?” “Persistence-dependabil- 
ity” questions include “Do people often come to 
you for help?” and “Are you particularly suc- 
cessful at saving money?” In view of the mass 
of evidence indicating that even long and care- 
fully constructed personality tests are ineffective 
as selection instruments, it seems highly im- 
probable that brief lists of transparent questions 
can contribute importantly. The reading, ver- 
bal intelligence, and arithmetic tests are some- 
what brief, but appear to conform to good 
practice. 

The manual is lacking in statistical informa- 
tion and contains wholly inadequate norms 
without differentiation or identification by oc- 
cupation, age, education, or even sex. Valida- 
tion has not been attempted, possibly because of 
the great investment of time and money that 
would be required for a multipurpose test. How- 
ever, educational and occupational group pro- 
files of considerable value could readily have 
been obtained and included. 

The principal objection to this test battery is 
that inadequate statistical procedures and dubi- 
ous face validity make impossible any affirma- 
tive reaction to the usefulness of the tests. 


George A. Ferguson, Assistant Professor 
of Psychology, McGill University, Montreal, 
Canada. This test is designed to explore seven 
areas of ability which are considered of signifi- 
cance in the selection of employees in business 
and industry. 


No information on the sample of the popu- 
lation on which the test was standardized is 
provided by the authors. It is described as a 
typical adult population. Very few data are pre- 
sented which enable an adequate appraisal of 
the reliability or validity of the test in different 
situations. Reliability coefficients of individual 
tests are reported to vary from .80 to .94. 

The test is short, easy to administer, and 
covers a broad field. If the claims of its author 
are substantiated, it should prove a convenient 
instrument in certain types of industrial selec- 
tion where rough and rapid appraisals are re- 
quired. One comment, however, which may be 
warranted is that this test, in its attempt to 
explore a variety of different abilities and per- 
sonality characteristics, appears to explore too 
widely and too superficially. The majority of 
psychologists would agree that in thorough se- 
lection work there is no easy royal road to the 
description of human ability and personality. 
More specifically the capacity of three short sub- 
tests, tapping, pursuit, and block counting, to 
provide an adequate measure of mechanical 
ability is questionable. Further it may be argued 
that many abilities and personality traits of 
marked importance in selection and classifica- 
tion work exist which are not measured by this 
test. 

The conclusion is that where quick appraisals 
are required the test may prove of value. In 
the more thorough types of selection work, 
more comprehensive data than this test pro- 
vides are usually required. 


[ 691 ] 
Rating Form for Use of Interviewers and Oral 
Examiners. Applicants for employment; 1938; IBM ; 
$1 per 25, postpaid; 6¢ per sample copy, postpaid; 
W. V. Bingham; distributed by Psychological Corpora- 
tion. 
REFERENCES 

1, Bincuam, W, V. Oral Examinations in Civil Service Re- 
cruitment: With Special Reference to Experiences in Pennsyl- 
vania. Civil Service Assembly of the United States and Canada, 
Pamphlet No. 13. Chicago, Ill.: the Assembly, February 1939. 
Pp. 30. Paper. $0.50. 

2. BINGHAM, WALTER Van Dyxz, anp Moore, Bruce Victor. 
How to Interview, Third Edition, pp. 112-26. New York: Harper 
& Brothers, 1941. Pp. xi, 263. $3.00. * (PA 16:304) 

Douglas H. Fryer, Associate Professor of 
Psychology, New York University, New York, 
New York. This rating form consists of graphic 
rating scales of five degrees, differently de- 
scribed, for eight qualities—(a) voice and 
speech, (b) appearance, (c) alertness, (d) 
ability to present ideas, (e) judgment, (f) emo- 
tional stability, (g) self-confidence, (h) friend- 


liness—and for an over-all rating of personal 
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fitness for the position. Interviewers are in- 
structed to observe these characteristics during 
the interview and to rate them in relation to 
the requirements of the position, not consider- 
ing the applicant's experience or training. 

The blank was used approximately in its 
present form in Pennsylvania in 1938 for the 
oral examination of 30,000 applicants for state 
civil service positions. Trained boards of 3 to 
5 members interviewed the applicants in from 
20 to 45 minutes, the blanks were scored by 
IBM scoring machines, and relative weights 
were assigned in summing the ratings for the 
different traits. The weights for the traits dif- 
fered for different positions. Internal consistency 
was studied, and the average of the deviations 
Írom the board average for the various traits 
was one-ninth of the scale length of 5 points. 
That the interview measured something differ- 
ent from the written civil service examination 
was indicated by low correlations of .15, —.08, 
and .20 for three different occupations. Validity 
was estimated roughly by a comparison with 
home community opinion of 29 applicants. 

The limitation in rating to what can be ob- 
served of the personality development of the 
applicant during the interview is unique in se- 
lection interviewing procedure. It gives an ac- 
ceptable purpose to an interview, in view of its 
cost, when one considers that most factors upon 
which interviewers judge applicants can be bet- 
ter and more inexpensively measured otherwise. 
Standard conversations would assist in achiev- 
ing this purpose. The possibility of scoring the 
interview is introduced, which, again, is unique 
in such personnel procedures, but the method 
of scoring is left to the ingenuity of the user, 
as is interpretation from the scoring. This pro- 
cedure is undeveloped as a psychological meas- 
ure, but it has the possibility of adaptation to 
various uses where the need is for an estimate 
of ability in public contact or personal relations. 


For a review by Ruth Strang, see 40:1672. 


[ 692 ] 
*Self-Rating Scale for Leadership Qualifications. 
Foremen and supervisors; 1942; 25¢ per copy; E. J. 
Benge; National Foremen's Institute, Inc. 


[ 693 ] 
Survey of Company Morale: Job Satisfaction 
Blank No. 12. Adults; 1933-43; 1 form ; $2.25 per 25, 
postpaid; 35¢ per specimen set, postpaid; nontimed 
(10-20) minutes ; Robert Hoppock ; Psychological Cor- 
poration, 
REFERENCES 

1. Horrock, Rosrrt. “Job Satisfaction of Psychologists.” 

J Appl Psychol 21:300-3 Je '37. * (PA 11:4723) 


William W. Waite, Associate Professor of 
Industrial Engineering, Columbia University, 
New York, New York. This form was prepared 
to assist employers in surveying workers to 
ascertain the degree of job satisfaction present 
in the group. A test of this type is sorely needed 
in industry and business, although many em- 
ployers have failed to realize the urgency of 
the need and continue to rely on haphazard 
opinion and rumor to guide them. The Hoppock 
survey, although copyrighted in 1943, is based 
largely on work done in 1933-35, and the revi- 
sions in the original form are the result of a 
trial of the questions on a sample of only about 
300 persons. The questions in the survey orig- 
inally provided for three degrees of job satis- 
faction, one neutral response, and three degrees 
of dissatisfaction. As a result of the trial, the 
three degrees of satisfaction were combined into 
a single choice and the same was done with the 
degrees of dissatisfaction, so that the present 
form has only three alternatives. 

The questions in the survey are fairly simple 
and almost any person filling out the blank 
should be able to select, without undue hesita- 
tion, one of the three alternatives. However, 
the greatest uncertainty in any such test is the 
honesty with which the questions are answered. 
Hoppock has recognized this fact and has pro- 
vided a final question, appropriately numbered 
64, which reads, “In answering these questions, 
have you told what you honestly believe to be 
the truth in all cases?” It hardly seems prob- 
able that a person who had answered the pre- 
ceding questions falsely would be likely to admit 
the fact. On the other hand, such a question 
appears to this reviewer to throw the veracity 
of all those filling out the form into question 
and to be likely to arouse antagonism among 
persons who would otherwise be cooperative 
and honest. 

There are several questions which appear 
not to be significant, as they may not be related 
to the job and do not provide information on 
the basis of which specific corrective action can 
be taken, particularly as the survey must be 
anonymous. Such questions include “Do you 
often feel just miserable?” and “Have you been 
worried about anything recently ?" 

The "norms" given in the manual accom- 
panying the test appear to be based on replies 
from 309 persons, all in one small community, 
taken in 1933. They can hardly be considered, 
therefore, as typical of postwar attitudes in vari- 
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ously located communities of widely differing 
size, economic status, and degrees of industriali- 
zation. Before this survey or, for that matter, 
any morale survey can be considered in any way 
standardized, a great deal of work must be 
done in validation and the establishment of 
norms. It is to be hoped that some survey of 
this type will be made available shortly in a 
standardized form, as it is essential to the 
smooth and proper functioning of our indus- 
trial life, 
[ 694 ] 

*Test of Practical Judgment. Grades 9-16 and 
adults; 1942; preliminary manual ; $3.75 per 25; 50 per 
specimen set; 35(40) minutes; Alfred J. Cardall; Sci- 
ence Research Associates. 


REFERENCES 
1, CARDALL, ALFRED J., Jr. A Test for Primary Business In- 


terests Based on a Functional Occupational Classification. Un- 
published doctor's thesis, Harvard University, 1941. 

2. Hocanone, EpwINA, AND SMITH, Leo F. “Some Evidence 
on the Validity of the Cardall Test of Practical Judgment.” 
J Appl Psychol 31:54-6 F "47. * (PA 21:2492) 

Glen U. Cleeton, Director, Division of Hu- 
manistic and Social Studies, Carnegie Institute 
of Technology, Pittsburgh, Pennsylvania. The 
purpose of this test is indicated by the compiler 
as being the measurement of "the element of 
practical judgment as it operates in everyday 
business and social situations." This is supple- 
mented by a statement that "practical judgment 
is an important consideration in the selection of 
executives, supervisors, salesmen, those who 
are working in contact occupations and those 
responsible for planning and supervising work." 
It is also suggested that the test might be used 
in schools as a measure of "social maturation." 

It is claimed that the test measures a factor 
which is statistically independent. Evidence of 
low correlation of this test with tests of intelli- 
gence, clerical aptitude, personality, reading, 
and other psychological measuring devices is 
given to substantiate this contention. Reliability 
coefficients of .88 and .86 are reported. 

A validity coefficient of .56 between test 
scores and supervisor's ratings is indicated. It 
is claimed also that miscellaneous applications 
of the test have revealed that scores can be 
used to discriminate between executive and 
clerical personnel and, furthermore, that the 
test "discriminates within the range of execu- 
tive ability itself." Opinions of industrial users 
are offered as supplementary evidence of valid- 
ity. The difficulty of obtaining satisfactory cri- 
teria with which to correlate test results is 
stressed. 

The test is self-administering and is printed 


in such a manner as to permit answers to be 
indicated with a pin or special punch which 
perforates a circle on sealed scoring sheets, The 
time required for administering is approximately 
35 minutes. 

Percentile norms are indicated for high school 
seniors and adults, and means are reported for 
a few industrial groups. 

No references to published studies are in- 
cluded in the manual. 

In the selection of commercial and industrial 
personnel at the administrative and supervisory 
level, this test provides a measure of an impor- 
tant characteristic not measured by other stand- 
ard instruments. 


Howard R. Taylor, Professor of Psychology 
and Head of the Department, University of 
Oregon, Eugene, Oregon. Any objective method 
of distinguishing between “common sense" and 
ability to comprehend abstract verbal material 
such as is required for success with standard 
intelligence tests would be very useful and have 
considerable theoretical importance. Unfortu- 
nately the author seems to have published no 
information about the construction and valida- 
tion of Cardall’s Test of Practical Judgment 
except that given in the preliminary manual. 
The extent to which this test samples habits of 
thought and action unmeasured by other psy- 
chological tests is obviously crucial. Correla- 
tions between the Test of Practical Judgment and 
24 other variables such as intelligence, clerical 
aptitude, arithmetic reasoning, reading compre- 
hension, personality questionnaires and college 
grades are cited in the manual. All are low 
ranging from —.11 to +.15. Ten are negative. 
Neither the standard deviations of the different 
variables nor the number of cases involved in 
the computations is reported. Taken at face 
value, however, such negligible correlations im- 
ply that the test has nothing in common with 
the general intellectual factors which predomi- 
nate in measures of school achievement and of 
clerical and scholastic aptitude. 

Just why 48 questions similar to those of the 
practical judgment subtest in the Army Group 
Examination, Alpha, which has a high correla- 
tion with the other tests of 4 battery constructed 
to measure general intelligence, should be un- 
correlated with tests known to require abstract 
capacity of the same sort is surprising. Such 
terms as nurtures, technicians, prestige, par- 
ticipator, mortality, feigned, etc. appear in the 
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problem situations and various alternatives of the 
Cardall test. One would expect to find some cor- 
relation even in a narrow range of occupational 
ability. Moreover, two of the questions in the 
test occur also in well-known intelligence scales 
—one at the superior adult level in the Revised 
Stanford-Binet Scale, the other in the Wechs- 
ler-Bellevue Intelligence Scale. It is conceivable 
of course that scoring determined by a consen- 
sus of opinion from the realm of “practicality” 
with weightings derived by item analysis for 
first, second, and third choices among four al- 
ternatives could reflect chiefly "attitudes" rather 
than logical reasoning. Inspection of the heavy 
weights assigned to certain answers which seem 
dubious on logical grounds make this explana- 
tion plausible. The reviewer has found a corre- 
lation of .27 between scores on the Cardall test 
and the Ohio State University Psychological 
Examination for a group of 202 college sopho- 
mores. Hogadone and Smith (2) report a 
correlation of .22 between scores on the 1941 
form of the ACE Psychological Examination 
and scores on the Cardall test made by 91 stu- 
dents registered in a three-year cooperative 
course in retailing. These students were em- 
ployed by department stores in western New 
York for alternate four-week periods. Ratings 
for "practical judgment" based on a "behavior 
journal" and made by supervisors in close con- 
tact with the students and their employers cor- 
related only .24 with Test of Practical Judg- 
ment scores. The same ratings correlated .28 
with the American Council on Education Psy- 
chological Examination. Although the difference 
is not statistically significant, the ACE seems to 
be a slightly better test of practical judgment 
than the Cardall which was especially devised 
for measuring it. In so far as the ratings pro- 
vide a suitable criterion of “common sense,” 
neither test would be of much practical value. 
In the manual Cardall reports a reliability 
coefficient of .88 estimated from self-correlation 
in a “homogeneous” group. The only validity 
coefficient cited is a correlation of .56 between 
scores on the test and pooled supervisory rat- 
ings in an accounting department. With inde- 
pendent criteria of occupational success ap- 
parently available, it should not have been 
necessary to use the difference between a high- 
scoring and a low-scoring group on the “even” 
scores of the test as a criterion for determina- 
tion by item analysis of the weights assigned 
to various answers. Whatever value the criterion 


of “internal consistency” may have in theoreti- 
cal problems, the use of occupational success is 
surely preferable for practical purposes. 
Possibly a detailed report of the criteria and 
statistical analyses used in constructing and 
validating the test would remove the doubts 
raised by the independent studies cited above. 
Until that happens its worth as a diagnostic 
instrument should be considered questionable. 


[ 695 ] 
Vocational Aptitude Examination, Type E-A, Re- 
vised 1946, Male high school graduates; 1935-47; 1 
form; 734¢ per copy, 10-99 copies; 50¢ per specimen 
set; part nontimed (65) minutes; Glen U. Cleeton and 
Charles W. Mason; McKnight & McKnight. 


REFERENCES 

1-3. See 40:1679. 

4. CLEETON, GLEN U., AND Mason, CmarLes W, Chap. 6, 
"Tests of Executive Ability," pp. 171-92. In their Executive 
Ability: Its Discovery and Development, [Second Edition]. 
Yellow Springs, Ohio: Antioch Press, 1946. Pp. xv, 540. $4.50. * 
(PA 21:566) 

D. Welty Lefever, Professor of Education, 
The University of Southern California, Los An- 
geles, California. The title of this battery of 
tests conceals rather effectively its major use 
in "spotting potential executive material." The 
examination is designed to be employed as an 
“aid in classifying men" into four general occu- 
pational groups: executive, sales, accounting, 
and technical (scientific or engineering) per- 
sonnel. Such differentiation results largely from 
an analysis of a profile based on individual 
ratings from each of the eight tests forming the 
battery. 

The examination is designed to be used in 
business, industry, and government in the ac- 
tual selection of personnel for employment or 
promotion. It is assumed that all available bio- 
graphical data and sources of appraisal are 
focused on the same problem. A well-planned 
personal history blank constitutes the final page 
of the test booklet. The authors claim greater 
validity when the test is applied to college grad- 
uates, and they assume the minimum of high 
school graduation for all testees. The examina- 
tion appears, however, to be best suited for use 
in guiding college students in formulating voca- 
tional plans with respect to the four occupa- 
tional areas concerned, 

The manual states that the examination pur- 
ports to measure “intellectual aptitudes.” It is 
not easy to reconcile such a statement of lim- 
ited scope with the inclusion of an interest in- 
ventory and a test of “social responsiveness.” 
However, since there is no single summarizing 
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score for the complete battery, these two meas- 
ures may be considered as additional indicators 
for predicting success in the four fields rather 
than as parts of a unitary test of intellectual 
aptitude. 

Interest inventories and adjustment question- 
naires are subject to a certain degree of “fudg- 
ing,” and some allowance must be made for this 
factor if the subject has a personal gain at stake 
in a high score. This reviewer administered Test 
8, Social Responsiveness, to a group of graduate 
students in education, first, on an unidentified, 
“sincere” basis and, second, on the basis of pro- 
ducing a score related to high executive ability. 
Six out of 30 made A or better on the first trial ; 
8 reached that level by conscious effort. The 
average score rose from a very low B rating on 
the first trial to a high B on the second trial. This 
gain is probably not too serious. 

The major portion of the examination appears 
to be a modified form of scholastic aptitude test 
in which the verbal factor receives less than the 
usual emphasis and in which specialized study 
tends to be a handicap rather than an asset. 
Broad acquaintance with many areas of experi- 
ence rather than systematic, academic study is 
required. An estimation test, for example, meas- 
ures the subject’s ability to guess the answers to 
a variety of statistical or numerical questions at 
a high rate of speed. The authors report high 
scores for the sales group and low scores for the 
engineers. Such differentiations indicate consid- 
erable validity for the general design of the 
examination. Among the four vocational areas 
analyzed the highest minimum score for the 
examination as a whole appears to be necessary 
for success as an executive. The evidence in gen- 
eral, as presented by the authors and confirmed 
in the observation of the reviewer, indicates that 
superior intelligence is necessary but not suffi- 
cient for success on the supervisorial and mana- 
gerial levels. 

This reviewer has often wondered why test 
batteries designed for use in business or industry 
do not include more items (such as paragraphs 
to be read, vocabulary to be defined, arithmetical 
problems to be solved, and the like) from mate- 
rials with greater “face validity.” This examina- 
tion would have been improved if more of the 
subject matter possessed a greater apparent 
functional significance for the occupational areas 
concerned. 

More detailed research data on validity should 
have been reported in the test manual or in the 


authors’ volume, Executive Ability: Its Dis- 
covery and Development (4). Rather vague 
statements are made about “high” or “low” 
scores without furnishing the reader with an 
adequate basis for an evaluative judgment. The 
authors specifically deny claim to validity as 
measured by such criteria as earnings, continu- 
ity of employment, or other “conventional stand- 
ards.” The most direct evidence on validity ap- 
pears to be the difference in scores made on the 
several tests by representatives of the occupa- 
tional groups: executives, salesmen, account- 
ants, and technical personnel. Exact quantitative 
statements, however, are lacking. 


Benjamin Shimberg, Research Assistant, 
Division of Educational Reference, Purdue 
University, Lafayette, Indiana. This examina- 
tion is offered as an aid in selecting and promot- 
ing certain types of personnel in industry and 
as a counseling instrument in vocational guid- 
ance, The test battery is designed to evaluate 
rapidly an individual with respect to (a) general 
executive aptitude; (b) sales aptitude; (c) en- 
gineering, research, and technical aptitude ; and 
(d) accounting aptitude. Recommendations for 
the various fields are made on the basis of the 
pattern of scores on the eight subtests which 
comprise the battery. 

The test battery is made up of the following 
subtests: Test r, General Information; Test 2, 
Arithmetical Reasoning; Test 3, Judgment in 
Estimating; Test 4, Symbolic Relationships; 
Test 5, Reading Comprehension ; Test 6, Vocab- 
ulary ; Test 7, Interest; Test 8, Social Respon- 
siveness. Tests 1, 2, 5, and 6, which take 40 
minutes of testing time, are said to constitute an 
adequate measure of general intellectual ability 
(4, p. 176). De 

The reading subtest takes only six minutes 
and consists of five paragraphs with five ques- 
tions on each paragraph. Judgment in Estimat- 
ing requires the examinee to answer 30 questions 
calling for information such as "Estimate the 
average number of navy beans to the bushel.” 
On this test, credit is granted for answers falling 
within a given range, with higher credit for 
answers which come closest to the correct figure. 

The interest test consists of 100 words and 
phrases related to occupations and interests. The 
subject indicates those which he likes and those 
which he dislikes. Social Responsiveness is 
divided into five sections: dominance-submis- 
sion (1o items), independence-dependence (10 
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items), extroversion-introversion (20 items), 
Social responsiveness (20 items), and social 
knowledge (20 items). The number of items 
used in the various sections appears to be too 
few to give a reliable measure of the separate 
traits being investigated. As in most personality 
questionnaires, the questions are in a “yes” or 
“no” form and depend on the truthfulness of the 
applicant for validity. No evidence is presented 
to show that persons seeking employment or 
wishing to be considered for promotion can be 
depended on to reply truthfully on such ques- 
tionnaires. 

The 1946 revision of the examination is essen- 
tially the same as the 1935 edition. Very few 
items have been changed, although the informa- 
tion called for in the estimating subtest has been 
brought up to date. Scoring of the test is per- 
formed manually, with a key which is placed 
alongside the answers written in the test book- 
let. Booklets can be used only once. 

A manual is provided which includes norms 
by which to interpret test scores for'each of the 
occupational fields. It is necessary, however, first 
to convert the raw scores on each subtest into 
percentile equivalents and then to translate these 
to a seven-point scale in order to interpret the 
norms for each group. Two sets of norms are 
given. One is for use with high school gradu- 
ates, college freshmen and college sophomores. 
The other is for use with upper-class college 
students and for college graduates. Persons 
using the test are cautioned by the authors not 
to interpret the test scores in absolute terms, 
but to weigh and judge the results in relation 
to other information about the individual. 

The reliability coefficients reported in the 
manual are said to be based on 850 cases, but 
no description is given of the sample used. The 
reliabilities range between .79 and .86 for the 
several subtests. When the interests and the so- 
cial responsiveness subtests are omitted, and 
the remaining six subtest scores combined into 
a total score, the reliability is reported to be 94. 
The method used to obtain reliabilities is not 
given. 

No evidence is presented for the validity of 
the examination. The references cited in the 
manual are largely based on an investigation by 
one of the authors (Mason) conducted in 1930 

on “The Possibilities of an Objective Executive 
Aptitude Test.” This investigation was merely 
exploratory. It did not utilize criterion groups 
from business and industry, but relied on cri- 
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terion groups made up chiefly of college students 
who had been rated by other students and by 
their instructors as possessing traits supposedly 
associated with sales, executive, and technical- 
research groups. 

The recent book, Executive Ability: Its Dis- 
covery and Development, by Cleeton and Mason 
presents some of the background for the exam- 
ination and the theory of executive ability which 
underlies it. However, the book fails to present 
any data which might be considered as validat- 
ing the instrument. Most of the generalizations 
about the discriminating power of the test seem 
to have been drawn from the findings of the 
original study by Mr. Mason. If the authors have 
information which bears out their claims that 
the test differentiates significantly among the 
various groups, they have failed to give any con- 
crete indication of their results in either the man- 
ual or in their 540-page book. 


For reviews by Harold D. Carter and M. R. 
Trabue of an earlier edition, see 40:1679. 
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*[American Transit Association Tests.] Selection 
of transit Shesattie persctel 1941-46; 4 tests; Glen 
U. Cleeton in collaboration with Merwyn À. Kraft and 
Robert F. Royster; American Transit Association. 

@) STANDARD EXAMINATION FOR TRANSIT EMPLOYEES, 
pu Forms A and AA for street car operations ; 

orms B and BB for bus operations ; 20(30) or 30(40) 
minutes; $5 per 100. 

) PERSONAL REACTION TEST FOR TRANSIT EMPLOYEES. 
1943-46; 1 form (Series A) ; nontimed (30) minutes ; 
$10 per 100. 

C) AMERICAN TRANSIT MOTOR ABILITY TEST. 1946; (45) 
minutes ; prices upon application. 

) PLACEMENT INTERVIEW FOR TRANSIT EMPLOYEES. 
1046; $5 per 100. 


REFERENCES 


1. Watts, J. V. “The Use of the American Transit Motor 
Ability Test in the Selection of Bus and Street Car Operators,” 
P. 340-53. In Proceedings of the Twenty-Sixth Annual Meet- 


ing of the Highway Research Board, December 1946. 


Harold G. Seashore, Director, Test Division, 
The Psychological Corporation, New York, New 
York. The American Transit Association has 
prepared a manual entitled Selection and Em- 
ployment of Transit Operating Personnel. Three 
stages are defined and procedures outlined for 
each: qualifying, testing, and evaluation. This 
review is concerned with the testing stage. Four 
testing units are prescribed: mental ability, 
visual skills, personal reactions (attitude and 
personality), motor ability. Although these tests 
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were developed by a committee of the association 
and the manual contains other names, it appears 
that Glen U. Cleeton is the professional psy- 
chologist responsible for their development. 

MENTAL ABILITY TEST. This test, entitled 
Standard Examination for Transit Employees, 
is an Otis-type test, with many verbal items 
possessing “face validity” since they involve 
transit terms and ideas, There are 60 items. 
Thirty minutes is the recommended time, but 
twenty minutes is permitted. Norms are in four 
categories of score range: normal hiring range, 
poor instruction risk, not likely to be satisfac- 
tory, and rapid learner who may not maintain 
interest. Statistical data are sparse and reference 
is made to an original study by Cleeton which is 
not published. Reliability is said to be about .go. 
Validity was established in the initial project, 
and subsequent checks are said to support the 
initial findings. The test is said to correlate sig- 
nificantly with the Otis Self-Administering Test 
of Mental Ability, but values are not given. To 
secure 100 “qualified operators,” 150 candidates 
in “normal hiring range” were needed, while 280 
candidates in the “unsatisfactory” category had 
to be screened. There are two forms, but four 
sets of identifying letters; A and BB are identi- 
cal, and B and AA are identical except that in A 
and B the words “street cars” are used in one 
item and in BB and AA the word “buses” ap- 
pears. 

VISUAL SKILLS TEST. The manual recommends 
several of the standard visual tests and offers no 
special norms or procedures. 

PERSONAL REACTION TEST FOR TRANSIT EM- 
PLOYEES. This test “measures attitudes and other 
personality characteristics relating primarily to 
the qualities of stability and maturity.” It is 
claimed that this special test is better than those 
commercially available because (a) the time re- 
quirement is shorter, (b) transit language is 
used, and (c) nonpsychologists can interpret the 
scores. 

There are 50 multiple-choice items couched 
primarily in terms of specific transit situations 
or general public contacts. The reviewer encoun- 
tered difficulty with the grammatical structure 
of the items. These are not questions with 
answers. The “stem” is sometimes a question, 
sometimes a part of a sentence which the choices 
will complete, and sometimes a phrase with four 
reactions one might make to the phrase. One of 
these items is reproduced. 


When you have a day in which everything seems 
to go all wrong: 

(1) I never seem to have such days 

(2) Become very upset until it is all over 

(3) Annoys me, but I make the best of it 

(4) I seem to have more than my share of these 

days 

It may well be that in practice such inconsistent 
and sometimes incorrect language makes no dif- 
ference, but the reviewer feels that test items 
should be presented in straightforward, accept- 
able English. 

In 26 of the 50 items of this personality test 
the correct answer is some minor variation of 
these expressions: “does not bother me,” “an- 
noys me mildly,” “rarely disturbs me,” or “pay 
no attention to it.” Such responses are practically 
never included as wrong choices. Even though 
these phrases probably describe attitude and 
personality characteristics that are desirable in 
transit employees, it does seem that more im- 
agination could have been used in getting variety 
and in disguising the best answer. Similarly, one 
would wonder what the semantic difference is 
between these two choices to one question: “pay 
little attention to it,” and “does not disturb me.” 
Both of them, it seems to the reviewer, are “well 
adjusted” answers to the question: “How does 
being observed by the examiner while taking 
this test affect you?” 

Interpretation is in terms of accident poten- 
tialities and public relations. A score of 35 or 
more suggests “an excellent record with respect 
to accidents and public relations.” A score of 25 
or lower is indicative of a poor risk. The man- 
ual’s statement on reliability is remarkable for 
its caution. It stresses the complexity of person- 
ality and the difficulty of its appraisal even with 
thorough clinical study. No data concerning re- 
liability are given, only a statement to the effect 
that persons can misrepresent themselves and 
also actually do change from time to time. With 
respect to validity, the test is called promising. 
The reviewer fails to see how a test with such 
real difficulties can be usable by persons without 
the caution and psychometric sophistication of 
trained psychologists. 

Since the test is specifically interpreted in 
terms of accident proneness, it seems elementary 
that it should not be used until a clear-cut statis- 
tical report is available concerning its ability to 
discriminate among transit operators with vary- 
ing numbers of accidents and that, further, such 
data should be based upon administration of the 
test prior to employment of the operator. The 
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association reports that such data may become 
available soon. 

MOTOR ABILITY TEST. This psychomotor test 
is designed to measure sensori-motor coordina- 
tion, ability to concentrate, ability to react speed- 
ily and accurately. It is also said to measure 
“teachability” and learning capacity. 

The equipment is a simulated driver’s station 
(seat, wheel, clutch, brake, and shift lever), a 
stimulus panel (various lights including a stand- 
ard traffic signal), an examiner’s control panel, 
and an automatic timing and recording appara- 
tus. 

The test activities are too complex for descrip- 
tion here. They begin with simple reaction pat- 
terns involving elementary hand or leg or hand 
and leg movement. Later more complex decision- 
reaction times are involved. 

The published manual of instructions does not 
contain the details of administration and scoring. 
A supplementary manual which comes with the 
equipment provides the details. Norms are ex- 
pressed in terms of four ranges of scores. 

Validity is expressed in terms of the number 
of candidates in several score ranges who must 
be selected to secure 100 qualified operators. In 
one study, 105 high-ranking candidates and 206 
low-ranking candidates were necessary to pro- 
duce 100 successful trainees from each group. 
It is not reported whether the other three tests 
were part of the screening process, If the test is 
as sharply discriminating as this seems to indi- 
cate, it is, first, a remarkably successful use of 
a motor ability test to predict later efficiency and, 
second, one which deserves much more complete 
description in the literature. During World War 
II, the military psychologists apparently did not 
secure such satisfactory predictions by means of 
psychomotor tests. One would also like to know 
to what extent scores on the motor ability test 
correlate with those on the mental ability test. 

This series of tests is designed for use by 
members of the American Transit Association. 
It is the belief of the reviewer that the authors 
of a special purpose test battery are not relieved 
of responsibility for publishing, at least for pro- 
spective users, the statistical information usually 
considered essential to test evaluation. Presum- 
ably, this battery has characteristics beyond face 
validity which commend it. The absence of tech- 
nical reports precludes an adequate appraisal at 
this time. 


Morris S. Viteles, Professor of Psychology, 


University of Pennsylvania, Philadelphia, Penn- 
sylvania. The procedures for the selection and 
employment of transit operating personnel de- 
scribed in the four manuals are gratifyingly com- 
plete and balanced in general scope and disap- 
pointingly inadequate in detail. The 3-stage, 10- 
step program, premised on job analysis, repre- 
sents an orderly approach to the selection of 
qualified personnel which judiciously applies 
what is known about good practices in employee 
placement. Of particular merit is the evaluation 
of each step in the selection program, including 
a delineation in terms readily understandable to 
the layman of the general scope and general lim- 
itations of psychological tests in relation to other 
selection devices. 

In striking contrast are the claims made for 
individual procedures, especially certain of the 
psychological tests, in the absence of adequate 
supporting data. Since the manuals are written 
essentially for the laymen, as aids in selling the 
uniform selection program to the transit indus- 
try, it cannot be expected that detailed statistical 
data will be presented. The facts that some data 
on both reliability and validity are given and that 
reference is made to the need for further experi- 
mentation represent a refreshing note in a series 
of manuals which are frankly geared to a selling 
program. Nevertheless, the individual tests can- 
not be properly recommended with anything like 
the degree of assurance which marks their pres- 
entation in the manuals. 

With respect to every test, there are questions 
which require answers before firm recommenda- 
tions as to its usefulness can be made. For exam- 
ple, it is stated that the Standard Examination 
for Transit Employees, designed to “measure an 
applicant's capacity or ability to absorb instruc- 
tion,” has a reliability of .90, but no validity co- 
efficient is given, although there is some evi- 
dence on validity in terms of the ratio of qualified 
employees to applicants for stated ranges of test 
Scores. 

A validity coefficient of .32 is reported between 
scores on the American Transit Motor Ability 
Test and "individual accident responsibility 
rates," and a coefficient of .39 between scores on 
this test and supervisory ratings, but no state- 
ment is made concerning reliability, although an 
earlier article reports reliabilities ranging from 
76 to .82 on eight series of measures on this 
test. For this test, too, there are data on the 
percentage of men with varying scores qualify- 
ing as acceptable employees. 


Tg MM ————— 
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In the case of neither test is there any indica- 
tion as to how many applicants will have to be 
tested in order to secure a stated number of 
qualified operators for each range of test scores, 
although from the correlations reported, it is ap- 
parent that a very large number of applicants 
would be required to provide a high percentage 
of qualified operators. Moreover, nothing is said 
about the correlation between the two tests al- 
though it is possible that one of the two tests 
may give approximately the same results as both 
in combination. 

In general, there are inadequate data on the 
nature of the experimental populations, a failure 
to define criteria accurately, and similar impor- 
tant omissions. It may be that such data are 
available, but a review of the literature from 
1935 on does not reveal sufficient supplementary 
data to provide the assurance needed for recom- 
mending the tests for use in the widely varying 
operating situations characterizing the transit 
industry. 

In the case of the Personal Reaction Test for 
Transit Employees, specific reservations are 
made as to the limitations of paper-and-pencil 
tests in measuring personality traits and as to the 
use of the test as a guide rather than as a basis 
for final judgment of attitudes. Nevertheless, 
these are almost completely negated by supple- 
mentary statements, made in the complete ab- 
sence of data on reliability and validity, that 
the test has "bearing on accident susceptibility 
and provides information for evaluation of pro- 
spective operators on an important phase of the 
job, public relations, not directly measured by 
other tests." 

The manuals include specific recommenda- 
tions as to the limits on age and other personal 
characteristics and scores on visual and similar 
examinations which should be used in selecting 
transit employees. At no point is there suggested 
the need for the experimental validation of such 
items, in spite of the fact that weighted applica- 
tion scores have shown the marked value of such 
an approach and also in spite of the evidence 
gathered, particularly during World War II, 
that a priori judgments concerning visual and 
similar requirements may be entirely untenable. 

In general, the review of the manuals suggests 
the need for caution in accepting the recom- 
mendations and for viewing the tests as experi- 
mental instruments which require detailed veri- 
fication by companies which may consider their 
use for the selection of operating personnel. 


J. V. Waits, Personnel Psychologist, Capital 
Transit Company, Washington, D. C. This bat- 
tery of selection tests was designed for a specific 
purpose and should be viewed from that perspec- 
tive. The scope of its effective use is probably 
limited to transit, taxicab, and trucking firms. It 
was not the objective of the authors, nor is it 
general practice, to use the tests for individual 
prognosis, guidance, or counseling. The goal is 
to secure information for isolating from the gen- 
eral population of applicants for employment the 
group that will furnish the best risks as em- 
ployees. 

The test battery covers three of the “nine 
essential qualities” of the job specifications for 
street car and bus operators: (a) mental ability, 
(b) sociability, and (c) motor ability. These 
terms need no special definition. 

Validation of the battery as a whole is in the 
very early stages. Most of the work to date has 
been devoted to evaluation of the technical suffi- 
ciency of the individual tests. The number of 
cases and the spread of use are too limited to 
make positive assertions about validity. Prelim- 
inary indications are promising, and extensive 
use of the battery in the transit industry is vir- 
tually assured. 

THE STANDARD EXAMINATION, This is a dis- 
guised intelligence test. It is couched in the lan- 
guage of the industry, and the problems are those 
likely to be encountered in the day-to-day work 
of the operators. The language of the test is well 
chosen, the difficulty level is well adjusted to the 
population, the figures and illustrations are 
clear, the printing and readability are excellent. 
Administration, scoring, and interpretation 
problems are well within the capabilities of the 
normal employment office personnel. Individual 
administration of the test, a necessity for an em- 


ersome than some other self-administering tests. 
Scoring is somewhat slower because of the num- 
ber of pages to be covered, making scoring errors 
slightly more likely. 

Validity as an intelligence test is predicated on 
correlation with the Otis Self-Administering 
Test of Mental Ability. Correlations of .80 to .89 
are reported. Validity as a selective factor of 
successful employees is tentative. Correlation co- 
efficients from .20 to .30 have been found with 
several criteria. Use of the correlation ratio 
would probably increase the validity coefficients 
since the relationship of intelligence test scores 
with most criteria of the industry is curvilinear. 


E ees office, makes it somewhat more cum- 
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Reliability by split-half method ranges from 
.88 to .93 for the several forms and for varying 
populations. For the general use made of the 
test, the reliability appears satisfactory. 

The test is not well adapted for use as a gen- 
eral employment instrument. In particular, we 
would hesitate to use it for clerical and office 
employees, although an experiment by the re- 
viewer with a sample of about 200 indicated 
reliable results. 

THE PERSONAL REACTION TEST. This is a mul- 
tiple-choice-type test. It suffers from many of 
the weaknesses of tests of personality, tempera- 
ment, or attitude. It has the advantage of not 
requiring a highly trained specialist to interpret 
it. The test is in a transitory stage, and results 
are too vague to evaluate the test at this time. 
Reliability coefficients of .79 to .89 are reported. 
Validity is far from established, although some 
encouraging correlations with other personality 
tests have been obtained. 

` THE MOTOR ABILITY TEST. This is a perform- 
ance test which involves reaction time, attention 
qualities, learning ability, ability to follow direc- 
tions, and motor coordination. The test involves 
the execution of given movements of the hands 
and feet with a steering wheel, gear shift lever, 
and two foot pedals in response to visual stimuli 
in the form of light patterns on a signal board 
in conjunction with a stand traffic signal. The 
test is composed of five parts, each of which is 
preceded by a standard practice period. Diffi- 
culty and complexity of the task increase con- 
tinuously as the test progresses. Stimuli are 
administered automatically and responses are 
recorded. d 

Validation of the test is still in the preliminary 
stages. Significant correlations of test scores 
with subsequent accident records have been ob- 
tained. These range from .30 to .45. Biserial 
correlation coefficients of .48 to .63 have been 
obtained between test score and accident record. 
The mean accident record was used as the point 
of dichotomy. Reliability appears to be satisfac- 
tory for the general uses of the test. Split-half 
reliability of .82 to .88 is reported or has been 
obtained in experiments by the author. 

The test is now in use at two companies, and 
some six to ten orders for test apparatus are in 
varying stages of process. Results to date are 
much more satisfactory than one would normally 
expect from the test. The reviewer has recently 
completed a two and a half year study of the test 
and recommended its inclusion in the Capital 


Transit test battery on the basis of its predictive 
value. 
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*Detroit. Retail Selling Inventory. Candidates for 
training in retail selling; 1940; 1 form; $4.50 per 100; 
20¢ per specimen set, postpaid; part nontimed (30) 
minutes ; Barty J. Baker and Paul H. Voelker ; Public 
School Publishing Co. 


Milton E. Hahn, Dean of Students and Pro- 
fessor of Psychology, University of California, 
Los Angeles, California. The reliability coeffi- 
cients reported in the manual for this instrument 
are somewhat below those usually demanded for 
test use with individual subjects. Reliability co- 
efficients are reported for various parts of the 
test as follows: personality, .86; intelligence 
(first half), .76; intelligence (second half), .80; 
checking, .98; and arithmetic, 77. The manual 
recommends the use of the instrument for two 
groups: students in (secondary ?) schools being 
considered for retail sales training and inexperi- 
enced applicants for retail selling jobs in busi- 
ness. Except for the checking test, the reported 
reliabilities appear questionable for the first 
group because most modern schools can admin- 
ister comparable tests and inventories with 
higher reliabilities. The test is defensible as a 
rough screen for the employment office which 
has insufficient professional and technical staff 
for a more complex screen. 

In the light of the data on reliability, the use 
of such extremes as the top 2 to 10 per cent of 
the clerks for the "superior" group and the low- 
est I to 5 per cent for the "inferior" group is 
questionable. This maximizing of differences be- 
tween extremes is not a particularly rigorous 
test of differentiating power. The criterion group 
was composed of 312 retail salesclerks “em- 
ployed in seven different types of retail stores in 
Detroit, Michigan." The criteria of success are 
not stated in the manual, and no multiple regres- 
sion coefficient is supplied. The familiar phe- 
nomenon of regression, or loss of efficiency, when 
such an instrument is used with new groups is 
not discussed in the manual. 

The user of this instrument would have been 
helped somewhat in considering test validity if 
an optimal cut-score or percentage had been 
indicated. On page 11 of the manual the per- 
centages of "superior" and "inferior" group 
members are given by quartiles of (total?) score 
distribution. Without interpolation of percent- 
ages, which is not too satisfactory a procedure 
when the distribution is undefined, the best esti- 
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mate for the optimal cutting score is at the soth 
percentile. By so cutting, 71 per cent of the “su- 
perior" and 34 per cent of the "inferior" group 
would have been retained. 

The manual comes fairly close to being ade- 
quate. However, it implies most of the material 
which should have been included. For example, 
on page 2, it states: “Schools which already have 
a record of their pupils' intelligence and schools 
desiring to use a complete, general intelligence 
test, may simply omit the intelligence sections 
(pages 4 and 5) of this inventory." This implies 
that more reliable and better validated instru- 
ments could have been substituted for the shorter 
sections of the test having doubtful reliability. 
The value of nontest information about individ- 
uals is mentioned but is not sufficiently developed 
for the relatively naive user of the test. 

The authors of the test have carefully ab- 
stained from making any sweeping claims as to 
its value. The general tone of the manual is 
reserved and cautious. With further experimen- 
tation this instrument might be demonstrated to 
be a real aid to the employment manager who 
wishes data of this type to supplement the tradi- 
tional interview, letters of recommendation, and 
school records. If the authors revise their man- 
ual, it is suggested that they will devote more 
space to the problem of test score "ceilings" and 
“basements,” particularly in regard to the sec- 
tions purporting to measure “intelligence.” 

This inventory is a promising approach to 
measuring traits and qualities related to success 
in retail selling ; it is not recommended as a sub- 
stitute for longer, better standardized single in- 
struments available in many school personnel 
programs. Employment managers who are un- 
trained in psychology may find the instrument 
quite useful as a supplement to other techniques. 


Floyd L. Ruch, Professor of Psychology, The 
University of Southern California, Los Angeles, 
California. This test was designed to test general 
ability to sell and contains four subtests : person- 
ality, intelligence, checking, and arithmetic. It is 
immediately apparent that the tests included 
each cover an important factor and are only 
slightly related. 

The subtests of the inventory were originally 
parts of a battery of tests given to 312 sales per- 
sonnel. They were included only after a statisti- 
cal study proved them to be significantly impor- 
tant in distinguishing “good” salesmen from 
“poor” salesmen. Furthermore, each item of the 


La 


subtest on personality was subjected to similar 
statistical analysis before it was included as a 
part of the test. 

For the most part the personality items test 
the personality through the element of “inter- 
ests,” and only a few are of the variety where 
a “correct” answer to get a job as a salesman is 
obvious. The intelligence section tests vocabu- 
lary and word analogies and gives an index of 
intelligence or something similar to Thurstone's 
verbal factor. The authors of the test point out 
that a full intelligence test may be included rather 
than this section if the tester so desires, and the 
standards are given for the interpretation of the 
inventory under this condition. The checking 
section of the test is a speed test in discerning 
whether numbers, letters or words are alike or 
different. It is a test of perceptual speed. The 
arithmetic section is not verbal, but contains 
sheer computation problems in addition, sub- 
traction, multiplication, and division. 

The intercorrelations of the tests run from .60 
(between intelligence and arithmetic) to .05 (be- 
tween personality and arithmetic). 

The reliability coefficients were computed for 
each section by either the split-halves technique 
or the test-retest technique, depending upon the 
nature of the test; they run from .76 to .98. The 
reliability coefficients were established on a dif- 
ferent group from the standardization group but 
the number of common cases is not given. It 
would seem that the reliability of the test, or the 
extent to which it yields the same results each 
time it is given, is quite acceptable. 

The validity of the test, or the degree to which 
it selects acceptable sales personnel, was estab- 
lished by dividing the test scores by quartiles 
and giving the percentage of "inferior" sales- 
clerks and "superior" salesclerks in each quarter 
of the test scores. The criterion for "inferior" or 
"superior" rating of the sales personnel was a 
purely objective one, the sales production rec- 
ord of each clerk. It was found that 26 per cent 
of the 209 superior salesclerks received a test 
score in the highest quarter as against 4 per cent 
of the 107 inferior salesclerks; while in the low- 
est quarter of test scores there were only 6 per 
cent superior clerks as against 25 per cent in- 
ferior clerks. It is not clearly stated whether 
this group of salesclerks is the same group as that 
on which the original standards were obtained ; 
but if it is, much shrinkage may be expected 
when the test is applied to a new group. 

The authors also give hints as to selection 
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when a total score may be passing although cer- 
tain subtest scores are very low. Complete per- 
centile tables are given for each subtest as well 
as the total test with and without intelligence. 

A review of the test reveals that it seems to be 
worthy of the attention of persons in the business 
field desiring to select salesclerks as well as of 
persons in the vocational guidance field. How- 
ever, more research is needed. 


Ed Res B 21:28 Ja 14°42. William J. Jones. 
* The test is very practicable, takes about 30 
minutes to administer. A well-prepared Exam- 
iner’s Manual contains suggestions for giving the 
test, tables of percentile ranks, as well as aids 
in interpreting the results. 

J Bus Ed 17:41 Ap '42. This test does not 
pretend to be a basis for final prognosis of mate- 
rials for success in selling occupations, but it 
will serve as one major aid in the selection of 
students for distributive occupational courses. * 


[ 698 ] 

XEngineering and Physical Science Aptitude 
Test. Grades 11 and over; 1943; IBM; 1 form; 
separate answer sheets must be used; $4.00 per 25, 
postpaid; 50¢ per manual and hand-scoring keys, post- 
paid; Sof per manual and machine-scoring keys, 
postpaid; 6o¢ per specimen set, postpaid; 72(80-90) 
minutes; Bruce V. Moore, C. J. Lapp, and Charles H. 
Griffin ; manual by Henry Borow; Psychological Cor- 
poration, 


REFERENCES 


1. GnrrriN, Cartes H., anp Borow, Henry. “An Engineer- 
ing and Physical Science Aptitude Test." J Appl Psychol 28: 
376-87 O '44. * (PA 19:543) 


Norman Frederiksen, Associate Professor of 
Psychology, Princeton University; and Re- 
search Associate, College Entrance Examination 
Board ; Princeton, New Jersey. This test was de- 
veloped for the purpose of predicting success in 
engineering and science courses given by the 
Pennsylvania State College as part of a defense 
training program. The test is made up of six 
parts: Part 1, Mathematics (25 items) ; Part 2, 
Formulation (ro items) ; Part 3, Physical Sci- 
ence Comprehension (45 items) ; Part 4, Arith- 
metic Reasoning (10 items); Part 5, Verbal 
Comprehension (43 items); and Part 6, Me- 
chanical Comprehension (22 items). Five-choice 
items are used in Parts 1, 2, 4, and 5; true-false 
items are used in Part 3; and three-choice items 
are used in Part 6. 

The Engineering and Physical Science Apti- 
tude Test was compiled from parts of previously 
developed tests. The selection of materials was 
based on their correlation with scores on the 


Cooperative Physics Test jor College Students: 
Mechanics, Form 1936 B, which was adminis- 
tered at the end of a course in physics. The first 
three parts were taken from the revised Jowa 
Physics Aptitude Test. Most of the arithmetic 
reasoning items were chosen from Moore's Test 
of Arithmetic Reasoning on the basis of an 
item analysis, using scores on the Cooperative 
mechanics test as the criterion. The verbal com- 
prehension section is composed of the most 
technical terms in the vocabulary section of the 
Moore-Nell Examination for Admission which 
is used at the Pennsylvania State College. These 
items were used on the strength of the fact that 
many highly verbal items in the Otis Self- 
Administering Test of Mental Ability were 
found to have high correlations with the cri- 
terion; the actual items selected had not been 
previously tried out. The mechanical compre- 
hension items were selected from Bennett's Test 
of Mechanical Comprehension on the basis of 
an item analysis, using the Cooperative mechan- 
ics test as the criterion. 

The reliability of the total test is satisfactory, 
but the reliabilities of some of the part scores are 
too low to justify their use in individual place- 
ment or guidance. The split-half reliability of the 
total test, as reported in the manual, is .96, and 
the reliability coefficients for the part scores 
range from .68 to .93. The least reliable parts 
are Formulation (.68), Arithmetic Reasoning 
(.75), and Mechanical Comprehension (.82). 
These coefficients are based on a sample of 201 
men and women students. 

The intercorrelations of the parts are all below 
-55 except those between Mathematics and For- 
mulation (.61) and between Physical Science 
Comprehension and Mechanical Comprehension 
(.59). The low reliability of certain of the sub- 
tests, together with their relatively high correla- 
tion with other subtests, suggests that they might 
be eliminated without harming the total test. The 
median intercorrelation of the parts is .46. 

Validity coefficients are presented in the man- 
ual, based on 188 students who took a course in 
introductory engineering subjects. This course 
consisted of mathematics, chemistry, physics, 
drafting, and manufacturing processes. The total 
score on the Engineering and Physical Science 
Aptitude Test correlated with the average of the 
grades in all subjects to the extent of .73. The 
total score also had substantial correlations with 
mathematics (.59), chemistry (.66), and phy- 
sics (.72). When the parts of the test were 
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weighted by the multiple correlation technique, 
the correlation with average grade increased 
only to .76. Since the standard deviations of 
grades in the subjects are not presented, we have 
no way of knowing the extent to which variation 
in validity coefficients from subject to subject 
were influenced by the amount of scatter in as- 
signed grades. No description of the group used 
in this validity study is given except that the 
students were Pennsylvania high school gradu- 
ates, mostly males. As the authors rightly state, 
the Engineering and Physical Science Aptitude 
Test should be validated in terms of the situa- 
tion in which it is to be employed. 

Percentile norms for total test scores are pre- 
sented for both men and women. A considerable 
range of talent is apparently present in the stand- 
ardization groups, since 15 per cent of the group 
had not completed high school and a substantial 
proportion of the high school graduates had had 
some college training. The user.of this test will 
probably find it necessary to prepare norms 
based on his local group. 


Robert M. W. Travers, Associate Professor 
of Education, University of Michigan, Ann 
Arbor, Michigan. This test is mainly a com- 
posite of previously published tests and is 
vaguely described in the manual as "an ap- 
titude test for training in work of an engi- 
neering and physical science nature." The vague 
nature of the suggested uses of the test makes 
the reviewer wonder whether the materials 
were gathered together after a careful analy- 
sis had been made of a well-defined selection 
program. It seems that the material was first 
put together as part of a program for testing 
the individuals entering a miscellaneous group 
of courses designated as Engineering, Science, 
and Management Defense Training, which were 
organized in the early part of the war and which 
provided formal instruction for more than 
50,000 adults. The initial criterion of validity of 
the material was its effectiveness in predicting at- 
tainment in physics training. The test was found 
to be a fairly valid predictor of grades in intro- 
ductory engineering subjects given in the Penn- 
sylvania State College in the summer of 1942. 
The total test scores yielded a correlation of .73 
with the average grade for the entire course of 
introductory engineering subjects, and the low- 
est correlation of .35 was with drafting. 

Since several engineering schools have dis- 
covered that some form of space manipulation 
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test predicts successfully those aspects of engi- 
neering training which require the visualization 
of objects in three-dimensional space, it is regret- 
table that the test did not include a measure of 
this aptitude. The percentile norms for the test 
were established on the basis of a group which 
has many unusual characteristics. Consequently, 
it is almost impossible to derive any satisfac- 
tion from the use of these norms. However, one 
is encouraged to note in the final paragraph of 
the manual that further investigations are in 
progress and that these investigations will supply » 
more adequate data on the validity of the test. 


[ 699 ] 


*George Washington University Series of Nürs- 
ing Tests. Prospective nurses; 1931-44; 1¢ per profile 
chart; 40 per specimen set, postpaid; (180) minutes ; 
Thelma Hunt; Center for Psychological Service, 
George Washington University. 

d) APTITUDE TEST FOR NURSING. [031—40; Forms I, 2; 
$10 per 100; Form 1: 45(55) miritites; Form 2: 70(80) 
minutes; F. A. Moss is co-author of Form 1. 

b) READING COMPREHENSION TEST FOR PROSPECTIVE 
NURsES. 1940; Form 1; $5 per 100; 30(35) minutes. 

C) ARITHMETIC TEST FOR PROSPECTIVE NURSES. 1940; 
Form 1; $5 per 100; 20(25) minutes. 

d) GENERAL SCIENCE TEST FOR PROSPECTIVE NURSES. 
1944; Form 1; $7 per 100; 30(34) minutes. 

e) INTEREST-PREFERENCE TEST FOR PROSPECTIVE NURSES. 
1944; Form 1; $7 per 100; nontimed (30) minutes. 


REFERENCES 
1. RuixEHART, Jesse Batizy. "An Attempt to Predict the 
ew P SEG Ne the PDA E eR TERM of Tests." 
chol 17:277:93 Je '33- 25 
a Meit RUTE ATHERTON. An "Évaluation of Criteria for 
the Selection of Students in the School oj Nursing e the Uni. 
versity of Minnesota. Unpublished doctor's thesis, iversity of 
Minnesota, 1937. (Summaries of Ph.D. Theses . . + » 1939, PP- 


126-31. 
. Wirrrawsox, E. G.; Srover, R. D.; anp Fiss, C. B. “The 
Selecon of Student Nurses.” J Appl Psychol 22:119-31 Ap 
*38. * (PA 13:2688) " $. = 
4. KRAFT, Nourse. The Relationship Between Some Criteria 
Commonly Used in Selecting Nurse Applicants and Success in 
the School of Nursing With a Special Attempt at Validating the 
Moss-Hunt Aptitude Test, Unpublished master's thesis, Univer- 


sity of Buffalo, 1940. 

s. Dovcráss, Hart R., AND MERRILL, ,RuTH A.; ASSISTED BY 
Constance M. McCurriovom. “Prediction of Success in the 
School of Nursing," pp. PE In University of Minnesota 
Studies in Predicting Scholastic Achievement, Part Two. 
Minneapolis, Minn.: University of Minnesota Press, 1942. Pp. 
v, 75. Paper. $1.00. * (PA 17:1342) - 

6. Berc, Irwin Avoust, “A Study of Success and Failure 
Among Student Nurses." J Appl Psychol 31:389-96 Ag '47. " 
(PA 22:2384) 
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Probst Service Report. Adults; 1931-46; $125 per 
scoring and rating device and 1000 service report forms 
as selected ; 20-99 forms, se each; $1 per set of sample 
forms; J. B. Probst; Probst Rating System. 

a) EDUCATIONAL FORM. “For appraising the service 
value of teachers, principals, supervisors and others en- 
aged in the work of education” ; 1931. 

b) FIRE DEPARTMENT FORM. “For appraising the service 
value of fire department personnel—to be used only for 
those engaged in fire fighting work” ; 1932-45. 

C) GENERAL FORM. “For appraising the service value of 
employees, supervisors or any other personnel not cov- 
ered by the specialized forms" ; 1932-45. 

d) LABOR Form. “For appraising the service of un- 
skilled and semi-skilled labor” ; 1932. 
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€) PERSONAL FITNESS FORM. Also called Probst Per- 
sonal Fitness Report; 1936-45. a 

f) POLICE Form. "For appraising the service value of 
police personnel, guards, watchmen, and similar work- 
ers”; 1032-45. > . 
g) PROFESSIONAL FORM. “For appraising the service 
value of professional personnel—library, social service, 
medical, engineering, and others”; 1937-46. ‘i 
h) SKILLED LABOR FORM. “For appraising the service 
value of workers in the skilled trades and other similar 
employments” ; 1932-45. 


REFERENCES 

1. Prosst, J. B. Service Ratings. A joint publication of the 
Bureau of Public Personnel Administration and the Civil Serv- 
ice Assembly of the United States and Canada. Chicago, Tl.: 
Peete of Public Personnel Administration, 1931. Pp. 94. Out 
of print. 

H "Tons; Joun B. Measuring and Rating Employee Value. 
New IST. Ronald Press Co., 1947. Pp. xi, 166. $5.00. * (PA 
21:2434, 


[ 701 ] 
*Purdue Test for Electricians. Grades 9-16 and 
adults; 1942; preliminary manual; $3 per 25; 50¢ per 
specimen set; 25(30) minutes; C. W. aldwell, H. R. 
Goppert, H. G. McComb, and W. B. Hill; manual by 
Joseph Tiffin; Science Research Associates. 


John W. French, Research Associate, Col- 
lege Entrance Examination Board, Princeton, 
New Jersey. This test is completely objective 
and is easy to administer, requiring only thirty 
minutes, Its reliability of .91 for a single form 
is high for a test of this length. The test can be 
scored by machine or self-scored. Self-scoring 
takes less than a minute per test, but the test 
booklets can be used only once for self-scoring. 

No validity studies have been made. The 
validity is ‘said by the manual to rest on the 
thoroughness of the sampling of electricity and 
electrical operations in standard textbooks and 
the fact that particular industrial concepts 
were incorporated in the construction of the 
test. The sampling of the fundamentals of elec- 
tricity is good, but almost the whole test is de- 
voted to the type of material that an alert high 
school student can learn from a beginning course 
in physics. Any user of the test should realize 
that the test measures little more than the 
groundwork in electrical theory. This may well 
be adequate for most industrial purposes, since 
a person well grounded in theory can usually 
be expected to learn fairly rapidly on the job 
to use an electrician's equipment! and to per- 
form an electrician's duties. If it is desired to 
measure the practical knowledge which an elec- 
trician needs, a good proportion of the test, 
rather than just a few scattered items, should 

be devoted to the use and care of electricians’ 
tools, methods of splicing wire, safety measures 
for wiring, standard home electrical installa- 
tions, trouble shooting, the care of storage bat- 


teries, and other problems which confront the 
electrician on the job. 


[ 702 ] 
*Purdue Test for Machinists and Machine Oper- 
ators. Grades 9-16 and adults; 1942; preliminary man- 
ual; $5 per 25; 75€ per specimen set; 50(55) minutes; 
H. F. Owen, C. C. Stevason, H .G. McComb, and C. D. 
Hume; manual by Joseph Tifün; Science Research 
Associates. 


William W. Waite, Associate Professor of 
Industrial Engineering, Columbia University, 
New York, New York. This test is designed to 
serve several purposes: (a) to aid industry and 
vocational schools in determining the amount 
of machine shop knowledge possessed by appli- 
cants or students; (b) to serve as a basic con- 
sideration for promotion or transfer or in the 
formation of a “utility unit”; (c) to serve as a 
terminal achievement examination in trade 
schools and industrial training programs. 

The total score on the test reflects the testee’s 
general achievement in machine shop practice, 
while separate subscores may be derived for 
the operation of the lathe, planer and shaper, 
grinder and milling machine, and for general 
bench work. 

The reported reliability of the test is based 
on results obtained with 200 vocational high 
school students, each of whom had had at least 
720 hours of practical instruction in machine 
shop. Coefficients of reliability were .96 for total 
score and ranged from .80 to .9o for the sub- 
scores. 

Validity is still being established, although 
it is reported that a number of industrial firms 
have used the test with good results. 

Use of the test as a selective medium in in- 
dustry would provide a reasonably satisfactory 
substitute for a worksample test, if it were not 
considered feasible to administer the latter. No 
test of this type can indicate the extent to 
which an applicant possesses the various quali- 
ties of physical dexterity essential to each type 
of work. 

For purposes of job placement, promotion, 
transfer, etc. this test provides an objective 
confirmation of the subjective opinions of em- 
ployment interviewers, foremen, and others. It 
should prove equally valuable in trade, voca- 
tional, and apprentice schools as a final exami- 
nation. 

The format of the test booklet, incorporating 
as it does the automatic scoring feature, makes 
it possible to administer the test with a mini- 
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mum of equipment and specialized personnel. 
The typographic work on the cuts illustrating 
the various tools and machine parts is not par- 
ticularly good, however, and might, in border- 
line cases, impair the value of the test. - 


[ 703 ] 
XSales Questionnaire. Experienced sales applicants ; 
1936-39; $3 per introductory set including 10 tests, 
postpaid; 4¢ per extra test, postpaid ; introductory sets 
may be ordered on approval; nontimed (10-20) min- 
utes; Eugene J. Benge; Management Service Co. 


REFERENCES 
1. Bence, EuczNE J. "There's à Science to Selecting Sales- 


men," Adv & Sell 37:82+ F '44. 

Robert G. Bernreuter, Professor of Psychol- 
ogy, The Pennsylvania State College, State 
College, Pennsylvania; and Technical Director, 
The Klein Institute for Aptitude Testing, Inc., 
New York, New York. This is one of the few 
tests available for measuring “sales background.” 
It is a multiple-choice 25-item omnibus test 
which includes questions concerning judgment 
in sales situations, meanings of technical terms 
in advertising and business practice, relations 
between salesman and home office, and loca- 
tions of manufacturing plants. The items have 
face validity—they are concerned with facts that 
experienced salesmen might reasonably be ex- 
pected to know. The questionnaire is easily 
administered. No time limit is set, and most 
men will finish it in fifteen minutes. 

The only norms are in terms of letter grades 
from A to F. Separate norms for different types 
of selling apparently are not available, although 
the author states that for lower sales jobs it is 
often a mistake to hire A men, for they will not 
be satisfied. 

When administered by the reviewer to a 
group of 67 college men, 20 of whom had had 
sales experience, a reliability of .44 was ob- 
tained by the split-half, Spearman—Brown tech- 
nique. It is possible that somewhat higher values 
would be obtained on sales applicants. The 
average score for the men with no experience 
was 15.6; the average for the men with experi- 
ence was 16.7. Both of these are in the range of 
D scores. However, there was a wider spread 
of scores for the inexperienced men; 28 per 
cent of them got “E” scores (poor), whereas 
only 1o per cent of the experienced men fell in 
that group. None of the 67 men earned an A 
score (excellent). 

[ 704 ] 


*Steward Selection System. Sales applicants ; 1940— 
46; revision of Composite Inventory and Examination 


& 


(40:1651) ; 1 form; 3 parts; $12.25 per 25 sets of the 3 
parts; $L per manual; 25 per simplified instructions 
and key; $1.75 per specimen set; Verne Steward; 
Verne Steward & Associates, P. O. Box 226, South 
Gate, Calif. 

a) PERSONAL INVENTORY OF BASIC FACTORS, THIRD EDI- 
TION. $7.50 per 25; nontimed in part (60-80) minutes. 
b) PERSONAL INVENTORY OF BACKGROUND FACTORS, THIRD 
EDITION. $3.50 per 25. 

C) GUIDE TO HIRING DECISION. $1.25 per 25. 


REFERENCES 

1-3. See 40:1651, 

4. STEWARD, Vernz, The Use and Value of Special Tests in 
TIT Re REA Rapa doctor's thesis, 

5. Steward, VERNE, Analysis of ales Personnel Problems. 
Los Angeles, Calif.: Verne Steward & Associates, 1943. Pp. iv, 
128. $4.00. * 

Milton E. Hahn, Dean of Students and Pro- 
fessor of Psychology, University of California, 
Los Angeles, California. The Personal Inven- 
tory of Basic Factors is an 8-page test booklet 
containing four subtests. The subtests are: (a) 
a personality inventory of 75 items “adapted 
from Bernreuter Personality Inventory"; (5) 
a selected series of 10 items from subtest 1, for 
each of which the respondent writes a short de- 
scriptive paragraph if his answer to an item was 
"yes"; (c) a 55-item test, Mental Ability, 
“from Otis Self-Administering Tests of Men- 
tal Ability”; and (d) a 44-item inventory, Vo- 
cational Interests. 

The Personal Inventory of Background Fac- 
tors is an 8-page booklet in which the respond- 
ent enters information of various kinds about 
himself and his past educational and vocational 
experiences. This booklet has many points of 
similarity to the Aptitude Index for Life In- 
surance Salesmen developed by the Life In- 
surance Sales Research Bureau of Hartford, 
Connecticut. 

There is almost no evidence of the usual 
statistical treatment of data for either instru- 
ment. Some quite nebulous evidence related to 
the selection of life insurance salesmen is pre- 
sented. The manual cites no bibliography of re- 
search efforts. On page 4 of the manual some 
data are supplied in Illustration 5, Validation 
of the Steward Appraisal System in Life Insur- 
ance. The number of cases is not given; nor 
are there clear clues to the experimental design 
followed. 

The Personal Inventory of Basic Factors is 
objectively scored with the exception of sub- 
test 2 which is highly subjective in scoring. In 
the absence of supporting research, this re- 
viewer is understandably suspicious of a 165- 
item battery which claims to do so much. Had 
the battery been standardized for a particular 
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type of salesman in a quite common working 
environment, the case for satisfactory validity 
and reliability might have been somewhat 
stronger. Until adequate research evidence is 
supplied, one must remain in doubt as to the 
value of the battery "in building sales organi- 
zations." 

The reviewer is alarmed to think that one 
would attempt to select a technical salesman 
with a degree in engineering with the same 50- 
item intelligence test that he would use for the 
man who sells bread to a small retail store. 

The manual contains some sound advice for 
sales managers. Unfortunately, this sound ad- 

. vice is too often obscured by fuzzy writing and 
generalizations which are so sweeping as to be 
almost meaningless. The best of industrial psy- 
chologists cannot come close to what Steward 
claims in some sections of the manual. 

The Personal Inventory of Background Fac- 
tors also falls heir to the weaknesses of gen- 
eralization from the known relationship to suc- 
cess in a restricted occupational environment 
for salesmen who are homogeneous in many re- 
spects to situations which are complex, un- 
known, and staffed by a group which may be 
extremely heterogeneous relative to a number 
of seemingly important variables. The similarity 
to the Aptitude Index for Life Insurance Sales- 
men has been noted above. It is dangerous to 
assume that materials which have been helpful 
in the selection of one highly specialized type 
of salesman will work equally well with quite 
dissimilar kinds of salesmen. 

The reviewer is disturbed at the attempt to 
"package" a Ph.D. degree for sales managers 
who do not have the training and experience to 
utilize the materials wisely. Even our highly 
trained and successful industrial and personnel 
psychologists tread cautiously with many of the 
concepts advocated in the manual. Perhaps the 
best defense for marketing these instruments is 
that they present an advance over present anti- 
quated methods of selecting salesmen found in 
so many business organizations. For many who 
will use the inventories, this may be true; but 

for many others there is the danger that prog- 
ress toward a more scientific approach to “man 
analysis" will be damaged. The reviewer is con- 
vinced that the sales manager who desires to 
use complex psychological tools and techniques 
will save time and money, and secure better 
results, if he will employ a reputable psycho- 


logical consultant who is competent to tailor a 
program to his needs. 

In summary, this battery is an interesting 
experiment which does not as yet have suffi- 
cient.exposure to published research by disin- 
terested parties to permit its recommendation 
for use by psychologically naive businessmen. 


Floyd L. Ruch, Professor of Psychology, The 
University of Southern California, Los Angeles, 
California. The system consists of three basic 
parts:(a) Personal Inventory of Basic Fac- 
tors; (b) Personal Inventory of Background 
Factors; (c) Guide to Hiring Decision. 

The Personal Inventory of Basic Factors is 
an 8-page booklet containing 75 items adapted 
from Bernreuter’s Personality Inventory; 10 
trait illustrations in which testee reports actual 
behavior in a specific social situation; 55 items 
adapted from the Otis Self-Administering 
Tests of Mental Ability; and 44 vocational in- 
terest items. 

The Personal Inventory of Background Fac- 
tors is an 8-page booklet containing: 1 page of 
questions on physical condition; 1 page on edu- 
cational background; 2 pages on experience 
background covering employment and military 
service; I page on financial status ; ¥% page on 
memberships and activities; 14 page on family 
status ; 4 page on interest in product and oppor- 
tunity; and 77 page on references. 

The Guide to Hiring Decision is a 2-page 
leaf designed as an aid in recording and sum- 
marizing the information regarding the appli- 
cant gathered by the tests and procedures out- 
lined above as well as from other sources. A 
total of 24 characteristics are rated from test 
and other data on a four-point scale to provide 
an over-all rating of quality. The author sug- 
gests that an applicant's final rating be marked 
unqualified, borderline, acceptable, or superior 
according to the number of such ratings he 
receives on the various factors, Thus, the key to 
the system is the method by which the four 
ratings are obtained for the 24 items. 

In examining this procedure it is found that 
it is largely without objectivity. The first 7 of 
the 24 factors are determined from the Personal 
Inventory of Basic Factors. The rating given 
each is determined from a Table of Values, which 
gives the scores obtained on the tests and the 
appropriate rating depending upon whether the 
employer is using high, medium, or low em- 
ployment standards on the particular factor. In 
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turn, this employment standard is determined 
by using another table in which the author sug- 
gests the employment standard to be used on 
each factor for various types of selling posi- 
tions. The data on which this table has been 
established are not given. Further subjectivity 
in this section of the system occurs in the grad- 
ing of the third factor, Trait Illustrations. Here 
the appraiser must evaluate the illustrations 
given by the applicant as OK or not OK. No 
standards are given by the author for making 
this determination, yet the final rating on this 
factor depends upon the OK's received. 

The last 17 of the 24 factors are determined 
from the Personal Inventory of Background 
Factors, supplemented by interview and inves- 
tigation. Which of the four ratings to give to 
these factors is entirely up to the employer, as 
the author of the system states: “It is to be re- 
membered that specific instructions for setting 
employment standards cannot be provided, and 
that the employer must assume the responsi- 
bility for getting the facts and for interpreting 
them according to the requirements of the work 
in question." As a result, the employer who is 
using this system must determine himself 
whether the age of 40 years is unqualified, bor- 
derline, acceptable or superior. 

The validity of the system as a whole is in- 
dicated by the excellence of the results reported 
for the 1934-35 system. The number of sales- 
men used in the 1934-35 study is not given, 
nor is the important fact of whether the ratings 
of these salesmen were made by the author or 
someone applying the system blindly with only 
the aid of the published instructions. Corre- 
sponding figures for the new system are not 
given. 

In summary, the success of the Steward Se- 
lection System depends largely upon the judg- 
ment of the employer in determining what 
things are important or unimportant for his 
particular company. No statistical studies are 
presented to substantiate such guidance in these 
decisions as is given by the author. The best 
description of the system would be that it is a 
series of tests and rating sheets to break up an 
interview, enabling closer scrutiny by the em- 
ployer than is usually the case. 

Whatever surface validity these materials 
may possess, the statistically trained reviewer 
is frustrated by the absence of the traditionally 
expected validation statistics. 


r1 


[ 705 ] 
Test for Ability to Sell; George Washington Uni- 
versity Series. Grades 7-16 and adults; 1929; Form 1; 
$9 per 100, postpaid; 15¢ per specimen set, postpaid; 
35(40) minutes; F. A. Moss, Herbert Wyle, William 
Loman, and William Middleton; Center for Psycho- 
logical Service. 


Floyd L. Ruch, Professor of Psychology, 
The University of Southern California, Los 
Angeles, California. This test was designed to 
test ability to sell, regardless of product, and 
contains six subtests including tests on social 
consciousness, arithmetical ability, memory, and 
ability to follow directions. The two subtests 
designed to measure the personality and the 
social consciousness factors in selling are very 
short and are composed of the type of item 
which has been proved invalid for selecting po- 
tential employees because the “correct” answer 
is obvious. There is no provision in the scoring 
for the factor of untruthful responses. On the 
surface, the memory tests and the following 
directions test would seem to have some valid- 
ity. The arithmetical ability test is very short 
and its selective value could be questioned. 

The standardization group comprised 100 
persons engaged in retail selling in various de- 
partments. The criterion was a composite of 
objective ratings (amount of sales, number of 
errors, amount of merchandise returned) and 
subjective ratings of estimates of the examinee’s 
ability by three. supervisors. 

The reliability was determined by the test- 
retest technique, resulting in a coefficient of .91. 
The validity of this coefficient is questioned be- 
cause the nature of certain parts of the test 
(the personality and the social consciousness 
sections) should logically give almost perfect 
retest scores. Tests of this nature should be 
tested for reliability by the split-halves method 
(only possible if tests are longer than the ones 
included here) or by alternate equal forms (only 
one form available for this test), Furthermore, 
in the memory test it is essential to test relia- 
bility by means of alternate forms in order to 
exclude the “learning” factor. 

The validity coefficient, or the degree to 
which the test selects good salesmen, was .54 
on an N of roo. It was found that in some 
cases the "better" salesmen in one department 
received lower test scores than the "poorer" 
salesmen in another department. Obviously, 
some departments require lesser sales ability 
than others, assuming that the test selects sales- 
men. In view of this, the standards given for 


705. _ ‘THE THIRD MENTAL ME MENTS YEARBOOK [ 750 


- sales personnel would be difficult to interpret for predicting whether a student will be a suc- 
"unless the firm using the test made their own cessful salesman. 
EX yalidation studies. Standards are also given for On the whole, the test has not had the benefit 
- high school.and college students, but these of rigorous statistical treatment and the results 
standards are based on test scores rather than are not so clear as those for a similar test, the 
performance. Therefore, they are rather useless Detroit Retail Selling Inventory. 
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Books and Reviews 
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The following roughly classified index has been prepared to assist readers to locate books on a 
particular subject. In addition to using this index, readers are urged to skim over titles and 
excerpts in search of works which otherwise may be overlooked. 


GENERAL 


Bibliographies, 788, 937, 1054-8, 1221-3 

Child study, 721, 743, 769, 799, 845, 871, 898-904, 916, 
934, 962, 1005, 1070, 1194 

Clinical psychology, 998, 1091, 1118, 1126 

Conference reports, 724, 1030 

Critical reviews, 788 

Growth, 776, 837-8, 851, 898-904, 1050-3, 1143, 1154, 
1156, 1172 

Guidance, 828, 1059-60, 1148, 1200, 1247 

Miscellaneous, 840, 881, 891, 920, 930, 983, 1159, 1208 

Philosophy, 768, 778, 887, 894, 911, 1063, 1111, 1158, 
1184, 1107, 1205 

Proceedings, 718-20, 723, 950-3, 1046-9, 1108, 1218 

Sample questions, 1211 

Self testing, 759, 1210, 1230, 1241 

Testing programs: city, 884; STATE, 748, 849-50, 
1160-1, 1247-9; ELEMENTARY, 850, 884, 1161; 
SECONDARY, 748, 849, 884, 1159-60, 1247-9; COLLEGE, 
794, 798, 883, 1076, 1157, 1162, 1248 

Textbooks and handbooks: GENERAL, 912, 1219; 
EDUCATION, 781, 828, 914-5, 1000, 1017, 1026-7, 
1063-4, 1072, 1008, III2, II3I-2, 1135, 1200-1; 
PSYCHOLOGY, 785, 824, 1043, 1102-6, 1129, 1236-7 

W orkbooks, 843, 913, 1113, 1130 


ACHIEVEMENT 


General, 752, 1063, 1098, 1150, 1201 

Subjects: ARITHMETIC, 1146; BOTANY, 943; BUSINESS 
EDUCATION, 825, 933, 1083; ENGLISH, 747, 850, 938, 
960, 1076; FOREIGN LANGUAGES, 798; HEALTH AND 
PHYSICAL EDUCATION, 773-4, 789, 805-6, 821-4, 959, 
976, 1003-4, 1147, 1149; HISTORY, 734-6, 1114; 
HOME ECONOMICS, 777, 782; MUSIC, 775, 897, 1013, 
1140, 1228-9, 1252; PHYSICS, 070; READING, 711, 815, 
890, 805, 1066, 1160-1, 1169, 1199, 1204; READING 
READINESS, 1067, 1124; SCIENCE, 944 ; STUDY SKILLS, 
968, 1039 


CHARACTER & PERSONALITY 


Bibliographies, 1202 

Chirology, 1165, 1243 

General, 746, 796, 873, 942, 1042, 1202 

Graphology, 053-4, 986, 1015, 1025, 1101, 1181-2 
Lie detection, 949, 1018 

Miscellaneous, 713, 742, 784, 797, 926, 1090, 1093-4 


vie 


Projective: GENERAL, 1144; MISCELLANEOUS, II07, 
1242, 1244-5; PAINTING, 717, 868, 1045 ; RORSCHACH, 
749-51, 770-1, 880, 924-5, 036. 963, 973-5, 985, 1016, 
1020-2, 1041, 1119, 1127, 1254; THEMATIC, 803, 1192 

Research reporis, 738, 754, 848, 860, 889, 908, 923, 
920, 946, 979, 997, 1001, 1024, 1040, 1073, 1084, 1123, 
1153, 1164, 1195-6, 1207, 1238, 1253 

Sociometry, 780, 927, 956-7, 1033-7, 1074-5, 1141 


INTELLIGENCE 


Group tesis, 1193, 1220 

Individual tests, 740, 795, 824, 844, 899-904, 909, 969, 
980, 1006, 1012, 1023, 1068, 1088, 1214, 1224, 1231-3, 
1234 

ee 764, 786, 790, 964, 995, 1062, 1089, 1168, 
1185- 

Practice tests, 706-7, 709-10 

Research reports, 772, 802, 907, 921, 941, 1007, 1009, 
TOII, 1085, 1087, 1110, 1125, 1235 


MISCELLANEOUS * 


Army Air Forces Aviation Psychology Program, 
765, 793, 816, 818, 836, 839, 847, 877, 905, 917, 939, 
967, 984, 1029, 1187, 1239 

Blind, 931-2, 1122 

Civil service, 801, 211-2 

College Entrance Examination Board, 808-14, 1076 

Criminology, 1151 

Educational Records Bureau, 852-67 

Factor analysis, 999, 1180, 1188-9 

Graduate Record Examination, 708, 1215-7 

Handicapped, 947 

Hearing, 800, 892, 1069 

Human Engineering Laboratory, 737, 739, 744-5, 753; 
760, 767, 846, 876, 879, 886, 896, 022, 945, 958, 971-2, 
987-93, 1044, 1065, 1077-81, 1096-7, 1099, 1115-7, 
1120, 1145, 1203 

Item analysis, 834 

Marks and grades, 1121, 1246, 1251 

National Teacher Examinations, 1136-9 

Observational methods, 919, 955 

Oral tests, 801 

Personal documents, 716, 910, 1183, 1211 

Preparation for examinations, 885, 940, 1008 

Record and report forms, 715, 741, 776, 833, 878, 982, 
1135, 1170, 1198, 1213 
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Reliability, 804, 875, 928, 952 

Safety education, 712, 1155 
Socio-economic status, 1152 

Sophomore testing program, 819-20 
Vision, 733, 761-3, 996, 1028, 1086, 1174-6 


VOCATIONS 

Bibliographies, 756-8 

Clerical, 994 

RUE 783, 791, 832, 966, 1019, 1109, 1136-9, 1142, 
1206- 

D ADIE: 965 

Executive, 807, 918 

Foremen, 730, 906, 918, 977, 1250 

General, 1030-1, 1038, 1060, 1190-1 

Industrial psychology, 1032, 1190-1 

Industrial testing practices, 725-32, 829-30, 841, 994, 
1031, 1038, 1060, 1133-4, 1178 


Interests, 787, 792, 826, 981, 1177, 1240 

Interviewing, 766, 827, 830, 842, 874, 893, 1061, 1082, 
1209 

Law, 965, 1092 

Medicine, 965, 1215-6, 1226 

Merit rating, 755, 906, 918, 977, 1014, 1060, 1095, 1163, 
1167, 1211-2, 1227, 1250 

Miscellaneous, 835, 1071, 1171, 1173 

Nursing, 948, 1010 

Professional aptitudes, 965 

Salesmen, 731, 831, 1100, 1128 

Scholastic aptitude, 708, 817, 870, 872, 888, 961, 1002, 
1179, 1215, 1217, 1242 

Trade tests, 1225 

Transit, 978 

Vocational guidance, 714, 779, 882, 1166 


BOOKS AND REVIEWS 


[ 706 ] 
*Air Corps Practice Tests. New York: Current Book 
Co., 1942. Pp. 128. Paper, lithotyped. $1.50. * 


[707 ] 
*Full-Length Practice Examinations for U.S. 
Army-Navy Classification Tests: Especially Appli- 
cable for Those Desiring a Grade High Enough to 
Enable Them to Enter Officer Candidate School. 
Milwaukee, Wis.: Pergande Publishing Co., 1943. Pp. 
99. Lithotyped. $1.65. * 


[ 708 ] 
*The Graduate Record Examination: A Memoran- 
dum on the General Character and Purpose of the 
Examination Including a Summary of Initial Studies 
of Its Validity. New York: Carnegie Foundation for 
the Advancement of Teaching, 1941. Pp. iii, 38. Paper. 
Out of print. * (PA 15:3589) 


^ [ 709 ] 
XThe Inductee's Mental Test. New York: Arco 
Publishing Co., 1942. Pp. 31. Paper. $0.25. * 


[ 710 ] 
XPractice for the Soldier's I.Q. Test. Cincinnati, 
Ohio: Practice Publishing Co., 1942. Pp. 65. Paper. 


$1.00. * 

[711] 
XAnrorr, E. CarLToN, Relationship Between Varia- 
tions in Silent Reading Ability and Mental Abil- 
ity. Philadelphia, Pa.: University of Pennsylvania, 
1943. Pp. x, 122. Paper. Out of print. * 


[712 ] 
XABERCROMBIE, SrAxLEY A. Measuring the Results 
of Driver Instruction. An Abstract prepared by 
Charles C. Hawkins. New York: Center for Safety 
Education, Division of General Education, New York 
University, 1941. Pp. 32. Paper. Out of print.* (PA 
16:2022) 

[ 713 ] 


_ *Asramson, Hanorp A.; BRODMAN, Kreve; HARRIS, 
 Hamotp J.; KitLINGER, Grorce G.; MITTELMANN, 

x Beta; PIOTROWSKI, ZYGMUNT A.; Rapaport, DAVID; 
SCHAFER, Roy; SCHEERER, Martin; WECHSLER, DAVID; 


Wemer, ArTHUR; Worrr, HaroLp G.; WLADKOWSKY, 
EDITH ; AND ZUBIN, JosepH. Non-Projective Person- 
ality Tests. Annals of the New York Academy of 
Sciences, Vol. 46, Art. 7. New York: the Academy, 
July 30, 1946. Pp. 531-678. Paper. $1.75 postpaid. * (PA 
20:4651, 4668, 4674, 4677, 4678, 4680, 4684, 4687, 4688, 
4808, 4913) 
[ 714 ] 


XALLEN, E. ParRICIA, AND Smitu, Percival. The 
Value of Vocational Tests as Aids to Choice of 
Employment. Birmingham, England: City of Birm- 
ingham Education Committee, 1940. Pp. 45. 1s. (PA 
15:3978) 

Brit J Ed Psychol 11:65 F "41. F. M. A. 
A first report of this interesting piece of work 
was published eight years ago. * children were 
tested during their last term at school. The 
head teacher, the teacher who had administered 
the tests, the juvenile employment officer, and 
the investigators then met to consider the re- 
sults of the tests, plus other relevant data, and 
decided what guidance should be given. In 820 
cases a ‘follow-up’ was carried out over two 
years, and for four years in 271 cases. The 
present report deals with the results gained 
from four elementary schools—two girls’ and 
two boys’. The remaining fifteen schools are to 
be dealt with in succeeding reports. * Control 
groups were used. These were not given the 
tests nor were temperament estimates compiled, 
but the children were advised as well as pos- 
sible on the other available information. * Lists 
of tests are given and samples of the rating 
scale for personal qualities and school subjects, 
the medical report form, and the vocational 
guidance report form for summary. It is a pity 
that no ‘cases’ have been included as examples. 
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It is found that children who were given voca- 
tional guidance and who followed the advice 
given were decidedly better placed in industry 
than those who did not follow the advice. In 
addition they were better placed than children 
of the control groups which were advised with- 
out the vocational tests. * The value of this 
particular work would seem to be more than 
mere corroboration of previous findings. It lies 
partly in the additional specific facts which 
emerged, viz., first posts entered, intermediate 
posts held, last posts held, transference, dura- 
tion of posts, children's and employers' esti- 
mates-of suitability of posts. It is also valuable 
as an illustration of the way in which different 
branches and departments can work together in 
a matter of this kind. Most important of all, 
perhaps, is the fact that 820 children got the 
best available advice at the outset of their 
careers and for two or four years had further 
advice when required. The value of such work 
to industry, to the city, and to the whole com- 
munity it is impossible to assess. The value to 
many of the individuals concerned could hardly 
be exaggerated. * 


[715 ] 
X*ALLEN, WeNvELL C. Cumulative Pupil Records: 
A Plan for Staff Study and Improvement of 
Cumulative Pupil Records in Secondary Schools. 
New York: Bureau of Publications, Teachers College, 
Columbia University, 1943. Pp. ix, 69. Paper. $1.25. * 


Ed Adm & Sup 30:376-7 S '44. A. S. Ed- 
wards. * a brief book upon one of the most 
important aspects of the work of an educator; 
namely, counseling * Many details of the prob- 
lems are briefly but wisely discussed. * a valu- 
able contribution * For the counselor who is 
already expert, it will be elementary but per- 
haps useful; for the teachers who are working 
without a proper understanding of this impor- 
tant work, and for those who are beginning, it 
should give much needed assistance. * The book 
should help to the making of the best records 
and to their use. One advantage of the volume 
is its brevity. There is no index and it is not 
needed. The appendix is a valuable addition to 
the main text. 

Ed Res B 23:244-5 D 13 '44. N. G. Faw- 
cett. One of the greatest weaknesses of public 
education has been the general tendency on the 
part of teachers to stress mastery of subject- 
matter rather than child development. This little 
book suggests one of the finest techniques avail- 
able for the basic in-service training of teachers. 


A study of the general suggestions made in the 
publication for the building of a functional 
cumulative record system should prove valu- 
able to school administrators, teachers, and those 
in the field of teacher training. * 

Occupations 22:515 My "44. Rebecca C. 
Tansil. This practical monograph....should find 
a place on the bookshelves of all persons inter- 
ested in vocational guidance. No words are 
wasted but there are so many valuable sugges- 
tions about records and record improvement 
that a reader finds himself re-reading the pages 
so as not to miss any of the pertinent material. 
* Wisely it is pointed out that administrators 
and counselors should not attempt to install a — 
record system without the assistance of the reg- 
ular teaching staff. By this joint study of prob- 
lems, improvement in the use of record data 
usually follows and teachers develop a coopera- 
tive attitude that would never exist if the rec- 
ords were superimposed upon them. * Every 
classroom teacher and counselor would gain 
much by reading this excellent monograph on 
cumulative records. 

Sch R 52:252-3 Ap '44. Gustav J, Froehlich. 
* presents a plan of action designed to enable 
schools to develop systems of pupil records 
which are both a practical aid to teachers, coun- 
selors, and administrators and a useful tool to 
research workers * The basic concept of this 
monograph is a good one. The relatively scanty 
treatment given to important issues may be 
justified by saying that the report is not in- 
tended to serve as a textbook, a scholarly 
treatise, or an exhaustive account. It does, as - 
its author indicates in the Preface, present any 
number of definite criteria and procedures. It 
contains representative illustrations of accom- 
plishments at various schools. Both the sugges- 
tions and the references to accomplishments at 
other schools should be helpful to the principal 
or the superintendent who directs an all-faculty 
project on cumulative pupil records. The re- 
viewer is disturbed, however, by the general 
tone of the monograph. Frequently, detailed 
procedures are dogmatically proposed, with 
little or no indication of possible alternatives or 
of situations in which the recommended pro- 
cedures may be impractical. References to symp- 
toms are more evident than references to causes. 
In other words, Allen does not make it clear 
that a conscientious adherence to the procedures 
outlined by him will not always yield the an- 
ticipated results. ^ 
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[ 716 ] 


*Arrronr, Gogpox. W. The Use of Personal Docu- 
ments in Psychological Science. Prepared for the 
Committee on Appraisal of Research. Social Science 
Research Council, Bulletin 49. New York: the Coun- 
cil, 1942. Pp. xix, 210. Paper. $1.50. * (PA 16:2314) 


Am J Orthopsychiatry 13:368 Ap’43. Joseph 
Lander. This meticulous study of an important 
source of psychologic insight demonstrates anew 
the far-flung roots from which psychiatry de- 
rives its strength and its varied approaches to 
the study of man and his emotional problems. 
The author defines a personal document as “any 
self-revealing record that intentionally or unin- 
tentionally yields information regarding the 
structure, dynamics and functioning of the au- 
thor’s mental life," more particularly diaries, 
autobiographies, questionnaires, and recordings 
of dreams or interviews. After tracing the his- 
torical development of the use of this material 
by psychologists, psychiatrists and others, All- 
port examines critically the conscious and un- 
conscious motivations which may lead not merely 
to the production of such writings, but also to 
errors in their interpretation. A major empha- 
sis deals with the problem of establishing the 
validity of the interpretations to be drawn from 
these documents. Another important emphasis 
deals with the problem of differentiating sharply 
between that knowledge which is of value in 
understanding any given individual, and that 
which can be generalized for application to a 
larger group. There is a healthy repetitious in- 
sistence that the pluralistic determinism of hu- 
man behavior necessitates tlie exercise of the 
greatest caution in forming such generalizations, 
no matter how accurately the data may fit any 
given case or group of cases. Reading is not 
made easier by the use of terms so highly eso- 
teric that they were completely unfamiliar to 
the first five psychologists whom the reviewer 
approached for aid. Finally a Greek scholar 
helped. One might also note that the general 
tone of the volume is slightly lightened by a 
fairly extensive series of laboriously sarcastic 
references to psychoanalysis, for which body of 
knowledge Allport is clearly able to restrain 
his enthusiasm. The book will be of primary 
interest to the philosopher of psychology rather 
than to the clinical psychiatrist. 

Am Sociol R 7:435-7 Je "42. Robert E. 
Park. * This volume impresses me as a par- 
ticularly lucid survey, based on a wide knowl- 
edge of the literature at home and abroad, of 


the logical problems involved in the use of per- 
sonal documents—autobiographies, diaries and 
letters, life histories and literary creations—for 
the understanding of human nature and society. 
However, the author in emphasizing the diff- 
culties of the use of personal documents for the 
purposes of a nomothetical science has not con- 
ceived it as part of his task to indicate how 
these same documents could be used by idio- 
graphic sciences—for example, cultural an- 
thropology and psychoanalysis—which are not 
sciences at all in the more austere sense of sta- 
tistical and behavioristic psychology. * 

J Abn & Social Psychol 38:115-6 Ja "43. 
Ernest W. Burgess. * The chief original con- 
tribution of the critique is Allport’s analysis and 
defense of idiographic procedure, namely, famil- 
iarity as intimate as possible with a single life, 
“with significant total processes of the sort re- 
vealed in consecutive and complete life docu- 
ments.” * Allport points out that the unique 
characteristics of a single life cannot be gen- 
eralized to other cases and makes a brilliant 
argument for the feasibility of laws for a single 
case which make possible understanding, pre- 
diction, and control for that case alone. In his 
enthusiasm for the science of the single case 
Allport gives a plausible theoretical argument 
for the superiority of idiographic versus either 
nomothetic or actuarial prediction of the be- 
havior of the individual. He grants, however, 
that the final decision will be determined by 
research. To the reviewer....there seems some- 
thing sterile and perhaps futile in disputation 
upon the rival claims of the idiographic, no- 
mothetic, and actuarial methods of prediction. 
The time is close at hand when the distinctive 
merit of each method will be recognized ‘and 
they will be employed in combination to secure 
their fullest value for understanding, predicting, 
and controlling behavior. This critique, compre- 
hensive in treatment, lively in style, and stimu- 
lating in its statement of issues, is bound to 
have a profound effect upon the future of the 
use of the personal document and the case-study 
method. It marks the close of the epoch of the 
wide but rather uncritical employment of the 
personal document, ushered in by the publication 
of The Polish Peasant in Europe and America, 
by W. I. Thomas and Florian Znaniecki; it 
points the way to crucial research upon the 
nomothetic and the idiographic procedures in 
the analysis of the personal document. 

J General Psychol 30:277-83 Ap ’44. Peter 
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Hampton. * the psychology of the future will 
be an eclectic psychology, and as such cannot 
afford to pass over lightly the fruits of labor 
accumulated in the works of a Shakespeare, 
Tolstoy, or Victor Hugo. The method of inter- 
nal consistency, for all its weaknesses—princi- 
pally those of validation—works, and as All- 
port so aptly shows in his The Use of Personal 
Documents in Psychological Science, the method 
deserves a chance to be tried out by the psycholo- 
gist. The real failing of the psychologist today 
is his inability or rather hesitancy in using his 
own mind in interpreting personality. He knows 
that personality is a consistent mental structure. 
He knows it, but cannot prove it. To be sure, 
he tries his best to unravel the uniqueness of 
personality by losing himself in statistical corre- 
lations. But when he is through he comes out 
with a skeleton sans the human being to whom 
the skeleton belongs. He has a summation of 
individual traits, and when he puts them to- 
gether he manages to arrive at a personality—a 
personality without a soul. * Obviously, accord- 
ing to Allport, we need something more than a 
horizontal approach to personality. We need a 
vertical approach as well. The method of cor- 
respondence of measures is concerned with con- 
sistency only in so far as it appears on a hori- 
zontal plane or dimension of behavior. It is 
always the consistency pertaining to the surface 
activities of large groups of people, and never 
the consistency that is found in the vertical 
dimension of behavior within a single individ- 
ual. It is no wonder, therefore, that the psy- 
chologist who defines consistency entirely in 
terms of correspondence and not in terms of 
congruence, obtains a personality that is one- 
sided, specific, and lacking in vitality. Even if 
statistical methods of dealing with behavior 
were perfect, they would still fail to give us the 
information desired in some cases, for some- 
times statistical measures cannot be applied. * 
if “a document hangs together and represents a 
structured configuration of human life and har- 
bors no impossible contradictions, it should get 
the benefit of at least a prima facie validity.” 
This, we should remember, is the only test ap- 
plied to literature and biography. It is the only 
test by means of which we interpret such per- 
sonalities as a Hamlet, an Anna Karenina, a 
Benvenuto Cellini, or even a Bernard Shaw. 
While self-confrontation is not a sufficient 
method for science, it has its place and its im- 
portant part to play. By all means let us use 


correlation and other statistical checks when 
looking for validity; but at the same time, let 
us not forget that there are other criteria of 
validity which can stand us in good stead, namely, 
tests of internal consistency—self-confronta- 
tion—, tests of plausibility—agreement with 
reader's experience—, tests of known reliability 
of the writer, and other tests of a non-quanti- 
tative independent nature. * Very often the 
search for the correspondence of measures to 
establish some preconceived dimension of unity 
fails. If that is the case, one should look deeper 
for the underlying cardinal trait or radix of the 
personality. According to Wertheimer, who first 
proposed the word radix to designate the cardi- 
nal trait in a man's personality, the radix should 
never be looked for on the “activity level,” for 
it is always to be found at “the root of activi- 
ties.” No matter how inconsistent activities 
may seem to be, as long as they spring from 
the same root they are consistent. * The best 
indicators of a non-quantitative nature of the 
validity of personal documents are, then, inter- 
nal consistency or self-confrontation, general 
honesty and credibility of report, and plausi- 
bility. In practice the validity of case studies is 
seldom checked. The assumption is that, given 
an accumulation of incidents, the one true pat- 
tern of unity will emerge by sheer virtue of the 
systematic relevance of one incident to another. 
The case study is tested only by its internal 
intelligibility, or in the words of Sorokin, “by 
its logico-meaningful criterion of integration,” 
that is, by its self-consistency. This, in the opin- 
ion of Allport, is the method which more and 
more psychologists are beginning to accept as 
the soundest of all methods. Allport admits that 
there is still a great need for exploration in this 
field. The problem of the future is to perfect 
methods whereby the congruence in a person- 
ality can be undeniably established. We need 
criteria that will tell us whether a pattern that 
is held up to be congruent is really so, or 
whether it is being read into the case. In short, 
we need criteria for determining the validity of 
case studies. There is no doubt in the review- 
er’s mind that Allport’s The Use of Personal 
Documents in Psychological Science is one of 
the outstanding works in Psychology of the 
year. 

J Social Psychol 18:423-8 N ’43. Mary C. 
Van Tuyl. * All who handle personal docu- 
ments of any kind and are faced with the prob- 
lems and the potentialities of their analysis will 
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be interested in this report. Although it con- 
cerns itself only with autobiographical material 
the discussion and many of the conclusions are 
equally pertinent to case histories and other 
accounts written in the third person. The ap- 
praisal moreover ought to be at least challeng- 
ing to a much larger group, of psychologists 
particularly, for the extremists of whom no 
good can possibly come out of a personal state- 
ment of any sort, and for others, more mod- 
erate, who are reluctant to admit that any relia- 
bility and validity can be established for the 
data of the personal history, or for the methods 
of handling and the interpretation drawn. * 
The first section is devoted mainly to a sum- 
mary of his examination and evaluation of ex- 
isting studies of personal life history material. 
* The first section closes with a chapter devoted 
to the distinction between nomothetic and idio- 
graphic bias in psychological and social sci- 
ence. * Applied to socio-psychological data a 
nomothetic study means essentially one in which 
generalizations are secured from large num- 
bers of individuals and where the single in- 
dividual, in Dr. Allport’s phraseology, “is made 
the helpless tale of the statistical kite" * A 
purely idiographic study on the other hand is 
devoted to the scrutiny and analysis of the 
single case. Dr. Allport has long been the cham- 
pion of the individual and the single case, both 
of which paradoxically need a partisan in psy- 
chology. In this work he takes the additional 
step of trying to validate the material secured 
when the individual is allowed to freely say his 
own say. Professor Allport has made it clear 
here and elsewhere that there is a place for 
the nomothetic approach in psychology, but 
that place he believes is not in the study of per- 
sonality and individuality. * he is sure that the 
nomothetic treatment of counting noses in this 
or that single aspect of many human lives is 
fruitless so far as knowledge of individuals is 
concerned. In this work he says (p. 62): 
“Nomothetic studies inevitably take but one or, 
at the most, a very few aspects from concrete 
lives for the construction of generalizations. No 
two lives can be compared in every detail, and 
the more lives that are taken the fewer the 
common aspects become, until only a slender 
framework of universal laws of human nature 
results. By contrast, strictly idiographic studies 
maintain the richness and many-sidedness of 
personality and stay far away from the dangers 
of merely glittering generality (p. 62).” With 


most of this the reviewer would agree. But is it 
a fact that the more lives that are taken the 
fewer the common aspects become? Is it not 
the patterning of these aspects, rather than the 
aspects or single events themselves which is so 
different that the individual stands out as some- 
thing absolutely and richly unique? Might it 
not be shown that many single aspects or events 
are similar enough to be put in categories—not 
a priori lists but classes dictated by the material 
of the history—and so recorded on tabulation 
sheets or Hollerith cards? The sheet or card 
would then represent the individual as ade- 
quately as the analyzing instrument permitted ; 
the finer the instrument the closer the approxi- 
mation to the source material. The pattern of 
one sheet or card, however, would never exactly 
duplicate the pattern of another. Under such 
conditions of research one might come close to 
holding both the nomothetic and idiographic 
necessities in view, and in the end perhaps 
achieve the "truly inductive use of documents" 
which Dr. Allport is justified in saying (p. 142) 
is so rarely found. That is, generalizations 
would actually arise, would have an opportunity 
to emerge, out of the different contexts of the 
personal histories under analysis. The docu- 
ments would then not “come home to roost,” as 
Dr. Allport says (p. 168) on the “psychologist’s 
own mental perches.” The psychologist might 
even be “compelled to draw such and such in- 
ferences and interpretations, never before 
dreamed of by him" (p. 166). But in this re- 
port it is not a combination of nomothetic and 
idiographic uses of personal history material 
which Dr. Allport is urging, but the use of the 
single case. He is stressing the point that “up 
to the present time too much research has been 
done by the single investigator scrutinizing 
many cases, and not enough by many investi- 
gators scrutinizing the same case.” His defense 
for the purely idiographic approach rests mainly 
on the following grounds: 1. Acquaintance with 
the particular concrete individual in its natural 
complexity is basic and obligatory to psychologi- 
cal science (p. 56). 2. The “course of catisa- 
tion” in the concrete individual is (a) personal 
and not universal and (b) “need not be based 
upon frequency of occurrence in multitudes of 
cases, but may apply to one-time happenings” 
(P. 57). 3. The application of knowledge is 
always to the single case (p. 58). 4. From the 
use of the single case as well as in the best of 
other scientific procedures, the aims of science, 
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i.e, understanding, prediction, and control, can 
be accomplished “beyond the levels achieved by 
unaided common sense" (p. 59). The scienti- 
fically minded reader is obliged to accept this 
defense of the use of the individual case because 
the single case is the indispensable datum for 
science. The individual case does, as Professor 
Allport says (p. 150) "stand at the gateway 
and terminus of generalized knowledge." But 
this figure is justified only when it assumes the 
area of the mass of individuals with which the 
single case is (at least implicitly) brought into 
relation. That is, the single individual person- 
ality has his uniqueness thrust upon him only 
because his characteristics are shown up (at 
least implicitly) against the characteristics of 
other men. * Part Two....describes the various 
kinds of personal documents that have been 
used for analysis, all the way from the compre- 
hensive literary autobiography to certain pro- 
jective techniques. This section is fully adequate 
and interesting, but the reader who handles 
personal history materials will probably read it 
last, because he is in haste to get to the fulfill- 
ment of the promise to measure the value of his 
kind of experimental material. Part Three in 
fact is the most important part of the report, 
and to this reviewer the most important discus- 
sion is the one which centers around the inter- 
pretations which an analyst gives to his mate- 
rials. This matter is discussed under the term 
“conceptualization,” which is defined as “any 
theorizing imposed upon a raw narrative" (p. 
164). From his own examination of studies 
Professor Allport concludes (p. 142) even as 
Blumer had done earlier in his critique of 
Thomas and Znaniecki's interpretations in The 
Polish Peasant that for the most part docu- 
ments take their meaning and intelligibility from 
the comments and interpretations of the experi- 
menters. * This is an exceedingly important 
matter for the analyst of personal documents to 
ponder upon. Is he being led by the data, or is 
he leading, or even pushing the data round, to 
conform to certain presuppositions, or even to 
certain hunches which have arisen along the 
way? * Like most workers who handle personal 
documents Professor Allport is so certain of 
the worth of carefully secured first-person-his- 
tory material that his report is an apologetic 
throughout. * Nevertheless a good case does 
arise out of this exposition of the strengths and 
weaknesses of personal document materials and 
what is even more important a challenge for 
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new methods of handling, and for “bold and 

radical experimentation” (p. 190) in the criti- 

cal use of documents. Dr. Allport is not looking 

for the most fruitful advances to arise in the 

direction of “attaining acceptable statistical 

standards” but rather from the direction of 

“augmenting the efficiency and meaningfulness 

of the single document” (p. 18). His strong 

concern is for the development of prediction 

applied to the single case instead of a mass of 
cases. * The report is provocative, which is all 

to the good, and it is particularly helpful to 

those who have gone to the debatable length ` 
suggested by Dr. Allport (p. 151), “If we may 

assume that the concrete and the general are of 
equal importance in the production of psycho- 
logical understanding, it follows that case mate- 
rials (including personal documents) should 
claim half of the psychologist’s time and at- 
tention.” 

Q R Biol 17:384 D '42. * excellent mono- 
graph * The case both for and against the use 
of personal documents is strongly argued in an 
attempt to reach a common sense, middle-of- 
the-road attitude which will be conducive to 
critical, yet careful and sympathetic work. In 
conclusion, the author makes a plea for more 
bold and radical experimentation with personal 
documents; lists the pitfalls of which the work- 
ers in the field must continuously be aware; and 
suggests strong counter measures against the 
theorists who hold that personal documents are 
of little or no value in psychological research, 
The work is well documented * 

Social Service R 16:556-7 S '42. David M. 
Levy. * a thorough and careful study of the 
psychological value of personal documents— 
especially the diary—and a critical review of a 
rather voluminous literature on the subject * 
To the psychiatrist, the diary, letter, composi- 
tion, or any of the other fantasy productions of 
his patient are accepted as part and parcel of 
clinical study. They are treated with the same 
type of critique that is related to the products 
of a direct psychiatric examination. Hence the 
problem of their incompleteness, distortions, 
omissions, and the like is familiar. The psychi- 
atrist, unlike the psychologist, judging by All- 
port’s publication, does not question their value, 
They are the very substance of psychodynamics, 
That arguments must be used to prove their 
validity, since they are intimate revelations of 
the personality, would seem surprising to him. 

Sociol & Social Res 26:482 My-Je '42. 
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E(mory) S. B(ogardus). * Two chapters of 
special importance are those which present the 
“case against personal documents” and the case 
for them. Equally significant is the discussion 
on the problem of interpreting personal docu- 
ments or of generalizing from them. In this 
connection the author suggests the necessity of 
having a full mind, a breadth of knowledge, and 
also that each document be interpreted by sev- 
eral different research persons. Throughout, 
the author has maintained a reasonable degree 
of objectivity, although evidently favorable from 
the start toward personal documents as scien- 
tific material. The subject has been extensively 
and thoroughly canvassed, and the idiographic 
method tested and not found wanting. 


[717] 
X*ArscHULERg, Rose H., AND Hartwick, LA Berta 
Wess. Painting and Personality: A Study of 
Young Children, Volumes I and II. Chicago, Ill. : 
University of Chicago Press, 1947. Pp. xi, 263; vi, 265- 
590. $10.00 per set. * (PA 21:2311) 


[718 ] 
XAMrnICAN Counci on Epucation. Ninth Educa- 
tional Conference, New York City, October 31 
and November 1, 1940: Under the Auspices of the 
Educational Records Bureau, the ooperative 
Test Service, the Committee on Measurement and 
Guidance of the American Council on Education, 
and the Commission on the Relation of School 
and College of the Progressive Education Asso- 
ciation. Educational Record, Vol. 22, Supplement No. 
14. Washington, D. C.: the Council, January 1941. Pp. 
148. Paper. $0.50. * 
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XAMERICAN Councit on Epucation. Tenth Educa- 
tional Conference, New York City, October 30 
and 31, 1941: Under the Auspices of the Educa- 
tional Records Bureau, the Cooperative Test 
Service, the Committee on Measurement and 
Guidance of the American Council on Education, 
and the Commission on the Relation of School and 
oleo of the Progressive Education Associa- 
tion. Educational Record, Vol. 23, Supplement No. 15. 
Washington, D. C.: the Council, January 1942. Pp. 
156 Paper. $0.50. * 
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*Amertcan Council on Epucatton. Eleventh Edu- 
cational Conference, New York City, October 31, 
1946: Under the Auspices of the Educational Rec- 
ords Bureau, the Cooperative Test Service, the 
Committee on Measurement and Guidance of the 
American Council on Education, and the Commis- 
sion on the Relation of School and College of the 
Progressive Education Association. Educational 
Record, Vol 28, Supplement No. 16 Washington, 
D. C.: the Council, January 1947. Pp. 183. Paper. 


$0.50. * 
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XAMERICAN COUNCIL on EDUCATION, COMMISSION ON 
TEACHER Epucation, Division on CHILD DEVELOP- 
MENT AND PERSONNEL, DANIEL A. Prescott Heap 
Helping Teachers Understand Children. Wash- 
ington, D. C.: the Council, 1945. Pp. xv, 468. $3.50. 
(PA 20:312) 


Am J Sociol 51:354-5 Ja '46. Earl S. John- 
son. * The problem of understanding children 
was approached by way of anecdotal records, * 
It is interesting to see how the teachers' anec- 
dotal records of these "problem children" 
changed from condemnation and judgment- 
making to objective descriptions of significant 
behavior. These changes were, obviously, the 
result of the teachers’ growing ability to take a 
scientific attitude toward their subjects. These 
changes can also be described in terms of the 
teachers’ learning to find patterns, discover 
cause-and-effect sequences, and generally gain 
insight where they had formerly been disposed 
simply to “order and forbid.” The method is 
not new. It is that which Healey and Bronner 
employed in the study of delinquent children. 
It has been used by Mayo, Whitehead, Roeth- 
lisberger, and others in studies of morale among 
industrial workers. The basic assumption in all 
these attempts to understand “what makes 
Sammy run" is that the whole person is in- 
volved, and that behavior is caused. It is no dis- 
paragement of this well-disciplined study to 
observe that truly great teachers have under- 
stood much of what is involved in the cause 
and cure of “problem children." But they knew 
it privately and as an artist knows it. Through 
this and other similar studies the processes 
which induce deviant behavior have, through 
scientific study, become public knowledge. But 
as this knowledge has become public, where it 
was once private and "intuitive," the tendency 
has arisen to set up experts in both study and 
"treatment." In the reviewer's opinion the task 
of understanding students, whether they be chil- 
dren at the elementary level or adults at the 
college and graduate level, is one which cannot 
be entirely bureaucratized. Nor does this study 
suggest that. In fact, it appears to imply that 
teaching and understanding go hand in hand. 
This implication is the book's chief contribu- 
tion to the "understanding of children." If we 
take it seriously, we shall be somewhat less dis- 
posed to separate, at any educational level, 
teaching and "advising." 

El Sch J 46:115-6 O "45. Helen Nichols. * 
One of the most helpful chapters in the present 
volume is that on learning how to describe be- 
havior—difficulty which many teachers expe- 
rience in their first attempts at child study. 
When a teacher understands this procedure, she 
can apply the psychology that she has learned 
and will welcome research materials. * This re- 
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port stimulates the reader to follow through the 
chapters as an eyewitness and to observe the 
gradual growth of the teacher as she gains new 
concepts and insights and increased sensitivity 
to children’s motives. This growth, in turn, en- 
ables her to understand complex combinations 
of factors. These understandings bring about 
fundamental changes in the teacher’s way of 
thinking about children and in her own work, 
because she knows the developmental tasks com- 
mon to all, she sees individual differences, and 
she recognizes that behavior is “caused.” A fre- 
quent criticism of many books is that they give 
a theoretical presentation which is not close 
enough to the situation in the classroom. Such 
criticism cannot be directed at this report. The 
first ten chapters try “to make it possible for 
the reader to look directly at children through 
the eyes of the teachers in the study program” 
(p. 401). Chapter xii is of great value to teach- 
ers and administrators interested in launching 
a child-study program, for it tells explicitly how 
such a program was planned and managed in 
one school system. This report is highly mean- 
ingful, enriched as it is with a wealth of illus- 
trative materials. Teachers who read it will feel 
that they have shared in many similar situa- 
tions. It gives a technique by which teachers, 
working either as individuals or in groups, can 
learn to understand children better and to solve 
their own classrooin problems. This unusually 
practical book will be welcomed, not only by 
teachers and supervisory officers in the schools, 
but also by parents and others who seek to un- 
derstand children and by all those who wish to 
help others understand children. 

Sch & Soc 63:266-8 Ap 13 '46. Helen R. 
Becker. * Chapters II to VIII, inclusive,....are 
replete with evidence that illustrates (1) the 
professional growth made by the teachers as 
their skill increased in gathering and organizing 
objective, accurate data about individual chil- 
dren; (2) how they acquired background infor- 
mation about the several aspects of human de- 
velopment and behavior; (3) how they learned 
to relate to scientific principles the pertinent 
facts about individuals; and (4) how they ar- 
rived at sound conclusions on ways to help girls 
and boys toward wholesome development. Chap- 
ters IX and X, taking into vivid account the pro- 
cedures followed by one of the teachers in study- 
ing the child society and the social interaction 
of the children in the classroom, conclude the 
specific description of the various activities in 


this child-study program. * Chapter XI...is 
devoted to the evaluation of the study by the 
participating teachers and administrators. * Of 
the two concluding chapters, Chapter XII, 
“Conducting a Program of Child Study,” is 
particularly significant for those interested in 
improving outworn practices in the education 
of teachers, in that it describes how the pro- 
gram was launched on a relatively small scale 
and was gradually extended to include, volun- 
tarily, all of the professional personnel in this 
school system.* 

Sch R 54:58-9 Ja '46. Thomas M. Carter. 
* In the Preface Daniel A. Prescott tells us that 
the book is not about children but about teach- 
ers. Such it may be, but anyone who reads the 
book thoughtfully will gain a better understand- 
ing of, and a greater respect for, children try- 
ing to cope with the problems of growing up. * 
If any part of the book can be considered dis- 
appointing, it would be chapter v, in which the 
content does not fulfil the promise contained in 
the title, “Learning Some Explanatory Princi- 
ples.” No explanatory principles are set forth in 
the chapter. Instead, the teachers are merely told 
that the findings from a number of sciences make 
possible generalizations about how children 
grow and why they behave in certain ways. 
This is hardly more than making a Ulysses bow 
to science. The reader may properly raise the 
question whether or not biological principles, 
for instance, apply without modification to hu- 
man education. There are those who contend 
that certain modifications are required and that 
it is the function of experts and teachers in the 
field of educational psychology to show how 
such modifications are to be made. It may be 
that we have here an illustration of a weakness 
that characterizes the present child-study pro- 
gram and perhaps also the child-study move- 
ment led by G. Stanley Hall a half-century ago. 
In chapter xi the teachers set forth their own 
evaluation of the study. The testimonies given 
dazzle with such glow that the authors are led 
to warn the reader that, while the picture pre- 
sented may sound too good to be true, no single 
teacher reported all the desirable results and in 
no school did all the teachers profit equally. * 
This book belongs on the “must” reading list of 
all in-service teachers who desire a better under- 
standing of children and whowish for themselves 
a happier and more successful professional ex- 
perience. It could also be used to advantage as a 
textbook in preservice teacher training if the 
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anecdotal records presented in the volume were 
supplemented by readings and lectures that 
would make available to the students the ex- 
perimentally derived facts by means of which 
adequate principles to explain the anecdotally 
described behavior could be developed. 


[ 722 ] 
*AmERICAN COUNCIL on EDUCATION, COMMITTEE ON 
MEASUREMENT AND GUIDANCE. Conference of Test- 
ing Leaders in Conjunction With the Ninth Edu- 
cational Conference, New York City, November 
2, 1940. Washington, D. C.: the Council, [1941]. Pp. 
38. Paper, mimeographed. Out of print. * 

[723 ] 
XAMERICAN COUNCIL on EDUCATION, COMMITTEE ON 
MEASUREMENT AND GUIDANCE. Invitational Confer- 
ence on Testing Problems in Conjunction With 
the Tenth Educational Conference, New York 
City, November 1, 1941. Washington, D. C.: the 
Council, [1942]. Pp. 73. Paper, mimeographed. Out of 
print. * 

[724 ] 
X*AMERICAN COUNCIL on EDUCATION, COMMITTEE ON 
MEASUREMENT AND GUIDANCE, T. R. MCCONNELL, 
CnamMAN. New Directions for Measurement and 
Guidance: A Symposium Sponsored by the Com- 
mittee on Measurement and Guidance. American 
Council on Education Studies, Vol. 8; Series I, Re- 
ports of Committees and Conferences, No. 20. Wash- 
ington, D. C.: the Council, August 1944. Pp. vii, 103. 
Paper. $0.80. * (PA 19:1046) 

Clearing House 20:125 O '45. P(hilip) W. 
L. C(ox). Until and unless educators have in 
mind reasonably clear goals for selection and 
guidance for youths, there is inevitable ob- 
scurity regarding what traits to measure and 
how to judge them and wherein any guidance 
of any individual by another person can wisely 
be attempted. In this pamphlet...a number of 
very able men explored limited aspects of the 
problem in which the Cooperative Test Service 
has interested itself—college admission, individ- 
ual evaluation, and tests for accreditation, for 
personnel work in high school and in college, 
for teacher selection, and for coordinated re- 
gional programs in high schools. * The par- 
ticipants in all cases accepted the stereotypes of 
institutional education, and, by implication, the 
static conception of guidance of the object, to 
be forwarded like an express parcel by a “guide.” 
The reviewer feels....that despite the intelligence 
and sincerity of the participants in the sym- 
posium, the unconscious assumptions that to 
date smother the measurement and guidance 
“experts” stultify education. 

Occupations 23:429-30 Ap '45. H. B. Mc- 
Daniel. Again the American Council has issued 
a monograph so timely that it should be read 


by all progressive high school and college coun- 
selors and administrators. * Originally proposed 
as an indication to test makers of needed de- 
velopments in appraisal instruments, this sym- 
posium serves also as a review of the avail- 
ability and uses of standard tests. The counselor 
and administrator will find in it many ideas for 
evaluation and improvement of his own ap- 
praisal program. 


[725 ] 
XAMERICAN MANAGEMENT ASSOCIATION. Advances 
in Methods of Personnel Evaluation. Personnel 
Series, No. 107. New York: the Association, 1947. Pp. 
50. Paper. $1.00. * 


[ 726 ] 
XAMERICAN MANAGEMENT ASSOCIATION. Advances in 
Personnel Administration. Personnel Series, No. 
93. New York: the Association, 1945. Pp. 30. Paper. 
$1.00, * 

[ 727] 
AMERICAN MANAGEMENT Association. Measuring 
Results of Personnel Functions. Personnel Series, 
No. 111. New York: the Association, 1947. Pp. 31. 
Paper. $0.50. * 

[ 728 ] 
XAMERICAN MANAGEMENT Assoctation. Psycholog- 
ical Aids in the Selection of Workers. Personnel 
Series, No. 5o. New York: the Association, 1941. Pp. 
43. Paper. Out of print.* 

[ 729 ] 
XAMERICAN MANAGEMENT Assoctation. Rating and 
Training Executives and Employees. Personnel 
Series, No. roo. New York: the Association, 1947. Pp. 
43. Paper. $0.75.* 

[ 730 ] 
XAMERICAN MANAGEMENT Association. Selection 
and Development of Foremen and Workers. Pro- 
duction Series, No. 127. New York: the Association, 
1940. Pp. 43. Paper. Out of print. * 

[ 731 ] 
XAMERICAN MANAGEMENT ASSOCIATION. The Value 
of Psychology in Selecting Salesmen. Marketing 
Series, No. 45. New York: the Association, 1941. Pp. 
35. Paper. Out of print. * 

[ 732 ] 
XAMERICAN MANAGEMENT Assoctation. Values of 

sychology in Industrial Management. Personnel 


Series, No. 43. New York: the Association, 1940. Pp. 
36. Paper. $0.50. * 


Pub Personnel R 2:86 Ja '41. Henry J. Mc- 
Farland. * “A Test Program in an Industrial 
Company" is the second paper. In it, Harold C. 
Taylor of the Western Electric Co. does some 
straight thinking and talking about tests, their 
predictive value, the need for speeding up the 
"manufacturing interval" Civil service exami- 
ners could get here some sound arguments, 
some introspective stimuli and a good healthy 
"kick in the pants." The low incidence of test- 
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ing programs in industry is rather cleverly ex- 
plained. Edward N. Hay, Personnel Officer of 
the Pennsylvania Company, discusses in the 
third article some of his experiences with a test- 
ing program. It is interesting reading, but famil- 
iar to all who have struggled with test pro- 
grams. 

[ 733 ] 
X*AMERICAN  OPTOMETRIC ASSOCIATION, COUNCIL ON 
EDUCATION AND PROFESSIONAL GurpANce. Manual of 
Ocular Tests: A Compilation and Compendium 
of Ocular Tests Which Are in Use by the Armed 
Forces of the United States of America. Minneap- 
olis, Minn.: the Association, 518 Wilmac Building, 
1942. * (PA 16:4292) 

[ 734] 
Anverson, Howard R, AND Lrnpgutst, E. F.; RE- 
visen BY Harry D. Berc. Selected Test Items in 
American History, Revised Edition. National 
Council for the Social Studies, Bulletin No. 6, Re- 
vised Edition. Washington, D. C.: the Council, Na- 
tional Education Association, April 1940. Pp. 90. Paper. 
(PA 15:3133) For latest edition, see 735. 


Harvard Ed R 11:253 Mr ’41. J. Wayne 
Wrightstone. * The authors indicate clearly 
that these test items do not deal with such ob- 
jectives as study skills, attitudes, critical think- 
ing, and qualities of character or personality. 
These test items are valid for appraising mas- 
tery of information and concepts in a course in 
American history. The conscientious teacher of 
American history will find this reservoir of test 
items of great assistance. It will aid him in plan- 
ning better and more comprehensive examina- 
tions fitted to his local course of study. It is 
expected that he will select appropriate test 
items from this collection and will construct 
additional items when no appropriate exercises 
are available. The test items are arranged con- 
veniently in a chronological sequence under 
various topics * The authors state that the test 
items vary in difficulty, but they have failed to 
indicate the relative difficulty of the items as dis- 
covered in past usage. Such an index would 
provide added guidance to the classroom teacher 
in selecting the items to be used in a test. ’Addi- 
tional compilations of test items such as these 
and their continuous revision is one of the most 
important projects sponsored by the National 
Council for the Social Studies. 


[ 735 ] 

*Anperson, Howarp R., AND Lrnpguist, E. F.; RE- 
visen By Harry D. Berc. Selected Test Items in 
American History, Second Revised Edition. Na- 
tional Council for the Social Studies, Bulletin No. 6, 
Second Revised Edition. Washington, D. C.: the Coun- 
cil, National Education Association, September 1947. 
Pp. ii, 113. Paper. $1.00. * 
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*Anperson, HowARD R., AND Linguist, E, F.; RE- 
VisED By FnrpERICK H, Stutz. Selected Test Items 
in World History, Revised Edition. National 
Council for the Social Studies, Bulletin No. 9, Revised 
Edition. Washington, D. C.: the Council, National 
Hapus Association, January 1947. Pp. 95. Paper. 
$075. 


[737 ] 
*AxpznsoN, Puy tus M. Construction of the Norms 
for the Free Association Test Form 35 FK. Hu- 
man Engineering Laboratory, Inc., Technical Report 
No. 204. Boston, Mass.: the Laboratory, December 
1945. Pp. 3. Paper, mimeographed. * 


[ 738 ] 
*AnMrTAGE, Stewart G. An Analysis of Certain 
Psychological Tests Used for the Evaluation of 
Brain Injury, American Psychological Association, 
Psychological Monographs, Vol. 60, No. 1, Whole No, 
277. Washington, D. C.: the Association, Inc., 1946. 
Pp. iii, 48. Paper. $1.25, * 


[ 739 ] 
*AnNoLp, Berry Lou Brown. A Study of Work- 
sample 268 as a Preliminary Step Toward the 
Development of New Age Norms. Human En- 
gineering Laboratory, Inc., Technical Report No. 165. 
Boston, Mass.: the Laboratory, April 1945. Pp. 31. 
Paper, mimeographed. * 


[ 740 ] 
X*AmrHUR, Grace. A Point Scale of Performance 
Tests: Vol. I, Clinical Manual, Second Edition. 
New York: Commonwealth Fund, 1943. Pp. xi, 64. 
$1.50. * (PA 18:1539) For related reviews, see 36 :Br9, 
38 :B304, 40:1379, and 40:B830. 


[ 741] 
KASSOCIATION For CuttpHoop Epucation. Records 
and Reports: Trends in NE and U; "Them. 
Bulletin of the Association for Childhood Education. 
Washington, D, C.: the Association, 1942. Pp. 32 
Paper. $0.35. * 

[742] 
*Bancock, Harrier. Time and the Mind: Personal 
Tempo—The Key to Normal and Pathological 
Mental Conditions. Cambridge, Mass.: Sci-Art Pub- 
lishers, 1941. Pp. 304. $3.75. * (PA 15:3651) For re- 
views, see 72. 

[ 743 ] 
Bary, Epona W.; Laron, Anita D.; AND BisHOP, 
ErizaBETE L, Studying Children in School, Second 
Edition. The first edition is entitled Outline for 
Study of Children in Schools, New York: McGraw- 
Hill Book Co., Inc., 1939. Pp. vii, 182. $2.25. * (PA 14: 
517, title only) 


Am J Psychol 54:472 Jl '41. Claire C. Dim- 
mick. The authors have planned a book to aid 
adults in studying the children under their su- 
pervision. They point out the need for such 
studies and emphasize the fact that a child must 
be studied as a whole. To simplify the collec- 
tion of data “a selected list of items on which 
information can be ordinarily obtained without 
too much trouble or expense and for which the 
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significance in a child's life can be more or less 
certainly pointed out" is given. The list is de- 
tailed and suggestions are made as to the inter- 
pretation and significance of the data collected. 
* The material..is carefully worked out and 
should be of practical value in training teachers 
and prospective teachers in studying children 
—a function which too many teachers perform 
poorly or not at all. 
For additional reviews, see 40:B831. 


[ 744 ] 
*Barrp, SartLvANN. Description of Pegs Work- 
sample 438 Form A. Human Engineering Labora- 
tory, Inc., Technical Report No. 269. Boston, Mass.: 
the Laboratory, February 1946. Pp. 11. Paper, mime- 
ographed. * 


[ 745 ] 
XBarp, SALLYANN. A Study of Visual Imagination 
Ore ple 307, Forms AA and AB Leading to 
Form AC. Human Engineering Repeat Inc., 
Technical Report No. 134, Boston, Mass. : the Labora- 
tory, December 1944. Pp. iv, 16. Paper, mimeo- 
graphed. * 


[ 746 ] 
Baker, Harry J. AND TRAPHAGEN, VinGINIA. The 
Diagnosis and Treatment of Behavior-Problem 
Children. New York: Macmillan Co., 1935. Pp. xiv, 
393. $3.85. * (PA 10:2743) For reviews, see 33, 36 :B24, 
38:B306, and 40:B835. 


[747 ] 
Barrar, Parr Boswoop. Teaching and Testing 
English. London: University of London Press, Ltd., 
1939. Pp. xi, 167. 6s, * 


Brit J Ed Psychol 11:234 N "41. A(Ibert) 
E. C(hapman). The author has described this 
book as half old and half new. He has retained 
in revised form the most valuable parts of his 
Teaching the Mother Tongue, and has added 
chapters on speech training, voice production, 
Sir Philip Hartog's method of teaching com- 
position and a section of four chapters on test- 
ing English. This new book should prove most 
stimulating to teachers of English in junior and 
senior schools, as the author expresses wittily 
and wisely the results of his observations and 
experiments on methods of teaching and test- 
ing English. Teachers would probably have 
found the book even more helpful if instead of 
the greater part of the interesting historical 
accounts of critics who attempted to defend a 
standard of English the author had given them 
a fuller account of his views on the teaching of 
literature. Few teachers will agree that ‘ample 
suggestions for varying the type of exercise 
will be found in Part IT, Chapter II (p. 77), 
for this chapter describes only four types of 
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tests in English and although pupils may like 
answering them their value in teaching English 
will depend on the manner in which they are 
used by teachers. Devices for testing English 
may be valueless as methods of teaching Eng- 
lish as the author himself emphasizes on p. 161. 
Students in training should certainly read this 
book, which should also be in the teachers’ li- 
brary of every school in which pupils between 
the ages of five and sixteen are taught English. 

J Ed (London) 72:389 Ag '40. Dr. Ballard 
explains in his Preface that this book is based 
upon an older work, Teaching the Mother 
Tongue, written by him eighteen years ago, and 
now out of print. The new book (for so in effect 
it is) has been built round the old by a thor- 
ough-going addition and subtraction. The vigor- 
ous attack upon the wrong kind of grammar has 
gone, and its place is taken by a well-documented 
survey of modern methods of testing English. 
An older generation of readers may well regret 
the suppression of those lively chapters from 
the earlier book, and it is to be hoped that Dr. 
Ballard has not been too optimistic in judging 
—as he does—that the danger of wrong-headed- 
ness in the teaching of grammar has entirely 
disappeared. Although this new book of Dr. 
Ballard’s can hardly rank with his Thought and 
Language, yet it contains much of interest and 
value. * written throughout with his usual ease 
and lucidity. He is penetrating, persuasive, and 
never dull.* 

For an additional review, see 40:B837. 


[ 748 ] 
XBarsox, WirLiAM H, High School Scholastic Con- 
tests in South Dakota: A Report Covering the 
Results of the Program During a Ten-Year Period 
From 1935 to 1944, Vermillion, S.D.: School of Edu- 
cation, University of South Dakota, 1945. Pp. 51. 
Paper. $0.50. * 


[ 749 ] 

BECK, SAMUEL J. Personality Structure in Schizo- 
phrenia: A Rorschach Investigation in 81 Pa- 
tients and 64 Controls. Preface by C. Macfie Camp- 
hell. Nervous and Mental Disease Monograph No, 63. 
New York: Nervous and Mental Disease Monographs, 
1938. Pp. 88. Out of print. * (PA 13:277) For reviews, 
see 74 and 40:B841. 


[ 750 ] 
*Becx, Samust J. Rorschach's Test: I, Basic Pro- 
cesses. Foreword by Willard L. Valentine. New York: 
Grune & Stratton, Inc., 1944. Pp. xiii, 223. $3.50. * (PA 
18:2837) For reviews, see 75 and 77. 


[ 751 ] 
*Becx, SAMUEL J. Rorschach’s Test: II, A Variety 
of Personality Pictures. Foreword by Roy R. Grin- 
ker. New York: Grune & Stratton, Inc., 1945. Pp. xii, 
402. $5.00. * (PA 20:1150) For reviews, see 76 and 77. 
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[ 752 ] 
*Bzzns, F. S.; BoATRIGHT, J. A.; AND CnzsrER, H. M. 
Some Principles of Examining, With Aids for 
Consulting Examiners. Athens, Ga.: Georgia Uni- 
versity Press, January 1942. Pp. i, 45. Paper, lithotyped. 
Out of print. * (PA 17:674) 


[ 753 ] 
X*BrNpER, EvELYN. A Study of Abstract I 
tion Worksample 329 and Revision From 
A to Form B. Human Engineering Laboratory, Inc., 
Technical Report No. 155. Boston, Mass. : the Labora- 
tory, June 1945. Pp. 14. Paper, mimeographed. * 


[ 754 ] 
Benner, LAunerTA. A Visual Motor Gestalt Test 
and Its Clinical Use. American Orthopsychiatric As- 
sociation, Research Monographs, No. 3. New York: 
the Association, 1938. d xi, 176. $3.50. * (PA 13:1720) 
For reviews, see 109 and 40:B843. 


[ 755 ] 
*BENGE, EUGENE J. ee Evaluation and Merit Rat- 
ing: A Manual of Procedures. Deep River, Conn.: 
National Foremen's Institute, Inc, 1941. Pp. 100. 
Loose-leaf. $7.50. * 
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[ 756 ] 

*Benyamin, Hazer C. Employment Tests in In- 
dustry and Business: A Selected, Annotated Bibli- 
ography. Princeton University, Department of Eco- 
nomics and Social Institutions, Industrial Relations 
Section, Bibliographic Series, No. 67. Princeton, N. J.: 
the Section, April 1942. Pp. vii, 46. Paper. * (PA 16: 
3735) For latest edition, see 757. 


Am J Psychol 56:152-3 Ja *43. T. A. Ryan. 
* The bibliography is designed “to aid war in- 
dustries and agencies in canvassing the possi- 
bilities of employment tests” according to the 
foreword by Director J. D. Brown. As such it 
contains a sampling not only of technical re- 
ports from the literature designed to be read by 
the psychologist, but also of articles written 
without technical detail and aimed at the per- 
sonnel administrator. It appears to be a repre- 
sentative selection of both types of report. 

Occupational Psychol 16:203 O '42. * Al- 
though the material described is predominantly 
American, as are the addresses of publishers 
and periodicals appended, English readers will 
find it thoughtfully and concisely laid out and 
conveniently indexed. * 

Occupations 22:152-3 N '43. Louise E. 
Gettys. * In the booklet on Employment Tests 
in Industry and Business, the Director of the 
Industrial Relations Section draws attention to 
the growing use of employment tests in the se- 
lection of personnel. The bibliography has been 
compiled mainly for this purpose, and is sub- 
divided accordingly under : General Discussions, 
Testing Principles and Procedures, Material 
Concerning Specific Types of Tests, and Re- 
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ports of Company Experiences and of Research 
in the Use of Tests. 


{ 757 ] 
*Benyamin, Hazer C. Employment Tests in Indus- 
try and Business: A Selected, Annotated Bibliog- 
raphy, 1945. Princeton University, Department of 
Economics and Social Institutions, Industrial Rela- 
tions Section, Bibliographic Series, No. 67, Revised. 
Princeton, N. J.: the Section, April 1945. Pp. vii, 46. 
Paper. $0.50. * (PA 19:3106) 

[ 758 ] 
*Bennett, Grorce K., AND CRUIKSHANK, Ruru M. 
A Summary of Manual and Mechanical Ability 
Conperation, toga Dy. Go, Paper, Hehotped, $070 
pr SEA eae A Die A 

J Ed Psychol 34:255-6 Ap '43. Forrest H. 
Kirkpatrick. School psychologists will find this 
compilation of information about tests of man- 
ual and mechanical abilities very helpful. It is 
important, however, always to sound the cau- 
tion that most of the tests reported in this re- 
view are more useful for vocational selection 
than vocational guidance purposes. Indeed, when 
we remember that it is the probable ability of 
a person which we desire to forecast in voca- 
tional guidance, we realize that an accurate 
measure of present abilities has serious limita- 
tions. This may be especially true in manual 
dexterity which is subject to psychological pa- 
rameters which may be considerably different 
in the adolescent boy or girl than in the adult 
man or woman. * Each test treated in full is 
clearly and concisely set out * will be helpful 
to teachers and administrators who are counsel- 
ling students, and to "research workers inter- 
ested in analysis of human abilities" * 

Occupational Psychol 16:203-4 O ’42. * care- 
ful and valuable compilation * 

Occupations 22:152-3 N '43. Louise E. 
Gettys. Vocational counselors will find the Ben- 
nett-Cruikshank compilation of information 
about tests of manual and mechanical abilities 
helpful. It is important, however, to sound the 
caution that most of the tests reported in this 
review are more useful for vocational selection 
rather than for vocational guidance. When we 
remember that it is the probable ability of a 
person which we desire to forecast in vocational 
guidance, we realize that an accurate measure 
of present abilities has serious limitations. * 


[ 759 ] 
* BERNARD, WILLIAM, AND LEOPOLD, JuLes. Test Your- 
self! A T'andhnok of Self-Analysis Based on Mod- 
ern Psychology, Second Edition. New York: Hell- 
man, Williams & Co., 1947. Pp. 94. $2.00. * 
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[ 760 ] 
X*BraNsrEIN, EstHEr. Knowledge of Geometric 
Figures. Human Engineering Laboratory, Inc., Tech- 
nical Report No. 185. Boston, Mass.: the Laboratory, 
July 1945. Pp. 5. Paper, mimeographed. * 

[ 761 ] 
Betts, EMMETT ALBERT; WITH THE ASSISTANCE OF 
AnrRUR W. Ayers. Data on Visual Sensation and 
Perception Tests: Part II, Visual Efficiency. 
Meadville, Pa. : Keystone View Co., 1940. Pp. 66. Paper, 
lithotyped. $0.25. * (PA 15:78) 


Ed Abstracts 5:276 S ' 40. William W. Brick- 
man. * The chief conclusion (there are nineteen 
in all) derived from the data is that the Betts 
tests and the Snellen Chart Test have sufficient 
reliability for purposes of screening out (de- 
tecting) individuals with questionable visual 
efficiency. * This report should be read in con- 
nection with Part I * Had Professor Betts re- 
peated some of the information describing his 
tests, instead of referring to Part I (which was 
unavailable to the present abstractor), his latest 
publication would have been more comprehen- 
sible and more interesting to the non-specialist 
reader. Of value are the directions for adminis- 
tering the Snellen Chart Test and the exception- 
ally thorough bibliography. The long list of 
limitations of the study shows care on the au- 
thor’s part to avoid claims of unlimited validity. 


[ 762.] 
XBzrrs, EMMETT A., AND Ayers, AmrHUR W. Data 
on Visual Sensation and Perception Tests: Part 
III, Stereopsis. Meadville, Pa.: Keystone View Co., 

1940. Pp. 37. Paper, lithotyped. $0.25. * (PA 15:707) 

[ 763 ] 
*Berrs, EMMETT À., AND Ayers, AnrHUR W, Data on 
Visual Sensation and PO PHon Tests: Part IV, 
Ametropia. Meadville, Pa. : Keystone View Co., 1941. 

Pp. 69. Paper, lithotyped. $0.25. * (PA 15 24943) 

[ 764 ] 
XBiESHEUVEL, S. African Intelligence. Johannes- 
burg, South Africa: South African Institute of Race 
Relations, 1943. Pp. viii, 225. 7s. 6d. * (PA 19:3192) 
Brit J Psychol 35:49 Ja ’45. The author’s 
Primary purpose in writing this book was to 
controvert the conclusions of Dr. L. Fick, 
stated in a monograph, The Educability of the 
South African Native, that “....the available 
objective data point to a marked inferiority on 
the part of the Native in comparison with Euro- 
peans, This inferiority, occurring in certain 
tests in which learning or environmental condi- 
tions are equalized for the Native and European 
groups, does not appear to be of a temporary 
nature....the inferiority of the Native in educa- 
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bility, as shown by measurement of their actual 
achievement in education, limits considerably 
the proportion of Natives who can benefit by 
education of the ordinary type beyond the rudi- 
mentary.” * Biesheuvel makes a very thorough 
analysis of all the conditions that must be con- 
trolled or allowed for before such a comparison 
of test scores is valid. The influences on test 
performance of poor home environment and 
upbringing, inferior school teaching, malnutri- 
tion, lack of interest and unsuitability of test 
material, are so much greater and more wide- 
spread for native than for white children that 
at present it is quite impossible to select com- 
parable test groups, or even to establish a basis 
for comparing them. Dr. Biesheuvel’s study is 
a fairly damning piece of evidence against any 
attempt to prove a fundamental inferiority in 
the African’s potential intelligence as long as 
his place in society remains such as it is in the 
Union of South Africa, But the study has wider 
implications of importance to psychologists, with 
regard to the whole question of the comparative 
intelligence testing of groups differing widely in 
culture and social status, environmental condi- 
tions, and fundamental interests. Even if the 
environmental conditions could be controlled or 
equated, as might be possible in other parts of 
Africa, there remains little doubt that the con- 
tent and type of performance demanded by in- 
telligence tests, even of the performance type, 
make little appeal to the African; such intelli- 
gence as he possesses does not, and possibly 
cannot, express itself along those lines. It yet 
remains to be seen whether it is possible to find 
any test situations in which the intelligences of 
both the African and the European can function 
readily enough to be really comparable. More- 
over, it is just as important that this difficulty 
should be faced in testing different social groups 
with different customs and interests, even when 
they are of the same race and nationality. Dr. 
Biesheuvel is to be congratulated on his clear 
and detailed discussion of these problems. 


[ 765 ] 
XBrjov, Sipney W., Eprron. The Psychological Pro- 
gram in AAF Convalescent Hospitals. Army Air 
Forces Aviation Psychology Program Research Re- 
ports, Report No. 15. Washington, D. C.: Government 
Printing Office, 1947. Pp. viii, 256. $1.25. * 


[ 766 ] 
XBINGHAM, WALTER VAN DYKE, AND Moore, Bruce 
Victor. How to Interview, Third Edition. New 
York: Harper & Brothers, 1941. Pp. xi, 263. $3.00. * 
(PA 16:304) 
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Adv Mgmt 7:48 Ja-Mr’42. Mathias Lynaugh. 
* a readable and instructional text which draws 
heavily upon the more recent studies in applied 
psychology * Nine of fifteen chapters are par- 
ticularly useful in the business and industrial 
field, for employment interviewers, personnel 
men and others who use the interview method 
even with considerable experience and training. 
Included in this section is a significant chapter on 
the interview as applied to employer-employee 
relations. 

J Ed Res 35:462-4 F '42. Carter V. Good. 
* The appendices and the long annotated bibli- 
ography of earlier editions have been omitted, 
with a comprehensive bibliography covering 
1934-41 to be issued separately by the junior 
author. In view of the extensive use of the 
volume as a handbook, a number of the chap- 
ters might well have included pertinent topical 
headings, as is done effectively in certain other 
chapters. * 

Occupations 20:236-7 D ’41. Joseph V. 
Hanna. In this....revision of this serviceable 
volume a considerable number of minór and a 
more limited number of major changes have 
been made. * The minor changes have to do 
with the order, combination, and revision of 
chapter headings, deletions, and the like. * The 
major changes consist of the omission of the 
appendices and of the annotated bibliography, 
of the inclusion after each chapter of a selected 
list of references, and especially the addition of 
two chapters dealing with oral examinations. 
The omission of Appendix I, which has to do 
with the vocational interview, and of Appendix 
II, which presents an outline for the psychiatric 
examination of a child, is to be regretted. The 
ready availability of even more such illustrative 
materials would enhance the value of such a 
volume as a handbook for counselors. Since the 
reviewer appreciates the dilemma of the authors 
in their efforts to conserve space, this observa- 
tion is not offered as criticism. Counselors and 
students of counseling will in general find in 
the selected list of references which follows each 
chapter a more serviceable arrangement of read- 
ings than the more detailed annotated bibliog- 
raphy of the preceding edition. The one change 
which in the reviewer's opinion fully justifies 
the authors in presenting the new revision is 
the addition of the two chapters on oral exami- 
nations, One of these added chapters deals with 
the oral examination in Civil Service agencies ; 
the other, with techniques and methods used in 
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training interviewers and field investigators. * 
While the methods [in the first of these chap- 
ters]..apply to Civil Service agencies, they 
would seem to be just as applicable to any other 
field which is willing to exercise the same de- 
gree of care in developing and standardizing 
careful examination procedures. The second 
chapter fills a need which is of long standing, 
but which has not been felt by those who regard 
the rating scale or other hastily prepared de- 
vices for the appraisal of personal qualities as 
fool-proof instruments. Well-trained personnel 
is rightly regarded by the authors as indispen- 
sable to any kind of machinery for administer- 
ing oral examinations. In presenting these new 
materials the authors bring their authoritative 
and helpful text into line with the ever-increas- 
ing tendency toward a more careful and more 
objective appraisal of the human personality. 
The added materials measure well up to the 
standards which characterize the book as a 
whole. The new revision merits fully the favor- 
able reception accorded the earlier editions. 

Personnel J 20:264 Ja '42. W. H. Sellander. 
* The beginner will find many helpful sugges- 
tions on learning how to interview; and the 
experienced interviewer will discover references 
to new techniques in other fields that will be- 
come very useful in perfecting his own inter- 
viewing. * 

Social Service R 17:238 Je '43. Mary Houk. 
* Social workers will not find the chapter de- 
voted to their own profession a substitute for 
reading the rich material by case workers which 
has appeared in the past few years. The book 
is valuable for its elucidation of techniques used 
in related fields. The chapters on employment 
interviews and the oral examinations in civil 
service agencies are particularly rewarding, as 
this is an area in which Dr. Bingham has had 
extensive experience. 

For reviews of the second edition, see 
38:B314. 

[ 767 ] 
XBirrEL, Jane. Standard Error on Worksample No. 
Three. Human Engineering Laboratory, Inc., Techni- 
cal Report No. 367. Boston, Mass.: the Laboratory, 
February 1947. Pp. 27. Paper, mimeographed. * 
[ 768 ] 

Buss, Esma F., Jr. Standardized Tests and Edu- 
cational Practice. Brooklyn, N. Y.: the Author, P.O. 
X3 d 1940. Pp. 29. Paper, lithotyped. $0.75. * (PA 
15:482 


Ed Abstracts 5:339 N ’40. Henry D. Rins- 
land. * The writer is more familiar with the 
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presentation of an argument than with the legal 
control of the teaching profession. He shows 
only a layman's knowledge of testing. He for- 
gets that teachers are licensed under statutory 
law, just as physicians are, and therefore already 
have the right to use the instruments of their 
profession—all of them. 

J Am Med Assn 115:1913 N 30 '40. This 
pamphlet seems to express a deep-rooted and 
intense, but rather vague, resentment and dis- 
satisfaction with the existing methods by which 
intelligence, aptitude and other psychologic tests 
are employed in the schools. An ordinance is 
proposed by means of which the purchase, 
preparation, administration and interpretation 
of such tests could supposedly be regulated, al- 
though the pamphlet also condemns what it 
calls political regulation while recommending 
an ordinance to be adopted by local political 
bodies, plus federal regulation. The text of the 
ordinance and the involved cumbersome style in 
which the pamphlet is written make it extremely 
difficult to interpret. There is, in fact, no assur- 
ance that the central idea, if any, contained in 
this pamphlet is not so deeply buried in words 
that it is not totally incomprehensible. 

Loyola Ed Digest 16:11 Ja '41. Austin G. 
Schmidt. 'The author believes, and not of course 
without some reason, that standardized tests 
are sometimes purchased because of exagger- 
ated claims made by publishers, that tests are 
sometimes used unwisely by administrators and 
teachers, and that results are sometimes divulged 
when they should be kept secret. He proposes 
an ordinance setting up the conditions under 
which tests may be purchased and used by pub- 
lic schools. The preliminary arguments by which 
the-author builds up his case are obscured by 
a style which sets sonorousness far above sim- 
plicity and directness. 


[ 769 ] 
XBros, Perer. Adolescent Personality: A Study of 
Individual Behavior. Prepared for the Commission 
on Secondary School Curriculum. New York: Apple- 
ton-Century-Crofts, Inc., 1941. Pp. xiii, 517. $3.00. (PA 
15:1996) 

Am J Orthopsychiatry 11:788-9 O "41. Ira 
S. Wile. The life histories are set forth with 
too ready acceptance of the validity of the tests 
employed, although no effort was made to set 
up a uniform procedure for the studies. It is 
unnecessary to criticize the volume in extenso, 
but clearly typical normal adolescents are not 
represented by a r4-11/12 year old girl (IQ 
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129) in the roth grade, and another, 18-2/12 
years of age, in college for two years. While 
there is much of significance in the book, the 
author reveals a definite lack of background of 
experience. He evidences a marked tendency to 
read meanings into statements in terms of a 
preconceived scheme of development of the psy- 
chological organization. This is true for both 
the material data gained by direct observation 
and those that are an outgrowth of self-expres- 
sion. The reviewer has a definite sense of the 
omission of material by purposeful selection. In 
the discussion of Betty and Paul, after referring 
to the variables “which allow access to the writ- 
er's personality, with the aid of interpretive 
techniques," it is stated of the interviews, “The 
worker followed no pre-ordained plan. The free 
conversation interview was designed, on the 
whole, to explore adolescent responses to vari- 
ous life situations, in order to gain insight into 
the characteristics of individual behavior ob- 
served and recorded by the school.” While this 
was supposed to be spontaneously offered, the 
text reveals how carefully the adolescent was 
led by the interviewer. This volume will un- 
doubtedly receive considerable praise because it 
is issued by the Commission on Secondary 
School Curriculum of the Progressive Educa- 
tion Association, but it scarcely serves as an 
authoritative and practical guide for those re- 
sponsible for solving problems of adolescents. 
The emphasis upon adolescent reaction as a 
whole person in a total situation is sound, but 
there is danger in relying upon indirect revela- 
tion of the total situation from isolated re- 
sponses even though the inferences are based 
upon “interpretive techniques competently em- 
ployed." 

Psychoanalytic Q 11:576-8 O '42. Margaret 
Mahler-Schoenberger. * of value to anyone 
dealing with personality development. Because 
of the primary objective of the book, its presen- 
tation of theory will not satisfy the psycho- 
analyst. Its treatment of case material, however, 
can leave no doubt that the book has been pre- 
pared by a person thoroughly familiar with psy- 
choanalytic concepts. * The cases of *Mary' and 
'Joe' which comprise Part Four contain only 
interview material. They show in an impressive 
way how Mary and Joe passed through the up- 
heaval of puberty to emerge as fairly well- 
adjusted people, without receiving active help 
from the outside. Throughout the interviews 
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the worker made no attempt to deal directly 
with these two young people except by listening 
to them. Their developmental history is recon- 
structed solely on the basis of the interview 
material and the transference situation. The 
material demonstrates-how the average worker, 
remaining within the limits of his competency, 
can guide and clarify troubled adolescents sim- 
ply by listening to them and making himself 
available to them. But it is deplorable that the 
worker’s faulty handling of the interviews is 
not clarified by the author. A situation of nega- 
tive countertransference is pointed out in the 
case of ‘Joe.’ But the author does not discuss 
this situation in any detail, nor does he criticize 
the interview handling in the case of ‘Mary,’ 
and her worker’s conspicuous blind spots, espe- 
cially regarding sexual material. The author 
unfortunately misses an excellent opportunity 
in this section to educate the educator through 
a demonstration of the pitfalls which lie in the 
path of an unwary interviewer. Throughout the 
case discussion emphasis is placed upon the reac- 
tivation of infantile problems during adoles- 
cence, and the educator is made aware of the 
contributions he can make to help his adoles- 
cent students in the struggle of puberty. The 
presentation of concrete case material with 
cumulative interpretations is particularly well 
suited for demonstrating the continuity of de- 
velopment and the symptomatic character of 
behavior. This unique way of formulating cer- 
tain aspects of psychodynamics may provide 
educators with insights which could never have 
been conveyed as effectively through a purely 
theoretical discussion so long as there exists a 
lack of a common conceptual language between 
psychoanalysts and the bulk of workers in the 
present educational system. As mentioned above, 
this adaptation to the conceptual language of 
educators, a far reaching compromise at the 
expense of psychoanalytic psychology, seems a 
compromise which may lead the psychoanalyst 
to criticize Part Three, A Theory of Adolescent 
Development. * 

Psychoanalytic R 31:214-5 Ap '44. A. N. 
F(oxe). * The author contributes to an active, 
idealistic, and forward looking view in this vol- 
ume; it should be welcomed in educational 
circles. 

Q R Biol 16:381-2 S '41. * A sound, factual, 
thought-provoking contribution to the literature 
on adolescence, 


[ 770 ] 
*Bocuner, RUTH, AND HALPERN, Frorence. The 
Clinical Application of the Rorschach Test. Intro- 
duction by Karl M. Bowman. New York: Grune & 
Stratton, Inc., 1942. Pp. x, 217. * (PA 16:2315) For 
latest edition, see 771. For reviews, see 78. 

[77 ] 
*BocHNeR, RUTH, AND HarPERN, Frorence. The 
Clinical Application of the Rorschach Test, Sec- 
ond Edition. Introduction by Karl M. Bowman. New 
York: Grune & Stratton, Inc., 1945. Pp. xi, 331. $4.00. * 
(PA 20:472) For reviews, see 79. 

[772] 
XBorres, Mary Marjorie. The Basis of Pertinence: 
A Study of the Test Performance of Aments, De- 
ments and Normal Children of the Same Mental 
Age. Archives of Psychology, No. 212. Washington, 
D. C.: American Psychological Association, Inc., Suis 
1937. Pp. 51. Paper. $1.00. * (PA 11:5549) 

[773 ] 
XBooxwarrER, Karu W. Further Studies of Indiana 
University Motor Fitness Index. Bulletin of the 
School of Education, Indiana University, Vol. 19, No. 
5. Bloomington, Ind.: Indiana University Bookstore, 
September 1943. Pp. 44. Paper. $0.50. * (PA 18 :1022) 

[ 774 ] 
*Booxwatter, KARL W., AND BOOKWALTER, CAROLYN 
W. A Measure of Motor Fitness for College Men. 
Bulletin of the School of Education, Indiana Uni- 
versity, Vol. 19, No. 2. Bloomington, Ind.: Indiana 
University Bookstore, March 1943. Pp. 22. Paper. 
$0.50. * (PA 17:2652) 

[ 775 ] 
*Bower, Lies B. A Factor Analysis of Music 
Tests. Washington, D. C.: Catholic University of 
America Press, 1945. Pp. 45. Paper. $1.00. * 

[ 776 ] 
XBowrzv, AcATHA H. A Study of Factors Influenc- 
ing the General De end of the Child During 
Pre-School Years by Means of Record Forms. 
British Journal of Psychology Monograph Supple- 
ments, Vol. 8, No. 25. London: Cambridge University 
Press, 1940. Pp. xi, 48. Paper. 125. 6d. * 


Brit J Ed Psychol 13:53 F '43. Ll. W (ynn) 
J(ones). This research....[has as its] basis a 
record form for use with children aged two to 
four years eleven months. A scoring method 
was devised for measuring motor development, 
language development, emotional development, 
and social adjustment. A rating was given to 
economic status, health standard, and play be- 
havior. The I.Q. was found in most cases on 
the Merrill-Palmer Scale. Dr. Bowley’s ex- 
perience in nursery school work and in child 
guidance has served her well. The treatment of 
social-emotional development deserves particu- 
lar attention as norms here are difficult to es- 
tablish. In the absence of norms the clinician 
without adequate training in psychology is apt 
to draw conclusions in terms of his own idols, 
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Emotional difficulties occur in most children, 
but, as Valentine has pointed out, the total situ- 
ation to the child at the particular time must be 
considered. A minor point is that the reviewer 
is not sure after reading pages 36, 42 and 62 
whether the maximum score on the social scale 
is twenty-one or twenty. * The account of the 
effect of nursery school training on I.Q. refers 
to American investigations carried out prior to 
the appearance of the Merrill-Terman Scale. It 
may therefore be pertinent to supplement it by 
mention of Miss Amma’s work in the review- 
er’s laboratory in 1939: The Merrill-Terman 
tests for which nursery training seems beneficial 
were absurdities, giving differences and simi- 
larities, defining abstract words, comprehension, 
finding rhymes, and Minkus completion. 

J Ed (London) 77:254-6 My ’45. C. 
Birchenough. * appears most opportunely * 
Miss Bowley has rendered a great service in 
devising an observational record form showing 
the general development of the pre-school child 
which cannot fail to deepen understanding, 
quicken the imagination and direct the obser- 
vation of students and practising teachers alike. 
She insists on the necessity of studying the 
whole development of the child, and the record 
is devised to bring out the interdependence of 
all aspects of growth, physical, intellectual, and 
social emotional. * The broad principle has been 
to compose rating scales of development such as 
the teachers can readily deal with. These are 
supplemented by the use of the Merrill Palmer 
Scale or the Terman-Merrill Revision for esti- 
mating mental growth. * The qualitative results 
have to be taken with caution because of the 
small number of children investigated —r18 in 
all. * can be warmly commended to the atten- 
tion of all who are engaged in work with young 
children. 


[77 ] 
*Boyn, Georce Rosert. The Construction of an In- 
strument for Measuring Attitudes Toward Desir- 
able Food Practices. Bulletin of the Bureau of 
School Service, College of Education, University of 
Kentucky, Vol. 16, No. t. Lexington, Ky.: the Bureau, 
September 1943. Pp. 85. Paper. $0.50. * (PA 18:526) 


{ 778 ] 
*BnznETON, J. L. The Case for Examinations: An 
Account of Their Place in Education With Some 
Proposals for Their Reform. London: Cambridge 
University Press, 1944. Pp. viii, 226. 8s. 6d. * (New 
York: Macmillan Co. $2.25.) s 


J Ed (London) 76:2546 My "44. C. 
Birchenough. * The widely different views 


which are held regarding the place of examina- 
tions are at bottom the expression of differing 
philosophies, of differing degrees of insight into 
the educational process, and of varying histori- 
cal background. As Assistant Secretary to the 
University of Cambridge Local Examination 
Syndicate, Mr. Brereton speaks with the expert 
knowledge of one closely in touch with the con- 
duct of secondary-school examinations. Any- 
thing he says regarding the technique of exam- 
ining is likely to be listened to attentively and 
with profit. * When a certain exuberance leads 
him to proclaim examinations as the panacea 
for the ills of *modern mass education", the 
reader is led to reflect that something of this 
sort of thing has been said many times before. 
Patterson, writing in the sixties of last century, 
pointed out that "the marvellous effects of the 
examination system inaugurated in 1800 (at 
Oxford) in awakening life in the place" led to 
a belief "that examinations could accomplish 
everything. We now know better the limits of 
their power". Mr. Brereton, however, has some 
reforms to suggest. * Teachers are to play a far 
greater part than hitherto; and, as school ex- 
aminations are involved, the present system of 
free choice is to give place to a system of re- 
gional grouping so that each examining body 
will deal only with the schools in its geographical 
region. On these matters Mr. Brereton has a 
good deal to say that is interesting, though he 
is unlikely to carry any general conviction. For, 
though he is writing for the new world after 
the war, his direct knowledge of school prob- 
lems is limited to orthodox secondary schools 
(grammar schools). He has no personal ac- 
quaintance with the problems peculiar to the 
great fields of elementary and further educa- 
tion. He feels, however, that the administrative 
principles which seem to apply to the more in- 
telligent children in orthodox secondary schools, 
Some 12 per cent of the child population, are 
applicable to the great mass of children. * Mr. 
Brereton envisages an examination machine 
composed of 23 as against the present 8 execu- 
tive bodies which will cater for 350,000 chil- 
dren a year, or more than so per cent of the 
whole child population of the r 5-16 age group, 
by means of an examination conducted on re- 
formed School Certificate lines, If this is not an 
excessive use of examinations and the suggested 
imposition of a new tyranny, it is difficult to 
imagine what is. Yet it is the logical outcome 
of Mr. Brereton’s initial premises and of his 
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philosophy of education. Under his leadership 
we should be back again to the position of a 
hundred years ago in university work, when 
Whewell deprecated the tendency in Cambridge 
to the preponderance of "indirect teaching", 
the effect of which was to conduct "education 
almost entirely by means of examinations and 
to consider lectures given in college as useful 
only in proportion as they prepared the student 
for successes in examinations. The love of 
knowledge and the love of distinction with the 
fear of disgrace are the two mainsprings of all 
education and it does not appear wise or safe 
to try to dispense with either of them". Mr. 
Brereton is scathing in his criticism of the want 
of success of the Norwood Committee in deal- 
ing with the question of school examinations. Tt 
cannot be said that he has made any real con- 
tribution himself to the solution of this most 
difficult question. 

J Higher Ed 16:453-4 N '45. Vera French. 
* Firsthand experience over a period of sixteen 
years with the organization of the Cambridge 
Scholarship Examinations has well prepared 
Mr. Brereton for the task he undertakes in this 
book. Abundant illustrative material, aptly in- 
serted, and a clarity of expression make his a 
convincing case for examinations. 


[ 779 ] 
*Broaptey, Marcaret E. Square Pegs in Square 
Holes. Garden City, N. Y.: Doubleday & Co., Inc., 
1943. Pp. ix, 211. $2.50. * (PA 18:1859) 

[ 780 ] 
*BRONFENBRENNER, Unie. The Measurement of So- 
ciometric Status, Structure, and Development. 
Sociometry Monographs, No. 6. Reprinted from So- 
ciometry, Vols. 6 and 7, 1943-44. New York: Beacon 
House, 1045. Pp. 80. Paper. $225. * (PA 18:2851-2; 
I9 :461 ; 20 :224, title only) 

[ 781 ] 
Broom, M. E. Educational Measurements in the 
Elementary School. New York: McGraw-Hill Book 
Co., Inc., 1939. Pp. x, 345. $3.25. (London: McGraw- 
Hill Publishing Co., Ltd. 205.) (PA 14:1056) 


Childh Ed 17:138 N '40. Alice Temple. * 
should serve well the needs of teachers and stu- 
dents in training. 

J Ed (London) 73:510-2 N "4r. C. M. 
Fleming. * a most lucid and scholarly descrip- 
tion of the origins and the present position of 
standardized testing * The material described 
is almost exclusively American, but the method 
of its description is excellent, and the book may 
be commended as a most useful discussion of the 
contribution made by the test movement to suc- 


cessful evaluation in the elementary school. 
Nation's Sch 25:100 My '40. * A large area 
of modern development in this field has been 
omitted, which certainly decreases its scientific 
value. 
For additional reviews, see 40:B850. 


[782] 


*Brown, Ciara M. Evaluation and Investigation 
in Home Economics. New York: Appleton-Century- 
Crofts, Inc., 1941. Pp. xviii, 461. $3.50. * (PA 16:4164) 


Ed Adm & Sup 28:313-4 Ap '42. Elizabeth 
Stevenson. Comprehensive yet detailed; scien- 
tific yet practical; stated in terms of home eco- 
nomics education, yet dealing with basic educa- 
tional principles, this book should be of value 
to teachers and administrators in amy area of 
education who are interested in measuring the 
attainment of educational goals and improving 
instructional procedures through the use of bet- 
ter evaluation techniques. * Interesting problem 
illustrations are given showing how to measure 
the factual knowledges, understandings, tastes, 
appreciations and attitudes acquired, and the 
ability to apply what has been learned. * The 
investigation and elementary research tech- 
niques are meticulously set forth. They should 
be of practical use to the classroom teacher in 
improving the quality of the measurement de- 
vices he prepares, and in evaluating his total 
educational program. They should stimulate 
graduate research in home economics education 
procedures. They could serve as an excellent 
guide to those interested in conducting research 
in the general education curriculum and in the 
teaching-learning processes in the elementary 
and secondary schools as well as in the college. 
The book is an important contribution to home 
economics education, leaders of which long have 
striven to obtain more accurate evidence that 
real learnings take place through the work- 
study experiences which are based on, or grow 
out of, the everyday living of boys and girls. 
The most helpful guides for the busy classroom 
teacher seem to this reviewer to be the sample 
problem situations and measurement devices 
and the direct, lucid explanations underlying 
theory. 

Sch R 50:467-9 Je’42. Frances Swineford. * 
organized under the two main heads....““Evalu- 
ation” and “Investigation.” The "Evaluation" 
section, constituting the major portion of the 
book....is replete with a wide variety of detailed 
examples of tests, score cards, rating scales, and 
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other devices for use at various levels of learn- 
ing. In fact, the competent discussion of the 
purposes and interpretations of evaluation and 
the presentation of a generous number of well- 
selected illustrative examples comprise the 
book's chief contribution. Although all the sam- 
ple tests are concerned with a specific field, their 
very specificity serves to clarify the more gen- 
eral portion of the discussion so that the vol- 
ume becomes a valuable aid to any student of 
measurement. The second portion of the book, 
entitled "Investigation," deals with research 
procedures. In her attempt to touch on all as- 
pects of investigation, the author found it neces- 
sary to include certain topics without the com- 
prehensive treatment which characterizes the 
section on evaluation. * Topics which are intro- 
duced with no attempt to give the student a 
working knowledge through these pages alone 
include analysis of variance and covariance, bi- 
serial r, partial and multiple correlation, and 
regression. It is not the reviewer's intention to 
suggest that these topics should be expanded. 
There is a tendency, however, to oversimplify 
some of the more elementary topics, which might 
be covered more fully. For example, the com- 
mon definition for "standard score" has been 
omitted in favor of a transmuted score having 
positive, integral values. This latter score should 
not have been called a “‘standard score,” but it 
could have been derived from the standard- 
score formula. The statistical portion is the 
weak spot of the book. Not only are some of the 
topics inadequately treated, but several errors 
are apparent. The following, among others, 
might be mentioned. Distinction should be made 
between “percentile” (a score value) and “per- 
centile rank” (a percentage). The word “rank” 
should not have appeared in the sentence: “The 
25th percentile rank coincides with Q and the 
75th percentile rank with Qs" (p. 319). For- 
mula I on page 321 defines the median score 
as N/2 instead of as the middlemost score in 
the distribution. The paragraph on the stand- 
ard error of a correlation coefficient gives the 
usual formula, SE, = (1 — r*)/./N which 
is said to be applicable “only when a relatively 
low degree of relationship exists” (pp. 362-63). 
The alternative formula, SE, = 1/ VN — tis 
then presented, the implication being that the 
latter form would apply in case the correlation 
is high or in case N is smaller than 500. Obvi- 
ously such a procedure would yield for a high 


correlation a standard error larger than that 
obtained for a low correlation with the same 
number of cases! It should have been made 
clear that the second formula is used only to 
test the null hypothesis that the true correlation 
is zero. The author’s style is clear and easily 
read. The material is well organized, ample sub- 
divisions being used to help the reader keep the 
general outline in mind. Important ideas are 
italicized and thus are easily located. The “Eval- 
uation” section is especially recommended not 
only as a textbook useful in home-economics 
education but also as a reference volume for 
teachers in any field. 


[ 783 ] 
*Brown, Epwin J. AND O'RrmLv, Virctnta M. A 
Self-Rating Scale for the Elementary Teacher. 
Kansas State Teachers College of Emporia, Bulletin 
of Information, Vol. 24, No. 6; Studies in Education, 
No. 28. Emporia, Kan.: the College, June 1944. Pp. 36. 
Paper. $0.20.* 

[ 784 ] 
*Brown, Max Duane. Variability as a Function 
of Ability and Its Relation to Personality and 
Interests. Archives of Psychology, No. 262, Washing- 
ton, D. C.: American Psychological Association, Inc., 
July 1941. Pp. 45. Paper. $0.80. * (PA 16:493) 

[ 785 ] 
Brown, WILLIAM, AND THoMsoN, Goprrey H., The 
Essentials of Mental Measurement, Fourth Edi- 
tion. London: Cambridge University Press, 1940. Pp. 
x, s 215. * (New York: Macmillan Co. $6.50.) (PA 
15:4. 

Am J Psychol 54:619 O "41. Albert K. Kurtz. 

* the fourth edition is the same as the third, 
with the addition of four new chapters, which 
are reprints of recent papers by each of the 
authors. A number of topics of current interest 
are not covered; e.g. there is no reference to 
the work of R. A. Fisher and others on small 
samples or the analysis of variance, and only 
four pages are devoted to multiple correlation. 
On the other hand, the recent work on factor 
analysis is covered in considerable detail, this 
being the main subject of all five of the chap- 
ters which have been added since 1921. The 
first four chapters deal with psychophysical 
problems, and statistical concepts are introduced 
only as they are needed in understanding, de- 
scribing, or analyzing the data. Consequently, 
those Statistical methods which are immediately 
applicable to the data are. sometimes unduly 
stressed at the expense of other methods which 
are applicable to other psychological problems. 
The next four chapters give a good explanation 
of the meaning of correlation and point out cer- 
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tain pitfalls in interpretation. In common with 
many statistics texts, the computational methods 
illustrated are not always the best for practical 
use (either with or without a calculating ma- 
chine). * Chapters IX through XV, covering 
theories of mental ability from Spearman's The- 
ory of Two Factors through modern factor 
analysis, deal with controversial material and, 
in the reviewer's opinion, vary considerably in 
their quality. Chapter XIII concludes that “the 
main purpose of our research has been achieved ; 
namely, to establish Professor Spearman’s The- 
ory of Two Factors on an adequate statistical 
basis.” But some psychologists may object to 
this conclusion, for it was arrived at after (1) 
devising “tests that, approximately, should fit 
a theoretical criterion (that of zero ‘tetrads’),” 
(2) omitting two tests, (3) partialling out the 
v-factor from six verbal tests, (4) partialling 
out the specificality from the two Code tests, 
(5) partialling out the specificality from two 
of the perceptual tests, (6) discarding tetrads 
which involve re.1s (as 7 of the 23 largest tetrads 
did), and (7) omitting one test which “can be 
considered to account for nearly all other tetrad 
differences greater than 0.1000.” Chapters XI 
and XV, also on factor theory, are sincere and 
competent attempts to examine controversial 
theories in an unbiased and strictly objective 
manner. On the whole, the book is a scholarly 
presentation of the field of mental measurement. 
It contains only a small number of minor errors, 
and it is written in an interesting manner. The 
large amount of original material it contains 
will make it valuable not only as a text for stu- 
dents but also as a reference manual for re- 
search workers in psychophysics, psychological 
statistics, and factor theory. 


[ 786 ] 
*Bnucr, MvmrLE. Factors Affecting Intelligence 
Test Performance of Whites and Negroes in the 
Rural South. Archives of Psychology, No. 252. Wash- 
ington, D. C.: American Psychological Association, 
Inc., July 1940. Pp. 99. Paper. $1.50. * (PA 15:4501) 


[ 787 ] 
*BuncEMEISTER, Bessie B. The Permanence of In- 
terests of Women College Students: A Study in 
Personality Development. Archives of Psychology, 
No. 255. Washington, D. C.: American Psychological 
Association, Inc., July 1940. Pp. 59. Paper. $1.00. * (PA 
15 :3042) 

[ 788 ] 
*Buros, Oscar Krisen, Eprron. The Nineteen Forty 
Mental Measurements Yearbook. Highland Park, 
N. J.: The Gryphon Press, 1941. Pp. xv, 674. $700. * 
(PA 15:1992) 


Am J Orthopsychiatry 12:372 Ap '42. 
L(awson) G. L(owrey). This important vol- 
ume is made up of 503 frankly critical Reviews 
of tests contributed by 250 of our ablest psy- 
chologists and subject specialists. By compari- 
son with earlier editions, reviewers have more 
space for appraisals, and various viewpoints re- 
garding particular tests are better represented. 
In addition to critical appraisals of tests, nearly 
170 pages are devoted to reviews of books hav- 
ing to do with tests, in which many journals 
are quoted. There are indices of periodicals 
cited, of titles and of names, all of which con- 
tribute greatly to utilization of the volume. We 
have previously called attention to this indis- 
pensable encyclopedia, of which four previous 
volumes have been issued since 1933. This is 
the most comprehensive of the series and it is 
difficult to know how better one could get at a 
comprehensive survey of so many tests. At a 
time when tests roll off the assembly line with 
such rapidity, a work of this sort is invaluable. 
It is an essential part of every clinic library. 

Am J Psychiatry 98:915 My ’42. Elizabeth 
Gifford. * A description of one section of the _ 
book “Character and Personality” will serve to 
indicate the type of service available in all fields 
of standardized tests. Herein are listed every 
current test of character, personality and ad- 
justment, most of them of the questionnaire type, 
together with references and usually two or 
more reviews. The reviewers are chosen to 
represent all useful viewpoints and include users 
of the test, authors of similar tests, and experts 
on test construction, as well as those critical of 
the value of all questionnaire tests. The reviews 
describe the test and appraise it in terms of re- 
liability, validity and usefulness. The most fre- 
quent criticism, apart from those levelled at all 
self-evaluating questionnaires, is of the inade- 
quacy of validation. This should result in a 
raising of the standards of test construction and 
in a more critical attitude to the usefulness of 
tests. All test users will appreciate the work... 
Buros and his collaborators are doing. 

Am J Psychol 55:287-9 Ap ’42. Roger M. 
Bellows. Current and widely used tests and test 
literature are appraised in a way that makes 
this volume useful not only as an aid in select- 
ing tests for specific purposes but as a clearing 
house of opinions of test technicians and test 
users. The yearbook is also a catalog of tests, test 
authors, test publishers, and sources of informa- 
tion about them. * The largest section of the 
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book, comprising tests and test reviews, includes 
tests of English (40 pages), fine arts (7 pages), 
foreign languages (34 pages), intelligence (70 
pages), mathematics (46 pages), miscellaneous 
tests and also a discussion of several compu- 
tational and scoring devices (44 pages), tests 
in the field of [social] studies (24 pages), 
and tests pertaining to vocations (39 pages). 
The 250 ‘cooperating test reviewers are either 
experts in test construction and validation or 
in particular subject matter fields, or are teach- 
ers. In some instances the test technicians re- 
port detailed or analytical studies or present 
data from their files pertaining to the test being 
treated so that their reviews amount to valuable 
research reports. The review by Lorge of the 
Detroit Clerical Aptitudes Examination may be 
cited (p. 433). He presents a factor analysis 
of the reported intercorrelations showing that 
41 per cent of the variance of the examination 
scores may be attributed to a factor common to 
intelligence tests. The review by Crawford of 
the Preference Record (p. 477) illustrates an 
instance in which the reviewer had an oppor- 
+ tunity to experiment with the instrument being 
appraised. If, in obtaining reviewers for the pro- 
posed subsequent yearbooks, the editor should 
find it possible to locate more reviewers who 
would be able and willing actually to determine 
validity and dependability of the tests by meth- 
ods of controlled observation the value of the 
reviews would be greatly increased. The subject 
matter specialists have offered comments and 
critical remarks that in many instances should 
point the way to tests and test manuals designed 
with a view to more adequate instructions, sim- 
pler administration procedure, improved evalu- 
ative results, and more practical suggestions for 
interpretation of obtained scores. In several 
cases the teachers also have made suitable and 
appropriate comments in their reviews but in 
others they appear to have been hypercritical. 
* This yearbook, in addition to its utility as a 
compendium, should prove valuable in focusing 
the attention of the increasing numbers of test- 
users upon the many inconsistencies in the test- 
ing field, particularly the lack of uniformity in 
assumptions that are basic to test evaluation. 
Am Sch Board J 102:85 Mr "41. * a monu- 
mental undertaking * It is difficult to review 
so encyclopedic a work, which derives its chief 
utility from its factual dependability and in- 
clusiveness. In some 674 large, double-column 
pages, the book provides accurate information 


and a critical review of new and revised edi- 
tions of tests which have appeared in the last 
two or three years. * Complete indexes of titles, 
names, etc., make the volume valuable for quick 
review. 

Austral J Psychol & Philos 19:89-94 Ap ' 41. 
C. A. Gibb. * the fifth publication in this excel- 
lent series produced by Professor Buros. And 
yet one hesitates so to designate it because in 
six years since the original publication in the 
series its growth and extension have been al- 
most incomprehensible. * An extraordinary 
expansion of the programme of the previous 
volume has taken place. And that in spite of 
the fact that Buros has failed to get financial 
assistance for the continuation and expansion 
of this work. * Readers familiar with the 1938 
volume will notice the omission here of its sec- 
tion “Research and Statistical Methodology 
Books.” The Editor promises, however, that 
this omitted section will be published as a sepa 
rate volume early this year. Thus the 1940 Year- 
book has really become two volumes. With such 
an expanded coverage one would expect great 
changes in the format of the volume, and there 
is no disappointment here. The 1940 Yearbook 
is delightfully printed and bound. It is set in 
larger type on a much larger page which affords 
greater ease of reading. One should point out, 
however, that this is not a book which anyone 
will want to read from cover to cover. It is 
essentially a reference work. It is just that fact 
which makes two opposed viewpoints possible. 
One might assume that since the Yearbook is a 
reference book the increased size of the volume, 
and, in fact, its spread to two volumes are of no 
consequence. On the other hand one might 
argue that the smaller print of the 1938 Year- 
book was adequate for this type of work and 
that it would, therefore, have been preferable 
to retain the small type and keep all the mate- 
rial between the one set of boards. Buros evi- 
dently takes the former view. The reviewer in- 
clines to the latter, and would have preferred 
the more closely set but handier volume. On the 
other hand Buros announces (p. xx) his inten- 
tion of providing handy and cheap information 
in each of the several fields covered by the 
“Yearbook.” A new series of monographs is 
being planned. Each such publication will pre- 
sent a comprehensive bibliography of all tests 
in the chosen limited field (together with re- 
views of those tests) which have appeared in 
previous Yearbooks; and, in some cases, new 


7713] BOOKS & REVIEWS: BUROS 


reviews will be included. It is proposed that 
such separates in each field should be issued at 
about six-yearly intervals, more or less fre- 
quently according to the test material published. 
It is obvious that these monographs cannot be 
so up to date as the “Yearbooks,” but they will 
afford teachers and others an opportunity to 
have always handy, reliable information con- 
cerning the more important tests. The first of 
these separates is to be devoted to tests of Eng- 
lish and reading and will be published in late 
1941 or early 1942. While this additional serv- 
ice is admirable it does not compensate for the 
increased cumbersomeness of the "1940 Year- 
book." One cannot but feel that as this expan- 
sion continues the book will be consulted less. 
Further, the expansion has been, in some in- 
stances, unwarranted. The more detailed bibli- 
ography given for each test is very welcome. 
But there must be some doubt as to the wisdom 
of extending test reviews, especially since re- 
views do occasionally overlap. The “Books and 
Reviews" section of the “1940 Yearbook" occu- 
pies only about one-third of the volume. It 
might, however, have occupied still less. An 
actual case or two may make this criticism 
clear. “Baker and Traphagen—The Diagnosis 
and Treatment of Behaviour-Problem Children” 
was given three columns of reviews from twelve 
journals in the “1938 Yearbook." In the present 
volume it is allotted another three columns for 
reviews from an additional four journals. Ad- 
mittedly one of these reviews sees the book from 
a new point of view, the others, however, do 
not. Again three of these additional reviews 
were published too late for inclusion in the 
1938 Yearbook, the other was not. It is difficult 
to conceive this additional three columns re- 
turning proportionate benefit to the Yearbook 
reader. This is no isolated example. Similar 
doubt arises concerning additional space given 
to at least half a dozen other books. One other 
facet of this same critical point concerns the 
large amount of space allotted to a few of the 
more outstanding books. Freeman's “Mental 
Tests” in its revised edition is represented here 
by eight reyiews occupying six columns. More 
severe pruning of reviews might well have done 
this book justice in half the space. A similar 
criticism applies to the presentation of fifteen 
reviews of “The Psychology of Early Growth” 
by Gesell and Thompson; and of the four pages 
of reviews of McCall’s “Measurement” ; and of 
the eleven columns given to “Who Shall Sur- 
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vive” by Moreno; and there are many others. 
To put it bluntly Buros shows signs of being 
carried away by his organisation, His desire to 
be impartial at all costs has led him to the high 
cost of abstracting every available review ar- 
ticle of measurement books. It would be mis- 
leading if this review concluded upon an even 
slightly disparaging note. The reviewer has no 
intention of creating an impression of abnormal 
gigantism, The “1940 Yearbook” is more valu- 
able than its predecessors have been; and that 
is no mean praise. Its list of co-operating spe- 
cialists is longer, it is more up to date at pub- 
lication, it is more complete in coverage, it has 
sampled more journals for reviews, Australian 
readers will be interested to realise that among 
new journals covered is “The Medical Journal 
of Australia,” while The Australasian Journal 
of Psychology and Philosophy has been cov- 
ered since the inception of Buros’ review ab- 
stracts. Buros has, however, gone further than 
that. He has arranged with the Australian Coun- 
cil of Educational Research and with Psycholo- 
gists and Educationists in this country for a 
more complete coverage of the Australian con- 
tribution ; and "The 1942 Mental Measurements ` 
Yearbook” is to have several Australians among 
its ever-growing list of co-operating reviewers. 
One can only wish Professor Buros and his 
venture every success and again offer congratu- 
lations upon the continued publication of this 
“Co-operative Service for Test Users.” 

B. C. Teach 20:387 Ap ’4t. C. B. C. This 
year, those who have been subscribing to the 
Mental Measurement Series will feel as if they 
had ordered a Pocket Oxford Dictionary and 
had been forwarded Webster’s Unabridged. 
The volume is three times as thick and six 
times as heavy, much larger print and heavier 
paper having been used. The tests listed do 
not seem to be more numerous, but an effort 
has been made to present all possible informa- 
tion on each one, usually by means of several 
reviews from various critics. In many cases 
these might have been combined, particularly 
when factual information about the contents, 
time, reliability, or methods of standardization 
are concerned. Most of the reviews, however, 
seem to be very honestly delivered. If a test is 
full of defects, the reviewers say so, and the 
numerous pages of this book are not cluttered 
up with what sometimes amount to mere ad- 
vertisements. By reading the yearbook a pur- 
chaser should be able to form a very good opin- 
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ion of any tests he may be thinking of buying 
or using. * it is not necessary to consult a 
dozen catalogues before you select a test which 
you believe will suit your requirements, and 
much of the guesswork commonly associated 
with the selection of tests has been removed. 
Nevertheless you still must use your own judg- 
ment to a certain extent; the editor has made 
no move toward ranking, contrasting or evalu- 
ating tests which are classified under the same 
heading. The preface contains a welcome an- 
nouncement. Monographs on tests in single 
subjects, containing descriptions and reviews 
of tests, are soon to be published. * the section 
"Books and Reviews" will be more interesting 
to the research worker or professor of educa- 
tion than to the classroom teacher. The indices 
of titles and names have been made more con- 
venient for reference * Canadians will be pleased 
to observe that some sixteen or more reviewers 
now are Canadian or British, and that many 
reviews of British tests are included. The 
U.B.C. staff will notice with regret that Dr. 
David H. Russell, one of the reviewers, is still 
listed as an assistant professor of the University 
of Saskatchewan. 

B Nat Assn Sec Sch Prin 25:135-6 Mr "41. 
* outstanding volume on a new test-appraisal 
service. 

Bookbinding & Book Prod 34:56 Jl ^41. 
David M. Glixon. A neat job: half a million 
words running to nearly 700 pages of double- 
column type, all very businesslike, no space 
wasted—but with every facility given to make 
consultation simple. * while there is no wasted 
space, there is also no crowding to cramp the 
teacher's attention. The sober title page in Cas- 
lon, framed by spaced parallel rules, has dignity 
without being forbidding—a few words in italics 
do the trick. And the cover design is magnifi- 
cent for a book of this type. In its bright red 
buckram and its black panels with gold rules 
and type, it bears the stamp of authority with 
the advantage of attractiveness. 

Booklist 37:401 My I '41. * a specialized, 
but important, reference work * 

Brit J Ed Psychol 11:237 N "41. This sub- 
stantial volume provides a useful reference book 
on many aspects of mental measurement, an 
exhaustive examination being made of recent 
publications on tests and examinations. * With 
the earlier volumes it should prove a valuable 
book of reference. 

Brit J Psychol 32:177-8 O "41. Many read- 


ers of this Journal must have found occasion to 
consult the Nineteen Thirty-Eight Mental 
Measurements Yearbook, an enterprise in pub- 
lication which deserved, and apparently, achieved 
wide publicity and as wide appreciation. Now 
comes the Nineteen Forty issue, larger, more 
ambitious, and definitely better than the earlier 
volume. The amount of devoted labour which 
Prof. Buros must have put into this publication 
must have been prodigious. * a very excellent 
collection of published critical studies of the 
material is reproduced. The reader is therefore 
in a very much better position than if he merely 
had a summary, and nothing else, for deciding 
whether the tests or discussions dealt with give 
him what he may be trying to find. No reputa- 
ble psychological library ought to be without 
this book, and there can be few teachers of psy- 
cholegy who will not be glad to be able to direct 
their more intelligent students to its pages. 
More than that, the worker in almost any ap- 
plied field of psychology, who is himself search- 
ing for tests of practical value, will find here a 
mine of most valuable information. It is clearly 
one of the first essentials, in any volume of this 
kind, that the arrangement of sections should 
be lucid and systematic and the indexing very 
good. It is difficult, in this case, to see how these 
matters could be done better than they are. It 
is probably inevitable that there should be some 
omissions, but the present reviewer who has 
reasonably good opportunities for becoming ac- 
quainted with current work in at least some of 
the sections covered by the publication, has 
never looked into these sections without finding 
much to learn. It is very much to be hoped that 
the publication will continue for many years to 
come. 

Bus Ed World 22:75 S '41. Jessie Graham. 
* The showing in business education is a meager 
one when compared with the rich offerings in 
other fields. * As all the reviews of one book 
appear together, certain characteristics of spe- 
cific reviews stand out plainly. Reviews that are 
superficial, overcritical, or properly informative 
are easily identified when placed side by side 
with others on the same subjects. This is an ex- 
cellent source of information about tests and 
books on testing. The book might have even 
greater usefulness if the following were in- 
cluded: 1. Short introductions to the sections, 
summarizing trends in measurement in the spe- 
cial fields. 2. A separate index for books re- 
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viewed. 3. A bibliography at the end of each 
section. * 

Calif J Sec Ed 16:293 My '41. Harold W. 
Leuenberger. * a monumental project in cooper- 
ative work in the field of tests and measure- 
ments * Buros...is to be congratulated both for 
the growth of the work and for his plans for 
future yearbooks and monographs. * The year- 
book will, without question, become an invalu- 
able aid to anyone engaged in measurement 
work, for no individual can hope to keep his 
knowledge apace with developments in the field 
by his own efforts alone. Any worker in the 
field, whether he employ one or ten thousand 
tests a year, cannot help but be grateful to 
Buros and his associates for the work they have 
done and made available in this and previous 
publications. From a purely personal stand- 
point, this reviewer would desire that the re- 
views of a particular test be limited, wherever 
possible, to those from individuals who actually 
have used the instruments in testing work. “On 
the job" use often shows flaws, or merits, to 
testing instruments which may be overlooked 
when contact with the test is limited to exami- 
nation of the test and its accompanying stand- 
ardization data. 

Char & Pers 9:248-9 Mr ’41. Edward J. 
Stainbrook. * Buros of Rutgers University 
here presents another comprehensive and ex- 
cellently edited yearbook of psychological and 
educational tests. This 700-page compendium 
of critical reviews of the available standardized 
tests for the evaluation of general and specific 
educational achievement, intelligence, character 
and personality, and vocational aptitude and in- 
terest, is in itself a physical index of the pro- 
gressively increasing appreciation and enthusi- 
astic acceptance of this book by users of mental 
tests. * The present work, like the 1938 volume, 
gives its major space to the tests and their re- 
views, even omitting the usual section on re- 
search and statistical books in order to make 
more room for the test reviewer's criticism. 
This omitted section is planned as a separate 
volume for the early spring of 1941. The num- 
ber of reviewers has been increased to 250, and 
includes classroom teachers, city school research 
workers, clinical psychologists, curriculum spe- 
cialists, guidance counselors, personnel workers, 
psychologists, and test technicians. * Another 
commendable change is to be seen in the dis- 
carding of the 40- to 100-word evaluations used 
in part in The 1938 Yearbook in favor of longer, 
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more representative reviews. Certain deficien- 
cies in the 1938 reviews of non-pencil-and-paper 
tests are remedied in the present work. Addi- 
tionally, a more thorough and detailed compila- 
tion of references dealing with construction, 
uses, limitations, and validation of tests has been 
achieved so that in many instances there are to 
be found extraordinarily comprehensive bibli- 
ographies of great general utility. The Ror- 
schach test, for example, is listed with 147 
references to the pertinent literature. * Happy 
improvements in typography, format, and pub- 
lication lag have been made in this new edition. 
This book is the mental test Baedeker. Any- 
one who contemplates the use of mental meas- 
urements for any purpose whatever will find 
this yearbook indispensable. Professor Buros's 
promise of, biennial volumes assures a continu- 
ously supplemented encyclopedia of tests and 
measurements. 

Child Develop Abstracts 15:230 O "41. * As 
is bound to be the case in compendia by nu- 
merous contributors, there is unevenness of 
workmanship and in some cases inadequacy of 
treatment. The Rorschach method, which inci- 
dentally is not mentioned in the index, is repre- 
sented by a list' of 147 bibliographic references 
but no description of the test or its use nor 
appraisal of its value is given. 

Clearing House 16:188 N '41. E(arl) R. 
G(abler). * most excellent contribution to the 
field of measurement * In the critical evalua- 
tion of each test the reader will find information 
that cannot be obtained from any other source. 
The good points and bad points of each test 
are clearly set forth. The book and review sec- 
tions will be found valuable to those looking for 
books bearing upon a particular area. This vol- 
ume represents a very painstaking piece of 
work. It speaks with authority and competence, 
and meets a felt need of those who are faced 
with selecting tests for a particular purpose. 

Ed Abstracts 6:60 F ’41. D. G. Ryans. * 
This yearbook represents a huge expenditure 
of time and energy on the part of the reviewers 
and the editor alike. The result appears to be 
a valuable source book in tests and measure- 
ments and a potentially useful critical index of 
tests for those persons concerned with the se- 
léction and procurement of testing instruments. 

Ed Forum 5:488-9 My ’41. The 1938 year- 
book was reviewed in these columns, Contrasted 
with it the present volume is truly monumental. 
* There is a useful index of periodicals which 
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have contained reviews, with full references, in- 
cluding the name of the editor and the list of 
references reprinted in the volume. * Bec»u- 
it was impossible to enlist the support of philan- 
thropic agencies and no subsidy has been forth- 
coming from other sources, a private, non-profit 
organization, has published the yearbook and 
will undertake to publish future editions. De- 
spite some difficulty in securing the cooperation 
of publishers and of securing competent re- 
views, the present volume is inclusive and covers 
the general field adequately. In reviewing the 
former yearbook, it was stated that the editor 
deserved the appreciation and congratulations 
of those working in the testing field. Even more 
is this true in connection with the present vol- 
ume. The critical and incisive reviews should 
do much to compel better test-construction and 
to assist the user in making his selections from 
the more than 1500 now available. Since mil- 
lions of tests are now used annually in the pub- 
lic schools of America, it is of great importance 
that the best be employed. The volume is a 
useful reference work for all makers and users 
of tests, 

El Paso Sch Standard 9:36 O "4r. M. E. 
Broom. * The reviews of tests are not uniform 
in kind or in length. Some instruments, as the 
Joel and Joel Parents Rating Scale (to cite the 
first one the reviewer encountered on opening 
the book at random), are given mention only. 
Other tests are reviewed extensively by more 
than one person. Perhaps subsequent volumes 
might be helped if the editor would establish 
definite criteria for test reviews. The 1940 
Mental Measurements Yearbook is a valuable 
reference book, and it is a volume that should 
be available to educators generally, and particu- 
larly to persons interested in research and guid- 
ance. 

El Sch J 41:495 Mr '41. Nelson B. Henry. 
[An editorial] * These authoritative reviews 
will enable test-users to select instruments 
adapted to the purposes of their measuring pro- 
grams. * also a handy reference work for spe- 
cialists in test construction * conveniently in- 
dexed * 

Fed Probation 5:44-5 O-D '41. David Segel. 
* Buros had made a tremendous effort to obtain 
reviews of tests by competent persons and in 
the main it is believed he has been successful. 
* of great value to persons selecting tests, for 

various education and guidance purposes and 
has an indirect value through the influence it 


brings to bear upon persons working in the test 
construction area. Because of the general excel- 
lence of the work one hesitates in such a brief 
review to call attention to any particular defi- 
ciencies. However, this reviewer has a feeling 
that the tests on specific subjects—and these 
comprise the bulk of the tests—have had a more 
unfavorable series of reviews than is warranted. 
A large number of these tests were criticized in 
one of the three following ways: (1) that the 
tests called for information rather than reason- 
ing on the subject; (2) that the tests did not 
cover the outcomes desired on the subjects 
which they were supposed to cover; and (3) 
that the tests were faulty in construction. The 
last criticism is always a legitimate one and the 
yearbook has served a very useful purpose in 
pointing .out deficiencies in construction. The 
other two criticisms are to an extent justified 
but they are also to some extent unjustified. 
These criticisms are justified in the fact that the 
tests are actually restricted as to scope but are 
not necessarily justified considered from the 
standpoint of the test constructor. He has often 
as not gone as far as he is able using the tech- 
niques available to set up a paper and pencil 
test on the subjects involved. The criticism 
against these individual tests is really a criticism 
of the testing movement as a whole. The very 
fact that so many tests were criticized in an 
identical manner shows that the fault does not 
lie with individual test constructors. However, 
the yearbook serves a useful purpose in bring- 
ing to light these general criticisms of the test- 
ing movement. It would be still more useful if 
these same reviewers would also describe how 
tests should be constructed in order to avoid 
the errors they so eloquently point out. The 
bringing together of excerpts of reviews in one 
Place makes reference much easier. This is 4 
valuable feature of the yearbook. 

J Am Assn Col Reg 16:347-8 Ap "41. J. 
Conrad Seegers. * [The] growth in size [of the 
volumes in this series] has been accompanied 
by equivalent growth in usefulness and signifi- 
cance. This issue is truly a co-operative enter- 
prise. * Manifestly, this is a very useful book. 
The bibliographical aids and lists of tests alone 
would justify it. However, it must be used with 
discrimination. The editor has attempted to se- 
cure reviews from qualified persons, and cer- 
tainly the list of reviewers includes many names 
which demand respect. But one must remember 
that. these reviews are opinions and that many 
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of these opinions rest simply upon analyses. No 
other method was possible, but the fact is un- 
altered. Occasionally one finds disagreement in 
reviews of a given test. Moreover,....Buros at- 
tempted deliberately to secure reviews "from 
persons representing a wide variety of view- 
points” and fields of activity. Absolute agree- 
ment should not be expected or desired, in such 
circumstances. However, the very nature of the 
book demands that one use it as a source book 
of opinion concerning tests and their uses, 
rather than as factual material describing them. 
Probably catholicity of opinion would have been 
approached more closely if more elementary 
and secondary school teachers or principals had 
been requested to express opinions. Universi- 
ties and colleges provide most of the reviewers, 
and when one does find a public school person 
in the list he is usually in an administrative 
office, not a teacher. We should be told more 
about tests by the ultimate consumers. Some 
day, perhaps, we shall consider tests useful or 
useless according to their proved diagnostic or 
predictive value, not simply because of theoreti- 
cal considerations. This is not to be construed 
as a severely adverse criticism of this volume, 
but it is a statement of its limitation. In fact, it 
is a limitation inherent in much of the work 
in measurements. We are frequently too prone 
to believe that high reliability guarantees use- 
fulness, just as we too frequently assume that 
an expression of opinion about a test is a state- 
ment of fact, if that opinion emanates from a 
respected source. However, if one remembers 
these limitations, and uses this carefully pre- 
pared and excellently printed volume accord- 
ingly, he should find it an almost invaluable 
reference work upon which the editor is to be 
congratulated and for which he deserves thanks 
and praise. 

J Am Med Assn 117:1659 N 8 *41. * There 
is little that is original in the volume, for it 
consists of an alphabetical listing of (1) intelli- 
gence, achievement and other educational and 
psychologic tests and (2) books which have to 
do with testing and the examination of the 
individual. * Each test is followed by rather 
extensive quotations from reviews which have 
appeared elsewhere. The reviews from THE 
Journat and others having to do with medical, 
psychologic and educational matters are carefully 
culled in order to give as good evaluation as 
possible of the test materials. Psychologists 
wishing to learn about any test are, by use of 


this guide, in rather a good position to deter- 
mine which one would be useful for a special 
purpose, provided they bring to bear on the 
problem the special training which the psychol- 
ogist must have in evaluating his test equip- 
ment. A physician might find himself at a dis- 
advantage in using the book because of the 
knowledge of psychologic literature, personali- 
ties and technics for which it calls. There is an 
extensive introduction, which is interesting be- 
cause it evaluates the opinions of reviewers, 
editors, test publishers and others relative to 
the validity of the reviews published in previous 
numbers. * There are extensive indexes cover- 
ing both subjects and names; all in all, to those. 
interested in mental testing, this is a most use- 
ful and comprehensive work. i 

J Consult Psychol 6:55-6 Ja-F '42. Warren 
G. Findley, When the 1938 Mental Measure- 
ments Yearbook was published, it was hailed 
almost unanimously by specialists in the field 
of mental testing as a monumental compilation 
of test criticisms that would prove of inestima- 
ble value to persons ranging from the beginner 
in the use of standardized tests to the specialist 
in test development. That publication has cer- 
tainly earned its place in the libraries of test 
specialists. And now one can announce that the 
1940 Yearbook is in virtually all respects a 
bigger and better publication, which should 
more adequately fulfill the promise of the earlier 
publication to be generally useful to the great 
body of test users. The 1940 Yearbook contains 
the critical opinions of 250 reviewers, compared 
with 133 who contributed to the 1938 issue. 
This group includes the great majority of the 
contributors to the previous volume and an 
added number of similarly competent critics. 
The number of test reviews has been more than 
doubled— 503 original reviews cover more than 
200 tests—and the reviewers have been allowed 
700 words or more in which to express judg- 
ment on a test. Furthermore, whereas the 1938 
Yearbook devoted 55 per cent of its space to 
books on testing, the 1940 Yearbook gives over 
7o per cent of its space to criticism of tests. 
Most significant, the 1940 Yearbook includes 
critical reviews of old tests currently in wide 
use as well as reviews of new tests. It thereby 
presents in convenient juxtaposition comments 
on the related tests to which a beginner, or 
one who has relied overlong on a single test or 
battery, may be referred to make a choice. A 
new feature is inclusion of a bibliography for 
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each test listed, such bibliographies running to 
as many as 134 titles in the case of the Revised 
Stanford-Binet Scale and 147 titles in connec- 
tion with the Rorschach Method. The 17940 
Mental Measurements Yearbook will be valu- 
able to members of all Sections of the A.A.A.P.: 
Clinical, Consulting, Educational, and Indus- 
trial. It should be noted that the term "mental" 
is given a general meaning and includes tests of 
intelligence, aptitude, achievement, personality, 
attitude, interest, etc. Discussion of individual 
and group intelligence tests will interest all. 
Clinical and consulting psychologists will wel- 
come the bibliography on the Rorschach Method 
but can only hope that the 1942 Yearbook will 
"contain critical reviews of it. Business and in- 
dustrial psychologists will be especially inter- 
ested in the 40 pages devoted to vocational tests 
and inventories. Educational tests are heavily 
represented, as is to be expected in view of 
their preponderance among test offerings. The 
classification of these educational tests under 
broad subject-matter headings will aid educa- 
tional psychologists and schoolmen in consider- 
ing related tests. The reviews of books on 
measurement continue to have the virtues of 
such reviews in earlier yearbooks: complete 
Coverage of current literature of the field with 
fine editing of review excerpts. The section on 
research and statistical methodology books, 
which occupied approximately one hundred 
pages in the 1938 Yearbook and which was 
reprinted separately, has been omitted from this 
yearbook and is published separately as The 
Second Yearbook of Research and Statistical 
Methodology. The format of the r940 Year- 
book is exceptionally good. The typography is 
of a high grade: Pages seven and one-half 
inches by ten and qne-half inches printed in 
double columns are in accord with best recom- 
mendations for speed, accuracy and ease of 
reading ; ten-point type throughout justifies the 
space it takes by the ease of reading it affords; 
very clean type makes footnotes highly legible. 
The volume gives every evidence of careful 
proof-reading. The organization and indexing 
of material, of supreme importance in such a 
reference work, are logical and clear, The Men- 
tal Measurements Yearbook series, now unfold- 
ing, holds a unique place in the literature of 
social measurement. The practice of having 
several similarly competent professional per- 
sons offer judgment on the same instrument for 
joint publication has merit which commends its 


extension to other areas. Not only is the reader 
enabled to compare authorities, but the authori- 
ties themselves are called upon to make careful 
contributions by the threat of comparison, “for 
with what judgment ye judge, ye shall be 
judged.” One may look forward to future bien- 
nial issues of the Mental Measurements Year- 
book with satisfaction at the prospect of illumi- 
nating and up-to-date critiques of the new 
instruments of measurement and evaluation now 
crowding forward, especially in the educational 
field. As a footnote the reviewer ventures the 
suggestion that the 1940 Yearbook offers to 
teachers of tests and measurements in our 
training institutions course material comparable 
to selected readings in other subjects. It pro- 
vides the basis for organizing a course about 
specific testing instruments as distinguished 
from the lecture-exposition approach. 

J Ed (London) 73:322 Jl ’41. Research 
workers engaged in mental measurement may 
well be deeply indebted to Prof. Buros for this 
excellent bibliography and review of recently 
published test material. Its scope is wider than 
its title suggests. * The task of keeping abreast 
of the times in this fast-developing field is in- 
deed a difficult one. The size of the book indi- 
cates the great wealth and variety of the mate- 
tial, and it is evidence, too, of the thorough, 
complete, and comprehensive nature of the in- 
formation supplied. The classification of the 
tests, the index and page headings are designed 
to simplify reference, and the number system 
employed facilitates cross-reference to previous 
Yearbooks. The book is indispensable as a guide 
to the mental tester. This is its primary func- 
tion. At the same time it tends to restrict the 
publication of tests which have been insuffi- 
ciently standardized. The imposing list of 250 
co-operative reviewers includes the names of 
many well-known authorities and famous pio- 
neers in mental measurement. The reviews are, 
in the main, frank opinions expressed by expert 
test-technicians, psychologists, and subject- 
matter specialists. The information supplied is 
of greatest value when more than one review 
is given for each test. A fair evaluation of a test 
may then be made, and the reviews themselves 
often gain an intrinsic importance when con- 
troversial issues are considered. The reviews of 
The Experimental Edition of Thurstone’s Tests 
for Primary Mental Abilities are, for example, 
of great interest. It is difficult to overestimate 
the value of this volume to research depart- 
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ments. Its use to teachers in England is, how- 
ever, somewhat limited, for most of the more 
adequately reviewed tests are American publi- 
cations, while many of the tests which are read- 
ily accessible are either merely listed or are 
accompanied by only one review. The fact re- 
mains that the editor has rendered a great serv- 
ice to the mental tester. The preparation of the 
volume has obviously entailed a tremendous 
amount of careful and painstaking work, and 
one has only praise and the highest admiration 
for the editor and the efficient way in which he 
has accomplished a difficult task. 

J Ed Res 35:50-1 S ’41. Carter V. Good. 
The decade of the nineteen thirties is charac- 
terized by the development of major guides to 
the literature and the printed materials of edu- 
cation and by large appraisal of evaluative 
projects. Certain of these guides are more than 
an index to or a summary of the literature, in 
that they attempt an appraisal and synthesis of 
the studies reviewed; for example, the Ency- 
clopedia of Educational Research and the Re- 
view of Educational Research. Among the major 
evaluative projects are the Cooperative Study 
of Secondary School Standards and the Eight- 
year Study on the Relation of School and Col- 
lege of the Progressive Education Association. 
The 1940 Mental Measurements Yearbook 
represents both of these movements of the past 
decade. * It is granted that evaluation of a test 
by one or more reviewers, however competent, 
is not an infallible appraisal. Even though sub- 
jective elements may have entered into the 
analyses prepared by some of the specialists, 
the effect of the series of yearbooks has been to 
render both authors and publishers more cau- 
tious in the preparation and distribution of 
standard tests, an area of activity virtually un- 
challenged by critical appraisal before the ap- 
pearance of the measurement yearbooks. One 
must express admiration for the discriminating 
editorial judgment, courage, and perseverance 
essential to the success of so large a venture, 
with its problems of cooperative authorship, 
editing, financial support, and the emotionalized 
attitudes of some test authors and publishers. 

J Mental Sci 88:144 Ja ’42. G. W. T. H. 
Fleming. This large volume is crammed with 
useful information on mental testing of all kinds, 
achievement, character and personality, intelli- 
gence, vocations, together with chapters on test- 
ing in mathematics, science, languages, social 
studies, reading, and fine arts. There is a sepa- 
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rate section quoting reviews of psychological 
books from the various psychological journals. 
This is a most useful section of the book, for 
here we have gathered together really sound 
opinions (in many cases giving the author of 
the opinion) on a large variety of books. We 
can easily find out what a variety of different 
schools of thought think of any particular book. 
This excellent book should be in the possession 
of everyone dealing with mental testing in any 
shape or form. 

J Nerv & Mental Dis 93:697-8 My '41. * a 
work of marked usefulness * the editor is to be 
congratulated on his continued efforts to keep 
abreast of the intrepid expansion in this A 
abstract and specialized approach to huma: 
problems. 

Jun Col J 11:477 Ap ’41. * Buros’ modest 
little pamphlet of 44 pages, Educational, Psy- 
chological, and. Personality Tests of 1933 and 
1934, published in 1935 has grown into this 
comprehensive reference work of almost 700 
large, well-printed pages. Two hundred and 
fifty psychologists, subject-matter specialists, 
teachers, and test technicians have cooperated 
by contributing frankly critical reviews of stand- 
ard tests. Their outspoken, penetrating evalua- 
tions will help test users to make more discrimi- 
nating selections from among the hundreds of 
tests which are flooding the market. 

Librarian 30:180 My ’41. There are few 
books of more importance to the practical psy- 
chologist than the Mental Measurements Year- 
book. * It is difficult in the space of a short 
review of this character to give any comprehen- 
sive idea of the value or scope of the work, at 
least to be of value to librarians. They will have 
to accept the statement that if the library is suf- 
ficiently important they owe it to their readers 
to make it available to them. The index acts as 
an open sesame to the body of the work, The 
records and bibliographies noticed and reviewed 
are not confined to America only, but cover 
England and other countries throughout the 
world. * an elaborate bibliography * 

Loyola Ed Digest 16:10 My "41. Austin G. 
Schmidt. The reviewer, having had the pleasure 
of cooperating in a humble way with....Buros 
when the plan of publishing this yearbook on 
measurement was initiated, feels something akin 
to family pride in seeing the work grow each 
year into something more substantial, more in- 
clusive, and more helpful to the profession. * 
The material is well classified and well indexed. 
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From every point of view the yearbook is an 
invaluable contribution. The one limitation from 
which the yearbook suffers is an inevitable 
one. Useful though it may be to have at hand 
information concerning the title, purpose, and 
internal organization of a new test, the one 
thing that is of paramount importance is to be 
able to form an estimate of its validity. But there 
is no quick way of determining the validity of 
a test, particularly of a test that attempts to 
measure the subtler mental functions. Intelli- 
gence tests were highly developed years before 
any serious study was made of the measurement 
of personality traits, yet even today we are un- 
certain how inclusively and validly our present 
tests of intelligence achieve their purpose. There 
are probably not a hundred men alive who are 
thoroughly competent to criticize the technics 
and statistical procedures of such workers as 
Spearman, Thorndike, Kelley, and Thurstone. 
Scarcely have we settled back into some sort of 
satisfaction with the instruments we are using 
when some new research study is published 
casting doubt upon their validity. The chief re- 
sponsibility of the experts who collaborated 
with Dr. Buros was to criticize from the stand- 
point of validity the tests submitted to them. To 
do this satisfactorily would require research ex- 
tending over months or even years. Some of the 
reviewers did indeed undertake time-consuming 
studies of the tests they were to examine, as 
when Irving Lorge made a Hotelling compo- 
nent analysis of the intercorrelations of the 
Detroit Clerical Aptitudes Examination ( p. 
433). In other cases verdicts rendered are prob- 
ably no better than subjective opinions. In say- 
ing this we do not mean to imply that any of 
the reviews are valueless. The reviewers knew 
their fields, and to read their contributions is 
almost equivalent to taking a seminar in the 
latest developments in measurements. But it re- 
mains true that the great task challenging the 
makers and users of tests is that of devising 
better means of determining validity. 

Math Gazette 25:200 Jl *41. C. M. Fleming. 
For all who are interested in measurement— 
whether of intelligence, aptitude, or achieve- 
ment—this book is well worth purchasing. The 
series of yearbooks of which it is the latest issue 
originated in a realisation in 1934 of the amaz- 
ing number of tests being published each year 
and the paucity of information available about 
most of them. Material too often has been widely 
advertised by publishers for authors who have 


omitted to indicate the size or distribution of 
the population tested and the validity and reli- 
ability of the scores obtained. This yearbook, 
with its comparable predecessor of 1938, should 


do much to secure the suppression of such in- - 


adequately prepared compilations. * Cross- 
references to other reviewers are most compe- 
tent, and details of fact have been most accu- 
rately checked. (A detailed index of subjects 
would have increased still further the accessi- 
bility of the information supplied.) The useful- 
ness of the book as a whole is very great, and 
its personal interest is considerable to readers 
with some knowledge of the publications of the 
contributory reviewers. The sections on mathe- 
matics are of particular interest, especially to 
those who are sometimes tempted to minimise 
the contribution which standardised testing has 
made to the measurement of aptitude or achieve- 
ment in geometry, trigonometry and algebra. 

Med Digest 10:43 F ’42. This is an out- 
standing example of what can be done in the 
way of bringing together within the scope of 
one Book a large volume of useful information 
of various types of educational, psychological 
and personality tests. From a small beginning 
«this edition has reached an enormous size... 
which has been only possible by the genius of 
the author and his 250 specialist co-operating 
reviewers. The work contains a mine of infor- 
mation; especially the Books and reviews sec- 
tion has been most carefully and comprehen- 
sively prepared. Being a notable contribution 
on the subject, the Book should find a certain 
place in libraries of all psychological and re- 
search centres, 

Med Times 69:238 My "41. Frederick L. 
Patry. This vade mecum of information con- 
cerning current psychological tests is indispen- 
sable to all those working in the field of applied 
psychology. * a “must” book for those inter- 
ested in this field. The book is well documented 
with pertinent references and indices of titles, 
publication and periodical directory. Various 
size type is utilized which makes for ready 
reference and clarification. The format is at- 
tractive and the construction sufficiently sturdy 
to withstand the heavy use which this type of 
publication deserves. 

Munic Ref Lib Notes 27:38 My r4 ’4r. * 
The Yearbook has already proved itself to be 
indispensable to the psychologist, the personnel 
expert, [and] the educator as a reliable guide 
to the quantities of tests currently appearing 
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about which little information is available in 
any other publication. The list of contributing 
reviewers includes names of the outstanding 
writers in the various fields of mental measure- 
ment and gives immediate indication of the high 
quality of the work. Extensive bibliographies 
supply additional references to literature on the 
subjects of the various tests. * The entire work 
gives evidence of the painstaking care taken by 
Mr. Buros and his helpers for accuracy and 
consistency in assembling and editing this di- 
gest. The librarian will appreciate the inclusion 
of a more complete table of contents with gen- 
eral subject headings which has eliminated re- 
course to the classification key used in the 1938 
edition. 

Nat Math Mag 15:264-6 F '41. Kenneth L. 
Heaton. * an attractively bound volume * This 
Vearbook seems superior to other volumes in 
the series because it has a greater number of 
reviews, because an effort has been made to 
have these reviews written by critics who differ 
in their opinions of each test, and because the 
reviews are longer and give a more complete 
analysis than was possible in previous editions. 
The volume gives evidence of the same thor- 
oughness and painstaking efforts which have 
characterized the work of....Buros in the past. 
It is no less severe in its criticism of tests than 
was the last edition which caused much con- 
demnation to be directed toward the editor. 
We are pleased to observe this adherence to 
frank criticism; for it is the one characteristic 
which would seem most necessary if the volume 
is to serve its declared purpose. In the selec- 
tion of reviewers, every effort has apparently 
been made to avoid bias,—either bias due to 
personal animosity or to strong personal friend- 
ship toward the authors of tests and publica- 
tions. * Buros has made an important contribu- 
tion to professional literature. 

Occupations 19:624 My "41. H(arry) D. 
K(itson). * Particular attention is paid....to the 
reports concerning reliability and validity of 
the tests listed. The frankness with which the 
reviewers point out weaknesses in this respect 
is refreshing. * By presenting these frank and 
penetrating appraisals every two years, com- 
piler Buros will in time, it is hoped, achieve his 
aims of impelling “authors and publishers to 
place fewer but better tests on the market"; 
and impressing "test users with the desirability 
of suspecting all standard tests—even though 
prepared by well-known authorities—unaccom- 


panied by detailed data on their construction, 
validation, use and limitations." 

Personnel Adm 3:12 Ja'41. | 

Prison World 7:31 Ja-F *45. ~ 

Psychiatry 4:633-4 N ^41. E(rnest) E. 
H(adley). * [In The 1938 Yearbook| Buros 
asked for criticisms looking toward the improve- 
ment of the next yearbook. The Nineteen Forty 
Mental Measurements Y earbook is an outstand- 
ing example of how one may profit from such 
criticism. The statement was made in Psychi- 
atry (1939) 2:142 that the 1938 yearbook was 
"comprehensive in scope and unbiased in treat- 
ment." This statement can now be repeated in 
the light of an increase in size, larger type, ex- 
cerpts from an additional number of periodicals, 
almost double the number of coóperating test 
reviews, longer test reviews, a larger number 
of books in closely related fields and a host of 
other improvements that should be seen to be 
believed. * There is an excellent periodical 
directory and index, a publishers directory and 
index, and index of titles and an index of 
names. A prodigious amount of labor—one 
might well say a labor of love—has gone. into 
the making of a fine book, providing an un- 
usually valuable service. Editor Buros is to be 
congratulated as are all who participated with 
him in its compilation—indeed as all are to be 
congratulated who have the good sense to use 
The Mental Measurements Yearbook. It is pro- 
foundly to be hoped that this work may be con- 
tinued. 

Psychoanalytic Q 11:437-8 Jl '42. Michael 
B. Dunn. The 1938 Yearbook....was reviewed 
in this QuamrERLY by this reviewer. At that 
time the essential orientation of the reviewer 
was towards a psychology that had meaning and 
usefulness in a normal world, a world without 
the cataclysmic aspects of total war. Conse- 
quently the use of tests was minimized and their 
limited predictive value emphasized. In total 
war with its ‘overnight’ need to mobilize mil- 
lions of service men and industrial workers, 
this testing tool suddenly acquires tremendous 
magnitude. The individual personality, with its 
subtle and yet most dynamic meaning, must of 
necessity be sidetracked in a mobilization of the 
masses. It was this same need in the World 
War of 1917 that called forth the resources of 
the psychologist, and the Army Alpha test was 
horn. After the war, testing acquired an impe- 
tus and the psychologist seeking a more direct 
relationship to human behavior, because of his 
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previous absorption with the sterility of the 
‘memory drum laboratory’ technique, now felt 
he had thesanswer. So everything was tested 
with the almost daily introduction of more and 
more tests. We suffered from an overdose of 
tests but fortunately, with the revolt stimulated 
by the gestalt school and psychoanalysis, the 
significance of the total personality began to 
slowly return to the psychologists. Now with 
the new World War and the need for and use 
of tests, we are apt to suffer again, especially 
after the war. It is, however, books like [this]... 
that can help us in demanding good standardi- 
zation and validation with a healthy awareness 
of the limitation of the specific tool. * Of con- 
siderable usefulness are the reviews of over 
three hundred books and monographs in the 
psychological field. A directory of journals, 
periodicals, names of publishers, all well in- 
dexed, makes for a comprehensive and timely 
bibliographical usefulness for all those inter- 
ested in mental tests and measurements. The 
1940 Yearbook is a great improvement on the 
1938 Yearbook and is especially more helpful 
to those interested in the total personality. The 
addition of the thorough handling of the Ror- 
schach test with its complete bibliography is 
indicative of this improvement. The book is a 
necessary addition to the library of every tester. 
Psychol B 38:764-7 O "41. J. B. Stroud. * 
Quite frequently, reviews that have appeared in 
psychological and educational journals are re- 
produced, in whole or in part, in addition to 
those written especially for the volume. The 
editor was quite liberal in space allowances. Re- 
views of from 400 to 500 words are common; 
some are considerably longer. The reviews are 
the work of 250 individuals representing vari- 
ous areas of specialization. Approximately 75 
of these are professors of education, adminis- 
trators, and elementary and secondary teachers; 
the various fields of psychology— general, clini- 
cal, child, educational—are represented by a 
like number of reviewers; approximately 40 
represent the various academic fields, exclusive 
of professors of English, mathematics, and the 
like, who are included in the education group; 
some 35 or 40 of the reviewers are statisticians, 
test specialists, directors of bureaus of measure- 
ments, and so on. Among the reviewers are 16 
from foreign countries. The selection of review- 
ers for the specific assignments seems to have 
been especially appropriate. For example, H. E. 
Garrett, T. L. Kelley, C. Spearman, Godfrey 
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Thomson, and R. C. Tryon appraise the Tests 
for Primary Mental Abilities; Nancy Bayley, 
B. M. Castner, Florence Goodenough, and 
Florence Teagarden, the Merrill-Palmer Scale 
of Mental Tests. * Lists of references frequently 
follow the titles and descriptive matter of the 
tests. Some of the references treat research con- 
ducted in the interest of validation and use of 
the tests in question; others are more general 
in character and may be said to supply infor- 
mation about tests of the kind under treatment, 
With respect to some of the best-known and 
most widely used tests the list of references is 
quite long; for example, for the Revised Stan- 
ford-Binet Scale 134 titles are given. * Of all 
the features of the volume the book reviews ap- 
pear to hold the least promise of usefulness. 
The volume purports to be up-to-date as of Oc- 
tober, 1940. It gives fairly comprehensive cov- 
erage of the tests published since the appear- 
ance of The 1938 Yearbook. An attempt is 
made to include a representative sampling of 
tests published prior to 1938. A good many 
tests reviewed in previous yearbooks are re- 
viewed anew. Although it was probably not 
feasible even to list by title all of the older tests, 
the writer feels that a place should have been 
made for the historic Army Alpha and for some 
other well-known tests, even at the expense of 
deleting some of the 147 references to articles 
on the Rorschach test. The editor tells us that 
the tests omitted will find their way into subse- 
quent yearbooks. Something could be said for 
the reviewing of the best of the older tests 
rather than the reviewing of a cross-section of 
them. Perhaps it is the editor's thought that the 
practice of having all of the tests reviewed ulti- 
mately, and ideally several times, in various 
Yearbooks will serve the purpose of selection. 
This is not too promising an outcome, since the 
various tests will be evaluated by persons hav- 
ing different standards and different philoso- 
phies. When A appraises test m and B test n, 
the result is a comparison of the reviewers as 
well as a comparison of the tests. * A subject 
index would have added somewhat to the con- 
venience of the user of the volume. One inter- 
ested in performance tests or tests for the deaf, 
for example, must look through the entire sec- 
tion on intelligence tests in order to round them 
up. A work sò ambitious and so well conceived 
will surely find several kinds of usefulness. The 
most obvious kind, and the one most frequently 
claimed for it in the preface and introduction, is 
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its service to test users in the selection of tests. 
Indeed, the work is styled a codperative service 
to test users. This objective will be realized in 
a comparatively small measure if the test user 
consults the volume only to ascertain what the 
reviewers have to say about the particular test 
that he may contemplate using. It should prove 
to be more serviceable to those who peruse all 
the reviews of tests of a given variety with the 
view to gaining a perspective of the tests of 
that variety, as appraised by test experts, edu- 
cators, and academicians. A given test may then 
be evaluated by comparison with other avail- 
able tests, as reviewed. The test user should 
evaluate each review in the light of the kind 
of information the reviewer is best qualified to 
give, consulting the test expert on matters of 
sampling, construction, compilation of norms, 
etc., and the subject-matter specialists—profes- 
sors and classroom teachers—for information 
on the adequacy with which a test covers the 
recognized content of the courses for which it 
is designed. The writer ventures to predict that 
the volume will be of greater service to gradu- 
ate students of the various branches of mental 
measurement and to the test makers themselves 
than to those who purchase and administer 
tests. Any very useful statement about the qual- 
ity of the reviews seems to be out of the ques- 
tion, since it would require careful scrutiny of 
the tests themselves—to say nothing of its be- 
ing presumptuous. Some who peruse the volume 
may feel—and justifiably so—that a consider- 
able proportion of the reviews are hypercritical 
as well as "frankly critical”—a phrase which the 
editor uses as often as there is the slightest 
occasion to do so. On the other hand, it is un- 
likely that anyone, unless he be an avowed 
enemy of mental measurement, will criticize 
the volume for its lack of critical quality. The 
“reviewer’s candid opinion of a test’s merits 
and limitations” was asked for and apparently, 
received. Whether or not reviews a bit more 
descriptive and a bit less opinionated would 
have been generally more serviceable is a ques- 
tion that may well arise. At any rate, there is 
a minimum of description. 

Pub Personnel R 2:155-6 Ap '41. Thelma 
Hunt. * The Nineteen Forty Mental Measure- 
ments Yearbook continues in an excellent way 
the invaluable service to test constructors and 
test users initiated by Buros’ earlier edition. 
The book should stand as a constant guide to 
all users of published tests, especially those who 


themselves have not time to evaluate by per- 
sonal searching of the literature and personal 
experimentation all of the tests they use. It 
should afford a measure of comparative evalua- 
tion of the several tests from which one may 
make his selection in many problems for which 
tests must be chosen. Furthermore, Buros’ vol- 
umes have the possibility of another great con- 
tribution to the field of tests and measurements 
—that of reducing the number of unreliable and 
unvalidated tests on the market, and of deter- 
ring uncritical authors and publishers from 
rushing poor and unevaluated tests into publi- 
cation. Two hundred and fifty reviewers have 
evaluated the tests. Among these are many of 
the most prominent workers in the field of tests 
and measurements, It might be said that some 
of the reviewers are relatively unknown in the 
field of tests, and that their competence in judg- 
ing the tests is questionable. Most of these less 
widely known reviewers, however, will be found 
to be rather constant users of tests, and Buros 
is to be commended for securing an adequate 
representation of test administrators as well as 
test technicians and constructors in his group 
of reviewers. The critical reviews of the tests 
vary greatly. They, of course, reflect the stand- 
point from which the reviewer considers the 
test—as constructor, as one versed in theoreti- 
cal aspects of measurement, or as user apply- 
ing the test in the solving of practical problems. 
Some of the reviews represent painstaking and 
detailed analyses of the tests, others are nearer 
general descriptions of the tests. However, 
since the more important tests usually have two 
or more reviewers, altogether the reader is 
likely to get adequate consideration and repre- 
sentative viewpoints with regard to the tests. 
The book is well arranged, well classified, and 
adequately cross-referenced. Its mechanical 
make-up is such as to favor easy reading and 
ready location of any desired portion of the con- 
tents. To the public personnel officer or ad- 
ministrator, the information about and reviews 
of tests under the classifications of Intelligence 
Tests and Vocational Tests will probably be 
most useful. These are the types of published 
tests for which he is most likely to have need. 
The personnel administrators might even ask 
that the section on Vocational Tests be ex- 
panded further. He might also ask that more 
public personnel administrators be included 
among the reviewers of those tests now being and 
likely to be used by him. To the personnel ad- 
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ministrator the Yearbook can be recommended 
as an excellent reference book for his library. 
It will be found useful whether he wishes to 
check up on published tests he is using, to 
select from available tests instruments for a new 
testing program, or to familiarize himself with 
developments and opinion in the field prepara- 
tory to construction of his own tests. Buros is 
to be commended for undertaking this second 
edition of his Yearbook, even in the face of dif- 
ficulties of financial support. It is to be hoped 
that he will find it possible to make his year- 
book a biennial one. 

Q R Biol 16:382 S '41. This book repre- 
sents a very complete and successful attempt 
by 250 of the leading psychologists, teachers, 
and test technicians to review the huge numbers 
of tests available for measuring various human 
qualities and abilities. * [the excerpts from book 
reviews] are instructive and very interesting as 
the opinions are often sharply conflicting * 
there is an excellent bibliography concerning 
the Rorschach test but nothing concerning the 
test itself. One also misses reference to Mur- 
ray's thematic perception test, a test which is 
coming more and more into clinical use. 

Sch 29:678 Mr '41. J(ohn) A. L(ong). The 
series of which this yearbook is the newest 
member was initiated in 1935 with a modest 
paper-bound volume entitled Educational, Psy- 
chological, and Personality Tests of 1933 and 
1934. Succeeding volumes became progressively 
more ambitious, until we now have a mental 
measurements yearbook of encyclopedic propor- 
tions. The pages measure 774 by 10% inches 
and are double-columned: the material would 
probably occupy 2000 pages of conventional 
size and arrangement. * This reviewer is of 
the opinion that the belief expressed in the an- 
nouncement by the publishers that this volume 
"will prove to be one of the most significant 
works published in the fields of education and 
psychology in recent years" is amply justified. 
Buros is performing a valuable service which 
is nowhere else provided; anyone who has 
much to do with testing in education should 
consider this book an essential part of his equip- 
ment, and he should keep it in a handy place, 
for he will be referring to it constantly. 

Sch Executive 60:31 Ap '41. N. L. Engel- 
hardt, Jr. * a critical and analytical catalogue 
of standard educational tests * a volume of con- 
siderable value to all who are frequently con- 
fronted with problems of test selection. The 


analyses, written by experts, are in the main 
clear, concise, and reasonably comprehensive. 
Strengths and weaknesses of tests are frequently 
pointed out with much frankness and impar- 
tiality. In addition to these reviews there are 
frequent references to reviews in other publi- 
cations to which one may refer for further in- 
formation and guidance. Test users will find 
that the use of this volume as a reference book 
will aid in locating, evaluating, and comparing 
tests. Test makers will also find this volume of 
much help, partly because of a rather marked 
critical attitude on the part of reviewers. Many 
of the analyses can be summed up in the words 
of one reviewer who states: "If one has to use 
a standard test, I see considerable merit in this 
one, but I believe it should be revamped.” This 
volume should offer workers in the field desir- 
able encouragement for test development, revi- 
sion, and thorough analysis. The editor and his 
collaborators have done an enormous task well. 
The results should appeal to a wide group of 
educators, 

Sch R 49:314-5 Ap '41. Nelson B. Henry. 
* the repeated preparation of this work over a 
period of years has resulted in several changes 
that have enhanced the value of the publication. 
Perhaps the most important changes are those 
which involve the altered logical grouping of 
the test material and better reviews by both the 
earlier and the new reviewers. * A limited 
analysis of these reviewers will aid the reader 
in judging the merits of the book. About 40 per 
cent of the reviewers hold professional positions 
of one kind or another in education in colleges 
or universities. Another 20 per cent are pro- 
fessional psychologists or professors of psychol- 
ogy. About 25 per cent are professors or in- 
structors in content subjects, and the remaining 
15 per cent are largely administrators. Approxi- 
mately 45 per cent of these specialists work at 
miscellaneous small colleges and universities. 
Another to per cent work in specialized insti- 
tutes other than teaching institutions. Five per 
cent are connected with foreign institutions, 
chiefly in Britain. The larger universities most 
adequately represented are Chicago, Ohio, Min- 
nesota, Columbia, Stanford, California, Iowa, 
and Yale, somewhat in that order. It is obvious, 
however, that the average proportion of men 
even in these frequently represented universi- 
ties is about 5 per cent for each institution. A 
rather large proportion of the space is devoted 
to a series of useful indexes. These are a peri- 
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odical directory, a publishers’ directory, an index 
of titles, and an index of names. Consequently 
the volume provides a handy reference book for 
the test-user. It is not without significance that 
this work is published by a co-operative, non- 
profit service for test-users. The editor is en- 
titled to much commendation for his work in 
supervising the cataloguing of tests, selecting 
and counseling the reviewers, and planning the 
convenient and effective presentation of this 
valuable reference material. 

Teach Col J 13:23-4 S "41. E. L. Abell. * 
This volume should be in every school office and 
in every library always available to those who 
are interested in tests. * 

Teach Col J (Manila) 3:59-61 JI-S "4r. 
Pedro T. Orata. The year 1941 will long be 
remembered in the annals of educational re- 
search in the United States, first for the publi- 
cation of the Nineteen Forty Mental Measure- 
ments Yearbook and second for the appearance 
for the first time in educational history of an 
Encyclopedia of Educational Research. These 
two volumes supplement each other, the Year- 
book making available to research workers 
everywhere descriptions and critical evaluations 
of one of the major instruments of educational 
research, namely, educational tests and scales; 
and the Encyclopedia indicating the present 
status and, in some instances, the trend of edu- 
cational research in America. * The immensity 
of the content of the Yearbook is evidenced not 
only by its size, but more particularly by the 
variety of interests represented by the "two 
hundred and fifty psychologists, subject-matter 
specialists, teachers, and test technicians" who 
have cooperated in making this important vol- 
ume by contributing frankly and without equivo- 
cation critical reviews of more than five hun- 
dred standard tests and scales. Space will not 
permit a complete analysis of the test reviews, 
and the book reviews have been completely left 
out. Detailed and quite frank, thorough analysis 
was made of 159 of the test reviews under four 
of the main headings: Achievement Batteries. 
Character and Personality, Social Studies, and 
Vocations. Each of the 159 test reviews was 
read three times, in order, first, to determine 
the general attitude of the reviewer toward the 
test being reviewed; second, to analyze the re- 
view itself and classify it under the specific 
categories indicated below ; and third, to check 
the tabulation for errors of fact and interpreta- 
tion. For the purpose of this review the test re- 


views may be classified under four headings, as 
follows: 1. Emphasizes functional or "true" 
validity, criticizes test for success or failure to 
measure ultimate educational objectives; gives 
specific instances in which the test succeeds or 
fails in measuring worthwhile results of in- 
struction ; and in general subordinates the tech- 
niques of test construction and mechanics of 
administration, scoring, and tabulation of scores 
to the higher values that the test and testing 
should engender in the pupils and in those who 
use it. 2. Takes considerable space in describ- 
ing the scope and use of the test, criticizes the 
test mainly for success or failure to meet the 
requirements of statistical validity and reliability, 
evaluates it on the basis of commonly accepted 
techniques of test construction, and in general 
assumes functional validity or subordinates it to 
formal content and make-up of the test. 3. In- 
dicates in general the scope and use of the test 
and evaluates it mainly from the point of view 
of its success or failure to meet the mechanical 
requirements of efficiency in scoring, administra- 
tion, and tabulation of test results. Puts a pre- 
mium on the mechanical aspects of the test to the 
neglect of other and more basic considerations. 
4. Comments on the test in general and in vague 
terms ; obviously superficial, uncritical, and lack- 
ing in value as a basis for evaluating or selecting 
tests. * assuming that the foregoing analysis has 
some validity, it seems encouraging to note that 
as great a percentage of the test reviews as do 
emphasize "functional validity" more than any 
other feature of testing and that as few test re- 
viewers as were superficial in their discussion or 
put a premium on the mechanics of giving and 
scoring the tests and tabulating the test results, 
It would be enlightening to compare the quality 
of the test reviews in the present Yearbook with 
that of the test reviews in preceding ones. It 
would seem proper to recommend at this time 
that in future editions the three elements of | 
functional validity of the tests; techniques of 

construction; and mechanics of administration, 

scoring, and tabulation be stressed for each re- 

view. Unless the test is a valid measure of the 

ultimate objectives of instruction no amount of 

refinement in the technique of construction and 

administration would render the test good. On 

the other hand, a perfectly valid test of educa- 

tional attainment may become worse than use- 

less as a research instrument or as an instru- 
ment to measure educational progress unless 
constructed with due regard to the require- 
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ments of scientific test making. It is hoped that 
as a result of such critical reviews as are in- 
cluded in the Yearbook future tests will be better 
and that premature publication of "experi- 
mental tests" will be prevented. As an instru- 
ment to improve educational research, the Year- 
book cannot be over-rated. A similar yearbook 
dealing with techniques of research should be 
made available to research workers in education, 
psychology, and social work. Until and unless 
educational research becomes critical of its own 
instruments and methods of procedure it will 
fall very far short of attaining its cherished 
ambition to become a member of the scientific 
fraternity. 

Teaching 14:94-5 D ’41. This issue of the 
Mental Measurements Year Book is more com- 
prehensive than any of its predecessors, and it 
contains a mass of useful information concern- 
ing recent developments which experts can 
hardly afford to ignore. * The editor, for whose 
industry and enthusiasm we have the highest 
admiration, has enlisted the services of no less 
than two hundred and fifty reviewers and has 
quoted excerpts from nearly two hundred scien- 
tific journals, in which new books on testing 
were originally reviewed. All the tests of which 
details and criticisms are given are published 
in English and the great majority come from 
America; of the reviewers only sixteen repre- 
sent English-speaking countries other than the 
U.S.A. This number would have been increased 
but for the war, but it includes such well-known 
English psychologists as Spearman, Thomson 
and Wynn Jones. * Details of each test....are 
carefully listed and then follow the critical re- 
marks of one or more experts; these are in- 
tended to enable the reader to form a fair esti- 
mate of the reliability and usefulness of each 
test (or book). In cases where there are two 
or three, or even more reviews, written by per- 

. Sons of acknowledged competence and repre- 
senting different points of view, there should 
be little difficulty in making a discriminating 
selection of those tests which will best meet 
one’s requirements. Several of them have been 
examined and evaluated with great thorough- 
ness by persons whose scholarship and freedom 
from bias are above suspicion; and those who 
are tempted to make use of any standardized 
test should first endeavour to satisfy themselves 
that its construction and validation have been 
approved by persons who are well fitted to 
judge. Reviewers have been given a very free 


hand, and they seldom fail to stress the limita- 
tions (if any) of the tests which they examine. 
Indeed, some of the criticisms are so severe that 
one is tempted to wonder why certain tests 
wêre thought worthy of mention in an authori- 
tative work of this kind. In recent years the 
literature of testing has grown enormously, and 
even the keenest student of this subject must 
make a selection of the books which he should 
consult. It is true that many of the books which 
are dealt with in this volume are not readily 
available in India, but research workers and 
other students of educational psychology will 
be able to discover from the entries under 
‘Books and Reviews’ what volumes are likely 
to prove helpful to them. We heartily congratu- 
late Professor Buros on the thoroughness with 
which he has carried out a painstaking task and 
we recommend the inclusion of this valuable 
bibliographical work in all University and 
Training College libraries. We may add that in 
the future this ‘Year Book’ will appear biennially 
and that the next issue will be for 1942. 
Wilson Lib B 16:252-3 N41. Louise Shores. 
This fifth in a series of critical test bibliogra- 
phies dating back to 1933 is a tribute to the 
intelligence, bibliographic sense, subject knowl- 
edge and, above all, courage, of the editor in 
the face of a series of obstacles that would try 
even the hardiest of scholars. Librarians and 
especially officers of American foundations are 
urged to read the very human and humorous 
account of an editor’s tribulations in the intro- 
duction and to join this reviewer in encouraging 
him to continue an undertaking of such un- 
questioned high merit and usefulness to libraries 
and to scholarship. * The reviews have been 
prepared especially for the Yearbook by some 
250 specialists who were invited to write criti- 
cally in the most courageous meaning of that 
word. In fact, so critical are some of these re- 
views that the publishers’ angry and sometimes 
threatening letters excerpted in the introduction 
only serve to strengthen the user’s faith in the 
impartiality and vigor of this tool. The section 
“Books and Reviews” will especially appeal to 
librarians. * The present volume does not re- 
place the four preceding publications * most 
school, college, and public libraries will find an 
immediate and frequent use for the 1940 Men- 
tal Measurement Yearbook * Mental Measure- 
ments Yearbook is recommended to libraries as 
a basic reference tool and to the philanthropic 
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foundations of America as a research project 
worthy of generous aid. 

For reviews of earlier volumes in this se- 
ries, see 36:B45, 36:B46, 38:B325, 38:B326, 
40:B856, 40:B857, and 40:B858. 


[ 789 ] 
X*BunPre, Royat HunpLEsTON, Seven Quickly Ad- 
ministered Tests of Physical Capacity: And Their 
Use in Detecting Physical Incapacity for Motor 
Activity in Men and Boys. Columbia University, 
Teachers College, Contributions to Education, No. 
818. Jesse Feiring Williams, faculty sponsor. New 
York: Bureau of Publications, the College, 1940. Pp. 
vii, 151. $1.85. * (PA 15:2197) 


Ed Abstracts 6:80 Mr '41. Karl W. Book- 
walter. * 'The criteria for the selection of items 
and organization of the test are exemplary of 
good logic but are such as differentiate too 
crudely for practical value in some instances. 
The test has possibilities, however. 

Ed Res B 21:213-4 O 14 '42. Charles C. 
Cowell. * 'This study is an important contribu- 
tion in that it analyzes all past research with 
physical-capacity tests and produces a new one 
which should receive many trials because of its 
ease of administration. On the other hand, since 
physical capacity is obviously the function of a 
large number of variables, one can hardly ex- 
pect to find the answer with any too simple 
device. 

J Health & Phys Ed 12:288 Ap '41. * a 
worth-while contribution * 


[ 790 ] 
*Bunr, Cvnir. Intelligence and Fertility: The Ef- 
fect of the Differential Birthrate on Inborn Men- 
tal Characteristics. Eugenics Society, Occasional 
Papers on Eugenics, No. 2. London: Hamish Hamilton 
Medical Books, 1946. Pp. 43. Paper. 2s. * (PA 21 :4364) 


[ 791 ] 
*CanrER, GERALD CLAYTON. Student Personalities as 
Instructors See Them. Edited by H. H. Remmers. 
Purdue University, Division of Educational Refer- 
ence, Studies in Higher Education, [No.] 54. Lafayette, 
Ind.: the Division, November 1945. Pp. 43. Paper. 
$075. * (PA 20:3881) 
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XCanrER, Hanorp D. Vocational Interests and Job 
Orientation: A Ten-Year Review. Applied Psy- 
chology Monographs of the American Association for 
Applied Psychology, No. 2. Stanford, Calif. : Stanford 
University Press, May 1944. Pp. 85. Paper, litho- 
typed. * (London: Oxford University Press. 9s. 6d.) 
(PA 18:3616) For reviews, see 650. 


[ 793 ] 
Carter, Launor F., Eprror. Psychological Re- 
search on Navigator Training. Army Air Forces 
Aviation Psychology Program Research Reports, Re- 
ort No. 10, Washington, D. C.: Government Printing 
ffice, 1947. Pp. ix, 186. $1.25. * (PA 21:3708) 


J Consult Psychol 11:223 JAg ’47. * This 
well-printed and copiously illustrated report 
gives results of direct value to aviation, and 
suggests techniques that can be applied to other 
problems of selection and training. 


[ 794 ] 
*Carter, W. R. The Missouri College Aptitude 
Testing Program. University of Missouri Bulletin, 
Vol. 47, No. 24; Education Series, No. 40. Columbia, 
Mo.: the University, August 20, 1946. Pp. 7a. Paper. 
Gratis. * (PA 21:3260) 

[ 795 ] 
X*CarrELL, Psycue. The Measurement of Intelli- 
gence of Infants and Young Children. New York: 
Psychological Corporation, 1940. Pp. 274. $3.10, post- 
paid. * (PA 15:2397) For reviews, see 282. 

[ 796 ] 
*CarrELL, Raymonp B. Description and Measure- 
ment of Personality. Yonkers, N. Y.: World Book 
Co., 1946. Pp. xx, 602. $4.00. * (London: George G. 
Harrap & Co., Ltd., 1947. 15s.) (PA 21:501) 


Brit J Ed Psychol 17:118-9 Je’47. G(odfrey) 
T(homson), This is an almost unreadable book, 
exasperating in its verbosity, in its hair-split- 
ting, and in its clumsy terminology, like "eri- 
vironmental mould trait.” Yet it is perhaps 
important. Let me say what Professor Cattell 
has done. He has taken an enormous number 
of names of personality traits, boiled them down 
to thirty-five, and extracted twelve oblique or 
correlated personality factors from these. There 
are several parts to the book. He first discusses 
in great detail the nature of traits and their 
classification, distinguishing for example, sur- 
face traits and source traits, dynamic traits, 
ability traits and temperament traits, and so on; 
and in passing, with the aid of a “covariation 
chart” on page 97, he expresses all kinds of 
possible and impossible correlations between 
tests, between persons, and (what is new) be- 
tween “occasions,” totally disregarding, how- 
ever, the difficulty of metric or units. Indeed, 
throughout his book he lets no difficulties daunt 
him, if indeed he sees them. And that is perhaps 
a virtue in him. He is a Romantiker, and no 
Klassiker, he fairly bubbles with ideas without 
caring much whether they are practicable or 
even possible. Most of them are not. Secondly, 
he catalogues and classifies all the personality 
trait names, starting with Allport and Odbert’s 
list of 4,500 terms, which he reduces to about 
160 to form “a kind of Basic English for the 
complete description of personality” (p. 217). 
He adds a few more from other sources, mak- 
ing 171, which he then lists in a sort of vocabu- 
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lary on his pages 219-232. He then collates the 
results of fourteen researches, two of them by 
himself (pp. 242-3) and identifies the “princi- 
pal nuclear and phenomenal clusters and sec- 
tors" of the personality sphere, namely 130 phe- 
nomenal clusters, leading to the formation of 50 
nuclear clusters, falling into 20 personality sec- 
tors, but with the majority falling in the first 
ten sectors. “The roundness of these numbers" 
he hastily adds, “is purely fortuitous.” Pages 
246 to 267 form a chart of these. * Thirdly he 
describes, very inadequately, an experiment on 
208 men, divided into thirteen groups of six- 
teen men each (eight of them squads of sol- 
diers), the correlations found in each group be- 
ing averaged via Fisher's 2 function. Two raters 
from each group independently rated their fel- 


lows on the 35 “established clusters” (pp. . 


295-9) to which the original 171 basic traits 
have by this time been reduced. “The product 
moment correlation matrix thus worked out,” 
he says, “is shown in the Appendix Table 2” 
(p. 301). But there are no appendices at all in 
the book which has reached me! Nor is the 
table of correlations given in the paper to which 
he refers us in his next sentence (Amer. J. 
Psychol. 1945, 58, 69-90), where, he says, “the 
details of this particular centroid analysis have 
been sufficiently set out.” Indeed they have not. 
In that article neither data nor raw correlation 
coefficients are given, nor any particulars what- 
ever which might enable a reader to check the 
work. The table of loadings of the unrotated 
orthogonal centroid factors is not given, only 
a table of the “factor loadings after rotation for 
oblique simple structure”; and then a footnote 
calls this table “the factor-pattern above” and 
says that the factor structure is not set out, 
which leaves one in grave doubt what exactly 
the table is. All this is very wrong. However, 
taking his results thus entirely on trust, we 
learn that the analysis yielded eleven, twelve, 
or thirteen factors (the doubt arising from dis- 
agreement among various criteria for deciding 
when to stop), and that these were rotated into 
an oblique simple structure by Thurstone’s most 
recent methods or rather, into two acceptable 
simple structures, one (a) of twelve and an- 
other (B) of seven factors, of which (a) was 
preferred. These factors he describes on pp. 
313-337, towards the end of Chapter 9. In Chap- 
ter ro he makes a somewhat similar classifica- 
tion of data obtained by many workers by means 
of self-inventories (the above data having been 


ratings), and in Chapter 11 of data from objec- 
tive test measurements; but does not himself 
carry out any factorial analysis. The ‘factors’ 
listed in these two chapters are in some cases 
compared or partially identified with the above 
named twelve principal source-traits. To these 
he returns in Chapter 12, the last in the book, 
where he gives a fuller interpretation of them 
and confidently calls them “the established 
primary traits.” In his Table 28 he lists the 
clusters the ratings of which are to be pooled 
to estimate each factor; for example, factor D 
named Hypersensitive, infantile, sthenic emo- 
tionality versus phlegmatic frustration tolerance 
is to be estimated by pooling four clusters with 
equally long names, which for brevity I may 
here call the egotistic self-effacing, the hostile- 
kindly, the restless-calm, and the infantile- 


mature bipolar qualities. Cattell’s pages cer- 


tainly supply plenty of adjectives, and running 
my eye over his Table 17, I choose, as descrip- 
tive of his book, "restless, assertive, energetic, 
spirited, imaginative." It badly needs disciplin- 
ing, checking, and pruning. But I think it can- 
not be neglected. At the least, it is a mine of 
information and references. 

Clearing House 21:377-8 F '47. E(arl) R. 
G(abler). * The major emphasis....is given to 
the identification and measurement of person- 
ality traits. The distinction on the part of the 
author between "source" traits and surface 
traits is significant, particularly in regard to the 
effect it will have upon future personality stud- 
ies. In the identification of the personality traits 
listed the author made use of questionnaires, 
ratings, tests, and clinical procedure. This book 
is a necessity for advanced students in the field 
of personality study and for students who are 
working with the clinical and evaluative aspects 
of psychology. Guidance counselors, personnel 
experts, and psychiatrists will also find the book 
helpful in their work. 

Ed Res B 26:108-9 Ap 16 47. Ralph M. 
Stogdill. * This work is characterized by the 
careful definition of the different modalities of 
personality and the listing from various types 
of studies of the basic variables which are de- 
scriptive of these modalities. The logic behind 
the method of the study is fully and clearly de- 
veloped. A knowledge of factor analysis is 
essential to an understanding of the technical 
problems relating to methodology but is not 
required in understanding the clinical implica- 
tions of the findings. The final chapter will per- 
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haps be most satisfying to the clinician who is 
seeking to deepen his insight into the intercor- 
relation of personality traits. New light is re- 
vealed on a number of obscure points and ap- 
parent contradictions which have concerned 
clinicians for many years. A thorough study of 
the chapters on principles and methods is cer- 
tainly prerequisite to the planning of any new 
research in the measurement of personality 
traits. 

J Consult Psychol 11:153 My-Je '47. * The 
omission of projective methods as a source of 
data is perhaps suprising, but it is clearly the 
author's choice and not his oversight. Of the 
Rorschach he says, “It remains a mixture of ill- 
defined intentions, analogous to a patent medi- 
cine, devoid of clear-cut theoretical bases that 
would permit refinement through research." * 

Occupational Psychol 21:102-4 Ap '47. H. J. 
Eysenck. It is one of the strange vagaries of 
psychological history that experimental and 
statistical work on non-cognitive personality 
traits has lagged far behind work on intelli- 
gence, memory, visuo-spatial ability, and the 
other group-factors subsumed under the general 
cognitive factor. This is all the more remark- 
able as the foundations for such a study of the 
orectic parts of personality were well and truly 
laid, at the time of the 1914-1918 war, by 
Webb and Burt, in two independent factorial 
researches. Very occasional papers were pub- 
lished in the twenty years that followed, in the 
main by students of Professor Burt, but the 
output was negligible compared with the stream 
of competent and large-scale work on tests of 
intelligence. One of the main reasons for the 
difference in status between cognitive and orec- 
tic studies appears to have been the fact that 
workers in the former field had succeeded in 
isolating the dimensions along which measure- 
ment can proceed, while workers in the latter 
field had failed to do so. And while the success 
of the “intelligence tester” was largely due to 
his use of the statistical methods of factorial 
analysis, the failure of the “temperament tester” 
was largely due to his neglect of this technique. 
Consequently a large-scale attempt to make use 
of factorial techniques in the field of orexis was 
surely overdue, and Professor Cattell, in this 
book, presents the results of his own efforts in 
this direction. At the same time, he tries to 
integrate his own results with such relevant. 
work as has been done by others on self-ratings, 
behaviour-ratings, objective test measurements 


and clinical assessments. A book comprising 
over six hundred pages cannot easily be sum- 
marized, but it may be permissible to indicate 
first of all, those points on which it marks an 
important advance in the study of the subject; 
then, to put forward a few points, on which the 
reviewer does not entirely agree with Professor 
Cattell. In the first place, Cattell puts forward 
twelve main “source traits", derived from his 
own large-scale factorial work, which in his 
view embrace the major part of the "personality 
sphere"; in other words, he considers that per- 
sonality can be described fairly completely by 
reference to these twelve traits. He reinforces 
his argument powerfully by showing that a 
great deal of past work fits in very neatly with 
his scheme, and suggests numbers of objective 
tests which might be used for the measurement 
of these twelve major traits. This attempt to 
bring order into the present chaos is one which 
certainly marks a great advance. Cattell puts 
forward a clear-cut scheme; the details of this 
scheme may be right or wrong, but at least it is 
definite enough to be tested by other workers 
in the field, and to be improved where such im- 
provement may be called for. The second major 
contribution of this work is Cattell's suggested 
improvement on Thurstone's principle of rota- 
tion of factors. Cattell calls his own method that 
of "proportional profiles" or "simultaneous 
simple structure", and essentially it is an exten- 
sion of Thurstone's "simple structure" criterion 
of rotation to cover, not one matrix of intercorre- , 
lations, but rather to cover a whole set of ma- 
trices. The principle is too complicated to be de- 
scribed in detail, but while not yet advanced 
enough statistically to be used routinely in prac- 
tice, the reviewer has no doubt that there we 
have a novel idea which may contribute greatly 
towards the improvement of factorial methods. 
Thirdly, Cattell’s discussion of the nature of 
factors, and their relation to source traits, sur- 
face traits, clusters, and other concepts should 
do much to reduce the bewilderment with which 
many non-statistical readers look upon these 
products of the statistical psychologist’s fertile 
brain. He establishes fairly and squarely the 
need for the use of these methods, and shows 
clearly in what way their use may be considered 
to advance psychological knowledge. Fourthly, 
this book is a mine of information of such work 
as has been done in this field. Here it all ap- 
pears, neatly arranged, catalogued and dock- 
eted; this feature of the book alone will save 
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the research student countless hours of looking 
up esoteric references. Among those features of 
the book which the reviewer considers subject 
to criticism is the method of factorization 
adopted. Cattell uses the Thurstonian principle 
of rotation. The reviewer is doubtful of the 
possibility of deriving as many as twelve statis- 
tically significant factors from correlations based 
on two hundred subjects, as is permissible 
under Thurstone's system ; he, therefore, pre- 
fers Cattell's "beta solution", which results in 
only seven factors, to his "alpha solution", 
which results in twelve. However, this must be 
a matter of opinion; only the scientific fruitful- 
ness of either solution can really decide as to 
their value. The second criticism which might 
be put forward concerns the reliance which Cat- 
tell places on ratings. His analysis is based on 
the intercorrelations of ratings, made by vari- 
ous judges, of people well known to them. The 
danger cannot be gainsaid that correlations 
found between ratings may be due as much to 
presuppositions regarding the probable connec- 
tions existing between traits in the minds of the 
judges, as to actual connections between the 
traits in the behaviour of the subjects. While 
Cattell has taken precautions against this dan- 
ger, and while the support from other investi- 
gations quoted by him certainly reinforces his 
argument, the possibility remains that his fac- 
tors are adulterated to some extent by this ex- 
traneous factor. This deficiency has not escaped 
Cattell's attention, of course, and at present he 
is engaged on a large-scale programme of test- 
ing in which he hopes to find objective, experi- 
mental evidence for the personality factors put 
forward by him. The results of that research, 
if they confirm in general his present findings, 
would go far to allay the doubts expressed in 
this review, and would establish even more defi- 
nitely the accuracy of the over-all map which 
Cattell has drawn for us of the orectic field. In 
summary, it may be said that in spite of certain 
possible criticisms this new book of Cattell's 
deserves the most careful study by all those in- 
terested in the description and measurement of 
human personality. As with most far-reaching 
contributions, it would be hazardous indeed to 
prophesy its precise place in the history of the 
Subject; all one can safely say is that future 
research will undoubtedly be significantly influ- 
enced by it. Only the verdict of such research 
can determine the correctness or incorrectness 
of Cattell's main contentions. 
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CHAPPLE, ELIOT D.; WITH THE COLLABORATION OF CON- 
RAD M. ArensBerc. Measuring Human Relations: 
An Introduction to the Study of the Interaction 
of Individuals. Genetic Psychology Monographs, Vol. 
22, No. 1. Provincetown, Mass.: Journal Press, Febru- 
ary 1940. Pp. 147. Paper. $3.00. * (PA 14:3069) 


Rural Sociol 5:274-5 Je '40. Carle C. Zim- 
merman. * attempts to determine elementary 
units of human behavior and then to apply 
quantitative methods to measure functional in- 
terdependence. Illustrations from the public 
schools, from families, factories, and political 
organizations seek to find unit relations and 
then to deal with these unit relations as sys- 
tems, subsystems, and tangent subsystems. The 
elements are reduced to symbols for algebraic 
analysis. To these quantitative analyses qualita- 
tive considerations are taken up under the title, 
"context of situation." However, as the senior 
author points out, the method emphasizes em- 
pirical positivism to a much greater extent than 
meaning. The reviewer thinks all social science 
methods using careful observation may reach 
worthy objectives if applied thoughtfully. It is a. - 
question as to how meticulous and "algebraic" 
a method can be and still not deaden the proc- 
esses of acquiring new Gestalts. * 
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XCnEvpLEUR, Freperic D. Placement Tests in For- 
eign Languages at the University of Wisconsin: 
A Forward Step in Education, 1930-1943. Bulletin 
of the University of Wisconsin, Serial No. 2686, Gen- 
eral Series No. 2470. Madison, Wis.: Bureau of Guid- 
ance and Records, the University, November 1943. Pp. 
39. Paper. Out of print. * 


J Higher Ed 15:385-6 O '44. James B. 
Tharp. * The report describes thirteen years of 
placement testing in vocabulary, reading, and 
grammar of French, German, Spanish, and 
Latin. Statistical tables show the extent to 
which high-school graduates offering one or 
more foreign-language entrance units were ad- 
vanced to courses one or more semesters be- 
yond the normal placement, were retained at 
normal placement, or were demoted one or more 
semester-levels. * Some of the data reveal con- 
clusively the advantage to students—and to the 
institution—of placing a student in the course 
appropriate to his measured attainment regard- 
less of the time-serving certificate of the high- 
School units. Not only did 98.4 per cent of the 
1,061 students who were advanced one or more 
semester-levels make marks of A, B, or C, but 
the number of 4’s earned actually increased in 


791 ] BOOKS & REVIEWS: CHAPPLE 


proportion to the number of semesters skipped 
* |t may be noted that the basis of success of 
the experiment is given in terms of the higher 
percentage of "satisfactory" marks (4, B, or 
C) earned by the correctly placed students than 
those earned by the students whose placements 
could not be adjusted due to conflict of sched- 
ules or other factors. Since marks are often 
highly unreliable when awarded by many dif- 
ferent instructors on differing criteria of evalu- 
ation, some attention should have been given in 
the report if provisions were used (but not re- 
ported) to increase the reliability of marks. It is 
the understanding of the reviewer that language 
achievement at the University of Wisconsin is 
measured by objective tests, either those of es- 
tablished standardization or those constructed at 
the University of Wisconsin, and it is presumed 
that marks are largely determined by these test 
results. There is evidence in this direction in 
the data reported on the use of American 
Council on Education German Reading Tests. 
It must be further noted that the placement 
testing did not take into account the oral-aural 
skills. So long as high-school preparation 
stresses reading, vocabulary, and grammer, the 
tests used in the programs are valid for place- 
ment purposes. Recently, the influence of war 
conditions and the intensive oral objectives of 
the Army Specialized Training Program have 
brought oral-aural skills into importance. Al- 
though no objective test of oral fluency has ever 
been published (although the Testing Division 
of the ASTP was about ready to produce ex- 
perimental semi-objective measurements before 
the program was abandoned), there are in print 
one or two standardized aural tests, mostly in 
French, One test exists in alternate forms in 
French, German, and Spanish. It is possible 
that future placements will need to include a 
test, at least of aural attainment. * 
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*CnrrrewpeN, Gertrupe E. An Experimental Study 
in Measuring and Modifying Assertive Behavicr 
in Young Children. Monographs of the Society for 
Research in Child Development, Vol. 7, No. 1, erial 
No. 31. Washington, D. C.: the Society, National Re- 
search Council, 1942. Pp. v, 87. Paper, lithotyped. 
$1.00. * (PA 17:992) 


Ed Res B 22:141 My '43. Francis N. Max- 
field. 4 

Q R Biol 18:197-8 Je '43. * The essential 
contribution of the study to the field of social 
behavior is the fact that children can be taught 


800 


life, ie. they do not have to depend entirely 
upon their limited social equipment, and the 
trial and error method in discovering what is 
acceptable and what is not acceptable in human 
society, Although the number of subjects used 
for the study was not large, the statistical pro- 
cedures in handling the data are sound, and the 
interpretations are well advised. The study is 
documented by a list of some 30 publications, 
and is supplied with numerous tables, charts, 
and photographs. A table of contents and an 
appendix of definitions and directions for re- 
cording and scoring tests in social behavior are 
provided. 
[ 800 ] 

*Ctocco, ANTONIO, AND PALMER, Carrout E, The 
Hearing of School Children: A Statistical Study 
of Audiometric and Clinical Records. Monograplis 
of the Society for Research in Child Development, Vol. 
6, No. 3, Serial No, 29. Washington, D. C.: the Society, 
National Research Council, 1941. Pp. v, 77. Paper, 
lithotyped. $1.00. * (PA 16:1651) 

J Nerv & Mental Dis 95:777-8 Je '42. * this 
report aims to make a critical evaluation of 
some of the methods used in the study of chil- 
dren's deafness; to ascertain the degree of hear- 
ing impairment and with it the implication of 
clinical and other factors; and to formulate 
steps basing a program for the prevention of 
deafness * Indicating the extent of the inves- 
tigation from which the material has been 
drawn are the examinations, in 1931, of nearly 
14,000 children with a Western Electric 4-A 
(phonograph) audiometer and the results of 
hearing tests administered to over 1,400 chil- 
dren with a Western Electric 2-A ( pure-tone) 
audiometer. Of the last group, one-half were 
examined twice and over a third have been 
tested three times since the beginning of the 
investigation. Certain of the findings of these 
inquiries have been published previously but 
are again presented here in the setting of this 
report, which offers a cogent synthesis of the 
whole mass of information thus accumulated. hg 
The conclusions show that for the purpose of 
identifying children needing special therapeutic 
and educational care because of defective hearing 
the phonograph audiometer is not wholly satis- 
factory. While more objective and rapid in re- 
sults as compared with tuning forks or Galton’s 
whistle it does not identify cases having mild 
auditory impairment; and, owing to character- 
istics aimed at obtaining responses to the spoken 
voice, cases involving high tone loss and the 
tones of the range of speech are missed. More 
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extensive investigation of incipient defects 
among school children would contribute con- 
cretely to the prevention of deafness if certain 
objectives are given special attention: a more 
adequate means for the detection of auditory 
defects of slight degree ; better etiological knowl- 
edge of middle ear infection and of its epidemi- 
ology ; and more thorough study of pathological 
changes in structures of the ear and its adja- 
cent organs. 

Q R Biol 17:189 Je '42. A report of a de- 
tailed summary of the results of an investigation 
which the U. S. Public Health Service has con- 
ducted on the hearing of Washington, D. C. 
school children over a 5-year period, beginning 
in 1931. * Very nearly 14,000 children were 
examined with a Western Electric 4-A (phono- 
graph) audiometer and over 1,400 children 
received a hearing test with a Western Electric 
2-A (pure-tone) audiometer. About one half 
of the children in the latter group were exam- 
ined twice, and over a third received three tests. 
Although the phonograph audiometer is satis- 
factory under laboratory conditions the results 
of the tests in the school environment seem to 
indicate that effort should be directed towards 
improving the instrument and methods of test- 
ing before the most satisfactory results can be 
obtained if a program aimed at the prevention 
of deafness in children is to be developed in the 
schools. During the 5-year period “fully 30 per 
cent of the ears with only a slight high tone 
loss and 25 per cent of ears with slight loss of 
tones of the conversational range deteriorated 
sufficiently to acquire a marked degree of the 
respective impairment.” The rate of develop- 
ment of decreased perception for the tones of 
speech is greater in the younger than in the 
older children. In this type of impairment, in al- 
most one half of the ears there was also present 
an overt and marked alteration of the otoscopic 
appearance of the tympanic membrane. The 
study points to the importance of therapeutic 
care as well as the prevention of middle ear 
infection. Tables and graphs exhibiting the re- 
sults of the tests, form blanks, and a bibliog- 
raphy of 66 titles are included in the report. 
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X*CiviL SERVICE ASSEMBLY oF THE UNITED STATES AND 
CANADA, COMMITTEE ox ORAL Tests IN Pustic PER- 
SONNEL SELECTION, SAMUEL H. Onpwav, JR, CHAIR- 
MAN. Oral Tests in Public Personnel Selection: 
A Report Submitted to the Civil Service Assem- 
bly. Chicago, Ill.: the Assembly, 1943. Pp. xviii, 174. 
$3.50. * (PA 17:4245) 


J Appl Psychol 28:181-3 Ap '44. Arthur ` 


Burton. * The amount of space allotted to the 
discussion of appeal and review procedure is 
disproportionate. Whereas two chapters or 20 
pages are devoted to such a discussion, the 
general evaluation and validity of the interview 
is treated in one chapter of seven pages. While 
the problem of oral test review by courts and 
administrative bodies is an omnipresent one and 
a true examining problem, it should be remem- 
bered that as the validity of the oral test is in- 
creased by research and objectivity, the prob- 
lem of appeals diminishes in importance. Of 
more serious import are the tendencies toward 
a “legalistic” validation of the oral test. While 
it is certainly true that evidence which is “mate- 
rial,” “relevant,” and “reliable” is the most 
valid kind of evidence upon which to determine 
ratings, logic or precedent, as in a court of law, 
cannot serve as criteria of what is "material," 
“relevant,” and “reliable.” This may only be 
established by correlation of specific oral test 
functions with job criteria. A statistically valid 
oral test will pari passu be based upon “mate- 
rial," “relevant,” and "reliable" evidence. Docu- 
mentation and reference to existing studies on 
the oral test are meager. For example, in the 
general discussion of the validity and reliability 
of the interview seven citations are made which, 
with two exceptions, were all published a decade 
or more ago. The contemporary industrial liter- 
ature on the employment interview is satisfied 
by a citation to Viteles Industrial Psychology 
and no reference is made to such recent innova- 
tions as the “stress” interview, or Rogers’ 
Phonographic methods as a technique for train- 
ing interviewers. The assumption in the book 
is unfounded that the oral test is valid until 
Proven otherwise because studies demonstrating 
the lack of validity of the interview are based 
upon fallible criteria of job success. No meas- 
urement technique is objectively valid until 
demonstrated as such by scientific method. Lack 
of definitive techniques on the part of a disci- 
pline does not in any sense constitute scientific 
validation. Specifically commendable is the ex- 
pressed and reiterated need for research on 
many points, e.g., the weighting of rating fac- 
tors, the insistence upon a thorough under- 
standing of the job before rating is attempted, 
and as a whole, the chapter on the training of 
interviewers. If evaluation of the report reveals 
that it falls short of being a definitive statement 
of the oral test, then one must hasten to point 
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out that in an area where even fundamentals are 
nebulous and conflicting, the clarification of con- 
cepts and techniques even by approximation is 
most welcome. If this report is instrumental in 
stimulating public personnel administrators to 
review the oral test practices in their jurisdic- 
tion, it is a job well done. 


[ 802 ] 
*CLARK, Mamie Pnipps. Changes in Primary Men- 
tal Abilities With Age. Archives of Psychology, No. 
291. Washington, D. C.: American Psychological As- 
sociation, Inc., May 1944. Pp. 30. Paper. $0.80. * (PA 
19:341) 

[ 803 ] 
*Cranx, Ruru MriLrBURN. A Method of Adminis- 
tering and Evaluating the Thematic Apperception 
Test in Group Situations. Genetic Psychology Mono- 
graphs, Vol. 30, No. 1. Provincetown, Mass.: Journal 
Press, August 1944. Pp. 55. Paper. $4.00. * (PA 18: 
3777) For reviews, see 104. 

[ 804 ] 
Crank, E. R. Predictable Accuracy in Examina- 
tions. British Journal of Psychology Monograph Sup- 
plements, Vol. 8, No. 24. London: Cambridge Uni- 
Yun Press, 1940. Pp. xi, 48. Paper 5s. * (PA 15: 
1491 


Brit J Ed Psychol 10:250-1 N ’4o. Ll. 
W (ynn) J(ones). * the treatment of the topic 
is sufficiently new and weighty to justify the 
inclusion of the monograph in this important 
series. It is well to inform readers that the 
author is not concerned with the validity of the 
verdict given by an examination but with the 
degree of consistency of that verdict. A scheme 
is put forward which would enable the exami- 
ner to increase the consistency of his examina- 
tions and thus reduce the number of errors. * 
This problem is worthy of study by all exami- 
ners, and Mr. Clarke's mathematical insight has 
enabled him to make a substantial contribution 
towards its solution. The author finds the two 
usual statistics, namely, the difficulty of a test 
and the ability of the testee, insufficient, and he 
introduces a new statistic which he calls the 
‘coefficient of ubiquity.’ * It is only when he 
begins to interpret these data that one may 
question whether his psychological acumen 18 
quite on a par with his mathematical insight. * 
The author's procedure, which he defends, is to 
discuss deductions from his mathematical meth- 
ods which appear later in the book. The reader 
may sometimes find this rather trying * thought- 
provoking....book. It is worthy of serious study. 
Moreover, readers possessing sufficient knowl- 
edge will also be able to appreciate the mathe- 
matical portions of the monograph. 


J Ed (London) 72:394 Ag '40. * an impor- 
tant addition to the literature dealing with the 
statistical interpretation of examination and 
test results. It introduces a new variable not 
hitherto considered in this field, viz. the varia- 
bility of performance displayed by an individual 
within one subject, as distinct from variability 
from subject to subject. The author shows that 
the importance of this “ubiquity” (as he calls 
this new variable) is so great as to throw con- 
siderable doubt on the validity of combining 
test marks and on using tests for the purposes 
of factor analysis. 

Math Gazette 24:364-6 D '40. Frank San- 
don. This monograph by the Headmaster of 
Aireborough Grammar School is based on his 
experience of, e.g., the West Riding Secondary 
School special place examination, and reviews 
the working of examinations in general. Mr. 
Clarke is fortunate, even among headmasters, 
for the statistical material that he seems to have 
to hand, and he is to be congratulated on his 
attempts to deal with the problems that he at- 
tacks. The compilation of the volume has obvi- 
ously entailed a great amount of thought and 
investigation and it contains some valuable 
ideas. Reference is made to three other theses 
of the author, two on the Interpretation of Age- 
Frequency Data, and one entitled The Nature 
and Extent of Subjective Examination Hazards, 
but no information is given where and how they 
were published: a number of passages in the 
present work might be clarified if these were to 
hand. There are few other references to other 
workers, and his originality leads him to uncon- 
ventionalities in statistical symbolism, as when 
we read (p. 34) 


p= Per.{ END. (s) = 


and in terminology, as in his use of "regres- 
sion” and “contingency.” The author tells us 
that he has sacrificed his tables of statistics 
and has adopted a certain crudity of mathe- 
matics in order that his ideas might be pre- 
sented to a wider circle of readers. One reader, 
at least, feels that what is good in the trea- 
tise has been far too drastically smothered, 
with the result that the argument is difficult to 
follow or to justify. But it appears to be on the 
following lines. (1) Ability of magnitude 5 in 
a candidate will lead him to pass a test of diffi- 
culty 3 and to fail a test of difficulty 7. Give 
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unit score to a pass and zero to a fail. In any 
examination the standard deviation of the abili- 
ties (e.g. 5, . . .) of the various candidates is 
denoted by the author by s and of the difficul- 
ties (e.g. 3, 7, .. .) by v. It is not clear how 
these are determined. Surely the absolute meas- 
ures of the abilities and difficulties are un- 
known and unknowable, for they are observed 
only by their manifestations and not directly. 
It seems likely from the author’s claim that “for 
the first time coherent use is made of scores 
(marks per boy) and frequencies (marks per 
test)” that he estimates the abilities and diffi- 
culties by totting up the passes in each case: he 
speaks in one place of the totals as “correspond- 
ing” to the abilities, etc. He gives certain results 
indicating that v/s is usually of the order of 
from 1 to 3. It appears that this is the ratio of 
the S.D.’s of what has been called by Walker, 
in his papers in the British Journal of Psychol- 
ogy, the answer pattern and the score pattern. 
The values of s and of v that he gives are esti- 
mated by a wrong formula, using 7 in place of 
(n — 1) in the denominator. (2) A boy of 
average ability 7 may have “ubiquity” 2. By 
“ubiquity” is meant a fluctuation in ability of 
any one definite kind: apparently this is what 
Thouless has called Function Fluctuation and 
what was, in a survey of the Necessary Imper- 
fections of an Examination in the British Jour- 
nal of Educational Psychology, called the In- 
stability of the Candidate. The author measures 
“ubiquity” by the S.D. of the abilities exhibited. 
He calls ita coefficient: it is not, being of the same 
dimensions as the ability measure itself. He com- 
putes it by the faulty formula for the S.D. And 
it surely must be unknown, just as the ability it- 
self is. The author claims that he establishes the 
fact that the individual is an individual to a meas- 
urable extent, and seems to think that the factor 
analysts deny this, and are unaware of this in- 
stability. (3) Mr. Clarke then assumes that 
“ubiquity” is normally distributed and thus con- 
cludes, using the equivalent of Sheppard’s 
Tables, that, if the boy of paragraph (2) tackles 
100 questions of difficulty 6, his probable score 
is 69: if the questions were of difficulty 9, his 
probable score would be 16. This assumption 
seems unlikely, There is a maximum that we 
cannot exceed, and for various reasons seldom 
reach: the effect of this is probably that the 
distribution of the measures of our manifested 
results will be skewed, with a mode near the 
maximum obtainable. (4) The thesis goes on 


to consider an actual examination in which can- 
didates of varying abilities and “‘ubiquities” take 
a number of tests (by a test the writer seems to 
mean a question, carrying one mark for a pass, 
to be marked right or wrong) of varying diffi- 
culties. He assumes a distribution of abilities 
and values of s/u and v/u. He then gets a table 
indicating that some 40 will be wrongly chosen 
and 40 wrongly omitted when 300 are to be 


selected from 1000. This seems an extravagant . 


claim, for it is equivalent to assuming a correla- 
tion of examination with criterion of the order 
of .95 (see a paper in the Annals of Eugenics 
on Selection by a Nearly Perfect Examination 
and the B.J.E.P. paper already referred to), a 
value much too high for anything found in our 
reading or experience. (5) To deal with the 
misfits, Mr. Clarke suggests that we give an- 
other test to the borderline candidates. He 
gives a formula to determine the number of 
misfits still remaining, but the assumption so 
far made, together with those now necessary 
(not all, we submit, allowed for), seem to make 
this of little value. It would, however, probably 
be of considerable practicable value, and his 
method of dealing with the borderline School 
Certificate candidate by using the school order 
of merit here is probably helpful, though surely 
no headmaster would guarantee the soundness 
of his ranking for his average candidates even 
after the suggested 400 or 500 school tests. (6) 
In the case of the Special Place Examination, 
the suggestion is made that the type of the 
examination should be changed, and that “In- 
telligence” type of question should be replaced 
by general scholastic questions—geography, his- 
tory, English, arithmetic, general knowledge. 
This would seem to provide more scope for the 
crammer and would work to the disadvantage 
of the candidate from the little school, etc. It 
does not seem to follow from any of the argu- 
ments of the thesis itself. (7) The writer pro- 
ceeds to deal with some effects of age. We are 
unable to understand some of his notation, and 
the argument here is very difficult to follow. 
But the author claims that he obtains a com- 
plete solution. His conclusion is that in number 
tests ability develops at a slightly increasing 
rate and variability increases rapidly, so that in 
the period from (school age) 614 to 11 it in- 
creases by five times. On the other hand, vo- 
cabulary test shows a more moderate develop- 
ment function and a variability which actually 
decreases slightly after 634 years, and does not 
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begin to advance beyond its value there until 
after 12. The obscurity of the argument is not 
helped by the statement that the value of the 
specific ability of a boy who can pass a test of 
definite quantitative value must be greater than 
the value of the test (which is directly opposed 
to the whole argument about “ubiquity”) by an 
assumption of uniform (? rectangular) distribu- 
tion of chances, by a numerical illustration of 
the connection between answer-patterns cooked 
to be linearly related, and by a casual rejection 
of values of v as insignificant by ruling out 
some frequencies. The whole monograph is an 
indication of the unfortunate state of research in 
education at present. There are isolated private 
workers attacking various problems, as Mr. 
Clarke has done, and a few centres of research 
—the Institute of Education, Moray House 
(whose leader, a sound worker on the same 
West Riding examinations as Mr. Clarke, is 
Thomson, and not Thompson, as spelt by Mr. 
Clarke), and so on. But there is practically no 
opportunity for teams of investigators, using 
reliable material and sound methods, to attack 
one by one the outstanding problems, and to 
get their results published and widely known. 
We are grateful to the British Psychological 
Society for presenting this monograph in their 
series, and to Mr. Clarke for his approach to 
the problems referred to, and for some sugges- 
tive ideas. But we feel that his presentation of 
his work is very unsatisfactory, in the absent 
material, the sketchy analysis, the dogmatic 
conclusions. It is a great pity that the whole 
could not be done by workers who know what 
has already been done in the field surveyed and 
who have suitable technique to hand to deal 
with the data from some of the original points 
of view seen by Mr. Clarke. 
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*Crarxe, H. Hae cay ication of Meas- 
urement to Health and Physical Education. New 
York: Prentice-Hall, Inc., 1945. Pp. xvi, 415. Trade 
edition, $5.00; textbook edition, $3.75. * (PA 19:1910) 

Ed Res B 25:80 Mr 1346. A. S. Daniels. * 
a timely and valuable textbook to a field facing 
immediate and important advances as an out- 
come of wartime experiences. It differs from 
other books in the field largely because it em- 
ploys the functional approach to measurement. 
Testing procedures are considered in relation 
to the specific physical-education objective they 
are designed to meet. * The book is recom- 
mended for its fresh approach; its coverage of 


wartime physical-fitness tests used by schools, 
colleges, and the several branches of the military 
service; its avoidance of highly technical treat- 
ment of measurement procedures; and the ex- 
cellence of its suggestions regarding inaugurat- 
ing and conducting measurement programs— 
the application of measurement. * represents a 
distinct advance toward intelligent use of test- 
ing procedures in health and physical educa- 
tion * 
[ 806 ] 

XCrankr, H. Harrison. The Application of Meas- 
urement to Physical Education Programs in 
Secondary Schools. Syracuse, N. Y.: Syracuse Uni- 


versity Bookstore, 1940. Pp. vii, 159. Paper, mimeo- 
graphed. * For revised edition, sce 805. 


Ed 61:250 D "4o. F(rederick) R(and) 
R(ogers). * should be in the library of every 
school administrator as well as on the desk, for 
ready reference, of every physical educator in 
the land. For it treats of a basic educational 
technique in a highly practical and informa- 
tive way. But its chief claim to immediate atten- 
tion is that Dr. Clarke has woven into his ex- 
position programs for the actual use of tests in 
real school situations—tests which are highly 
valid measures of physical fitness, —which itself 
is the most-needed quality of life for success, or 
even for survival, especially in war-time. Here, 
then, is no mere theoretical treatise in vaccuó 
—designed to be studied and conveniently for- 
gotten. Every chapter challenges the reader to 
act. It is persuasive, too, and not so much be- 
cause of the author’s apparent fervor, as be- 
cause his facts are so logically marshalled and 
so simply presented. Another virtue of this new 
book resides in the valorous way in which Dr. 
Clarke has critically analyzed the various tests 
best known to physical education theorists. * 
Clarke has, wherever possible, revealed the re- 
liability and validity, the integrity and quality 
of norms, the time for administration, and the 
various practical uses of each test he considers. 
This is of the greatest aid to the reader not yet 
trained in statistical procedures, who must, per- 
force, rely on the advice of someone for selec- 
tion of tests. Again, Dr. Clarke contributes an- 
other feature: he gives the physical fitness ob- 
jective “considerable prominence” * There is 
an excellent group of informative tables, a few 
figures, complete instructions for giving PFI 
tests, valuable suggestions on the interpretation 
and follow-up of PFI problem cases, and an 
excellent Bibliography. 
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: [ 807 ] 
KCLEETON, GLEN U., AND Mason, CHARLES W. Execu- 
tive Ability: Its Discovery and Development, 
praed Edition]. Yellow Springs, Ohio: Antioch 
ress, 1946. Pp. xv, 540. $4.50. * (PA 21:566) 


J Appl Psychol 31:459-60 Ag '47. Harold E. 
Burtt. The aim of the book is quite compre- 
hensive—to define executive functions and give 
procedures for discovery and development of 
executive ability. This is a large order, espe- 
cially where so much of the available material 
is of an a priori rather than empirical character. 
The authors get about as much out of this mate- 
rial as could be done at this juncture. * About 
every conceivable problem in industrial psy- 
chology is mentioned somewhere in the book * 
Naturally most of these must be touched upon 
or hinted at rather superficially. However, 
there are plenty of references so that the reader 
can obtain more details in the appropriate 
sources. Psychologists, presumably, will be 
most interested in the portions dealing with 
selection and training of executives. These sec- 
tions describe such tests as are available, like- 
wise rating scales. They give a fair amount of 
detail as to the techniques of constructing, ad- 
ministering, and validating tests. There is some 
consideration of selection of traits constituting 
executive ability by using the pooled judgment 
of experts. This results in a pretty good list of 
executive traits. For this and other reasons the 
book would be worth while reading prior to 
undertaking research in this field. The authors 
Stress emotions among executive qualifications 
more than usually is done but extend the term 
to include many aspects of personality. There 
are quite a few self-rating exercises which the 
reader is encouraged to try such as interest 
tests and personality tests, with keys in the ap- 
pendix. These would be all right for use in a 
college class with appropriate interpretation by 
the instructor. It is questionable practice to put 
them in the hands of the unguided reader who 
might attempt to evaluate himself and attach 
too much significance to them. The authors 
wisely stress the importance of supervision as 
one phase of executive function and give a good 
outline of supervisors' problems. The consider- 
ation of incentives and the training of super- 
visors is all too brief. It is interesting to note 
that the authors have caught ori to the new case 
study method which is proving successful in 
supervisory training. There is a sensible series 
of suggestions in the general field of industrial 


relations. In the spirit of the times the authors 
conclude their discussion with "The Executive 
in a Democracy." They stress the importance 
of leadership in this connection and give a list 
of suggested readings on social problems which 
the executive might well investigate. The re- 
viewer is not sure how valuable the work will 
be to an actual executive or a prospective ex- 
ecutive—probably not as much as the authors 
would hope. To the industrial psychologist or 
the student in this field, however, it is worth 
while to have a lot of this material brought to- 
gether with adequate references. It would be 
helpful to anyone who is going through the 
literature with a view to further research in this 
field. One cannot help being impressed by the 
paucity of empirical material and the need for 
such further research. 

Psychol B 44:192-3 Mr ’47. William O. 
Jenkins. * Emphasis is placed on problems of 
selection and training and the role of executives 
in labor relations. The treatment appears to be 
strongest in the last area and less strong when 
the discussion turns to scientific principles of 
behavior. Typical of the latter case are the ac- 
ceptance and use of terms such as instinct, will, 
and mental capacity without a clear statement 
of operations defining them. The major short- 
coming of this volume is that sweeping generali- 
zations, presumably based on research findings, 
are reported without the necessary information 
to evaluate the instruments, procedures, statis- 
tical tools, and data. A few other criticisms may 
be listed. The presentation of selection proce- 
dures (roughly 125 pages) omits the critical 
concept of cross validation. Over-emphasis on 
rating scales leads to the ignoring of more 
objective measures of proficiency. The authors 
believe that there is no communality among the 
traits necessary for success in different execu- 
tive positions; this position implies a research 
program for each executive job, certainly an 
impractical task. It is the opinion of the re- 
viewer that the attempt to treat this important 
applied field is admirable and that this book 
may be useful in presenting elementary notions 
for the non-psychologist working in this area. 
It is felt, however, that the book falls short of 
its stated purpose of summarizing the best avail- 
able data concerning executive ability. 

Scientific Mo 64:354-5 Ap "47. Dael Wolfle. 
Cleeton and Mason have re-examined the ex- 
ecutive's place in industry and society to take 
account of new information and in the light of 
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changes in governmental and industrial policies 
since the first edition of this book was pub- 
lished. Thirty percent of the references are of 
1940 or later and 54 percent are of 1935 or 
later. Yet the basic philosophy remains un- 
changed, and the orienting introduction of the 
first edition is reprinted in the current one. The 
executive is analyzed from several standpoints, 
but always he is viewed as an important, so- 
cially conscious member of a democracy in 
which he must show broad vision and mature 
understanding of the interrelated interests of 
capital, labor, and consumer in order that all 
three may share equitably in the wealth pro- 
duced by industry. The hard-driving exploiter 
of labor and the consumer is not the executive 
for whom or about whom Cleeton and Mason 
have written. * The psychologist who is not an 
executive is likely to be disturbed by the over- 
simplification of some of the difficulties of meas- 
uring personality traits and predicting their 
roles in the successful discharge of executive 
functions, * In spite of such oversimplifications, 
the volume should be of interest both for the 
information it contains and for the problems it 
discusses which need further analysis and study. 
The executive who is not a psychologist will 
find in the volume both elementary discussions 
of psychological processes and suggestions for 
their practical employment. Study and appli- 
cation of the principles discussed in Executive 
Ability should make him a better executive. 
For a review of the first edition, see 40:B864. 


[ 808 ] 
*CorrzcE Entrance Examination Boarn, Fortieth 
Annual Report of the Executive Secretary, 1940. 
New York: the Board, 1940. Pp. x, 90. Paper. $0.25. * 


[ 809 ] 
*Coutece Entrance Examination Boar. Forty- 
First Annual Report of the Executive Secretary, 
1941. New York: the Board, 1941. Pp. x, 98. Paper. 
$0.25. * 

[ 810 ] 
X*CoLtecE ENTRANCE EXAMINATION BOARD. Forty- 
Second Annual Report of the Executive Secre- 
tary, 1942. New York: the Board, 1942. Pp. x, 64. 
Paper. $0.25. * 

[ 811 ] 
KCoLLEGE ENTRANCE EXAMINATION BOARD. Forty- 
Third Annual Report of the Executive Secretary, 
1943. New York: the Board, 1943. Pp. xi, 57- Paper. 
$0.25. * 

[ 812 ] 
*CortEcE ENTRANCE EXAMINATION BOARD. Forty- 
Fourth Annual Report of the Executive Secre- 
tary, 1944. New York: the Board, 1944. Pp. xi, 62. 
Paper. $0.25. * 


[ 813 ] 
*CottEGE Entrance Examination Boarp. Forty- 
Fifth Annual Report of the Executive Seco A 
1945. New York: the Board, 1945. Pp. xi, 66. Paper. 
$025. * 

[814 ] 
XCoLLEGE ENTRANCE Examination Board. Forty- 
Sixth Annual Report of the Executive Secretary, 
1946. New York: the Board, 1946. Pp. xii, 105. Paper. 
$025. * 

[ 815 ] 
XCOoNANT, Marcaret M. The Construction of a 
Diagnostic Reading Test for Senior High School 
Students and College Freshmen. Columbia Uni- 
versity, Teachers College, Contributions to Education, 
No. 861. Ruth Strang, faculty sponsor. New York: 
Bureau of Publications, the College, 1942. Pp. ix, 156. 
Out of print. * (PA 17:2164) 


H Sch J 27:43-4 Ja-F '44. A. M. Jordan. * 
While some of the passages of the test are too 
easy for the senior high school and the essay 
type of questions are difficult to score it does 
furnish an interesting attempt to test and 
analyze the more complex reading processes. 

J Ed Psychol 34:380-1 S "43. Miles A 
Tinker. * The development of this test and the 
analysis of results reveal careful planning and 
critical insight. More questions are raised than 
are answered. Admittedly, additional research 
must be completed before the test is suitable for 
individual diagnosis. The case-study technique 
suggested for establishing a valid criterion of 
reading status appears promising. The author, 
however, is probably over-optimistic about hav- 
ing established validity of the test by definition. 
Furthermore, with the material and procedures 
employed, she is not justified in even the tenta- 
tive' conclusions concerning ‘reading ability’ 
versus ‘reading abilities.” 


[816 ] 
*Coox, Stuart W., EDITOR. Psychological Research 
on Radar Observer Training. Army Air Forces 
Aviation Psychology Program Research Reports, Re- 
port No. 12. Washington, D. C.: Government Printing 
Office, 1947. Pp. x, 340. $1.75. * (PA 22:4139) 

[ 817 ] 
XCnAwronp, ALBERT B., AyD BURNHAM, Paur S, Fore- 
casting College Achievement: A Survey of Apti- 
tude Tests for Higher Education: Part I, General 
Considerations in the "Measurement of Academic 
Promise. New Haven, Conn. : Yale University Press, 
1946. Pp. xxi, 291. ive * (London: Oxford Uni- 
versity Press. 255.) A 20:4331) 


Brit J Psychol: 37:166-7 My '47. A. W. 
H(eim). * Perhaps the most valuable function 
performed by Drs. Crawford and Burnham is 
their description of the difficulties encountered, 
though often unrecognized, by the vocational 
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and educational psychologist, and their stringent 
criticisms of many of the methods which he 
uses and abuses. The book has also a more posi- 
tive aspect * The chapter on ‘Unitary Traits 
and Primary Abilities’ contains a fair but criti- 
cal exposition of the work of the factorial 
analysts, with special reference to Thurstone’s 
centroid method. * The book contains several 
rather muddled discussions on the relative mer- 
its of restricted answer and essay form ques- 
tions. Drs. Crawford and Burnham rightly 
point out that whilst the former are, in certain 
senses, more objective than the latter, and more 
appropriate if a great number of items are to 
be covered, they are a less suitable instrument 
for probing deeply into the subject's knowledge. 
The authors do not draw any clear distinction 
between the various types of restricted answer. 
"Their final verdict should, however, please both 
the "subject matter experts who have a limited 
grasp of testing methods" and the "testing ex- 
perts who have a limited grasp of subject mat- 
ter”; they advocate that for practical purposes 
the two methods should be combined. In the 
chapter entitled ‘A Sample Aptitude Battery’, 
the Yale Battery of “prognostic and differential 
tests” is explained. * a comparison is made be- 
tween profiles (group and individual) obtained 
from these seven tests and success in college 
career. A good measure of agreement is claimed 
_ to exist between these profiles, properly inter- 
preted, and success in one of the three groups 
of faculties: “liberal arts study”, “pure science 
and mathematics” and “applied science such as 
engineering”. The agreement is in general quite 
impressive, but examination of the examples of 
the tests, given in the Appendix, suggests that 
some of them may well not be testing what they 
are intended to test. The “Artificial Language” 
test, for instance, appears prima facie to assess 
elementary reasoning plus a high degree of ac- 
curacy, rather than the complex qualities which 
go to make up a gifted linguist. Inspection of 
the correlation table on p. 161 confirms this sus- 
picion; “Artificial Language” correlates more 
highly with Physics grades, for example, than 
with grades of English and History. * 

Ed Adm & Sup 32:510-2 N '46. James S. 
Kinder. * Part I, as now published, can hardly 
be called a manual for counselors, rather, it is 
a comprehensive summary and critical review 
of present and past attempts to measure scien- 
tifically 'readiness-to-learn. Two chapters of 
Part I give an excellent summary of underlying 


assumptions and considerations in testing, and 
a quick review of the basic statistical procedures 
used in interpreting educational and mental 
measurements. A chapter entitled “Unitary 
Traits and Primary Abilities” is theoretical and 
of value only to those who are engaged in test 
construction. Two other chapters—one entitled 
“Differential Aptitude and General Intelligence 
Traits” and the other “Achievement Testing” 
—are provocative and of value to all persons 
interested in testing. A careful review is made 
of state, regional, and national test programs * 
One of the most fruitful chapters describes the 
development and composition of the Yale Apti- 
tude Battery for Freshmen. * Parts II and III 
in this series will be awaited eagerly by those 
persons concerned with college testing. * 

J Am Assn Col Reg 22:206-7 Ja "47. John 
Dale Russell. * The authors are eminently 
qualified to deal with the subject * The authors 
perform a valuable service, in discussing uni- 
tary traits and primary mental abilities, by 
showing how the Spearman-Holzinger theory 
and the Thurstone theory, which at one time 
seemed violently in conflict with respect to the 
nature of mental abilities, now seem to be ap- 
proaching a reconciliation. Particularly valuable 
also is the extensive bibliography of references 
and the appendix containing illustrative mate- 
rial from the Yale Educational Aptitude Battery. 
Although the authors indicate in the Preface 
that high school and college counsellors com- 
prise the audience for which they are writing, 
the treatment of most topics would lead one to 
suspect that the book is intended more for test 
technicians than for counsellors. Detailed dis- 
cussions are presented concerning questions 
over which technical controversy has raged; 
many of these questions are far beyond the 
range of problems faced by typical high school 
or college counsellors. An example is the exten- 
sive discussion (pages 56-8) of the “split-half” 
reliability coefficient as compared with the 
Kuder-Richardson formula. This reviewer's 
chief criticism of the book is that it is difficult 
to read. The organization of topics is not always 
easy to follow. The system of type-faces used in 
headings, sub-headings, and sub-sub-headings 
is unusual, and is not so arranged as to indi- 
cate quickly to the reader the relative subordi- 
nation of the various sections. The vocabulary 
level is about that of a Ph.D. who has special- 
ized in statistics and test construction, and the 
sentences are frequently long and involved. In 
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defense of the heavy style it may be contended 
that the authors have attempted to produce a 
reference work rather than a selí-teaching text 
or a' volume for entertainment during a winter 
evening by the fireside. * will undoubtedly prove 
very useful as a reference source for those who 
give tests or who deal with the results of tests 
given to high-school seniors and college stu- 
dents. For anyone engaged in student counsel- 
ling who wishes to know something more than 
the mere mechanics of giving tests and tabulat- 
ing results, for the kind of technician who 
wishes to have, in addition to mere procedural 
skill, an understanding of what he is doing, this 
volume will be extremely valuable. Perhaps it 
will be of greatest service to those who have 
had some technical training in statistics and in 
test procedures, and who wish to be brought 
into touch with the most recent theory and prac- 
tice in these fields. 

J Higher Ed 18:276 My ’47. Valerie C. 
Wickhem. * Much useful material has been 
brought together in this study. There is gen- 
eral information concerning studies that have 
been carried on in the field of testing and much 
illuminating detail about certain aspects of the 
problems involved. This material and the excel- 
lent bibliography that is included offer the 
reader a challenge to investigate for himself in 
more detail the real significance of studies that 
are being carried on. In recognizing the need 
for more widespread understanding of tests 
and their uses, and in assembling so much perti- 
nent data on the subject, the authors have per- 
formed a service. * a factor which has signifi- 
cant bearing on the problem of educational 
measurement does not receive full considera- 
tion. While the efficiency of tests in predicting 
success or failure in college receives careful 
attention, not enough is said about the differ- 
ences that exist between institutions in educa- 
tional aims and methods of approach to subject- 
matter, and the effect of these differences on test 
construction, This subject may receive fuller 
treatment in the studies that are in progress. 

Nerv Child 6:12r Ja '47. * The book is a 
purely technical one without the psychological 
or psychotherapeutic background which we con- 
sider equally essential in approaching such a 
responsible problem as deciding the future of 
youth desiring to strive in academic fields. 

Occupations 25:246 Ja '47. Herbert A. Car- 
roll. The problem of the selection of young men 
and women for college training has plagued 


admissions officers almost since the first college 
opened its doors. The authors of this book have 
not solved it, nor do they claim to have done 
so. They have, however, evaluated a consider- 
able amount of material on achievement and 
aptitude testing. * Chapters III and IV contain 
rather extensive discussions of intelligence and 
achievement tests * There is little in this mate- 
rial that is new or encouraging. The reader 
meets once more such familiar correlations as 
0.45 and 0.50 between intelligence, as measured 
by the instruments discussed, and college 
achievement, If the authors presented this mate- 
rial in order to demonstrate that more adequate 
prognostic techniques are needed, they have 
done well. * Forecasting College Achievement 
represents an advance in the field of measure- 
ment as it relates to selection and to educational 
and vocational counseling. * Tests, however, no 
matter how well constructed, do not provide a 
complete solution to the problem of forecasting 
academic or vocational success. The emotional 
component is as important as mental aptitude, 
yet it is almost completely ignored by Crawford 
and Burnham. Possibly it will be considered by 
them in Part III of the series. At least, “per- 
sonality measures” are mentioned in the jacket 
reference to this volume. 

Personnel Adm 9:46-7 N ’46. Louis M. 
Heil. A comprehensive review of the procedures 
and methods employed in aptitude testing at the 
college level is timely. Colleges are now 
swamped with students and they are therefore 
looking for the best possible techniques to pre- 
dict success for the students admitted, Further- 
more, many schools and organizations have, 
during the past fifteen years, made various ap- 
proaches to the problem of aptitude testing and 
an over-all summary of their successes and fail- 
ures is past due. Crawford and Burnham, pio- 
neer workers in this field, have had extensive 
experience not only at Yale University but in 
connection with other testing agencies with the 
problem of predicting success in college. They 
should be in a better position perhaps than al- 
most anyone else working in the field to present 
a critical and comprehensive survey. * The idea 
that aptitudes are revealed by different people 
in different ways is refreshing. The distinction 
and similarity between aptitude and achieve- 
ment testing as pointed out by the authors is 
helpful not only in sharpening the problem of 
aptitude testing, but also in enabling the non- 
professional" to discriminate between the vari- 
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ous forms of “objective type” tests. * the au- 
thors have given noteworthy attention to the 
idea of test reliability as a function of “who 
takes the test and under what conditions,” a 
point frequently overlooked. It appears, how- 
ever, at one point....that an overconcern is shown 
for the split half method of determining test 
reliability. At that point the authors’ analysis of 
the relative merits of the split half and the 
Kuder-Richardson methods for estimating test 
reliabilities does not seem to be at the same 
level of objectivity as the remainder of the 
chapter. Richardson seems to have gotten under 
their skins! * Specific data are presented which 
should be of assistance in helping the critical 
reader to draw his own conclusions concerning 
the relative values of most of the better known 
aptitude tests in those colleges and universities 
where they have been used. The authors' con- 
clusions concerning the effectiveness of aptitude 
testing, assuming early college work as the cri- 
terion, are such that the non-professional in the 
field may readily obtain the impression that 
aptitude testing is, up to the present time, 
quite successful in predicting college success. 
One reason for such an impression is that the 
authors have uniformly expressed relations 
between the aptitude tests and early college 
grades in terms of correlation coefficients. An- 
other reason is that the authors imply at sev- 
eral points in Chapters III and IV that the 
lack of higher correlation is due to the unrelia- 
bility of college instructors’ marks. The only 
evidence presented supporting this contention is 
a low term-to-term correlation between student 
marks. Such a relatively low correlation might 
exist even though the reliability of college in- 
Structors' marks was very high for either term. 
The authors do not point out that, even in view 
of maximum correlations cited between apti- 
tude scores and college grades, less than half of 
the total variance of college success is accotinted 
for by aptitude test prediction. The problem of 
what factors are likely to account for such a 
discrepancy between college work and aptitude 
test scores may be planned for presentation in 
Volumes II and III of the series. It would 
seem, however, that such a fundamental point 
should be noted in the specific discussion of the 
results of aptitude testing. * It is difficult to 
understand why the reference to certain testing 
devices developed by Tyler and his associates 
in the Eight Year Experiment is included in the 
chapter on primary abilities. Furthermore, it is 


entitely possible that the statement “it is un- 
fortunate that objective data in its behalf 
(Evaluation Study of the Eight Year Experi- 
ment) are as yet insufficient for presentation” 
can be challenged from the standpoint of what 
constitutes objective data. In this same connec- 
tion it is also curious that other tests are much 
more fully described for which there are little if 
any data available in published form. A com- 
prehensive review of aptitude testing at the col- 
lege level should....also reveal how the “aptitude 
tester” proceeds with his work. How, for exam- 
ple, does he determine the kinds of questions to 
be included in an aptitude test? Does he work 
actively with college instructors to determine 
what they expect of their students? In short, 
how does he proceed to make an analysis of the 
kinds of behavior which represent his basic 
hypotheses for aptitude test construction? An- 
swers to these questions would be helpful to 
schools and colleges who are struggling with 
their individual problems of prediction. Further- 
more, answers to such questions would reveal 
more clearly why certain kinds of predicting de- 
vices seem to be more successful than others. It 
is to be hoped that some part of Volumes II and 
III will take up such questions. Considered as 
a whole the volume presents the past and exist- 
ing philosophy of aptitude testing quite well. It 
presents a large quantity of specific informa- 
tion which should be helpful to other workers in 
the field. 

Sch & Soc 65:46-7 Ja 18 '47. Arthur E. 
Traxler. * unusually timely * The treatment of 
the topics discussed is scholarly, meticulous, and 
thorough. Available research in the field is used 
effectively, and pertinent quotations from nu- 
merous measurement specialists are employed 
to support and clarify the authors’ own views. 
Occasionally a point is labored beyond what 
seems necessary to make it clear to the average 
reader, but for the most part the discussion is 
well presented. The authors are frank and out- 
spoken in their criticism of theories, procedures, 
and conclusions which seem to them unsound. 
Probably few test specialists would agree with 
all the opinions and critical comments set forth 
in the book, but Crawford and Burnham have 
rendered a distinct service in defining the issues 
and courageously presenting their own view- 
points. Much of the critical discussion is di- 
rected toward factor analysis as an approach to 
aptitude measurement and in particular toward 


"the Tests of Primary Mental Abilities con- 
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structed by the Thurstones. The Thurstone tests 
and the Yale Educational Aptitude Tests are 
the two most extensive test batteries for the 
differential measurement of academic promise 
thus far made available. As Crawford and Burn- 
ham make clear in their discussion, there are 
two fundamental differences between these bat- 
teries. In the first place, the Thurstone tests 
were set up on the basis of an internal analysis 
of intercorrelations among a variety of tests in 
an effort to isolate and measure certain primary 
traits, whereas the Yale tests were constructed 
with a view to their practical utility in predict- 
ing outside critera of school and college success. 
In the second place, the primary mental-abilities 
battery is highly speeded, while the Yale battery 
is more nearly a series of power tests in differ- 
ent areas. Although. much more research will be 
required before the relative merits and limita- 
tions of the two batteries can be stated with con- 
fidence, Crawford and Burnham have done a 
thorough, if not wholly disinterested, job of 
presenting and discussing the present status of 
the problem. Although the Crawford-Burnham 
books are to be concerned primarily with apti- 
tude tests, it is to be hoped that somewhere in 
the series the authors will find a place for 
discussion of the possibilities inherent in 
elementary- and secondary-school cumulative 
records of both aptitude and achievement for 
use in forecasting college achievement. It is be- 
lieved that their otherwise admirably thorough 
program of investigation in this important area 
will be strengthened if they inquire into what 
schools can do to assemble information that 
will make possible the prediction of student 
success at the college level some years before 
the students reach the point of entrance to col- 
lege. 


[ 818 ] 


*CRAwronp, MEREDITH P.; SOLLENBERGER, RICHARD 
T.; Warp, Lewis B.; Brown, CLARENCE W.; AND 
GurseLLI, Epwin E., EDITORS. Psychological Re- 
search on Operational Training in the Continental 
Air Forces. Army Air Forces Aviation Psychology 
Program Research Reports, Report No. 16. Washing- 
ton, D. C.: Government Printing Office, 1947. Pp. vii, 
367. $1.50. * (PA 22:4140) 


[ 819 ] 

*Crissy, Witttam J. E., AND PEDERSEN, Rut A. 
The War-Time Role of the College Sophomore: 
A Report on the 1942 Sophomore Testing Pro- 
gram. Cooperative Test Service Publications in Meas- 
urement and Guidance, Series 3, Vol. 2, No. 3. New 
York: Cooperative Test Service, July 1942. Pp. 40. 
Paper. $0.10. * (PA 16:5005) 


[ 820 ] 
*Crissy, WiLLIAM J. E, AND Ryans, Davin G. The 
College Sophomore Appraises His Curriculum: 
A Report of the 1941 Sophomore Testing Pro- 
gram. Cooperative Test Service Publications in Meas- 
urement and Guidance, Series 3, Vol. 2, No. 2. New 
ine Cooperative Test Service, January 1942. Pp. 39. 

aper. $0.10. * 
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*CunrroN, Tuomas Kirk. Physical Fitness Work- 

book: A Manual of Conditioning Exercises and 

Standards, Tests, and Rating Scales for Evaluat- 

ESSET Fitness, Third Edition. St. Louis, Mo. : 

E K Mosby Co., 1947. Pp. xi, 64. Paper, spiral binding. 
50. 


[ 822 ] 
KCURETON, THOMAS KIRK, JR. ; ASSISTED BY FREDERICK 
W. Kascn, Jonn Brown, AND W. G. Moss. Physical 
Fitness Appraisal and Guidance. St. Louis, Mo.: 
C. V. Mosby Co., 1947. Pp. 566. $6.00. * 

[ 823 ] 
*CurETON, THOMAS Kirk; HurrmMan, WARREN J.; 
Weser, Lye; Krems, Ramon W.; AND LATHAM, 
DarreLL E. Endurance of Young Men: Analysis of 
Endurance Exercises and Methods of Evaluating 
Motor Fitness. Monographs of the Society for Re- 
search in Child Development, Vol. 10, No. 1, Serial No. 
40. Washington, D. C. : the Society, National Research 
Council, 1945. Pp. xxiii, 284. Paper, lithotyped. $2.50. * 
(PA 20:407) 

Am J Psychol 59:720-1 O '46. * an "attempt 
to analyse statistically a group of exercises 
which involve every principal muscle group of 
the body" (p. iii). Though it goes well beyond 
works of like import, it may be an exaggeration 
to claim that it is “the first statistical approach 
to kinesiology of the human body." The main 
interest of the study lies in the quantitative de- 
termination of muscular power; its sources, its 
coordination, and its conservation at a top level. 
* Discussion of the psychological factors in the 
production of body-power does not go much 
beyond ‘motivation,’ ‘drives,’ and ‘will power’; 
though a reference upon this subject is made to 
a work by Brozek. * 

[ 824 ] 
*Currs, Norma E. How to Use Psychological Serv- 
ice in the Schools: A Guide to the Techalans and 
Interpretation of the Individual Psycho. logical 
Examination for the Examiner, Principal, and 
Teacher. Connecticut State Department of Education, 
Division of Instruction, Bulletin 29. Hartford, Conn.: 
Division of Administration, the Department, May 
1943. Pp. 111. Paper. Out of print. * 

[ 825 ] 
*DameE, J. FRANK; BRINKMAN, ALBERT R.; AND 
Weaver, WILBUR E. Prognosis, Guidance, and 
Placement in Business Education. Cincinnati, Ohio d 
South-Western Publishing Co., 1944. Pp. v, 216. $2.00. 
(PA 19:765) 

J Bus Ed 20:37 S '44. * brings together in 
more detail than can be found elsewhere the 
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information that is available in this field. The 
bibliographies are excellent and the forms for 
follow-up, guidance, reports to and from parents 
and business men, and for prognosis will be 
particularly helpful to those who may wish sim- 
ilar studies in these fields. 


[ 826 ] 
*Danrev, Joun G. Clinical Aspects and Interpreta- 
tion of the Strong Vocational Interest Blank. 
New York: Psychological Corporation, 1941. Pp. 72. 
cm $1.20, postpaid. * (PA 16:328) For reviews, see 


[ 827 ] 
X*Danrev, Jonn G. The Interview in Counseling: 
An Outline of Interviewing Procedure for Use of 
Community Advisory Centers. United States De- 
partment of Labor, Retraining and Reemployment Ad- 
ministration. Washington, D. C. : Government Printing 
Office, 1946. Pp. v, 25. Paper. $0.10. * 


Occupations 25:66-8 O "46. C. L. Miller. * 
sketches in lay terms the process and procedures 
essential in the counseling services, and thus 
aids personnel engaged in community advisory 
services * Several significant features....include 
the emphasis on counseling as a process and the 
recognition that the success of counseling de- 
pends not only on the knowledge of the tech- 
niques, but also on skill and familiarity with 
the facilities and resources in the community 
that can be utilized to facilitate adjustment. * 
The need for an outline such as this is recog- 
nized; the effectiveness of its use is contingent 
upon the professional alertness of counseling 
personnel, the recognition that the lists of rules 
for interviewing and the do's and do not's are 
suggestive rather than inclusive, and the realiza- 
tion that skill in counseling comes with more 
training and experience. 


[ 828 
*DanrEv, Jonn G. Testing s Counseling in the 
High-School Guidance Program. Chicago, Ill. : Sci- 
ence Research Associates, 1943. Pp. 222. $2.05. * (PA 
17 :2879) 

J Appl Psychol 27:546-8 D '43. Donald E. 
Super. * Darley's new text...covers familiar 
topics and material...in a manner which should 
make it appeal to the high school teacher start- 
ing out in vocational guidance. While not a 
book for the specialist, it is one with which 
specialists should be familiar, for it presents the 
methods of testing and test interpretation in a 
context and manner which help to make them 
seem valuable and practical to the non-psycho- 
logically trained persons who now carry the 
burden of such vocational guidance as is avail- 


able to most high school students. The book is 
at once a text on testing and a text on counsel- 
ing procedures, but it does not deal in great 
detail with either topic. It would make a good 
text for a first course in testing or counseling 
for non-psychologists, following a basic course 
in the principles of vocational guidance. How- 
ever, many of Darley's statements and his un- 
derlying philosophy do much to misrepresent 
the objectives and methods of guidance and 
counseling, in line with two fallacies revealed in 
a number of the Minnesota publications. The 
first Minnesota fallacy arises, as this reviewer 
sees it, from the very excellence of much of the 
work done there in the development of tests and 
of the use of objective techniques: too great an 
emphasis is put on the role of tests. The second 
fallacy stems from this, for the test interpreter 
becomes a judge and maker of decisions for the 
student, rather than a counselor or teacher who 
helps the student to develop insight and make 
his own decisions. Thus, whereas Darley's title 
is "Testing and Counseling," what he appears 
to mean is "Diagnosis and Counseling," as in- 
dicated by the statement that this book is writ- 
ten for those “who want to understand what to 
expect of counseling." The implication is that 
diagnosis is based so largely on testing that 
other methods, briefly discussed, are of minor 
importance. This reviewer would be one of the 
last to decry the use of tests, but it would be 
unfortunate were they to be assumed to be the 
essence of counseling, as is so often done by 
teachers and laymen. This is especially so in 
view of the second fallacy, namely the assign- 
ment of the role of decision maker to the coun- 
selor. If testing is the essence of counseling 
and the counselor alone understands testing, the 
counselor, in his superior wisdom, must decide 
what is wise and what is unwise. This fits in 
with a common misconception of guidance, in- 
stead of revealing the counseling process as one 
in which a knowledgeful person assists a less- 
informed individual to acquire needed insight 
and to make appropriate decisions. Darley's un- 
democratic philosophy is illustrated by a para- 
graph on p. 169: "In many respects, the 
interview seems somewhat similar to a sales 
situation, since the counselor attempts to sell the 
student certain ideas about himself, certain 
plans of action, or certain desirable changes in 
attitudes. Persuasion and logic will facilitate 
and hasten the sale of such ideas by a ‘coun- 
selor.” But will the student be any better 
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equipped to solve problems himself, or any hap- 
pier with his new adjustment, assuming he 
brings it about? Fundamental though these 
philosophical defects are, they do not affect the 
other excellent features of Darley's text. * 
Chapter II consists of a brief but interesting 
discussion of what one should know about stu- 
dents, how these factors operate and why they 
are important—it is a good interest-arouser and 
is not limited to test data. Chapter III deals 
with statistics in an easily understood and 
breezy manner; it is on the whole admirable 
for beginners, and certainly necessary at present 
in such a text, but one wishes for the day when 
all would-be students of testing will be expected 
to know elementary statistics. The crucial topics 
of selecting and using specific tests are covered 
in Chapter IV in about forty pages. Darley 
judiciously selects two or three tests in each 
area; unfortunately, they are too briefly dis- 
cussed. * It is to be regretted that psychologists 
who have done as much test construction and 
have stressed the clinical approach as much as 
the Minnesota group have not written at greater 
length on the clinical interpretation of specific 
tests—as Darley himself has elsewhere on 
Strong's Interest Blank. The discussions of in- 
terest and personality testing in general are very 
good, with judicious caution urged in the latter 
area, each receiving about five pages; intelli- 
gence, achievement, and aptitude are less well 
treated both quantitatively and qualitatively. * 
The counseling process is discussed in Chapter 
VII, in which the Minnesota viewpoint on ac- 
tive therapy is expressed. The interview is 
appropriately called a learning situation. Learn- 
ing may be sought by lecture or recitation (both 
dismissed as ineffective), discussion (approved 
but not discussed at sufficient length), or ex- 
planation of data followed by discussion. The 
last-named technique is called the best. While 
this is true in many short-term vocational coun- 
seling cases seen by clinic counselors, it is prob- 
ably not true of others, and this reviewer 
believes there are better techniques for use in a 
school situation where time is available for 
long-term counseling. These involve discussion 
of a question-asking type which stimulates the 
student to seek information, some of which is 
obtained by means of tests and some by means 
of other experiences, and which also leads him 
to suggest and evaluate interpretations, thus 
helping him to develop techniques of self- 
guidance rather than habits of depending on 


others for specific advice. The last chapter is an 
admirable account of the development of a 
community, county, and state-wide counseling 
program of a type happily becoming increas- 
ingly common, Publication of such accounts is 
a help both as stimulation and as guide for 
other communities. Despite its defects of philos- 
ophy, brevity of treatment of specific tests and 
of counseling techniques, and relative neglect of 
non-testing sources of diagnostic data, this is a 
useful book. It will be valuable, when properly 
supplemented, in elementary testing courses and 
in introductory counseling courses in which it is 
necessary to survey testing techniques. 

J Appl Psychol 28:179-80 Ap ’44. John G. 
Darley. “A Reply to Dr. Donald E. Super.” * 
Dr. Super happily finds no cause for disagree- 
ment in regard to the factual material presented, 
but he apparently disagrees strongly with my 
emphases and the viewpoint I have tried to 
present. This is true to the extent of stating that 
guidance and counseling are misrepresented, “in 
line with two fallacies revealed in a number of 
the Minnesota publications.” The “fallacies” for 
which the Minnesota studies are thus stigma- 
tized turn out to be no more than differences of 
opinion regarding the extent of the role of tests 
in clinical work and differences of opinion re- 
garding the role to be played by the counselor. 
It is misleading to have such differences of 
opinion distorted to the implied level of a factual 
or theoretical fallacy. This makes it appear that 
Dr. Super has evidence at least of the presumed 
fallacious nature of these emphases, if not evi- 
dence of the superiority of other methods. * The 
excellent series of studies by Williamson and 
Bordin have demonstrated that the techniques 
developed at Minnesota produce improved ad- 
justment among students; the critics of the 
Minnesota writings have maintained a thunder- 
ing silence regarding these studies, while they 
persist in their denunciation of an allegedly 
autocratic and undemocratic role imputed to the 
counselor. The reviewer even resorts to the 
classic critical technique of quoting out of con- 
text to buttress his point in this regard. The 
exponents of the transfer of learning theory in 
personal problem solving have yet to demon- 
strate their hypotheses; furthermore the ex- 
treme categories of autocratic and non-autocratic 
counseling are rhetorical constructs rather than 
demonstrable types, in view of individual differ- 
ences among good counselors. It may be also 
that my attempt to write a self-contained intro- 
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duction to counseling for non-psychologists was 
unwise. But in view of the fact that most high- 
school counselors are non-psychologists and will 
continue to be non-psychologists, psychologists 
have the option of translating their methods for 
this large group or of ignoring the group in the 
field. The exigencies of translation and the fre- 
quent misunderstandings of test techniques in 
this group as seen from the author's experience 
combine to determine the emphases of the book. 
The experience of others might well produce 
different emphases. With the exception of his 
synonymous use of fallacies and differences of 
opinion, Dr. Super has written a careful and 
thoughtful review. It is to be hoped that both 
the critics and exponents of various forms of 
counseling may someday combine their efforts 
to design crucial experiments on the processes 
of therapy, and thus turn their analytical powers 
to the interpretation of research findings instead 
of philosophic debates involving alleged types of 
professional workers. 

Occupations 22:153 N ’43. Edwin W. Davis. 
"This book treats testing in relation to counseling 
as it should be done to receive its proper em- 
phasis and perspective. The writer used it as 
one of the three texts in a university course in 
Tests and Measurements in Vocational Guid- 
ance last summer and would use it again. Many 
of the students who were preparing to become 
counselors considered it their best source of a 
clear introduction to elements of measurement. 
The volume is especially good for introducing 
students with little or no statistical or testing 
background to the elements of testing and the 
use of tests in counseling. * On page 130 in 
listing the testing companies to which it is well 
to write for catalogs, the author might also have 
mentioned the World Book Company, the Pub- 
lic School Publishing Company, the Teachers 
College Bureau of Publications, and C. A. Greg- 
ory. A fine choice of book both for a student 
taking a course in testing or in guidance tech- 
niques and for a person undertaking this work, 
without having taken courses in testing or in 
the general principles of guidance. 

Psychometrika 11:189-90 S '46. Welty Le- 
fever. * The treatment throughout is nontech- 
nical, simple, and functional. The style is lively 
and informal. It is possible, however, that the 
author may have underestimated the capabilities 
of teacher-counselors. A carefully worded sim- 
plified introduction to each topic might well 
have been followed by a more critical considera- 


tion of some of the significant issues and prob- 
lems related to each area of discussion. The 
book deals with only one phase of personnel 
work, individual counseling. This important 
function is given rather fuil consideration on a 
descriptive and practical level. A sizeable sec- 
tion is devoted to the use of objective tests in 
counseling. The basic statistical procedures are 
explained in an exceptionally clear, common- 
sense manner with a minimum stress on mathe- 
matical formulas and processes. For example, 
student achievement is diagnosed by noting the 
position of each individual on a correlation table 
for which the variables are scholastic ability and 
the mastery of curriculum content. The problem 
cases of the under-achievers and over-achievers 
are located for study and counseling. The im- 
portance of sampling is rather well presented. A. 
praiseworthy feature of the discussion is the oc- 
casional comment on the shortcomings of meas- 
uring devices available to counselors. * The 
relatively low predictive efficiency of most cor- 
relation coefficients is realistically presented. 
The problem of test validity is stated quite 
frankly. In spite of the book's more than usual 
honesty concerning the limitations of tests,....a 
more complete treatment of test reliability is 
needed. Actual examples of student scores 
should have been furnished with accompanying 
sampling-error ranges. The fact that every 
measurement must be interpreted as a zone 
rather than a point should have been stressed. 
The high degree of validity required for indi- 
vidual prediction should have been contrasted 
with that sufficient for group selection or place- 
ment as in hiring workers for an industrial 
plant. The selection of suitable tests is treated in 
a practical manner with a few specific examples 
briefly summarized. Here again an even greater 
realism in picturing the strengths and weak- 
nesses of actual tests might well have been em- 
ployed. Obviously such a viewpoint is debatable. 
Few counselors, however, possess a balanced 
and informed understanding of the real worth 
of existing measuring devices. Many modern 
educators would, no doubt, question the empha- 
sis on published standardized tests as opposed to 
teacher-made objective achievement tests. Any 
considerable dependence on published tests: in 
content fields in high school (in contrast to 
specific skill areas) reflects a rather conserva- 
tive curriculum policy, since such dependence 
tends to standardize both subject-matter and 
teaching approach. A much needed word of 
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caution concerning the interpretation of person- 
ality tests by teacher-counselors appears. This 
word could with justice have been elaborated 
into several paragraphs. Too many amateur 
psychologists show an unwarranted optimism 
in passing judgment on personality quirks and 
difficulties from group-type, paper-and-pencil 
personality measures. The last third of the book 
contains a useful analysis of the most frequent 
types of student problems. The common-sense 
approach reflects much mature experience in 
counseling on the part of the author. A rather 
complete chapter outlines some of the elements 
important to the development of skill in inter- 
viewing. Another current controversy is sug- 
gested by this discussion. An influential group 
of clinical workers and counselors will accuse 
the author of presenting a counselor-centered 
rather than a client-centered approach to the 
student. The relative merits of different coun- 
seling techniques under varying circumstances 
and with different objectives in mind should be 
clearly explained to the’ beginning guidance 
worker. There is undoubtedly an appropriate 
place for an informational interview as well as 
for the so-called nondirective approach. The 
final chapter encourages the administrators and 
teachers of smaller high schools in rural areas 
and less populous towns by describing the evo- 
lution of a successful guidance program in 
North Dakota. These concrete experiences ap- 
pear to justify the author’s enthusiasm and gen- 
eral optimism regarding a counseling program 
for the average high school in America. 
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Occupations 24:188 D '45. Roy N. Ander- 
son. * the result of a study of more than 1,000 


sales organizations which have used psycho- 
logical tests and other techniques for selecting 
salesmen * The data presented in Part I are 
supposed to be a description of how various 
companies use selection tests. But much of the 
material deals with the use of the application 
form and an interview guide. Part II gives 
opinions and results of various companies. 
There were 18,030 sales executives contacted 
and 2 per cent of this group reported that they 
now use tests and 6 per cent reported having 
used, but discontinued tests. The experiences of 
both groups are reported. A section is also de- 
voted to the opinions of three psychologists. * 
The summary: of the report is: (1) Aptitude 
testing for salesmen is still in the controversial 
stage. (2) It has proved valuable for screening 
purposes to the great majority of sales execu- 
tives who have used it. (3) Until some better 
method. of selection has been perfected, and 
proved in use, aptitude tests will appeal to many 
scientific-minded sales executives as a valuable 
tool to be used in conjunction with other meth- 
ods for selecting sales personnel. * 
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*Davenrort, Kenneth. An Investigation Into 
Pupil Rating of Certain Teaching Practices. Fore- 
word by Ida B. Kelley and Keith Perkins, Edited by 
H. H. Remmers. Purdue University, Division of Edu- 
cational Reference, Studies in Higher Education, [No.] 
49; Further Studies in Attitudes, Series 7. Lafayette, 
Ind.: the Division, January 1944. Pp. 61. Paper. $0.75. * 
(PA 18:3282) 
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Royal Stationary Co., 1944. Pp. 150. $2.50. 


Occupations 23:374 Mr "45. Personnel rec- 
ord sheets giving space for pertinent informa- 
tion on home background, health, aptitudes, 
personality, test results, scholastic record, voca- 
tional training and experience, educational and 
vocational plans. The homeroom teacher trans- 
fers information to these sheets from her cumu- 
lative folder. Each classroom teacher, therefore, 
has a record for each of her pupils. New data 
may be added. The reverse side of the sheet may 
be used for any significant anecdote the teacher 
wishes to record. The record sheets are enclosed 
in a 3-ring loose-leaf notebook cover. This de- 
vice is useful in gaining the cooperation of the 
classroom teacher in the pupil personnel pro- 
gram. It is intended to serve “as one of the tools 
which may be used by persons who are not spe- 


cialists....” 
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Pp. ix, 85. Paper. $1.25. * (PA 21 :2775) 


J Consult Psychol 11:153 My-Je '47. * Psy- 
chologists and educators who need a clear and 
brief summary of the achievements of the armed 
forces in the selection and classification of per- 
sonnel during World War II will find this 
monograph enlightening. Chapter I is a remark- 
ably full and concise account of the procedures 
used during the war, in relatively nontechnical 
language. * Two technical appendices will be 
of great interest to test construction specialists 
and statisticians. 
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*Davis, FnEpERICK B., Eprron. The AAF Qualifying 
Examination. Army Air Forces Aviation Psychology 
Program Research Reports, Report No. 6. Washington, 
D. C.: Government Printing Office, 1947. Pp. xvii, 266. 
$125. * (PA 21:3711) 


J Consult Psychol 11:223 Jl-Ag "47. * de- 
Scribes the successive forms of the examination, 
the principles and techniques underlying its 
construction, and the data concerning the types 
„Of test items used: verbal, information, judg- 
ment and reasoning, mechanical, perceptual, and 
others. 
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Am J Orthopsychiatry 13:558 J1 43. John 
Gray Peatman. * replete with tables and graphs 
* should provide those interested in child devel- 
opment with a wealth of factual information not 
ordinarily available. 

Brit J Ed Psychol 13:109—10 Je ’43. C(yril) 
B(urt). * In the curves for physical develop- 
ment one of the most striking features discov- 
ered was the well-marked spurt appearing at 
what the authors call the ‘maximum growth pe- 
riod.’ It proves to be far more abrupt than has 

hitherto been supposed * In general, it is found 
that the maximum growth period “coincides 
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with the advent of the menarche in girls and of 
pubescence in boys”; and the writers suggest 
that it is probably one of the best available 
indications of impending adolescence. The cor- 
relations between the eight physical measure- 
ments are of great interest. The authors do not 
attempt any formal factor analysis; and content 
themselves with demonstrating that the crude 
proposals to classify individuals into two ‘bod- 
ily types,’ by means of rough criteria such as 
those proposed by Kretschmer and his follow- 
ers (e.g., ratio of height to weight), find little 
justification in actual fact. "The difficulty,” as 
they say, “lies in determining just when one 
type ends and another begins.” Such a difficulty 
can only be overcome by the use of a factorial 
method. I have ventured to factorise their table 
of correlations (Table 64, p. 301, 533 boys, aged 
16). The results....reveal a remarkable agree- 
ment with those obtained from English children. 
(i) They plainly contradict the common as- 
sumption (explicitly adopted by Spearman and 
others) that there is‘no general factor underly- 
ing physical measurements analogous to the 
general factor found with mental tests. (ii) 
They show a clear factorial basis for the distinc- 
tion between the leptosomic (asthenic) and 
brachysomic (pyknic) types. (iii) Further, 
standing height appears to be largely dependent 
on leg-length, and sitting height on trunk- 
length. (iv) Of single measurements, height 
and weight are the most suggestive for assessing 
both the general factor and the Kretschmerian 


'type-factor, though, as Dearborn and Rothney 


have argued on other grounds, they would by 
themselves have a very poor diagnostic value. 
It would be interesting to know whether an ex- 
amination of the annual records would show 
that boys they call D and E, for example (two 
obvious pyknic types), and G (an asthenic 
type), have always exhibited the same type- . 
tendencies as are suggested by the measurements 
taken when they were sixteen. To fit the curve 
of mental growth the authors finally adopted 
the Gompertz formula (suggested by Courtis in 
America and used in this country by Richard- 
son and others). At first sight, as they observe, 
their conclusions differ rather widely from those 
of Thurstone, who based his inferences on anal- 
ysis of Binet results obtained with 3,000 Lon- 
don children (published in Mental and Scho- 
lastic Tests). Dearborn and Rothney, however, 
point out that, if Thurstone had taken as his 
unit the variability of the 8-year-old Londoners, 
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instead of the 3-year-old, his results would have 
been very much the same as theirs, On the basis 
of their curves, they deduce that mental growth, 
so far from stopping short at 15 or 16, must 
continue (though with a rapidly diminishing 
rate) up to about 30 years: "approximately 2 
per cent of mental growth may take place after 
the age of 21." Actually, however, their figures 
could be fitted almost as well by a logistic curve ; 
and this would obviously show a less abrupt rise 
during the earlier years of life and would indi- 
cate a much earlier arrest. * Against the current 
tendency to rely on a simple I.Q. as a final 
measure of ‘intelligence’ they rightly register 
strong protests. As they remark, "in minds of 
many of the makers, intelligence tests were de- 
signed to determine innate differences in intel- 
ligence, and not to be affected by environmental 
influences"; yet, beyond question, “socio-eco- 
nomic conditions do influence intelligence, as 
the commonly used tests of intelligence test it.” 
They suggest that this influence is probably 
strongest during the pre-school period, and may 
be partly neutralised later on by the uniformity 
of school conditions. Thus, between the ages of 
8 and 12 the I.Q.’s of their children have re- 
mained moderately constant; but before school 
age and during adolescence the I.Q. is a highly 
unreliable measure. In its place they propose to 
substitute the percentage of the maximum 
growth reached at the age of testing. The whole 
elaborate investigation has been admirably 
planned, industriously carried out, and most 
carefully reported. Its general outcome has far- 
reaching implications. We now have a clear 
demonstration of what many have for long sus- 
pected, namely, that both physical and mental 
development are highly individual characteris- 
tics, with features peculiar to each particular 
child. This is especially true during adolescence. 
Consequently, the customary methods of predic- 
tion, based on averages or ‘norms’ for succes- 
sive ages, obtained by the usual ‘cross sectional 
method,’ appear liable in individual cases to lead 
both the teacher and the educational psycholo- 
gist very far astray. 

Ed Res B 21:243-4 N 10°42. Horace B. Eng- 
lish. The monumental study of physical and 
mental growth carried on for twelve years by 
the Harvard Psycho-Educational Clinic has 
been expertly analyzed in technical monographs 
by Shuttleworth and others. There is need, 
however, for a semitechnical presentation for 
educators and psychologists not specifically spe- 
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cializing in the field. Such was, apparently, the 
purpose of Dearborn and Rothney—a purpose 
not very well achieved. The book is too heavy 
for most of the intended readers. Yet it must 
not be lightly dismissed. Here we have the rec: 
ord of 1,350 cases studied over twelve years. 
In all, 190,000 physical and 30,000 mental 
measurements are reported, This is a rich mine 
of information. * few new general principles 
have emerged from the Study. Proof has been 
adduced, however, for principles which had 
rested on insecure bases. Most important for 
the future is the wealth of detailed fact which, 
in the fullness of time, will be integrated into a 
richer picture of human development. 

J Appl Psychol 26:716-8 O '42. J. R. Gentry. 
* An interesting feature of..[the Harvard 
Growth Study Curve] is that mental growth 
continues to 30 years of age. * Considerable 
space is given to a discussion of a regression 
equation by which "normal weight for body 
build" can be determined. But its practicability 
is questionable. * The applications of the for- 
mula for developing a growth curve and pre- 
dicting normal weight are important additions 
to the rest of the data (many of which have been 
published in other volumes) of this book. 

J Consult Psychol 6:303 N-D ’42. Harold E. 
Jones. Perhaps the outstanding conclusion of 
Predicting the Child's Development is that it 
can't safely be done. "We have established the 
fact that variability rather than consistency is 
the rule, that prediction except for group aver- 
ages is extremely hazardous (and even this is 
hazardous at the adolescent period)." (P. 346.) 
To those who have read the title of this book as 
carrying a more positive implication, it may be 
frustrating to learn that there are still so many 
unknowns in the growth process. Some consola- 
tion, however, is to be gained from the fact that 
we are beginning to have a better knowledge 
of the factors which limit prediction. One of 
these factors is, of course, the variable age of 
physiological maturity, as brought out $0 clearly 
in the earlier monographs by Shuttleworth. * 
While it cannot substitute for a study of the 
original monographs, particularly those by 
Shuttleworth, the present volume is useful as an 
introduction to the more detailed technical pub- 
lications, and also as providing a survey of 
problems and methods. 

J Ed L(ondon) 75:40-2 Ja’43. C. M. Flem- 
ing. * Occasionally....a report emerges whose 
findings are so far from obvious as to challenge 
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belief. Of such a type is a description of a re- 
cent "growth study" undertaken by investiga- 
tors at Harvard. Its conclusions in many cases 
run counter to the comfortable presuppositions 
of contemporary practice; but the evidence upon 
which the conclusions are based is presented in 
iluminating detail with the reassuring title: 
Predicting the Child's Development. 'The sup- 
porters of selective Special Place examinations 
and the advocates of opportunity "suitable to 
the capacity of the pupil" will find here material 
worthy of the closest study. * The document as 
a whole deserves careful reading by teachers, 
administrators, and research workers ; but men- 
tion can be made here only of the conclusions 
which seem most relevant to the problems which 
confront English workers at the present time. 
Of these the most important seems to be that 
(with respect to every characteristic measured) 
variability rather than consistency in growth is 
the rule, that prediction (except for averages of 
groups) is extremely hazardous, and that com- 
parisons with averages have little value in the 
study of the development of individuals. This 
provides a useful corrective to recent over- 
confident attempts at vocational or educational 
prediction, associated as these often are with 
somewhat facile promises of "equality of edu- 
cational opportunity" in the sense of the pro- 
vision of education "suitable to the ability of 
the child", The disconcerting truth seems to be 
that physical and mental growth is essentially 
individual. Performance in any one test or 
measurement of any single attribute cannot rea- 
sonably be used as a pointer to certain perform- 
ance at a later date. This is related not merely 
to imperfections of the means of measurement 
but also to the fact that growth is characterized 
by cycles. The chronological or physiological age 
at which acceleration or deceleration will occur 
is a function of the life history of the individual. 
* If, however, groups are selected on the basis 
of relative performance at any given age, a cer- 
tain regularity of fluctuation in the growth 
rhythm may be detected. * The composition of 
any group is at the same time subject to fluctua- 
tion, and individuals may make surprising 
changes upward or downward. Each person fol- 
lows a different path in relation to the yearly 
averages of the group as a whole. A determina- 
tion of such averages for physical, mental, emo- 
tional, and scholastic performances (while of 
little value for predictive purposes) will prob- 
ably prove a valuable instrument for use in the 


diagnosis of abnormalities of development— 
provided that it is accompanied by a cumulative 
record of measurement over a period of years. 
If a pupil's growth for several years has fol- 
lowed in general the course which previous 
measures of comparable groups would lead one 
to expect, a sudden change of relative rate of 
growth provides a challenge to the observer 
and indicates the desirability of a careful in- 
vestigation of health, social environment, eco- 
nomic status, emotional condition, or glandular 
balance. 

Loyola Ed Digest 17:10 Mr '42. Austin G. 
Schmidt. * The report is a thoroughly scientific 
one, full of truly significant data on the un- 
reliability of the IO as a measure of mental 
growth and on other kindred subjects of vital 
importance to the educator. 
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Brit J Ed Psychol 12:80 F '42. * a full, 
elaborate, detailed report of research into impor- 
tant questions * will be of service to other re- 
searchers in cognate subjects, and to students of 
child development * 

For additional reviews, see 38:B343 and 
40:B879. 


[ 839 ] 
*DeeMer, WALTER L., Jr, Eprrog. Records, Anal- 
ysis, and Test Procedures. Army Air Forces Avia- 
tion Psychology Program Research Reports, Report 
No. 18. Washington, D. C.: Government Printing 
Office, 1947. Pp. ix, 621. $225. * (PA 22:4142) 

[ 840 ] 
*DESING, Minerva F. The Relation of Pupil 
Achievement Gain to Certain Personal and En- 
vironmental Elements. Philadelphia, Pa.: Univer- 
sity of Pennsylvania, 1940. Pp. xii, 169. Paper. * 

[ 841 ] 
*Drake, CHARLES A. Personnel Selection by 
Standard Job Tests. New York: McGraw-Hill Book 
Co. Inc., 1942. Pp. ix, 147. $2.25. * (London: McGraw- 
Hill Publishing Co., Ltd., 1943. 145.) (PA 17:657) 


Adv Mgmt 8:67-8 Ap-Je ’43. Edward N. 
Hay. * There is some controversial material, 
such as Drake's criticism of the use of co- 
efficients of correlation. He uses the test as the 
criterion, thus establishing the performance of 
which that individual is capable. Then he com- 
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pares the actual performance of each operator 
with the test results to see if the individual is 
working up to his full capacity. This inverts the 
usual procedure in which the test is standard- 
ized against the production performance of the 
subject. The claims for this method will not be 
conceded by most industrial psychologists. 
Drake is sound when he stresses the importance 
of utilizing the phenomenon of individual dif- 
ferences, and points to the common situation in 
which the best operators produce twice as much 
as the poorest ones. He adds that it is high 
time for industry to develop methods of identi- 
fying these superior individuals. * The author 
urges close association between time and motion 
study and the use of tests, and argues with some 
reason that the individual who is basically 
trained in time and motion study is better 
equipped to devise these special aptitude tests 
than is an academic psychologist. In his warm 
advocacy of tests modeled after the job he fails 
to warn the reader against tests too much like 
the actual operation itself. In some cases of that 
kind an experienced operator may have bene- 
fited so much from practice on the job that he 
will do better on the test than one who has had 
no experience on the job. A test of this kind, 
therefore, will be useful only in testing inex- 
perienced applicants or experienced ones, but 
not both. Drake makes some interesting sugges- 
tions regarding the development of tests for 
inspection. He also restates his hypothesis re- 
garding accident-proneness, which he defines 
as a condition in which an individual has a 
lower perceptual level than motor response level. 
There is as yet insufficient substantiation of this 
hypothesis. Another interesting hypothesis is 
that social intelligence is a faculty of perception. 
He thinks the poor record of so-called sales 
tests is that they are all expressed verbally, 
whereas sales situations involve varied and 
subtle relationships between individuals. The 
ability to sell, he thinks, depends on a special- 
ized kind of human perception. This book will 
be valuable to anyone concerned with industrial 
tests, particularly those engaged in selecting 
operators for manual jobs. 

J Consult Psychol 7:245-6 S-O '43. C. L. 
Shartle. * presented as a "guide for practice." 
As such it appears to apply to the initial steps 
of analyzing the job, and constructing the tests, 
and to the design of machines which will make 
possible the efficient use of operators' abilities. 
Photographs and descriptions of a number of 


tests are presented, some of which are quite 
similar to tests previously described in the liter- 
ature. The author's procedure is usually con- 
sidered by others as a preliminary step in test 
development, but he regards it as the final 
thing. * The author recommends the work- 
sample type of test which is designed to meas- 
ure the basic abilities required by the job. He 
presents his technique as a new one, although 
other investigators such as Viteles have previ- 
ously constructed and reported tests of this type. 
He further ignores the literature when he states : 
“Tt is now well established that tests of the 
pencil-and-paper variety have very limited util- 
ity in selecting operating personnel for indus- 
try." Whereas some of his criticisms of various 
criteria of occupational proficiency are no doubt 
cogent, the shortcomings are not sufficiently 
great to justify discarding criteria entirely. (P. 
14.) He opposes the checking of his hypothe- 
ses by accepted scientific procedures, but pre- 
fers his own judgment to determine the ade- 
quacy of his tests. Instead of determining the 
validity of his tests by checking his test results 
against occupational proficiency measures, he 
proposes an "inversion of the usual point of 
view": “It is proposed, therefore, that the test 
results be used as the criterion against which 
the efficiency of the productive process may be 
measured.” (P. 15.) Whether the author actu- 
ally holds this point of view is not clear, since 
he later compares rank on Machine Operator 
Test with rating by foreman, and reports: 
“Thus, this test correlates with foreman’s rat- 
ings .40, which is not impressive. In spite of 
this low figure, a study of the individuals at the 
high end of the classification indicates that the 
test is doing very well what it is intended to do, 
that is to select the applicants who will have a 
high assurance of success on the job.” (P. 72.) 
His implication that a correlation of .40 is of 
little value is apparently contradicted by the 
author’s finding “that the test is doing very 
well what it is intended to do.” Earlier in the 
book (p. 12) he also maintains that a coeffi- 
cient of correlation between +.30 and +.50 is 
low and “in the customary procedure, the size 
of the correlation coefficient between the final 
battery of selected tests and the sample of em- 
ployees used as a criterion was usually not im- 
pressive, often ranging from only .30 to .50.” 
Tt should be pointed out that many investigators 
have found tests which correlate .40 to .50 with 
industrial proficiency, and have found them use- 
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ful in differentiating between successful and un- 
successful workers, just as the author does in 
the case of his Machine Operator Test. Techni- 
cians in test construction will be interested in 
the photographs and descriptions of test ap- 
paratus. The layman who reads the book may 
become oversold on the use of tests. 

. Occupations 21:575 Mr '43. Patricia Lee 
Scharf. This work presages new emphases in 
the selection of employees for industrial plants. 
The foremost of these are the new discoveries 
in the field of perception and the importance 
of determining the dexterities necessary for in- 
dividual jobs. * will be invaluable to those 
about to enter the field of business psychology 
for its strict subordination of theoretical ex- 
perimental work to the viewpoint of the profit 
motive in business. To undertake his technique 
of constructing tests, Mr. Drake says that the 
personnel worker “must have a working knowl- 
edge of time and motion study, of machine de- 
sign, of industrial psychology of the conven- 
tional sort, of human anatomy and physiology, 
and of elementary statistics." Although many 
personnel workers possess two of these, I think 
those possessing all are as rare as his good dual- 
hand operators. To label those who do not have 
all of this background as inadequately trained 
does not solve the problem. If this technique 
is to be feasible for smaller plants, it will be 
necessary to' market many slightly differing 
tests which can be easily adapted. 

Personnel 19:630-40 Ja ' 43. R. S. Uhrbrock. 
"This book is a mixture of special pleading for 

- what is described as “a new technique for the 
selection of industrial personnel," and descrip- 
tions of many ingenious performance tests. It is 
a disturbing book—chiefly because it is based 
upon a reversal of orthodox assumptions that 
have been accepted when considering tests as 
aids to personnel selection. For reasons that 
appear sufficient to him, Dr. Drake rejects fore- 
men's ratings, production records, earnings, 
percentage efficiencies, and quality estimates as 
criteria for deciding whether or not a test is 
any good. In italits, he states his thesis: “It is 
proposed, therefore, that the test results be used 
as the criterion against which the efficiency of 
the productive process may be measured" * At 
their present stage of development, can tests 
assume the burden of labeling men with the de- 
gree of finality that is implied in this statement? 
Dr. Drake’s hypothesis is simple and intriguing, 
although his methods are not entirely new. * 


Little enthusiasm developed when Dr. Drake 
discussed his procedures before the American 
Psychological Association in 1940. He now 
“believes that greater progress in the continued 
development of this technique will be made if 
it is adopted as part of the standard curriculum 
for the training of industrial engineers than if 
it is left to persons whose training is primarily 
psychological and whose interests are largely 
clinical.” He may be right. Certainly there is a 
fertile field for the development of performance 
and manipulative tests, such as he has used in 
his experiments. Unquestionably, they will re- 
ceive wide acceptance if their predictive value 
is high. * When engineers design and construct 
a new machine, they subject it to rigid tests to 
make sure that it will perform according to 
specifications. If they become interested in 
Drake’s “new technique,” it is to be hoped that 
they will follow the same procedure and test the 
tests through controlled experiments, instead of 
accepting them as valid measures of important 
behavior characteristics merely because they 
seem to conform to some undefined standard of 
common sense. In the field of personnel selec- 
tion they should follow F. W. Taylor’s dictum 
and “question everything.” 


[ 842 ] 
*Draxe, Frances S. Manual of Employment In- 
terviewing. Research Report, No. 9. New York: 
American Management Association, 1946. Pp. 75. 
Paper. $2.25. * 


Personnel Adm 9:45 N 46. James W. Eike. 
* primarily intended for the use of industrial 
and commercial companies and executives * 
contains much that will be of value to anyone 
in a personnel office or in an operating position 
whose work involves selection for employment, 
whether his role be the preliminary interview 
or the final selection. The various factors in 
finding the right man for the job are dealt with 
in logical order, and annotated throughout with 
bibliographical references for more extensive 
reading. The author states that “this manual 
has been prepared in response to the requests 
of the American Management Association mem- 
bers and others for a concise exposition of 
working principles and techniques of employ- 
ment interviewing.” In my opinion this goal has 
been attained. * a compact fund of sound ideas 
on such points as the nature of the interview, 
the objectives, pitfalls, types of interviews, 
physical setting, techniques, evaluation, related 
selection procedures, etc. I was impressed not 
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only by the excellent presentation of the impor- 
tant phases of the subject, but by the indica- 
tions that the selection of personnel in private 
industry is, in many instances at least, done 
with greater care, more scientifically, and is 
more thoroughly planned than in many Federal 
offices. It appears that much remains for us to 
achieve in closer working relations among rep- 
resentatives of placement sections, personnel 
relations staff, and operating officials before we 
can be sure that we are finding the right man 
for the job. Our efforts seem somewhat lacking 
in coordination and thoroughness when we com- 
pare them with those attributed to large indus- 
trial concerns. Whether you feel that all of the 
practices described in this manual are neces- 
sary, you will certainly find yourself engaging 
in a healthy appraisal of your own performance 
and that of your particular agency. For that 
reason the “Manual of Employment Interview- 
ing" is recommended. 


[ 843 ] 
*Dnaxz, RALEIGH M. Work Book in Measurements 
of Human Behavior. For use with Edward B. 
Greene's Measurements of Human Behavior. Kent, 
Ohio: the Author, Kent State University, 1945. Pp. ii, 
38. Paper, mimeographed. $0.75. * (PA 19:2390) 

[ 844 ] 
*Drever, JAMES, AND Cortins, Mary. Performance 
Tests of Intelligence: A Series of Non-Linguistic 
Tests for Deaf and Normal Children, Third Edi- 
tion. Edinburgh, Scotland: Oliver and Boyd Ltd., 
1946. Pp. 59. 5s. * 

[ 845 ] 
*Driscott, Gertrupe, How to Study the Behavior 
of Children. Practical Suggestions for Teachers, No. 
2. Edited by Hollis L. Caswell. New York: Bureau of 
Publications, Teachers College, Columbia University, 
1941. Pp. ix, 84. Paper. $0.60. * (PA 15 14833) 


Ed Res B 22:140 My 12 '43. Horace B. 
English. This booklet is intended to sharpen 
the teacher's direct observation of children. It 
calls attention to the manifold opportunities for 
observation, helps one to see the varied signifi- 
cance of children's actions. The best review I 
can make of this little book is to urge every 
elementary-school teacher to buy and study it. 
It is practically guaranteed to increase her un- 
derstanding of children. ) 

Loyola Ed Digest 17:9 Ja "42. Austin G. 
Schmidt. The title might suggest that this is a 
treatise on research technics, but as a matter of 
fact it is a simple discussion of what a teacher can 
do in his own classroom to learn more about chil- 
dren and to use his knowledge for their greater 
good. 


Progressive Ed 18:401-2 D "41. Hubert C. 
Armstrong. * Chapter II is....the most valuable 
section of the book. It deals with how and what 
to observe. The methodology recommended 
points out the sources of most errors in judg- 
ment, namely : inaccurate observation, observa- 
tion confused with interpretation, observation 
of a situation or of behavior per se rather than 
the response of a child in a situation, isolated 
or non-typical examples, and lack of a develop- 
mental or sequential observation. Assistance in 
interpretation is given in the sense that all be- 
havior is viewed as natural phenomena due to 
natural causes. Under the heading “Clues For,” 
are sections covering these topics: physical de- 
velopmental age, intellectual development, social 
development, emotional development, personal- 
social relations, aggressive behavior, withdraw- 
ing behavior, the significance of teasing, the 
response of interfering or thwarting, and the 
relationship of a child to a teacher. Under these 
headings are interesting and usable suggestions 
for observation and also ideas as to meanings 
and significances that will materially aid the 
teacher in understanding what she observes in 
a child. * The book, as a whole, does not talk 
down to teachers. Neither does it presume that 
teachers do not face, as one of the major prob- 
lems of teachers, the skillful management of the 
children or good classroom discipline. The 
point of view is not dogmatic; it is not ‘selling’ 
a belief. It is aimed to be used. It is short and 
readable. Its liabilities consist in omissions 
rather than commissions. Too little is said 
about the signs and significance of vision, hear- 
ing, speech, general health, chronic disabilities 
and such difficulties as are frequently found in 
acquiring abilities with words and numbers. 


[ 846 ] 
*DuBors, Anson M. Three Specialized Projects 
in Measuring Accounting Aptitude. Human Engi- 
neering Laboratory, Inc., Technical Report No. 147. 
Boston, Mass.: the Laboratory, June 1945. Pp. iv, 76. 
Paper, mimeographed. * 


[847 ] 
XDuBor, Pume H., Eprrom The Classification 
Program. Army Air Forces Aviation. Psychology 
Program Research Reports, Report No. 2. Washing- 
ton, D. C.: Government Printing Office, 1947. Pp. xiv, 
394. $1.50. * 

du 1 Stud 
*DunKERLEY, Mary DOROTHEA, A Statistical Study 
or Leaderstip Among College Women. Catholic 
University of America, Studies in Psychology and 
Psychiatry, Vol. 4, No. 7. Washington, D. C.: Catholic 
University of America Press, 1940. Pp. vii, 65. Paper. 
$125. * (PA 14:5096) 
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[ 849 ] 
XEaroN, Merritt T. Survey of Abilities of High 
School Seniors in Twenty-One Counties of In- 
diana. State of Indiana, Department of Public In- 
struction, Research Bulletin No. 2. Indianapolis, Ind.: 
the Department, September 1942. Pp. vi, 48. Paper. 
Out of print. * (PA 17:3244) 

[ 850 ] 


*kEaton, Merrit T. A Survey of the Language 
Arts Achievement of Sixth Grade Children in 18 
Counties and Six Cities in Indiana. State of In- 
diana, Department of Public Instruction, Research 
Bulletin No. 3. Indianapolis, Ind.: the Department, 
1942. Pp. vii, 75. Paper. Out of print. * (PA 17:3245) 


[851 ] 

KEBERT, ELIZABETH, AND SIMMONS, KATHERINE. The 
Brush Foundation Study of Child Growth and 
Development: I, Psychometric Tests. Mono- 
graphs of the Society for Research in Child Develop- 
ment, Vol. 8, No. 2, Serial No. 35. Washington, D. C.: 
the Society, National Research Council, 1943. Pp. xiv, 
113. Paper, lithotyped. $1.50. * (PA 18:3322) 


Am J Orthopsychiatry 15:183 Ja '45. Hen- 
riette Woolf. * The material..is thoroughly 
reported statistically with elaborate tables and 
graphs. These are analyzed carefully but the 
data are so detailed that even the conclusions do 
not summarize the results clearly and concisely. 
Research workers using similar techniques or 
these same tests will be interested in the study, 
but it is not well written or explained for any 
but the most statistically interested interpreter 
of psychological test results, 

Am Sociol R ro:117-8 F '45. Mapheus 
Smith. * Conclusions of most interest to soci- 
ologists are (a) that boys show greater variabil- 
ity in scores, (b) that there is no significant sex 
difference in mean scores over the entire period 
of study, (c) that IO gains with increasing age 
are the rule rather than the exception, and (d) 
that level of parental education is closely related 
to intelligence of children. These findings, like 
most of the others reported in the monograph, 
do not constitute original discoveries but pro- 
vide support for previous investigations. * 
Statisticians will be surprised: at several inade- 
quacies in the data. The children entered the 
study at widely varying ages, and the records 
were obtained for widely different lengths of 
time. Furthermore, no one group of children 
was tested and no one test was administered 
throughout this entire age range (p. xiv). The 
sample was also very heavily weighted by the 
presence of siblings, since a number of families 
of children were accepted as subjects. How- 
ever, it is probable that the statistical meth- 
ods used in analyzing the data compensate 


for these defects and insure generally depend- 
able results. Sociologists will be disappointed 
at the absence of any attempt by the authors to 
indicate the general meaning of their work for 
human behavior or for students of social psy- 
chology. There is no general summary of inter- 
pretation of the analysis nor the delineation of 
a frame of reference into which their work or 
the findings fit. The result is merely a group 
of separate essays, substantially meaningless to 
those who are not specialists in psychometric 
tests. It is to be hoped that later portions of the 
complete Brush Foundation Study will provide 
better orientation. Sociologists will also feel that 
an excellent opportunity has been lost to relate 
psychological tests to other sorts of data, such 
as socio-economic scales, or other sociologically 
significant facts. In this respect, the original 
purpose of the organization of the Society for 
Research in Child Development, which was to 
co-ordinate the efforts of all kinds of specialists 
interested in research on child development, 
seems to have been forgotten. 

Brit J Psychol 35:49 Ja "45. The two main 
studies of this monograph are of the variation 
in LQ. on the 1916 and 1937 Stanford Binet 
tests and the Otis S-A test at various ages from 
3 to 15 years; and on the Merrill-Palmer scale 
for pre-school children and the Kent-Shakow 
Form Board series for children aged 5 to 14. 
* Additional studies included in the monograph 
are of the prediction of scholastic achievement 
by various intelligence tests given at the same 
and earlier ages; of the Minnesota Mechanical 
Ability and Porteus Maze tests, which were not 
found to be very satisfactory ; and of sibling re- 
semblances in the various intelligence tests. 

Ed Res B 24:26-7 Ja 17 '45. Francis N. 
Maxfield. * a prodigious quantity of psycho- 
metric data * With such extensive research data 
before them, the authors must have exercised 
great restraint in keeping their presentation of 
results so objective. Readers interested in prob- 
lems of constancy of the intelligence quotient, 
prediction in general, or “mental growth” (sic), 
and the like will be tempted to speculation, for 
this research raises many questions. For exam- 
ple, one may ask, why does the formula 

y = LII + 27/1200, 
where y is the mean mental-age score and x is 
the chronological age of a group, predict ap- 
proximately the Brush Foundation means for 
mental ages on the 1916 Stanford-Binet for 
age groups three to nine inclusive? 
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J Am Med Assn 125:1161 Ag 19 44. * This 
important study....is of interest to the research 
worker in child development; one will await 
with interest clinical elaboration in connection 
with the later studies of the series. 

J Nerv & Mental Dis 104:342 S '46. S. 
Brecher. * the first of a series of reports sum- 
marizing the findings of the Brush Foundation 
Study of the growth and development of more 
than 800 Cleveland children of similar back- 
grounds. The reports are based upon the test 
scores of all ages from two to 15 years. How- 
ever, no one group of children was tested and 
no one test was administered throughout the 
entire age range. The material is treated in 
five sections and tables are liberally used to 
supplement the data. The first section deals 
with intelligence test constancy and the com- 
parability of the 1916 and 1937 revisions of the 
Stanford-Binet Scales and the Otis Self-Admin- 
istering Tests. * The second section deals with 
performance tests and data on the constancy of 
these scores, their inter-relationships and rela- 
tionships with verbal intelligence. * 

Q R Biol 20:105 Mr '45. Theta Wolf. * The 
work has been painstaking, with hardly a detail 
unturned that might yield a possible relation- 
ship. * In light of the recent interest in using 
terminal status as one of the criteria for the 
selection of test items, the data are interesting 
that show correlations between previous test 
scores (Binet and performance) and standing 
on Progressive Achievement batteries in the 
junior and senior high school grades. The work 
of these investigators in straightforward report- 
ing of statistical relationships is valuable in this 
period of a renewed critique of mental tests. It 
is unfortunate, however, that they failed to use 
the concentrated insight that this large study 
must have given them, either to put their own 
results into a clearer perspective or to project 
with vision the problems for the future raised 
by their own work and that of others! 

Sch R 52:504-5 O '44. Charles D. Flory. * 
The authors explain fully the limitations of their 
data, including changes in methods, changes in 
sampling, changes in tests, and a combination of 
cross-sectional and longitudinal data. The facts 
have been presented with accuracy and in detail. 
The caution exercised, however, in interpreting 
data gives the impression of overcautiousness 1n 
generalizing and highlighting the implications of 
the results. It is difficult for the reader to maintain 
sufficient interest to follow the detailed statistics 


and, in some instances, the repetitious presenta- 
tion. The difficulties encountered in integrating 
the work of separate aspects of a research pro- 
gram are evident in this monograph. A more 
thoroughly integrated and cohesive style would 
have made the study more readable. There is a 
lack of consistency in presenting tables and fig- 
ures. The tabular data are not always clear to 
the reader without close comparison of text and 
table. The text is well supported by tabular and 
graphical evidence, there being a total of 70 
tables and 19 figures in 113 pages. The salient 
findings in each section are not readily acces- 
sible, since there are inconsistencies in the 
methods of summarizing the results. No at- 
tempt has been made to bring together the 
over-all conclusions from the psychometric data 
assembled by the Brush Foundation. The re- 
sults reported in this study will be of value to 
the research worker interested in psychometric 
measurement. The findings are, in most in- 
stances, consistent with the general trends in 
the published studies of the past decade that 
have been made from similar data. The authors 
have made a few direct comparisons with other 
studies but have not attempted to compare their 
findings with research trends in the literature. 
They have stated that their objective is to pre- 
sent facts from the Brush data without generali- 
zation, and they have stayed by this purpose. 
This monograph should be of special interest 
to the research workers who have had some 
contact with the work of the Brush Foundation 
and of general interest to all students of child 
development. This initial report based on Cleve- 
land children has made a commendable start in 
disseminating facts for general use. 
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EpucationaL Recorps BUREAU. Fourth Report of 
the Committee on School and College Relations 
of the Educational Records Bureau. New York: the 
Bureau, February 1943. Pp. v, 55- Paper. $0.25. * 
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*EpucationAL Recorps BUREAU. 1940 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Educational Records Bul- 
letin, No. 30. New York: the Bureau, June 1940. Pp. 
xii, 75. Paper, lithotyped. $1.50. * (PA 15:3579) 
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xi monar Recorps BumkAU. 1941 Achievement 
Pathe Program in Independent Schools and 
Supplementary Studies. Educational Records Bul- 
letin, No. 33. New York: the Bureau, June 1941. Pp. 
x, 76. Paper, lithotyped. $1.50. * (PA 15:4388) 
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*xEpucaTioNaL Recorps Bureau. 1942 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Educational Records Bul- 
letin, No. 36. New York: the Bureau, June 1942. Pp. 
xiii, 59. Paper, lithotyped. $1.50. * 


[ 856 ] 
*kEDUCATIONAL Recorps Bureau. 1943 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Educational Records Bul- 
letin, No. 38. New York: the Bureau, June 1943. Pp. 
xiii, 53. Paper, lithotyped. $1.50. * 


[857 ] 
*xEpucaTIonaL Recorps Bureau. 1944 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Educational Records Bul- 
letin, No. 40. New York: the Bureau, June 1944. Pp. 
xii, 58. Paper, lithotyped. $1.50. * (PA 18:3898-9, 
3907) 

[ 858 ] 
*EpucarioNAL Recorps Bureau. 1945 Achievement 
Testing Program in Independent Schools and 
Supplementary Studies. Educational Records Bul- 
letin, No. 43. New York: the Bureau, June 1945. Pp. 
B Paper, lithotyped. $1.50. * (PA 19:3512, 3514, 
35r 


[ 859 ] 
*Epucationat Recorps Bureau, 1946 Achievement 
Testing Program in Independent Schools and 
Supplement Studies. Educational Records Bul- 
letin, No. 45. New York: the Bureau, June 1946. Pp. 
x, 63. Paper, lithotyped. $1.50. * (PA 20:4353-4, 4356) 


[ 860 ] 
xEpucationat Recors Bureau. 1947 Achievement 
Testing Program in Independent Schools and 
Supplementa Studies. Educational Records Bul- 
letin, No. 48. lew York: the Bureau, June 1947. Pp. 
xii, 66. Paper, lithotyped. $2.00. * (PA 22 :463-4, 449) 


[ 861 ] 
XEDUCATIONAL Recorps Bureau. 1940 Fall Testing 
Program in Independent Schools and Supple- 
KELA Ga poe Records sae No. 
3r. New York: the Bureau, January 1941. Pp. xi, 41. 
Paper, lithotyped. $1.00. * (PA 15:3580) 3 k 


[ 862 ] 
XEDUCATIONAL Recorps Bureau, 1941 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Educational Records Bulletin, No. 
35. New York: the Bureau, January 1942. Pp. xiv, 57. 
Paper. $1.00. * (PA 16:2055) 


[ 863 ] 
XEpUCATIONAL Recorps Bureav. 1942 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Educational Records Bulletin, No. 
E New York: the Bureau, January 1943. Pp. xii, 50. 
aper, lithotyped. $1.00. * 
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xEpucationat Recorps Bureau. 1944 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Educational Records Bulletin, No. 
42. New York: the Bureau, January 1945. Pp. x, 49. 
Paper, lithotyped. $1.00. * (PA 19:1811-3) 
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xEpucationaL Recorps Bureau. 1945 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Educational Records Bulletin, No. 
44. New York: the Bureau, January 1946. Pp. xiii, 66. 
Paper, lithotyped. $1.00. * (PA 20:2105-7, 2112-3) 

[ 867 ] 
XEpucATIONAL Recorps Bureau. 1946 Fall Testing 
Program in Independent Schools and Supple- 
mentary Studies. Educational Records Bulletin, No. 
47. New York: the Bureau, February 1947. Pp. x, 58. 
Paper, lithotyped. $1.50. * (PA 21 :2484, 2497-8) 

[ 868 ] 
X*ErxiscH, Paura. Children's Drawings in a Pro- 
jective Technique. American Psychological Associa- 
tion, Psychological Monographs, Vol. 58, No. 1, Whole 
No. 266. Washington, D. C.: the Association, Inc., 1945. 
Pp. v, 31. Paper. $1.00. * (PA 19:3526) 

[ 869 ] 
*E us, ALBERT, A Comparison of the Use of Direct 
and Indirect Phrasing in Personality Question- 
naires. American Psychological Association, Psycho- 
logical MORSU Vol. 61, No. 3, Whole No. 284. 

ashington, D, C.: the Association, Inc., 1947. Pp. iv, 

41. Paper. $1.00, * 

[ 870 ] 
*Emmett, W. G. An Inquiry Into the Prediction 
of Secondary-School Success. Preface by Godfrey 
H. Thomson. London: University of London Press, 
Ltd., 1942. Pp. 58. Paper. 2s. 3d. * (Toronto, Canada: 
Clarke, Irwin & Co., Ltd. $0.45.) (PA 17 :2882) 


Brit J Ed Psychol 13:53 F ’43. Ll. W (ynn) 
J(ones). The subjects of the inquiry were 765 
children who had remained for three years in 
certain West Riding secondary schools. The 
LQ., English score, and arithmetic score of 
each child at the entrance examination, together 
with the head teacher's assessments at the end 
of the second and third year in the secondary 
school, were the variables, which have been 
treated with the masterly skill and competence 
expected from a Moray House investigator. * 
The booklet will prove valuable for research 
workers, but we must warn readers that only 
those conversant with matrix algebra can hope 
to understand the calculations in the three ap- 
pendices. A generally accepted meaning of “suc- 
cess in secondary school life" will never be 
obtained, and even if we agree on the way to as- 
sess the intellectual aspect of it in the secondary 
School there still remains the task of devising 
entrance tests, A common method is to add up 
English, arithmetic and intelligence marks. The 
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reviewer has suggested another (this Journal, 
Vol. VI, Part III, November 1936, p. 312). 
Whatever method is adopted there will still be 
scope for techniques such as that so successfully 
carried out by Mr, Emmett. 

J Ed (London) 76:38-42 Ja '44. C. Birche- 
nough. * an important investigation dealing 
with some 765 children to ascertain how far the 
Special Place Examination in the West Rid- 
ing, consisting of papers in English, arithmetic, 
and a group intelligence test, predicts the per- 
formance of children two and three years later. 
Its appearance is most opportune, for it pro- 
vides a fitting complement to the larger Scottish 
Report. Thus it finds the group intelligence test 
has approximately equal predictive value to 
that of the English and arithmetic papers com- 
bined. * 

Occupational Psychol 16:199-200 O "42. 
C(harles) S. M(yers). * an admirable account 
of an important enquiry. * 

Psychometrika 7:297 D '42. Max D. Engel- 
hart. * The research reported by Emmett is 
unique in that techniques are applied which 
yield estimates of the regression coefficients, 
their standard errors, and the correlations which 
would be obtained had selection not operated. 
Use was made of Aitken's matrix formulation 
of Karl Pearson's selection equations. The pro- 
cedure is illustrated for three independent vari- 
ables—intelligence quotient, English score, and 
arithmetic score. Appendix III of the mono- 
graph shows how Aitken’s method of pivotal 
condensation can be used when extending the 
procedure to more than three independent vari- 
ables. This scholarly little monograph should be 
required reading for all persons engaged in at- 
tempts to use regression equations in the pre- 
diction of academic or vocational success. 


[871] 
XExcLIsH, Horace B., AND RAIMY, VICTOR. Studying 
the Individual School Child: A Manual of Guid- 
ance. Foreword by Daniel A. Prescott. New York: 
Henry Holt and Co., 1941. Pp. vii, 131. Paper. $1.00. * 
(PA 16:1253) 


Am J Psychol 56:155-6 Ja '43. Rose G. An- 
derson. * "The manual has developed out of 
several years' experimentation in a course for 
pre-service teachers. It bears throughout the 
stamp of use checked against and revised by 
experience. The common pitfalls and misunder- 
standings attendant upon the efforts of the 
novice are set forth and offset, so far as this 
can be accomplished by verbal admonition and 


direction. * The study proposed is somewhat 
ambitious in recommending both intelligence 
and achievement tests at this stage of training. 
Discussion should result, however, in the cau- 
tion necessary to the use and interpretation of 
results. The scope of the study, too, should lead 
to the desired reserve in judgments of the chil- 
dren until adequate information is available. 
The chapter on character is the least adequate. 
This criticism is borne out by the material re- 
ported in the student's sample case-history under 
this heading, most of which is relevant rather 
to personality-characteristics or to intellectual 
habits and attitudes. It seems preferable to re- 
serve 'character' for behavior having moral im- 
plications, and to include such behavior under 
the broader heading of 'behavior-patterns' or 
‘personality-characteristics.’ It is not clear how 
much the student learns about the individuality 
of the child observed. The emphasis throughout 
is on his ‘normality.’ Unless the student learns 
through experience something of the similari- 
ties and differences of this child in comparison 
with others under observation, the reviewer 
believes that one important phase of ‘learning’ 
is sacrificed. This opportunity might come from 
class reports and discussions of different chil- 
dren studied. It is not clear whether this pro- 
cedure is an integral part of the course. The 
reviewer agrees with Prescott's foreword that 
the manual and the related course are steps 
in a needed direction, that of providing first- 
hand experiences to supplement ‘read-and-talk’ 
courses. The subtitle, A Manual of Guidance, is 
misleading, however, in that it implies pupil- 
guidance rather than teacher-guidance. 

J Ed Psychol 33:152-4 F "42. Stanley G. 
Dulsky. * The table of contents is unusually 
detailed to facilitate rapid reference. * The art 
of observing and its pitfalls are thoroughly dis- 
cussed and the distinction between fact and 
interpretation is sharply pointed. Answers will 
be found to any questions a student may ask, 
from making the case-history to how to write 
the final report. The weakness of the manual is 
its failure to consider more fully the less ap- 
parent, but vitally important, inter-familial rela- 
tionships. These are less objective than test 
scores but just as, if not more, meaningful in 
understanding the "whole child." It is difficult 
to obtain data about a child's relationships to 
his parents and siblings and even more difficult 
to interpret them, but that is no reason for giv- 
ing less attention to them. Perhaps, in this area 
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the students will need help in learning what to 
observe despite the authors' emphasis on how 
to observe. They should be sensitized to psy- 
chological subtleties. The “Sample Case Study” 
in the Appendix is a beautiful illustration of 
how hollow and sterile a study of a child can 
.be when'such vital data are ignored or receive 
too little attention. This manual will undoubt- 
edly be welcomed by those responsible for 
teacher-training. Educators realize that a “good” 
teacher must understand the child as well as 
know how to teach him. Giving the prospective 
teacher a broad, vital understanding of the child 
poses a difficult problem for pedagogy—the 
authors have considerably aided its solution. 

J Ed Res 36:636-7 Ap '43. Carter V. Good. 
* Methods of gathering data are outlined briefly, 
including diary observations, time samples, 
rating instruments, interviews, and analysis of 
documents. A series of chapters deals with in- 
tellectual, physical, social, and character de- 
velopment, and with preparation of the final 
report. The appendix includes a group of in- 
struments for studying the individual child and 
an illustrative case study. An index might well 
have been prepared, an aid especially important 
in a manual. The style is simple and direct, 
with many of the suggestions addressed to the 
reader in the second person. The usefulness of 
this guide extends beyond students of child psy- 
chology and teachers to others who may wish a 
brief outline of the case method as applied to 
school pupils. 

Loyola: Ed Digest 17:11 Mr "42. Austin G. 
Schmidt. * the purpose of the present manual 
is to instruct the teachers in making case stud- 
les of the whole child, with special emphasis on 
the psychological aspects * While we fear that 
few teachers will have the time and inclination 
to undertake studies such as the one suggested 
here, the book, containing as it does full direc- 
tions and numerous rating scales, will prove of 
much help to those who are more happily cir- 
cumstanced. 


[ 872 ] 
*Essex [Enctanv] Epucation Committee. Report 
of a Subcommittee on the Selection of Children 
for Secondary Education. Chelmsford, Essex, Eng- 
land: Essex Education Committee, November 1944. 
Pp. 40. Paper. Gratis. * 


[ 873 ] 
*xEysencx, H. J. Dimensions of Personality. A rec- 
ord of research carried out in collaboration with H. T. 
Himmelweit and W. Linford Rees with the help of 
M. Desai, W. D. Furneaux, H. Halstead, O. Marum, 


M. McKinlay, A. Petrie, and P. M. Yap. Foreword by 
Aubrey Lewis. London: Kegan Paul, Trench, Trubner 
& Co. Ltd., 1947. Pp. xi, 308. 25s. * (PA 22 :210) 


Occupational Psychol 21:147-8 Jl '47. P. E. 
Vernon. * presents the results of a highly fruit- 
ful collaboration between psychologists and psy- 
chiatrists at the Mill Hill Emergency Hospital 
for neurotic patients. Another unusual feature 
is the author's integration of the factor analysis 
and direct experimental approaches. Most psy- 
chologists would nowadays agree that factor 
analysis alone cannot establish fundamental or 
primary psychological traits (genotypes), and 
that additional experimental evidence is re- 
quired showing that factorial entities are selec- 
tively affected by certain stimuli. This book can 
claim then to constitute a very real advance in 
the scientific analysis of personality in that it 
first establishes two important dimensions, and 
then by a series of tests and experiments dem- 
onstrates that they can be measured objectively 
and that they have measurable effects upon a 
great variety of psychological and physiological 
functions. The first dimension is ‘general neu- 
roticism’ which reveals itself not only in psy- 
chiatric diagnoses and in selí-ratings of the 
psycho-neurotic inventory type, but also in high 
suggestibility, poor dark adaptation, abnormal 
responses to the multi-choice group Rorschach 
test, low persistence, low fluency, slow personal 
tempo, high colour/form ratio, etc. The second 
dimension runs from hysterical personality on 
the one hand to dysthymic personality—typical 
of anxiety or obsessional neuroses—on the 
other. Eysenck believes this to be identical with 
Jung's extraversion-introversion, but points out 
that it is not the same as the conventional so- 
cial vs. intellectual dichotomy (lack of socia- 
bility goes rather with neuroticism), nor is it 
the same as Kretschmer's cycloid vs. schizoid 
typology. It is related to body build (hysterics 
tend to belong to the pyknic or 'eurymorph' 
type), and to certain autonomic functions such 
as high salivary secretion, good exercise toler- 
ance and low choline esterase secretion. Hys- 
terics also tend to show lower intelligence than 
dysthymics, particularly on vocabulary tests; 
they emphasize speed rather than accuracy, 
have low levels of aspiration and overrate past 
performance; they show distinctive apprecia- 
tion of visual art and of humour, and produce 
distinctive mosaic designs and handwritings. 
They do not, however, differ in some of the 
characteristics alleged to correlate with extraver- 
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sion-introversion such as suggestibility, per- 
severation, personal tempo, colour/form ratio, 
or oscillation with reversible perspective figures. 
An obvious criticism of Eysenck's theories and 
results is that they are based primarily on neu- 
rotic patients. The book might have been better 
called Dimensions of the Neurotic Personality. 
Nevertheless he is frequently able to cite con- 
firmatory evidence from the literature of experi- 
ments on normals, and he has already taken the 
first steps towards constructing batteries of tests 
for measuring these two dimensions, which 
could be applied to, and used for carrying out 
similar experiments on, normals. When these 
become available there is no doubt that they 
will have most interesting vocational applica- 
tions. This review touches only on the high- 
lights of a book which is packed with points of 
interest, many of them decidedly controversial. 
The amount of research which Eysenck and his 
collaborators have accomplished in about three 
years is quite astonishing. Over three dozen 
separate, yet closely integrated, experiments are 
described, often involving very large groups of 
subjects, and all showing a high level of tech- 
nical competence and an exhaustive knowledge 
of the literature (the bibliography includes over 
700 references). Moreover the author seems to 
have a flair for planning the right experiment, 
which yields just the results needed to prove or 
disprove some important point of theory. The 
cost of the book is unfortunately high, but its 
printing and layout, under present circum- 
stances, are excellent. 


[874] 
XFram, RrcHARD A., AND JorpAN, Byron. Employee 
Evaluation Manual for Interviewers. New York: 
Psychological Corporation, 1943. Pp. 39. Paper, spiral- 
bound. $1.75, postpaid. * (PA 19:222) For reviews, 
see 686, 

[ 875 ] 
*Fercuson, Georce A. The Reliability of Mental 
Tests. London: University of London Press, Ltd., 
1041. Pp. x, 150. 7s. 6d. * (PA 16:4201) 


Brit J Ed Psychol 12:136 Je ’42. S. J. F. 
P(hilpott). The first part is theoretical. The 
second part is more experimental, dealing with 
the reliability and constancy of Intelligence, 
English and Arithmetic Quotients as measured 
by Moray House Group Tests. Comparison is 
also made between group and individual tests in 
this respect. A note by the publishers points 
out that, owing to the war, the author could 
not revise his MSS. or see the proofs. This 


perhaps explains some of the difficulties the re- 
viewer met with in tracing the author's lines of 
thought. Thus, eg., although in his preface he 
defines his second part as we have quoted him 
above, using the terms “reliability and con- 
stancy" as co-equal, he re-defines on p. 10 by 
saying reliability is the "accuracy (not con- 
stancy) with which a test measures the abilities 
which it measures at the time when it measures 
them." Or again, on p. 141, he opens a chapter 
by saying "Examination of data concerning the 
reliability and constancy of Moray House Intel- 
ligence, Arithmetic and English Quotients 
brings to light the fact that of the three types 
of tests those regarded as measures of intelli- 
gence are least reliable." In general, one would 
not mind which way he expressed himself, were 
he "reliable or constant" in it. As it happens, 
of course, his statement on p. 141 leaves the 
door open to yet another kind of misunderstand- 
ing. I have already heard him quoted as pro- 
ducing evidence against Intelligence Tests in 
favour of the more old-fashioned academic tests 
in English and Arithmetic. As most people 
agree, the term reliability is misleading. If a test 
puts a group of children in the same order of 
merit every time it is applied to them it is com- 
pletely ‘reliable.’ But as obviously it would have 
been better to refer to its ‘constancy’ or ‘con- 
sistency.’ A thing can be ‘consistent’ but yet bad 
or useless. To say it is ‘reliable’ brings in an 
unfortunate implication of goodness, And so, 
of course, with his statement that English and 
Arithmetic Tests are more reliable than Intel- 
ligence Tests. In any case, the coefficients he 
quotes only differ in the second decimal place 
(being 0.96, 0.95 and 0.94 for Arithmetic, Eng- 
lish and Intelligence respectively). But even so, 
to those who would quote him in a reactionary 
sense, one must point out that it is conceivable 
that a test in (say) ability to turn cartwheels 
might give a reliability of even 0.97, thus beat- 
ing them all. But it would still not be a very 
useful test for Junior County Scholarship pur- 
poses. However, although the author may have 
been a little remiss in not realising the needs of 
those of us with more elementary minds, he 
does at any rate work faithfully through the 
various problems in this, a most complicated 
field. Sometimes he is obviously not doing more 
than open up vistas, indicating avenues not yet 
explored. But even that is worth while. We 
have been asked to note a correction made in 
some but not all the copies'of the book so far 
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issued by the Press. Equation 6 on p. 14 should 
read 


ye 
e-ily 


J Ed (London) 74:427-9 S '42. J. W. Jen- 
kins. * The first part of the book is devoted 
mainly to the theoretical aspect. In this he con- 
siders methods of determining reliability, basic 
formulae, and the estimation of reliability from 
answer pattern data. He brings together a num- 
ber of useful suggestions....but it is doubtful 
whether he fully realizes his aim in this theo- 
retical section. At the outset he sets down a 
number of assumptions which must be made. 
Mathematicians and scientists entertain a healthy 
scepticism of such assumptions and one would 
have liked a more thorough examination of them 
before passing on. The mathematical proofs in- 
volving a limit process are expressed in forms 
which may make mathematicians feel rather 
uncomfortable. The formulae may be rigorously 
deduced by methods in which the limit opera- 
tions are explicitly formulated. In the experi- 
mental section which follows he incidentally 
supplies further evidence of the reliability of 
Moray House Group Tests, investigates the 
variation in LO. at different ability levels, the 
constancy of LO., and compares the results 
with those obtained from the Terman and Mer- 
rill revision of the Binet Scale. This is evidence 
of value for those interested in these topics. The 
results of experimental research may be chal- 
lenged on other grounds than those of reliabil- 
ity. A criticism, which is often levelled at studies 
involving a questionary, concerns not so much 
the reliability as the validity of the method of 
measurement, i.e. how far tests and question- 
aries measure what they purport to measure. 

J Nerv & Mental Dis 103:206-7 F '46. 
Simon H. Tulchin. * The author concludes that 
the usual assumption in favor of individual tests 
on grounds of their higher reliability appears to 
be unwarranted. While this conclusion is justi- 
fied when large groups are concerned, it is 
doubtful whether the same can be said when an 
individual child is concerned. * 

Occupational Psychol 16:150 Jl’ 42. P. Slater. 
* in two parts. The first deals with the general 
theory of test reliability, the second with the 
reliability of the Moray House tests of Intelli- 
gence, English and Arithmetic. * When relia- 
bility coefficients are obtained by splitting tests, 


correlating the split halves and applying the 
Spearman-Brown formula, errors due to per- 
sonal variance cannot be regarded as uncorre- 
lated. This point is not fully brought out, but 
Ferguson uses the apt term “boosted” for such 
reliability: coefficients. Part II will be of par- 
ticular interest to individuals and departments 
making use of the Moray House tests. It pro- 
vides, amongst other things, data for comparing 
the Moray House Intelligence tests with the 
Binet test, and shows that although the former 
can be used as Group tests, they are as reliable, 
on the whole, as the Binet, which has to be 
given individually. The force of this evidence 
is weakened because I.Q.s below 70 could not 
be included in the Moray House results. The 
Binet is particularly favoured for testing defec- 
tive and borderline cases, and it is with them 
that the advantages of individual testing are 
specially extolled. When tests are combined in 
batteries, they may be validated by reference to 
an external criterion. The recognized principle 
is that each test should correlate as highly as 
possible with the criterion and as low as pos- 
sible with the other tests in the battery. Why 
should not the same principle be applied in se- 
lecting the individual items in a test? Practically 
every question which can be treated by analysis 
of reliability can be treated more effectively as 
a problem in validation. Thus the comparison 
between the Binet and the Moray House Intel- 
ligence tests shows that the reliability of the 
two tests is approximately the same. It does 
not show whether they are testing the same 
function or which is the better test of any given 
function. Unless both tests are compared with 
an external criterion, and the problem treated 
as one of validation, such questions cannot be 
answered. Reliability coefficients provide a basis 
only for formal comparisons. Increasing a test's 
reliability without increasing its validity only 
damages it. Dr. Ferguson, however, defines his 
subject matter in such a way as to exclude 
questions of validation altogether. This he is 
perfectly entitled to do, provided that he is con- 
tent to confine himself to almost purely aca- 
demic issues. Within its chosen terms of refer- 
ence his book is excellent. His use of the pooling 
square is most instructive, and his revision of 
Kuder and Richardson's work is particularly 
interesting. His Chapter VI will be a thorn in 
the flesh of people prejudiced in favour of nor- 
mal distributions. 


—Me 
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[ 876 ] 
*kFisHeER, Frances. Scenario on Word Straggled 
95 A D Word Number 83. Human Baginesing 
Laboratory, Inc, Technical Report No. 208. Boston, 


Mass.: the Laboratory, January 1946. Pp. 4. P: 
mimeographed, * (PA 2 946. Pp. 4. Paper, 


[877 ] 
XFrirrs, PauL M., Eprtor, Psychological Research 
on Equipment Design. Army Air Forces Aviation 
Psychology Program Research Reports, Report No. 
19. Washington, D. C.: Government Printing Office, 
1947. Pp. xii, 276. $1.25. * 


[ 878 ] 

xFrrMiNG, C. M. Cumulative Records: Notes on 
Their Content and Use: A Guide to the Findings 
of Research. Educational Research Pamphlets, No. 1. 
Published under the auspices of the Foundation for 
Educational Research and the Department of Higher 
Degrees and Research of the Institute of Education. 
London: University of London Press, Ltd., 1945. Pp. 
48. Paper. 1s. 6d. * (Toronto, Canada: Clarke, Irwin 
& Co., Ltd. $0.45.) 


[ 879 ] 

XForEev, JEANNE. A Description of a New Tech- 
nique in Obtaining Practice and Conversion 
Factors for All Trials of Wiggly Block Beginning 
With Worksample 3 the Preliminary Factors. 
Human Engineering Laboratory, Inc., Technical Re- 
port No. 364. Boston, Mass.: the Laboratory, January 
1947. Pp. i, 25. Paper, mimeographed. * 


[ 880 ] 
*Forp, Mary. The Application of the Rorschach 
Test to Young Children. University of Minnesota, 
Institute of Child Welfare Monograph Series, No. 23. 
Minneapolis, Minn.: University of Minnesota Press, 
1946. Pp. xii, 114. $2.00, (London: Oxford University 
Press, 125.) * (PA 21:807) For reviews, see 80. 


[ 881 ] 
*Forrest, Heren ve Sates. Correlations Between 
the Constants in the Curve of Learning. Studies 
in Psychology and Psychiatry From the Catholic Uni- 
versity of America, Vol. 5, No. 1. Washington, DCi 
Catholic University of America Press, May 1941. Pp. 
v, 43. Paper. $1.00. * (PA 15:4153) 

[ 882 ] 
XFowrzn, Fren M. Selection of Students for Voca- 
tional Training. Federal Security Agency, Unite! 
States Office of Education, Vocational Division Bulle- 
tin, No. 232; Occupational Information and Guidance 
Series, No. 13. Washington, D. C.: Government Print. 
ing Office, 1945. Pp. iv, 156. Paper. Out of print. 
(PA 20:315) 


Ed Adm & Sup 32:314-5 My 46. Silas 
Hertzler. In the past, all too often, selection 
has been approached as if it were a matter of 
choosing from the misfits and the inefficient, 
those, if any, capable of success—especially 
academic success. The emphasis of this brief 
study is that vocational preparation demands 
constructive and forward-looking selection. This 
type of selection uses slightly different tech- 
niques than does the measurement of intelli- 


gence, but it must be just as wisely and carefully 
done. Vocational intelligence and vocational 
aptitude, are just as specific, and just as useful, 
as is the older academic intelligence. Its meas- 
urement is just as important, and because the 
aptitudes needed are more objective, their 
measurement, may be more reliable. This bulle- 
tin approaches selection as a living process to 
be accomplished through guidance. * An out- 
standing and entirely worth-while concept of 
this booklet is that selection is a continuous 
process. * The booklet aims to be practical for 
the use of both school and student; hence it in- 
cludes long lists of admission, enrollment and 
counseling blanks, as used in various schools 
and life situations. Cumulative records, rating 
scales, placement blanks, blanks for recording 
interviews, are among the items included in the 
later pages of the book. 


[ 883 ] 
XFox, Wıram H. An Analytical Study of Janu- 
ary-February (1943) Entering Freshmen at 
Indiana University. Bulletin of the School of Edu- 
cation, Indiana University, Vol. 20, No. 1. Blooming- 
ton, Ind. : Indiana EATER Bookstore, January 1944. 
Pp. 46. Paper. $0.50. * (PA 18:3880) 


[ 884 ] 
*Fox, Wiuram H. The Washington, Indiana, 
Testing Program: An Experiment in Educational 
Measurement in an Average-Sized City School 
System. State of Indiana, Department of Public In- 
struction, Research Bulletin No. 16. Indianapolis, Ind. : 
the Department, 1945. Pp. x, 41. Paper. $0.15. * 

[ 885 ] 
Francis, Tuos. G. How to Pass Examinations. 
London: Sir Isaac Pitman & Sons, Ltd., 1944. Pp. 15. 
Paper. Is. * 

[ 886 ] 
XFnANKLIN, LILLIAN. PINE Distribution of 
Pairs of Related Individuals. Human Engineering 
Laboratory, Inc., Technical Report No. 153. Boston, 
Mass.: the Laboratory, June 1945. Pp. 6. Paper, mime- 
ographed. * 

[ 887 ] a 
*FriepMAN, BertHA B. Foundations of the Meas- 
urement of Values: The Methodology. of Loca- 
tion and Quantification. Columbia University, 
Teachers College, Contributions to Education, No. 914. 
New York: Bureau of Publications, the College, 1946. 
Pp. ix, 227. Out of print. * (PA 21:1550) 


J Abn & Social Psychol 42:373-4 Jl "47. 
Babette Samelson Whipple. * [The book] is so 
full of methodological sophistication....that in 
parts it reads like a philosophical treatise. * for 
those who either do not mind wading or who 
are adept at stone-hopping Friedman’s book 
makes very stimulating reading. The author's 
ultimate aim is to seek "for ways of improving 


888 THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 820 


the educator's means of guiding the conduct of 
his students" (p. 2). And since the educator 
should know the kinds of things that motivate 
them to action, he should know how to measure 
their "values." Hence Friedman examines vari- 
ous problems in the field of "value" measure- 
ment. The reader soon becomes aware that they 
are the same ones customarily treated under the 
titles of "attitude" and "interest" measurement. 
* The first problem of measurement Friedman 
discusses is the question of what should be 
measured. Predictions regarding future value- 
behaviors, she says, may be based upon an indi- 
vidual’s past or present value-behaviors and/or 
his verbal statements expressing either value- 
behavior or feelings of various types. Omitted 
from this list is an important source of evi- 
dence, namely, statements of information and 
belief. The inadequacies of both behavioral and 
verbal indices as bases for predictions of future 
value-behavior are discussed with admirable 
fullness. The whole field of attitude measure- 
ment would be in a stronger position were 
more investigators as clearly aware of them. 
Much is to be gained also from the five chapters 
on ways of quantifying the above indices. * 
* The final chapter, likewise, is full of interest. 
Results obtained from using different scales 
with the same samples are compared. And three 
original experimental investigations are re- 
ported which in themselves are a significant 
contribution to the field of attitude measure- 
ment. The book’s major weakness, in the mind 
of this ‘reviewer, lies in the choice of a be- 
havioral definition of “value.” With such a defi- 
nition, difficulties are bound to arise in connec- 
tion with the problem of validating the various 
scales, And they do. Friedman: stresses the 
necessity for validating studies but never indi- 
cates how they might be made. And she never 
can as long as overt behavior is rejected as a 
criterion. Friedman does not consider behavior 
scales valid because of the possibility of observ- 
ing behavior along several dimensions and sub- 
sequently of arriving at different equally valid 
scale measurements to represent the individual’s 
real value. Ironically enough she thereby im- 
plicitly rejects her own definition of “value.” At 
` one point she even writes, “....when we say that 
A values X,....we mean that this is character- 
istic of him; it is his disposition to do so..." 
(p. 61). Were values consistently regarded as 
intervening variables, confusions such as these 
would not exist and the real contributions of 


the book to the field would stand out more 
clearly. 

Psychol B 44:290-1 My ' 47. Elizabeth Duffy. 
* contains a comprehensive analysis of many 
problems basic to the measurement of values. 
Pitfalls and difficulties of great variety are 
pointed out. * To many readers the chief inter- 
est of the book will lie in the specific questions 
it raises in regard to the Thurstone scales and 
other widely used scales for the measurement 
of attitudes or values. While the author has un- 
doubtedly made a contribution to the field of 
value-measurement, the discussion at times 
seems unnecessarily circuitous, and the reader 
is led to wonder whether the longest way round 
is really the shortest way home. And it must 
be added in all sadness that the exhaustive con- 
sideration of theory does not appear to the 
reviewer to have improved the "practice" in 
values measurement as represented in the tech- 
niques used by the author in her own experi- 
mental investigations. 


[ 888 ] 
XFnoEHLICH, Gustav J. The Prediction of Aca- 
demic Success at the University of Wisconsin, 
1909-1941. Bulletin of the University of Wisconsin, 
Serial No. 2574, General Series No. 2358. Madison, 
Wis.: Bureau of Guidance and Records, the Univer- 
sity, October 1941. Pp. 44. Paper. * (PA 16:3775) 

[ 889 ] 
XGANNoN, Jose Trvornv. A Statistical pinay of 
Certain Diagnostic Personality Traits of College 
Men. Catholic University of America, Studies in Psy- 
chology and Psychiatry, Vol. 4, No. 4. Washington, 
D. C.: Catholic University of America Press, 1940. 
Pp. v, 45. Paper. $1.50. * (PA 14:2473) 

[ 890 ] 
XGaxs, Roma. A Study of Critical Reading Com- 
prehension in the Intermediate Grades. Columbia 
University, Teachers College, Contributions to Educa- 
tion, No. 811. Irving Lorge, faculty sponsor. New 
York: Bureau of Publications, the College, 1940. Pp. 
vii, 135. Out of print. * (PA 14:6205) 

Chicago Sch J 22:138 Ja-Mr "41. J(ohn) J. 
D(e Boer). * an intensive and painstaking 
analytical study * 

J Ed Psychol 32:79-80 Ja "41. Miles A. 
Tinker. * This type of study is distinctly of the 
kind that should be encouraged in the field of 
reading. Many will agree that the “traditional” 
tests of reading ability, while contributing much, 
are far from adequate for giving a complete 
picture of the reading skills required in contem- 
porary school programs. It is now timely to 
devise techniques for measuring proficiency in 
these skills and to organize teaching to over- 
come deficiencies where they are discovered. 
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While the present investigation may be incom- 
plete in some respects, it is a movement in the 
right direction. 

Social Ed 5:238-40 Mr '41. Helen M. Car- 
penter. * This study, while primarily concerned 
with the problem of measurement, has signifi- 
cant implications for the development of criti- 
cal-mindedness in pupils. * The results of the 
study support the original assumption that a 
high score on a standardized reading-compre- 
hension test does not necessarily indicate the 
presence of ability to read with discrimination 
in a problem-solving situation. Probably more 
significant to social studies teachers is the fur- 
ther revelation that of the five types of material 
submitted, students in both elementary school 
and college failed most frequently in seeing the 
connection of the remotely relevant and in re- 
jecting the fanciful. The implications of the 
findings are far-reaching. As Dr. Gans points 
out, “gullibility in reading functioning con- 
jointly with high ability in reading comprehen- 
sion is an incongruity which should not be 
tolerated in a process directed toward education” 
(p. 119). While social studies teachers have 
accepted major responsibility for developing the 
ability to detect authenticity as a part of the 
reading process, it has often been assumed that 
standardized reading tests now commonly used 
were adequate measures of this kind of growth. 
This study indicates that different instruments 
must be found. The results also show the neces- 
sity for increased attention to training in the 
use of sources. Competence in the evaluation of 
sources implies the availability and use of a 
variety of materials in the classroom. Critical 
reading ability can not be developed through 
the use of a single text. The difficulty of stu- 
dents in recognizing the remotely relevant and 
fanciful is of particular interest to those pre- 
paring social studies materials. It raises the 
question of how specific the presentation must 
be to have meaning for pupils reading without 
discernment. This investigation makes an im- 
portant contribution to the literature on meas- 
urement. It suggests a new approach to the 
evaluation of a complex ability. The findings of 
the study are not to be interpreted as discredit- 
ing the utility of standardized reading compre- 
hension tests. It serves to refine the field of 
their application. 

Social Studies 32:188 Ap "4r. F. Melvyn 
Lawson. * a searching study * offers some sig- 
nificant suggestions on how to improve the criti- 
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cal, selective reading power that reference read- 
ing entails * The study is adequately supplied 
with charts, tables, and statistical summaries. A 
highly valuable bibliography of references relat- 
ing to reading in the curriculum, mental and 
semantic aspects of reading comprehension, and 
test construction complete the work. 


[ 891 ] 
*Garpner, D. E. M. Testing Results in the Infant 
School. London: Methuen & Co. Ltd., 1942. Pp. ix, 
158. 6s. 6d. * (PA 16:4526) 

Brit J Psychol 33:140-1 O ’42, * a carefully 
controlled piece of testing....to compare the 
capabilities of Infant school children educated 
in the formal manner with those educated in 
‘free activity’ schools * it appears that ‘free 
activity’ does not have a deleterious effect either 
on learning specific school subjects, or on power 
of concentration and on social behaviour. It 
seems that the general superiority of these chil- 
dren in liveliness and.keenness affects all the 
work they do. Miss Gardner has done a valua- 
ble piece of work in demonstrating this result 
so clearly and conclusively. Only one control 
was missing—that of the capabilities of the 
teachers, Miss Gardner states that none of the 
teaching in the schools studied was bad. But it 
does seem probable that the more lively and 
enthusiastic teachers would be found in the 
‘free activity’ schools—possibly a further recom- 
mendation for the latter. It would, however, be 
interesting to have an assessment of teachers’ 
capabilities to compare with that of the chil- 
dren’s. 

J Ed (London) 75:392 Ag '43. * This book 
will be of value to students in training colleges 
and to teachers in the two types of schools se- 
lected; it is written by a person who has long 
been interested in her subject and who has had 
wide experience in teaching children and in the 
training of teachers. Its contribution to the sub- 
ject with which it deals will prove useful to 
future investigators in that limited sphere which 
the writer claims for it. 


[ 892 ] 
*&GARDNER, WARREN H. Instructions for Conduct- 
ing Audiometer Tests. Washington, D. C.: Ameri- 
can Hearing Society, 1940. Pp. i, 14. Paper. $0.25. * 
[ 893 ] 
Awnnerre. Interviewing: Its Principles 
and Methods. New York: Family Service Association 
of America, 1942. Pp. 123. Paper. $1.25. (PA 17:2062) 


Fed Probation 7:34 O-D '43. Henry Coe 
Lanpher. Everyone who deals with fellow hu- 
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man beings, and that means everyone, could 
profit from reading this little book. It is a book 
on interviewing, the author tells us, and not, 
except incidentally, on social case work. Nev- 
ertheless, the book is outstanding, in this re- 
viewer's opinion, mainly because it is an excep- 
tionally straightforward, simple and workable 
exposition of the case work approach and 
attitude. This social case work approach and 
attitude is essential, obviously, to persons in case 
work, and parole and probation officers; but it 
can be a most important asset also, if they only 
knew it, to doctors, nurses, counselors, school 
teachers, ministers, prison custodial officers, 
and many others. Each needs to understand non- 
rational motivations in human behavior, to rec- 
ognize that how a person feels about a situation 
may be as important a fact as the situation it- 
self. Each needs to know the significance of con- 
flicting pulls toward different courses of action 
—ambivalence—which everyone experiences. 
Each needs to be on his guard against his own 
prejudices and to be able to show by his actions 
that he realizes his own personal standards are 
no more inherently "right" than those of the 
other fellow. Trained case workers may have 
learned these things, but in this little book the 
essence of the case work gospel is set forth so 
clearly and sensibly that it should prove an 
inspiration to every worker with human beings, 
trained or untrained, who reads it, and it should 
definitely improve the case-work performance 
of every one who studies it. 

J Am Med Assn 122:68 My r ^43. * con- 
tains many points which might be helpful to the 
physician, especially in relation to history 
taking. 

J Ed Res 36:636-7 Ap ’43. Carter V. Good. 
A practical, concrete handbook * both the prin- 
ciples discussed and the illustrative interviews 
reported have pertinent applications to many 
other fields where case study, case work, and 
interview techniques are employed * The treat- 
ment of appropriate procedures for interviewing 
is clear cut, but brief, with only three references 
to the literature of this area. The organization 
and titles of the chapters and smaller subdivi- 
sions leave something to be desired, in terms 
of logic and sequence. An index would have 
rendered this manual more usable. 

Social Service R 17:107-8 Mr '43. Jeanette 
Hanford. * strikes a timely note * a sound, well- 
organized, and helpful discussion of the recog- 
nized insights and processes necessary to suc- 
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cessful interviewing at any level * excellent 
presentations....on “Understanding Human Na- 
ture” and “The Interviewer’s Attitudes” * 
[The interview,] “Only a Conversation,”....is 
not only a healthy reminder that insight is not 
the sole prerogative of the present-day social 
worker but suggests that the process of case 
work, properly interpreted, is still within the 
comprehension of the reading public. The other 
selections are no less readable and vital in their 
presentation. * Altogether, this is a book that 
should be well received and of use to a larger 
group than the average book on case work. It 
should be basic reading, not only in the train- 
ing of case aides and volunteers in case work, 
but in beginning case-work courses in the 
schools. It should also hold the interest of any 
student of human behavior, if only for its excel- 
lent distillation of known truths. It is brief 
enough, readable, and practical, and may well 
stand as a handbook for anyone who is using 
interviewing as a tool for helping people. 


[ 894 ] 
*Gasxinc, D. A. T. Examinations and the Aims 
of Education. Melbourne, Australia: Melbourne Uni- 
versity Press, 1945. Pp. 61. Paper. rs. * (London: 
Oxford University Press. rs.) 


[ 895 ] 
*Garzs, ARTHUR I. The Improvement of Reading: 
A SOEUR of Diagnostic and Remedial Methods, 
Third Edition. New York: Macmillan Co., 1947. Pp. 
xxi, 657. $4.25. * (PA 22:3195) 


[ 896 ] 
*GeLMAN, BABETTE. A Preliminary Comparison of 
Vocabulary Levels With Choice of Reading Ma- 
terial. Human Engineering Laboratory, Inc., Techni- 
cal Report No. 176. Boston, Mass.: the Laboratory, 
October 1945. Pp. 24. Paper, mimeographed. * 


[ 897 ] 
*Gernet, SrerLING K. Musical Discrimination: At 
Various Age and Grade Levels. A Dissertation Sub- 
mitted in Partial Fulfillment of the Requirements for 
the Degree of Doctor of Education in Teachers Col- 
lege, Temple University. Angwin, Calif, : the Author, 
oor Union College, 1940. Pp. xv, 160. * (PA 14: 
9 


[ 898 ] 
XGESELL, ARNOLD; IN COLLABORATION WITH CATHERINE 
S. Amatrupa, The Embryology of Behavior: The 
Beginnings of the Human Mind. New York: Har- 
s Brothers, 1945. Pp. xxi, 289. $6.00. * (PA 19: 
13 


[ 899 ] 
XGESELL, ARNOLD, AND AMATRUDA, CATHERINE S. 
Developmental Diagnosis: Normal and Abnormal 
Child Development: Clinical Methods and Prac- 
tical Applications. New York: Paul B. Hoeber, Inc., 
1941. Pp. xiii, 447. $6.50. * (PA 15:4836) For reviews. 
see 277. 
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[ 900 ] 
GESELL, ARNOLD; AMATRUDA, CATHERINE S.; CAST- 
NER, BURTON M.; AND THoMPsoN, Heren, Biogra- 
phies of Child Development: The Mentai Growth 
Careers of Eighty-Four Infants and Children; A 
Ten-Year Study From the Clinic of Child Devel- 
opment at Yale University. New York: Paul B. 
Hoeber, Inc., 1939. Pp. xvii, 328. Out of print. * (PA 
13:3325) For reviews, see 278 and 40:Bor3. 

[ 901-2 ] 

GESELL, ARNOLD; HALVERSON, Henry M.; THOMPSON, 
HxrLEN; Irc, Frances L.; Castner, Burton M.; 
Ames, Louise BATES; AND AMATRUDA, CATHERINE S. 
The First Five Years of Life: A Guide to the 
Study of the Preschool Child. From the Yale Clinic 
of Child Development. New York: Harper & Brothers, 
1940. Pp. xiii, 393. $4.00. * (London: Methuen & Co. 
Ltd., 1941. 21s.) (PA 14:3804) For reviews, see 279 
and 40:Bor4. 

[ 903 ] 


*GESELL, ARNOLD, AND Irc, FRANCES L.; IN COLLABO- 
RATION WITH Louise BATES AMES AND GLENNA E. 
Burris. The Child From Five to Ten. New York: 
Hepa & Brothers, 1946. Pp. xiii, 475. $4.50. * (PA 
20:4391 


J Am Med Assn 133:505 F 15°47. * a com- 
panion work to the earlier volume “Infant and 
Child in the Culture of Today” (1943). It is 
based on studies of 50 or more children at age 
levels 5, 534, 6, 7, 8, 9 and 10 years. The ob- 
servations at each level are systematically re- 
ferred to ten major "fields of behavior," * The 
study evidences the carefulness of observation 
that has made the work of the Yale Clinic such 
an important contribution to our knowledge of 
child development. Authentic, lively, realistic 
descriptions of how children behave, what they 
say and do and how they react to their environ- 
ment are presented. The emerging patterns of 
behavior, modified by the maturing of the de- 
veloping organism, are traced. The authors are 
sound in their emphasis on the total complex 
of a behaving organism in a changing environ- 
ment. Yet the student of child life and the clini- 
cian will sense the deep lack in this work, the 
lack that consistently characterizes the fine work 
of the Yale Clinic. Despite the authors’ empha- 
sis in the final chapter on “developmentalism” 
the work remains consistently on the descrip- 
tive level and lacks the dynamic quality that can 
be imparted to any account of the mental life 
of a human being only through a synthesizing, 
dynamic psychology. Such a psychology lurks 
at the fringes of the authors’ comments and is 
almost desperately sought for in an attempt to 
attain the needed synthesis. Those aspects of 
childhood life that have to do with the more 
subjective mental life, i.e. emotional expression, 
fears and dreams, self and sex and ethical 


sense, carefully documented as they are, lose 
much of their meaning unless understood in 
terms of the dynamic interplay of psychologic 
forces actually observable in childhood life. This 
is not specifically acknowledged or recognized 
by the authors in relation to their own work. 
That they are by no means insensitive to this is 
evidenced by the fact that the best presentation 
of the subjective state of a child is not derived 
from their observations but quoted from the 
autobiographic work of Edmund Gosse. The 
profound implication to the child on discovering 
that the father is not infallible is made vividly 
clear by Gosse’s account. Perhaps the “clinical 
observer” might with some justice maintain 
that we cannot “observe” a child’s subjective 
state—perhaps we must rely on his ability to 
report it at a later date. However, a wealth of 
clinical observation bearing on the subjective 
life of the child is available in the works of 
child psychologists and psychoanalysts, together 
with precisely the sort of integrating dynamic 
concepts of the: development of the childhood 
personality useful to an understanding of the 
facts reported so carefully in this work. Indeed, 
such recording of precise observation is what 
is required to correct, modify or confirm the 
current theories of child development. Also re- 
vealing the lack of a basic dynamic psychology 
is the statement of the authors, in commenting 


.on the possible fate of mankind, that “the in- 


trinsic charm and goodness of childhood still 
constitute the best guaranty of the further per- 
fectibility of mankind” (page 453). This is a 
charming sentiment but a highly unscientific 
statement, The development of ethical concepts 
furthering a higher order of society is depend- 
ent on a process involving the relationship of 
developing children with ethical parents; proc- 
esses which involve early conditioning, intro- 
jection and identification eventually enable the 
child to assume the responsible attitudes of the 
mature adult. It is this process that could 
profitably be an object of study by the authors 
of this important book. 


[ 904 ] 
GESELL, ARNOLD, AND THOMPSON, HELEN ; ASSISTED BY 
CATHERINE STRUNK AMATRUDA. The Psychology of 
Early Growth: Including Norms of Infant Be- 
havior and a Method of Genetic Analysis. New 
York: Macmillan Co., 1938. Pp. ix, 290. Out of print. * 
(PA 12:3211) For reviews, see 280 and 40:Bo16. 


[ 905 ] 
*XGrsson, JAMES J., EDITOR. Motion Picture Testing 
and Research. Army Air Forces Aviation Psychology 
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Program Research Reports, Report No. 7. Washing- 
ton, D. C.: Government Printing Office, 1947. Pp. xi, 


267. $1.50. * 


[ 906 ] 
*GriLLETT, ALsERT N. How to Evaluate Supervisory 
Jobs: An Executive and Supervisory Appraisal 
Manual. Deep River, Conn.: National Foremen's In- 
stitute, Inc., 1945. Pp. 141. Loose-leaf. $7.50. 


[ 907 J 

*GorprARB, WiLLIAM. An Investigation of Re- 
action Time in Older Adults and Its Relationship 
to Certain Observed Mental Test Patterns, Co- 
lumbia University, Teachers College, Contributions 
to Education, No. 831. Irving Lorge and Rudolf Pint- 
ner, faculty co-sponsors. New York: Bureau of Pub- 
D the College, 1941. Pp. ix, 76. $1.60. * (PA 
5:4015 


J Appl Psychol 25:728-30 D "4r. J. R. 
Gentry. * An obvious criticism, recognized by 
the author, is that the experimental groups used 
in this study are not representative of the gen- 
eral adult population. But the author has done 
a careful piece of research, and provides valu- 
able data upon a timely subject. 


[ 908 ] 
KGOLDSTEIN, Kurt, AND SCHEERER, Martin. Abstract 
and Concrete Behavior: An Experimental Study 
With Special Tests. American Psychological Asso- 
ciation, Psychological Monographs, Vol. 53, No. 2, 
Whole No. 239. Washington, D. C.: the Association, 
Inc., 1941. Pp. vi, 151. Paper. $2.25. * (PA 16:1004) 
For reviews, see 41 and 42. 

[ 909 ] 
*GoonENoucH, Fiorence L., AND Maurer, KATHA- 
RINE M. The Mental Growth of Children From 
Two to Fourteen Years: A Study of the Predic- 
tive Value of the Minnesota Preschool Scales. 
University of Minnesota, Institute of Child Welfare 
Monograph Series, No. 20. Minneapolis, Minn.: Uni- 
versity of Minnesota Press, 1942. Pp. xv, 130. Out of 
print.* (PA 16:4562) For reviews, see 287. 


[ 910 ] 

*GorrscHALK, Louis; KLUCKHOHN, CLYDE; AND ÀN- 
GELL, Ropert. The Use of Personal Documents in, 
History, Anthropology, and Sociology. Prepared 
for the Committee on Appraisal of Research, Social 
Science Research Council, Bulletin 53. New York: the 
eau 1945. Pp. xiv, 243. Paper. $1.50. * (PA 19: 
1520 


Psychol B 44:185-7 Mr '47. Arthur F. Jen- 
ness. * social scientists frequently use historical 
method, and Gottschalk's suggestions regarding 
the use of documents should help them to avoid 
pitfalls in their excursions into historical fields. 
* The documents employed by anthropologists 
are usually written at the instigation of field 
workers, or are merely records of interviews. 
Kluckhohn’s critical review of such documents, 
including a number of biographies, is an excel- 
lent guide to the literature. The suggestions 


which he makes for future research, and for 
making available more of the original and de- 


p 


tailed notes from field work, represent no mean - 


contribution to methodology and technique in 
anthropology. * No brief review can do justice 
to the importance of these monographs or to 
their usefulness as guides to users of documents. 
Nor can they be evaluated properly without con- 
sidering them in the light of publications of a 
similar nature by Dollard, Blumer, and G. W. 
Allport, the latter two of which are authors of 
previous SSRC Bulletins. The definitions of 
personal document (or human document, as 
the case may be) given by these six writers are 
so different and distinct that they suggest the 
analogy of the blind men describing the ele- 
phant. * in stating their definitions boldly and 
precisely these writers have provided the key to 
an understanding of the distinctive methods and 
techniques characteristic of sociology, psychol- 
ogy, anthropology, and history applied to docu- 
ments. In these days when psychologists, social 
scientists, and historians are being encouraged 
to pool their concepts and techniques in grand 
“cooperative projects,” there is strong tempta- 
tion to urge that all of them should get together 
and agree on fundamental concepts and meth- 
ods. While such cooperation is desirable, it must 
be remembered that data, interpretations, and 
generalizations are in part functions of the con- 
cepts, methods and techniques employed in the 
research which produced them. Pooling con- 
cepts, methods, and techniques may result in 
data of doubtful value, interpretations devoid 
of clarity, and generalizations lacking precision 
and validity, unless the cooperators have scru- 
pulous regard for the various methodologies. 
These monographs are excellent guides to such 
methodologies, at least in so far as the use of 
documents is concerned. 


[ 911 ] 

AGREAT BRITAIN. COMMITTEE OF THE SECONDARY 
ScHooL EXAMINATIONS Counc, Cyr Nomwoop, 
Carman. Curriculum and Examinations in Sec- 
ondary Schools: Report of the Committee of the 
Secondary School Examinations Council Ap- 
pointed by the President of the Board of Educa- 
tion in 1941. London: His Majesty's Stationary 
Office, 1943. Pp. ix, 152. Paper. 15. 6d. * 


Brit J Ed Psychol 13:159-62 Je ?43. Frank 
Smith: * In general, the report is marked bya 
lofty idealism, a cautious conservatism, and a 
balanced outlook. Tt puts first and foremost the 
varying needs and capacity of the child. It seeks 
to redress many of the faults that are now pres- 
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ent, to reduce the examination evil, to curb the 
specialist, to nurture individual gifts, and to 
diminish the financial limitations that create 
such large inequalities among our gifted chil- 
dren. It is condemned in some quarters for be- 
ing too timid, in others for being over-bold. Its 
critics complain that their particular demand is 
not satisfied. The report will survive these at- 
tacks, and its balanced wisdom will stand the 
test of time more successfully than the partisan’s 
exaggeration of particular faults. It is, however, 
both too long and too short: too long in its 
repetitions and its elaborate explanations and 
leisurely expositions, too short in its treatment 
of the technical and modern schools, which are 
shown to be included in secondary school pro- 
vision and then put on one side. In the expan- 
sion of secondary education that is imminent, 
which will raise the population of our secondary 
schools from half a million to two millions, and 
later perhaps to two and a half millions, it is 
the technical and modern schools that will need 
most guidance and undertake the most difficult 
task. The grammar school has a tradition and 
a reputation that will serve it for the time be- 
ing; the new schools have a tradition to create 
and a reputation to earn, Their function and 
purpose need careful enunciation and their cur- 
riculum much thought. To say this is to praise 
the Norwood Committee for what they have 
done and to urge that they or somebody else 
should accomplish what still remains to do. 


[ 912 ] 
*GnrENE, EnwAnD B. Measurements of Human Be- 
havior. New York: Odyssey Press, Inc., 1941. Pp. 
xxi, 777. $4.00. * (PA 15:4433) 


Adv Mgmt 7:191 O-D '42. Herman Feld- 
man. * most valuable as a reference book for 
educators and counselors but it is also for per- 
sonnel officials and others concerned with the 
research aspects of studies in the field-selection, 
training, promotion and other industrial rela- 
tions problems. * Much of the book necessarily 
concerns itself with technical features. While 
these are judiciously handled the pages on such 
subjects require a previous background in sta- 
tistical methods. A useful bibliography and an 
excellent combination of glossary and subject 
index add to the usefulness of this sourcebook. 
The author's purpose was to compile and inter- 
pret the results of research in the various fields 
and in this he has done an outstanding job. — 

Am J Psychol 55:605-6 O "42. J. P. Guil- 
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ford. * The author of this volume is surely to 
be commended for not compromising the pro- 
fessional status of testing as it is practiced. He 
correctly realizes, for example, that without 
statistical language, in which the fundamentals 
of mental measurement must be phrased, one 
cannot present a really sensible picture. Conse- 
quently, at least three chapters are devoted to 
statistical ideas, including factorial analysis. 
Factor theory is frequently the important frame 
of reference, which, in the opinion of this re- 
viewer, is as it should be. It is questionable, 
however, how much serviceable statistical lan- 
guage the average student can master in so 
short a time as can be devoted to it in the use 
of this text. * It is undoubtedly much too diffi- 
cult for the average sophomore and will tax the 
average senior. * Part I (7 chapters) deals with 
fundamentals of test theory, including statistics. 
Considering the difficulty of the subject and the 
limited space in which it can be presented, the 
author has accomplished a remarkably able peda- 
gogical task. * he is to be commended for using 
the more consistent term 'centile' in place of 
the traditional ‘percentile’ * Part II (13 chap- 
ters) is the heart of the book and in it are de- 
scribed all the types of mental tests, with 
detailed descriptions of many of the standard 
tests and scales, some of them being well known 
and others not. Evaluations are liberally in- 
cluded, and a frequent criterion offered for 
judging a test is its intercorrelations with other 
tests and the related criterion of factor loadings. 
* well provided with documentation * At times, 
however, readers familiar with the field will 
miss references to important studies. Part p 
(3 chapters) concerns....the effects of practice, 
the effects of growth and decline, and methods 
of absolute scaling. The attention of the student 
is very well called to the importance of the 
changing individual and the difficulties which 
result for measurement. Few discussions of these 
problems have been equally competent and sys- 
tematic, but the reader is necessarily left with a 
feeling of incompleteness for lack of solutions. * 
There is no other [text] exactly like this one. 
Those of Garrett and Schneck, of Symonds, and 
of Thelma Hunt come first to mind. The pres- 
ent volume is more comprehensive than any of 
these, combining as it does test-theory with 
practical guidance in selection of tests. Among 
such volumes it is a step forward and points 
toward further advances in the future of its 


kind. 


912 


a 
THE THIRD MENTAL MEASUREMENTS YEARBOOK  [ 826 


Am Sociol R 7:441-2 Je '42. Thomas C. 
McCormick. * the author has included practi- 
cally every variety of test that any instructor 
could wish to find. For this reason the book 
should be useful as a handy source for students 
and others who desire a first acquaintance with 
the kinds of tests used for any given purpose. 
The very wide range of the text, however, pre- 
cludes any but very sketchy treatment of each 
test or technique. Since the problems of testing 
are really complex, the knowledge that can be 
gleaned from it is quite inadequate for prac- 
tical use. * The author might have expanded his 
discussion of tests and testing methods in the 
space given to subjects that should require a 
separate course. For example, chapters 3 and 4 
provide a hasty view of the principles of ele- 
mentary statistics, a thorough mastery of which 
ought to be a prerequisite to any serious study 
of the measurement of human behavior. The 
pedagogical value of the text is considerably 
lessened by tendency toward abstract exposition 
in a field where concrete illustrations are espe- 
cially needed. Chapter 5 may be cited in this 
connection. One is also surprised to find at vari- 
ous places rather pointless elaboration of re- 
search findings at the expense of an explanation 
of the techniques employed. In spite of such 
objections, however, this text is probably the 
most up-to-date and complete summary of the 
testing field now on the market. 

El Paso Sch Standard 19:29-30 My ’42. 
M. E. Broom. * Emphasis is placed throughout 
the book on a clear presentation of facts about 
interpretation and on their interpretation. The 
book contains a wealth of material. The treat- 
ment provides opportunity for the student to 
become familiar with many widely used meas- 
uring instruments and emphasizes the methods 
of measurement and evaluation in terms of 
actual results of measuring. This book should 
have wide use among persons working in edu- 
cation, in personnel work in business and in- 
dustry, and in clinical and welfare work of vari- 
ous sorts. 

El Sch J 42:313-5 D "41. Andrew W. 
Brown. * One distinctive feature of the book is 
the discussion of the methods and the results of 
factorial analyses. No attempt is made to cover 
the mathematics of factorial solutions, but the 
treatment “covers the logic, which is very im- 
portant in analytical thinking, and the results, 
which are significant in test construction and 
interpretation” (p. ix). In Part II, “Instru- 


ments and Results,” the author covers all the 
important types of tests, scales, and inventories 
in this country. No attempt has been made to 
list every test. * Measurements of Human Be- 
havior is a book which should cover a long-felt 
need. The author has achieved a prodigious feat 
in combining in one textbook the wealth of ma- 
terial which has been previously largely scat- 
tered in psychological and personnel journals. 
There are appended an excellent bibliography, 
a glossary, and a subject index, which make 
the book invaluable for reference purposes both 
for scientific research workers and for persons 
engaged in practical guidance and personnel 
work. As a textbook for a course in mental 
measurements or psychological tests, the book 
will probably prove to be too comprehensive. It 
is, in reality, two textbooks condensed into one. 
Students in the second, third, or fourth year in 
college cannot be expected to secure in one 
course (1) a comprehensive and thorough grasp 
of basic considerations underlying psychologi- 
cal measurements, including statistical methods 
and techniques; (2) a knowledge of the differ- 
ent types of instruments used in intelligence, 
educational, and vocational testing ; (3) a famil- 
iarity with the interpretation of test results ; and 
(4) an understanding of the persistent problems 
arising from the use of psychological tests. 
These should, and often do, constitute at least 
two separate courses, and in the reviewer's mind 
this procedure is better. A thorough course in 
statistics, as applied to psychology and educa- 
tion, should precede a study of the instruments 
themselves. It has been the reviewer's experi- 
ence that, when the two are combined into one 
course, the students usually secure only a super- 
ficial knowledge of each. From the point of 
view of teaching technique, it seems best to 
have two textbooks covering these fields. 

H Sch J 25:188-9 Ap ’42. A. M. Jordan. * 
lt is to systematize and evaluate the whole 
range of the measurement movement that Dr. 
Greene has attempted in, Measurements of Hu- 


man Behavior. * Its greatest contribution lies . 


in its all-inclusiveness. Some textbook writers 
in the field of tests and measurements are satis- 
fied with describing the test, furnishing their 
norms together with their opportunities for ap- 
plication. Greene does more. He has taken the 
tremendous amount of trouble to gather the 
evaluative data as expressed in word and corre- 
lation on which one can base a critical judg- 
ment. * Among these coefficients are those 
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which indicate the predictive values of tests. A 
perusal of these figures will make any student 
slow to place a great deal of weight upon pre- 
dictive judgments based on tests. In such a 
large undertaking some areas of measurement 
will be stretched pretty thin. It could hardly be 
otherwise. In some cases the tests are simply 
listed * More errors than necessary have crept 
in. I have noted errors in formulas, in graphs, 
in tables, in the omission of pronunciation signs 
in foreign words, and in pagination. Such errors, 
however, are not serious. They are merely irri- 
tating to the precision (one who likes his third 
decimals correct). As a whole Measurements 
of Human Behavior is a book for adults. It 
uses without apology the more recent techniques 
of measurement whether they come from de- 
tours into mathematics, biology, or any other 
area of knowledge. It fulfills well its fundamen- 
tal purpose of surveying adequately and thor- 
oughly the measurements of human behavior. 
Its use will be in advanced courses in measure- 
ment and for those who have had good courses 
in elementary statistics. 

Harvard Ed R 12:424-5 O '42. William 
J. E. Crissy. * The book differs from previ- 
ously published textbooks..in several ways: 
(a) More topics are included; (b) more fre- 
quent citations of studies involving the uses of 
tests are made; (c) the topics covered are or- 
ganized within a framework of differential psy- 
chology rather than according to the more 
usual operational development * While....un- 
questionably....a major contribution to the field 
and..a book which every person working in 
measurement will want to own, there are sev- 
eral criticisms of it which may be made. First, 
the compendious treatment of a multiplicity of 
topics in measurement is sometimes accom- 
plished at the sacrifice of clarity and cohesive- 
ness of presentation, especially in the case of 
the aspects of measurement covered in Part II. 
Second, the relationship between specific sub- 
topics in many of the chapters is too frequently 
left to the inference of the reader. Third, minor 
instances of inaccurate citations of names and 
organizations occur. These are not likely to 
confuse the instructor but they might easily mis- 
direct the student. Fourth, the author uses in- 
exact phraseology in some of the statements of 
measurement principles, especially in the first 
part of the book. * The chapter of the book 
which..leaves most to be desired in terms of 
clarity of treatment of topics as well as actual 
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content is the one entitled “Measures of 
Achievement". This chapter does not give a 
very up-to-date picture of developments in this 
area of measurement. The tests cited for illus- 
tration too frequently do not reflect the best 
measurement procedures available for use in 
the fields covered. The problem of compara- 
bility of scores is raised....but no mention is 
made of some of the better attacks on this prob- 
lem. There is no direct treatment of the gen- 
eral problem of what constitutes validity in the 
case of achievement tests. Though numerous 
correlation studies are cited, no mention is 
made of the fact that the ultimate validity of an 
achievement test is really determined by the 
content of the items! There are several chap- 
ters which should prove useful as reference 
reading for persons interested in the topics cov- 
ered; namely, two chapters on measuring pro- 
cedures in fine arts entitled "Measurement in 
Fine Arts: Design" and “Measurements in Fine 
Arts: Literature and Music” and three chapters 
concerned with the measurement of personal and 
social adjustment entitled “Modes of Adjust- 
ment: Free Association and Inventories,” 
“Modes of Adjustment: Conduct Measures, 
Observation, and Laboratory Methods,” and 
“Modes of Adjustment: Evaluation of Methods 
and Results”. Students of measurement will be 
pleased with Professor Greene's excellent treat- 
ment of the uses as well as the limitations of 
several widely used clinical instruments. This 
book might well be adopted for use as a text- 
book in a one-year course in measurement 
where the emphasis desired is upon psycho- 
logical research applications. It is not well 
adapted for use in a practitionary course in 
educational measurement, especially if only one 
semester is allotted for study of the subject. 

J Appl Psychol 25:477-8 Ag "41. Amos C. 
Anderson. * It is significant that Greene has 
entitled his book Measurements (plural) of 
Human Behavior (not intelligence). The impli- 
cation is that measurement has expanded in so 
many directions that the concept of tests is too 
limited ; and that so many forms of behavior are 
being measured, more or less well, that a book 
on psychological measurement can no longer be 
restricted to intelligence with perhaps a chap- 
ter thrown in on educational achievement tests. 
* This book meets the needs of the student and 
the worker in the field of measurements in that 
it presents the methods of measurement, it en- 
ables one to become familiar with most of the 
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tests, scales, and other measuring devices now 
being widely used, and it evaluates in terms 
of actual results, or, where necessary, lack of 
results. Supplemented by adequate laboratory 
work and the use of demonstration materials, 
a course conducted with this as the basic text 
will give the student a wide range of familiarity 
with tests and a diversified competence in meas- 
urement. For the old style course in intelli- 
gence testing, it would not do as a text. For a 
revamped course in measurement, or for an 
additional course beyond intelligence testing, it 
is the only book this reviewer has seen which 
would do at all. While making no attempt to be 
popular or easy, difficult chapters are clearly 
presented. Any student who has intelligence 
and background such that he has any business 
using the book will probably be able to read 
it with no other difficulty than that of applica- 
tion to the task. * the cooperation of author and 
publisher is reflected in an unusually good job 
of bookmaking. 

J Ed Psychol 33:398-400 My '42. Julian B. 
Rotter. * In his attempt to acquaint the student 
with a wide variety of psychological and edu- 
cational tests and methods the author is very 
successful. The [presentation] is comprehensive 
and thorough, and the style is clear and under- 
standing. * From the point of view of one inter- 
ested in the clinical application of these tests 
one could wish that fewer specific tests were 
presented and these examined more critically. 
Although reliability and standardization are 
often discussed, the lack of information con- 
cerning the validity of many of the tests is not 
emphasized. Instead of a healthy scepticism and 
understanding of the limitations and misuses of 
‘these tests the elementary student can get an 
impression of a highly developed field where 
one can measure almost anything with great 
accuracy. Individual factors influencing test re- 
sults are largely neglected. Part III, “Persistent 
Problems,” provides a stimulating discussion of 
such major problems as: Effects of Practice on 
Test Scores; Measurement of Growth and 
Senescence; Standard Deviation or Absolute 
Scaling; The Evaluation of Judgments: and 
Measurement of Native Differences. * This 
book provides a clearly-written and comprehen- 
sive introductory text for students in courses 

dealing with mental measurements. It provides 
the student with knowledge of a wide variety 
of tests and methods, statistical procedures in 
the construction and evaluation of tests, and 


experimental results in the various fields where 
such techniques have been applied. The failure 
to stress individual factors limiting the use of 
many test scores and the absence of information 
concerning the validity of most of the tests in 
practice can be compensated for by the in- 
structor. 

Loyola Ed Digest 17:9 D "41. Austin G. 
Schmidt. Here is a really excellent book on 
tests of all kinds and types, with short but good 
chapters on statistics. The typography is supe- 
rior. All in all, the work is one which we pre- 
dict will enjoy a wide popularity. 

Occupations 21:424-5 Ja '43. J. Gustav 
White.-Read parts of this book, look through 
the remainder carefully, then keep Greene's 
volume handy for reference. For any counselor 
or teacher in the field of applied psychology this 
will prove a valuable tool. It is a technical trea- 
tise, popularly written. * While written for 
college students it serves counselors by bridg- 
ing the gap between scientists and practitioners. 
The definitions and classifications under “Basic 
Considerations” are helpful. * Each test is de- 
scribed and an analysis of its reliability indi- 
cated. The hurried counselor may not care for 
the tedious descriptions of varieties of research 
or the scholarly documentation of authorities 
but this section is a storehouse of pertinent in- 
formation. * has an excellent bibliography and 
also a good glossary and subject index. 

Psychol B 38:992-3 D ’4r. Robert M. Lind- 
ner. The expressed intention of the author was 
to prepare a book in which behavioral measure- 
ment techniques and procedures would be 
treated more comprehensively than they are in 
available volumes. It is a credit to Professor 
Greene that such an ambitious project has been 
realized in a manner that will please the three 
most interested groups: instructors who are 
seeking a textbook suitable for the more mod- 
ern course in measurement; students who have 
expressed (at least to this writer) their despair 
with texts which from preface to bibliography 
credit them with too much in the way of prepa- 
ration; and clinicians who have long awaited a 
convenient and descriptive practical guide. In 
contrast to the majority of volumes, in this 
work the application of measurement principles 
and procedures beyond the field of intelligence 
per se is discussed with commendable restraint. 
The author recognizes that, while measurement 
has been extended into many new fields, we 
have not yet come far enough to make positive 
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claims with unmodulated voices. The result is 
a work that covers the field of measurement in a 
critically objective fashion, resorting for the most 
part to exposition based on primary sources 
carefully evaluated. * the chapter on factorial 
analyses is the weakest part of the book * The 
volume is liberally supplied with illustrative 
material, graphs, charts, and tables. For a re- 
freshing change these appear for the most part 
on the same pages with expository matter relat- 
ing to them, thus obviating bothersome turning 
of pages to find the graphic referrals mentioned 
in the text. On the whole, the format reflects 
the excellent cooperation of author and pub- 
lisher, resulting in a book of pleasant, yet func- 
tional, aspect. 
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*GrEENE, Harry A., AND CRAWFORD, Jonn R, Work- 
Book in Educational Measurements and Evalua- 
tion. Revision of Work-Book in Educational Meas- 
urement by Harry A. Greene. New York: Longmans, 
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Ed Res B 22:52-3 F ’43. James B. Burr. * 
In their introduction the authors take the posi- 
tion that “the evaluation concept has doubtless 
been stimulated by the recent attention of edu- 
cators and psychologists to the whole child and 
his behavior. This tendency to consider the 
child as a whole, rather than as an individual 
whose behavior and abilities can be catalogued 
into a number of different compartments, places 
a responsibility upon the user of tests and other 
instruments of evaluation for considering the 
child in this broad sense. It is through the ap- 
plication of the evaluation concept rather than 
the narrower concepts of measurement and test- 
ing that this result is most effectively obtained." 
In the light of this statement the reader will be 
disappointed to learn that practically all of this 
long volume is devoted to the use and inter- 
pretation of tests. In fact, in the chapter head- 
ings of 22 of the 25 chapters the terms fests, 
examinations, and measurement appear. The 
revision has not altered significantly the origi- 
nal contribution of the volume—a thorough 
discussion of tests and measurement. Noticeably 
absent is a comprehensive treatment of what 
constitutes a modern program of evaluation 


for the elementary school. Newer types of eval- 
uation instruments and techniques—the anec- 
dotal record, the interview, the questionnaire, 
the rating scale, pupil profile, cumulative record, 
and case study—are briefly discussed and con- 
ceded to have a significant place in the evalua- 
tion of pupil behavior and achievement. The 
sketchy references to these essential evaluation 
instruments, however, seem to the reviewer to 
be damning with faint praise. The elementary- 
school worker who uses this volume will be 
much encouraged in an outmoded over empha- 
sis upon testing as such, and will receive little 
help in developing a forward-looking philos- 
ophy and program of evaluation. 

El Sch J 42:712-3 My’42. Gustav J. Froeh- 
lich. * Among the book’s strong points the re- 
viewer has found the following: (1) There is a 
profusion of illustrative material. Approxi- 
mately a hundred facsimile excerpts from cur- 
rently available standardized tests at the ele- 
mentary-school level appear. The reviewer was 
disappointed, however, to note that some of 
these excerpts are fairly old. For example, in 
the chapter dealing with testing in the social 
studies, only one illustration, among the eight 
given, involves a test published after 1931. (2) 
The publishers and the dates of publication are 
given for practically all the standardized tests 
referred to. (3) A handy topical outline ap- 
pears at the beginning of most of the chapters. 
(4) Ample supplementary bibliographies are to 
be found at the end of each chapter. (5) A glos- 
sary of almost three hundred technical terms 
will aid the reader whose knowledge of statis- 
tical and technical test terminology is limited. 
Measurement and Evaluation in the Elemen- 
tary School, written from the point of view of 
the classroom teacher, is a handy reference 
book on testing at the elementary-school level. 
The reviewer recommends it as a desirable ad- 
dition to the professional library of the class- 
room teacher. It might well serve as a basic 
textbook in a first course in testing, but ad- 
vanced students in education and specialists in 
testing will find the book somewhat elementary. 

Harvard Ed R 12:425-6 O ’42, William E. 
Young. * completely revises the 1935 edition * 
the 1942 revision comprises the most elaborate 
coverage of elementary school testing which has 
thus far appeared. However, so many topics are 
included that the treatment of almost every one 
has to be economical. Many topics are only in- 
troduced, and many important phases are not 
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considered at all. The economy has been 
achieved somewhat at the expense of perspec- 
tive or background so that the book emphasizes 
the doing rather than the understanding. It is 
a manual rather than a treatise. * The book 
would be improved with chapter summaries. 
These are attempted in the case of two chap- 
ters. If suggestions for student work other than 
topics for discussion were extensively used, the 
chapter exercises would be more in harmony 
with the objective tone of the book. The Work- 
Book to accompany the text will no doubt meet 
this need of problems and case studies. The 
volume contains an excellent introduction by 
the editor, a helpful glossary and index and 54 
tables and 31 diagrams or figures. 

J Ed Res 35:634-6 Ap 42. Gerald V. Lann- 
holm. * a revision and expansion of a volume 
published in 1935 under the title The Use and 
Interpretation of Elementary School Tests. * 
some rather fundamental changes and additions 
have been made. The new topics treated in- 
clude personality testing, physical and health 
education testing, rapid scoring devices, and 
certain guidance and adjustment techniques. 
The area of practical arts....is not treated. The 
authors have employed an organization which 
combines the school-subject-fields approach 
with a semi-functional approach. * In several 
respects this new volume is a distinct improve- 
ment over the former edition. The order of 
presentation of the topics treated should result 
in its being more effective than its predecessor 
as a teaching manual. For the most part, the 
authors have presented concepts in the order in 
which they are needed by the student to com- 
prehend succeeding concepts. * The increased 
emphasis on the importance of the informal 
objective test and the improved treatment of its 
construction and use constitute another note- 
worthy improvement. The authors also make 
a better distinction between standardized and 
informal objective tests in this volume than they 
did in the earlier edition. While retaining some 
of the illustrations from older tests, this revised 
edition includes illustrative material from many 
of the recently published tests. Finally, the im- 
proved table of contents will make for greater 
convenience in using the book for reference 
purposes. While....an obvious and decided im- 
provement over the earlier volume....it still pos- 
sesses certain definite limitations. * for students 
not all of whom are preparing to teach at the 
elementary school level....a book which presents 


and develops the principles of measurement 
common to all levels and all subject fields would 
be more satisfactory. The authors of this book 
do devote several chapters to problems of meas- 
urement that are common to all levels and areas. 
In some instances, they even draw their illus- 
trations from subjects that are taught at the 
high school level only. Because the typical class- 
room teacher has more frequent need for infor- 
mal tests than for standardized tests, problems 
related to the former probably should be given 
even more extensive treatment than they were 
by these authors. * If the authors of this text- 
book had used examples of only the best prac- 
tice in illustrating different types of objective 
test items, the book probably would exert a 
better influence upon practice in test construc- 
tion. * For those desiring an up-to-date and 
rather thorough treatment of the problems of 
measurement in the elementary school only, 
this revised edition should fulfill a real need. 
Those who used the former edition as the basic 
textbook....will not need to make serious revi- 
sions of their syllabi when they change over to 
this new edition of a widely-used textbook. 

Loyola Ed Digest 17:9 Ap ’42. Austin G. 
Schmidt. Although issued under a new title and 
as a new work, this book is in fact a revision 
of one first published in 1929 and revised in 
1935. We can unreservedly recommend the 
work as equal to any other in thoroughness. 
The only thing we regret—and there may have 
been some good reason for the omission—is the 
lack of a bibliography of tests giving prices and 
publishers’ names and addresses. 
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Sch Sci & Math 43:695-6 O ’43. Joseph G. 
Phoenix. * an excellent revision * The histori- 
cal background and development of the testing 
program is quite adequately treated. Of special 
interest is the addition of a chapter on Person- 
ality Testing, as well as testing in physical and 
health education and in business education. * 
the direct and straightforward nature... makes it 
a valuable addition to the field. * should be of 
considerable interest to students of Education, 
as well as those engaged in teaching or super- 
vision at the secondary level. 
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Am J Orthopsychiatry 12:375 Ap "42. 
L(awson) G. L(owrey). * Despite the fact 
that no specific approach to emotional problems 
is indicated, there is a wealth of material. * 

J Ed Res 32:611+ Ap ’39. Carter V. Good. 
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J Appl Psychol 29:318 Ag '45. Charles (o 
Gibbons. * contains material of value to anyone 
interested in employee ratings. * Numerous 
helpful excerpts and sample forms are pre- 
sented, mostly from the field of public personnel 
administration. The author gives proper em- 
phasis to such practical points as how to train 
supervisors to rate employees and to discuss 
the ratings with the individuals. * The effi- 
ciency rating system of the federal government 
ss explained in considerable detail. It is evident 
that this plan is superior in every way to the 
graphic rating scale used almost universally in 
industry. With regard to the problem of scor- 
ing a rating form, the approach is both labori- 
ous and naive. From an attempt to attach nu- 
merical grades to employee ratings problems 
arise which cannot be overcome ina practi 
situation. Logical and statistical considerations, 
which the author completely overlooks, prevent 
one from obtaining a total score "by adding the 
numerical values of the ratings on the separate 
qualities." There is a wide-spread misconcep- 
tion on this point among industrial people. It is 
not clear why it is necessary to grade the rat- 
ings in order to serve the training or adminis- 
trative purposes outlined above. It is unfortu- 
nate that the book does not touch on the 
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problem of using employee ratings in a unionized 
organization. The broader problem is how can 
the merit of certain individuals be appraised 
and rewarded under conditions of collective 
bargaining. This book is a worthwhile contribu- 
tion to the literature on rating industrial em- 
ployees. Although the approach is not schol- 
arly, most of the content is sound and, above 
all, it is realistic. 

Occupations 23:430-1 Ap '45. Richard S. 
Schultz. * Of special interest to the readers of 
Occupations will be a description in the preface 
of the “Averaged-opinion Plan of Vocational 
Guidance,” and the section on employee self- 
rating. Suggestions in these pages may be par- 
ticularly useful as content in student courses on 
improvement of work habits and development 
of satisfactory personal qualities for employ- 
ment. * A commendable emphasis is placed on 
the full cooperation of employees, supervisors, 
and executives in the preparation, installation, 
and administration of a rating program. This is 
well illustrated in a merit rating form which 
was at first rejected by supervisors and then 
revised by them for actual use. * An adequate 
inclusion of psychological research, statistical 
methods and bibliography would enhance the 
value of this book. 

Pub Personnel R 6:63 Ja "45. O. Glenn 
Stahl. * a conventional treatment of a profes- 
sionally over-refined subject. Its principal value 
is in bringing together in one volume some ele- 
mentary principles of service rating as they 
apply both in private industry and in govern- 
ment. Its principal weakness is the absence of 
any significant contribution to advancement of 
the art of evaluating human effort. * The book 
is unique in treating a personnel subject with 
neither a predominantly “business” or "govern- 
ment" point of view. This is an approach that 
has rarely been taken in other personnel fields 
and is one that deserves more use. The author 
probably devotes too much space to lengthy 
quotations from various articles or rating hand- 
books (see especially the chapter on federal sys- 
tem), but in so doing he has produced a sort 
of symposium on the subject. The most lucid 
and best organized chapters are those on “Fun- 
damental Requirements for a Successful Serv- 
ice Rating Program," "Steps in Installing or 
Revising a Service Rating Program,” and “Ad- 
ministration of the Service Rating Program.” 
Several features and omissions in Mr. Halsey’s 
discussion unfortunately keep it out of the van- 
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guard of thinking on performance evaluation. 
For one thing, the use of numerical scores (e.g., 
pP. 11, 38, 45) and weighting of items (eg., 
p. II) seem to be taken for granted as essen- 
tials in any system. No recognition is given to 
the systematic use of objective evidence simply 
to analyze and report, rather than to rate, an 
employee's performance. There is little refer- 
ence to the need and technique for scrutinizing 
employee performance as a continuous method, 
practically the whole attention being given to 
annual or periodic ratings (e.g., p. 119). An 
inadequate chapter on “self-rating” slights this 
intensely interesting idea. There is little discus- 
sion of the psychological fundamentals of hu- 
man beings appraising the work of other human 
beings. Here and there (pp. xiv. 15, 50) we 
find the idea that "health" is an appropriate 
factor in evaluating an employee's work, with- 
out any distinctions being made between those 
factors that are within and those that are be- 
yond the employee's control. The chapter on 
administration concludes that ratings should be 
Shown to employees, but is much too mild to 
squelch the absurd idea that any purpose is 
served in keeping them confidential. The author 
draws some proper distinctions between “per- 
sonal quality" and “job description type" rat- 
ing forms, the latter referring to those that 
focus attention on performance and results 
rather than on abstract traits or personal char- 
acteristics. But he disposes rather cavalierly of 
the progressive and promising California “Re- 
port of Performance" plan, as an example of 
a disadvantage of the "job description type," 
merely because numerous special forms for in- 
dividual occupational groups are required. Yet 
the California plan offers as much hope as any 
to get service ratings out of the woods. From 
some examples cited and from some of the 
author's comments, one would gather that he is 
more interested in "rating" the individual, as 
such, than he is in evaluating the individual's 
productivity of his efforts. The emphasis in the 
final chapters on using ratings for training and 
correction is all to the good. But this is one of 
the places where Mr. Halsey might have 
brought out the need for continuous, not just 
spasmodic, appraisal. Also, the discussion at the 
end on personality and ego; handling com- 
plaints; meeting cases of bluff, conceit, sensi- 
tiveness, stubbornness, and the "older em- 
ployee” problem—while sound in itself —seems 
to be a little out of place in a book on service 


ratings, because it gets into various phases of 
supervision, only a few of which have to do pri- 
marily with "making or using" ratings. Perhaps 
Mr. Halsey had a few general points he wanted 
to get off his chest and used his last chapter as 
the vehicle for them. After taking him to task 
on more important aspects of the book, I per- 
haps run the risk of being captious to deny him 
this one minor indulgence. In any event, I can- 
not escape the conclusion that the book would 
have been much more valuable if it had omitted 
or referred historically to some of the out- 
moded ideas and refinements of ratings and had 
emphasized and analyzed several of the newer 
concepts that are more realistic. 


[ 919 ] 
X*HAMALAINEN, ÁRTHUR Emm. An Appraisal of 
Anecdotal Records. Columbia University, Teachers 
College, Contributions to Education, No. 801. Irving 
Lorge, faculty sponsor. New York: Bureau of Publi- 
cations, the College, 1943. Pp. vii, 87. $1.85. * (PA 18: 
2579) 


J Appl Psychol 28:538 D '44. F. S. Beers. 
* one of the best analyses of anecdotal records 
yet published. He gives the facts with great 
scrupulousness and he gives his authorities so 
that the reader can make his own judgment of 
their reliability. The book is not merely useful ; 
it is readable. It has the rare merit of being 
impartial. A moderately complex experiment is 
reported in which statistical techniques are em- 
ployed to advantage. Among the conclusions 
drawn is the following: “The anecdotes are 
often as much a reflection of the teacher's out- 
look as they are of the child's behavior." This 
supports the generally accepted dictum that no 
instrument for use in guidance has merit in its 
own right; its effectiveness is conditioned upon 
the skill of the operator. The anecdotal record, 
it appears, has complementary advantages if 
used under optimum conditions—a device for 
guidance of the pupil, and a device for evalua- 
tion of the teacher. 


[ 920 ] 
*HAMILTON, Muppen Ecknanpr. The Contribution 
of Practice Differences to Group Variability. 
Archives of Psychology, No. 278. Washington, D. C.: 
American Psychological Association, Inc., March 1943. 
Pp. 40. Paper. $0.80. * (PA 17 23028) 
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*HaMrrN, Roy M. An Analysis of Age-Progress 
Curves as Related to the Mental Growth Curve: 
A Study of Vocabulary. Archives of Psychology, 
No. 293. Washington, D. C.: American Psychological 
Association, Inc, June 1944. Pp. 37. Paper. $080. * 
(PA 19:574) 
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[ 922 ] 
*Hamuin, Tareor F. A Further Study of the Pyra- 
mid Worksample 68. Human Engineering Labora- 
tory, Inc, Technical Report No. 255. Boston, Mass.: 
the Laboratory, August 1946. Pp. 37. Paper, mimeo- 
graphed. * 

[ 923 ] 
X*HANFMANN, EUGENIA, AND KASANIN, Jacos. Con- 
ceptual Thinking in Schizophrenia. Nervous and 
Mental Disease Monograph, No. 67. New York: Nerv- 
ous and Mental Disease Monographs, 1942. Pp. viii, 
115. Out of print. * (PA 16:2289) For reviews, see 28. 


[ 924 ] 
X*Hannowzn, M. R., AND STEINER, M. E. Manual for 
Psychodiagnostic Inkblots (A Series Parallel to 
the Rorschach Blots. New York: Grune & Strat- 
ton, Inc, 1945. Pp. 112. Paper, lithotyped. $2.00. * 
Accompanying Psychodiagnostic Inkblots, $4.50. (PA 
20:839) For reviews, see 81. 


[ 925 ] 
*Harrower-Ericxson, M. R., AND STEINER, M. E. 
Large Scale Rorschach Techniques: A Manual 
for the Group Rorschach and Multiple Choice 
Test. Springfield, Ill.: Charles C Thomas, Publishers, 
1945. Pp. xi, 419. $8.50. * (PA 19:1720) For reviews, 
see 82. 

[ 926 J 
*Hanr, Hornet. Chart for Happiness. New York: 
Macmillan Co., 1940. Pp. xi, 198. $2.00. * (PA 15:925) 
For reviews, see 37. 
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*kHartiey, Ruts Epira, Sociality in Preadolescent 
Boys. Columbia University, Teachers College, Con- 
tributions to Education, No. 918. New York: Bureau 
of Publications, the College, 1946. Pp. vii, 117. Out of 
print.* (PA 21:1558) 
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*Hanroc, Parir; BALLAR, P. B. ; Gurrey, P.; HAM- 
LEY, H. R.; AND Surrn, C. Esstewatre, The Mark- 
ing of English Essays: A Report of an Investiga- 
tion Carried Out by a Sub-Committee of the 
International Institute Examinations Enquiry 
Committee Consisting of Sir Philip Hartog 
(Chairman), Dr. B. P. Ballard, Dr. P. Gurrey, 
Professor H. R. Hamley, and Dr. C. Ebblewhite 
Smith. With a statistical report by C. Ebblewhite 
Smith. London: Macmillan and Co., Ltd., 1941. Pp. 
xv, 165. Paper. 3s. 6d. * (New York: Macmillan Co. 
$1.00.) (PA 16:1667) 


Brit J Ed Psychol 11:233 N "41. P. E. 
V(ernon). This report on essay examining... 
represents a definite advance on previous ones 
in that it is constructive rather than merely 
critical. The primary object of the investigation 
was to test Hartog's theory that compositions 
written *with a given audience and with a given 
object in view' (which he calls 'directed' essays) 
are more valuable pedagogically and more easy 
to mark consistently than are the conventional 
‘undirected’ essays. Also that the chief category 
for which marks are awarded should be the ex- 
tent to which they achieve this object, in other 


words the extent to which pupils write ‘sense.’ 
The eight professional examiners who took 
part, as well as the authors of the report, are 
convinced as to the value of marking for sense 
and of stimulating the writing of directed com- 
positions. But the statistical analysis failed to 
show any superiority in the marking of this 
category. The investigation was carried out in 
great detail and is replete with results and 
recommendations that are of interest to all 
teachers and examiners. These concern, for ex- 
ample, the earliest age at which reasonable com- 
positions can be written and marked, the cate- 
gories for which marks should be given, the 
impossibility of deciding whether or not a School 
Certificate candidate's essay should be regarded 
as ‘literate,’ and changes which might advan- 
tageously be made in English section of the 
Certificate examinations. Perhaps the most 
striking finding is that high or low marks for 
such categories as vocabulary, sentence struc- 
ture, grammar, etc. (in fact, for all categories 
except sense and general impression) depend 
to a greater extent upon the idiosyncrasies of 
examiners and on their random errors than 
they do upon the actual merits of the pupils. 
The failure of the sense marks and the directed 
essays to show up better is attributed by the 
authors to the examiners being too steeped in 
old traditions of teaching and marking. To the 
reviewer another factor seems to be that the 
sense marks were awarded first in every in- 
stance, and this would tend to make them less 
reliable and consistent than, say, vocabulary, 
sentence structure, and general impression 
marks, which were awarded last. Every educa- 
tional psychologist is likely to have his own 
ideas about the adequacy of the methods of 
analysis. The reviewer's chief complaints are, 
first, that no use was made of the fact that 
every pupil wrote six essays; the analysis was 
carried out for each essay separately. Inter- 
comparisons should throw valuable light on the 
consistency of pupils, and on the far-reaching- 
ness of the examiners’ ‘halo effect.’ Secondly, 
no deductions whatever were made from the 
intelligence and objective English tests which 
were also taken by the pupils. It is to be hoped 
that other members of the Sub-Committee will 
re-work the data and publish a supplementary 
report on these and other points which might 
have been studied. 

J Ed (London) 74:37-8 Ja ’42. Clarissa C. 
Bell. * Stress is laid on the constructive, as 
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opposed to the critical, purpose of the investi- 
gation, and the final recommendations are so 
soundly based in the writers' own experience of 
teaching problems that the reader is inclined 
rather to accept their good sense than to ques- 
tion their relevance to the ascertained results 
of the investigation. The recommendation, for 
instance, that pupils shall be required to write 
a plan or summary of a composition is one 
that many teachers of English will wish to see 
widely practised, but to recommend this "in 
order to teach the pupils how to present their 
thoughts in orderly sequence" is surely to sug- 
gest that the results of the investigation support 
this practice as a means of achieving orderly 
sequence of thought and expression. In fact, 
candidates whose scripts were the subject of 
the investigation were not required to present 
a plan or summary, only two out of the eight 
examiners allocated marks under this heading, 
and these marks were not submitted to any sta- 
tistical analysis. If the report is read from cover 
to cover, the distinction will be clear between 
those recommendations which have their basis 
in the information yielded by the investigation 
and those which are the fruit of the writers’ 
own views on teaching English composition. 
Since however some readers will certainly weary 
of the fifty or so pages devoted to sample scripts, 
and others will be intimidated by the ‘Statistical 
Report’, it is to be feared that having ‘cut the 
cackle’ these skippers may read into the last 
chapter a more precise reference to the results 
of the investigation than is actually to be found 
. in certain of the final recommendations. Dr. 
Ebblewhite Smith is in fact the most humane 
of statisticians. However complex the calcu- 
lations, however abstruse the analysis, he always 
finds words in which to explain to the un- 
initiated where he is going and what he is after. 
Unlike so many of his tribe, who communicate 
almost wholly in curves and equations, he can 
speak plain English and be understood by plain 
people. The chief object of the experiment was 
to discover whether by the introduction, into 
the marking, of the element ‘Sense’, i.e. the de- 
gree to which the writer of a composition at- 
tains his object, the validity and reliability of 
the examination could be enhanced. The results 
show that examiners, in marking ‘Sense’, are 
little more consistent than in marking other ele- 
ments such as grammar, spelling, and punctua- 
tion, and that the examiner’s personal idiosyn- 
crasies and random errors combined influence 


his judgment of the ‘Sense’ of a composition 
almost as much as the actual merit of the script. 
* Tt seems indeed to be doubtful whether a good 
team of examiners can be produced unless some 
satisfactory method can be devised of teaching 
the necessary scientific skill. The technique of 
examining is undeveloped. The introduction of 
the element ‘Sense’, for instance, in the hope 
of providing a focus for the roving judgment of 
the examiner, did not achieve its purpose. It 
would be interesting, in the light of these re- 
sults, to investigate more fully a possible tech- 
nique of teaching examiners how to isolate 
‘Sense’ and award marks for this element. * 


[ 929 ] 
*Harvey, Lucy Jean. The Mental Hygiene of 
Higher Learning as the Student Sees It, Purdue 
University, Division of Educational Reference, Stud- 
ies in Higher Education, [No.] 53. Lafayette, Ind.: 
the Division, March 1945. Pp. 68. Paper. $0.75. * 

[ 930 ] 
*Haynen, Howarp, The Evaluation of Education 
in Barbados: A First Experiment. Bridgeton, Bar- 
bados, B.W.I.: Department of Education, 1945. Pp. ii, 
34. Paper. 1s. * 


Brit J Ed Psychol 16:103 Je ' 46. This report 
describes the results of applying Moray 
House Intelligence, Arithmetic and English 
Tests to a large sample of the Elementary and 
Secondary School population in the colony in 
the age groups ten to twelve. It includes also 
the results of the testing of a number of adults 
in the colony who volunteered for R.A.F. 
ground staff duties. Those who are interested 
in the effects of cultural background upon intel- 
ligence test results and upon levels of attain- 
ment among English speaking peoples will find 
much of value in this courageous piece of pio- 
neer work, the first of its kind to be carried out 
in the islands of the Caribbean. 


[ 931 ] 
XHavzs, Samuet Perkins. Contributions to a Psy- 
chology of Blindness. New York: American Found. 
ation for the Blind, Inc., 1941. Pp. viii, 296. $2.50. 
(PA 16:336) 

[ 932 ] 
XHavzs, SAMUEL P. Vocational Aptitude Tests for 
the Blind. Perkins Publications, No. 14. Watertown, 
Mass.: Perkins Institution and Massachusetts Schoo! 
for the Blind, February 1946. Pp. 32. Paper. $0.25. 
(PA 20:2456) 


J Appl Psychol 30:663-5 D '46. Salvatore G. 
DiMichael. In a small readable book written in 
non-technical language, Hayes briefly surveys 
the attempts to develop vocational aptitude 
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tests for the blind. The approach is historical, 
and gives quiet testimony to the prominent part 
played by the author in the development of 
psychological tests for this group. The scope 
of the treatment is indicated by the opening 
definition of vocational aptitude tests as those 
"designed to measure the special abilities needed 
for success in some specific occupation." Hayes 
intentionally excludes not only general intelli- 
gence and achievement tests which he has dis- 
cussed in numerous other articles, but also per- 
sonality "tests," which need little change for 
use with the blind except their transformation 
into braille. The survey encompasses psycho- 
logical measurements under the classifications 
of mazes and formboards, musical, manual and 
mechanical, and scholastic aptitude tests. The 
book is not a test administrator's manual but a 
description of the research efforts made to date 
within the scope of the given definition. At 
appropriate points in the book the author makes 
thoughtful appraisals. based upon his experi- 
ments and experiences. * This newest book by 
an experimenter who has worked almost alone 
for three decades in a psychology of the blind 
ought to be read by all professional persons 
interested in the education, guidance and voca- 
tional rehabilitation of the blind. Clinical and 
Industrial Psychologists will learn from this 
book where they may find source information— 
first, on the experimental adaptations for the 
blind of some tests now widely used with the 
sighted, secondly, on the literature which de- 
scribes past attempts to construct special tests 
for the blind. Research workers interested in 
the construction of a battery of vocational apti- 
tude tests for the visually impaired will find 
excellent leads for further study. Graduate stu- 
dents should have little trouble in locating a 
wealth of “problems” that have both academic 
and practical significance. The reader may be 
disconcerted to learn that so few have turned 
their energies to the vocational testing of the 
blind. It is hoped that the book will stimulate 
more systematic, widespread and coordinated 
endeavors to conduct studies in this field. Such 
investigations will make contributions not only 
to theoretical knowledge but also to human and 
social values. A psychology of the blind gives 
fresh insight into the psychology of all human 
personality. 
[ 933 ] 


*Havxrs, BeNJAMIN R.; Broom, M. E.; AND HarD- 
Away, Mature. Tests and Measurements in Busi- 
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ness Education. Cincinnati, Ohio: South-Western 


Publishing Co., 1940. Pp. 400. $240. * (PA 15:2781) 


Bus Ed World 21:257 N '40. Written as a 
guide for students in teacher-training institu- 
tions and for classroom teachers of business 
subjects, this book is a real find for those of us 
who wish to improve our testing practices. * 
most of us will find Chapters 5 to 8 aimed 
directly at our problems * may be used as a 
reference book or as a textbook. Each chapter 
includes "Readings for Further Study" and 
“Questions and Exercises." Those interested 
in the training of business teachers will wish to 
scan the book with an eye to its possible class- 
room use, 

El Paso Sch Standard 18:39-40 O "40. 
Louise Green. * Chapter VII....merely lists the 
available standardized and unstandardized tests _ 
in the different business subjects * This chapter 
would have been more helpful to teachers if ex- 
cerpts from the different tests had been ap- 
pended to the brief descriptions of the instru- 
ments * Chapter IX....discusses needed research 
in testing and measuring in business education. 
This chapter could have been improved greatly 
by summarizing the types of research already 
completed in the field and the kinds of research 
that now are under way, and by indicating spe- 
cifically the types of research tasks that yet 
remain to be done. The authors throughout the 
volume have given careful footnote references 
and bibliographical materials within and at the 
end of chapters. These reference materials and 
the exercises at the end of each chapter will 
prove helpful to persons seeking to master 
tests and measurements in the field of business 
education * The index is adequate, but it neg- 
lects to list authors and titles mentioned in foot- 
notes and in chapter bibliographies. This over- 
sight makes it difficult for the reader to locate 
readily the first mention of a title to obtain the 
correct citation; the authors do not give the 
complete citation at each reference point. This 
volume should prove of great value to class- 
room teachers in business subjects and to stu- 
dents of business who are preparing to teach, 
since it contains a minimum of theory and a 
maximum of practical materials. 

J Bus Ed 16:47 O go. * The theory of tests 
and measurements is actually applied to busi- 
ness education, making the book very practical. 
* A special chapter on the construction of tests 
for class use will prove a valuable aid to every 
teacher. * 
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Loyola Ed Digest 16:11-2 My "41. Austin 
G. Schmidt. * a complete, clear, and scholarly 
presentation of the entire subject of tests and 
measurements as applied to business education. 
A good chapter on statistical and graphic meth- 
ods is included. The discussions of the uses and 
limitations of tests are intelligent. * should 
prove fully satisfactory for courses in tests and 
measurements conducted for teachers of com- 
mercial subjects. 
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servations in the Kindergarten: A Manual for 
Teachers. New York: American Book Co., 1942. Pp. 
139. Spiral binding, paper. $1.50. * (PA 17:2930) 
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*Hertz, Marcuerire R. Frequency Tables to Be 
Used in Scoring the Rorschach Ink-Blot Test, 
. Revised Edition. Cleveland, Ohio: Western Reserve 
. University Bookstore, 1942. Pp. v, 275. Paper, mimeo- 
graphed. $4.00. * 
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*Hitpeets, GERTRUDE H. A Biblio, aphy of Mental 
Tests and Rating Scales, 1945 Sapnlemenr: New 
York: Psychological Corporation, 1946. Pp. ix, 86. 
Paper. $1.75, postpaid. * (PA 21:1310) 
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*Hinton, EucENE Marx. An Analytical Study of 
the Qualities of Style and Rhetoric Found in 
English Compositions. Columbia University, Teach- 
ers College, Contributions to Education, No. 806. Per- 
cival M. Symonds, faculty sponsor, New York: Bureau 
of Publications, the College, 1940. Pp. ix, 135. $1.85. * 
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*Hosss, NicHOLAS, EDITOR. Psychological Research 
on Flexible Gunnery Training. Army Air Forces 
Aviation Psychology Program Research Reports, Re- 
port No. 11, Washington, D. C.: Government Printing 
Office, 1947. Pp. viii, 508. $175.* 
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*Hoox, J. N. How to Write Better Examinations. 
d Pie. PR pares & Noble, 
nc., 1941. Pp. iii, 32, Paper. $0.25. * (Originally pub- 
ine in 1940 by Stipes Publishing & CHR pei 


Loyola Ed Digest 17:12 N "41. Austin G. 
Schmidt. A simple and helpful collection of 
suggestions for students preparing for exami- 
nations, 
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XHonN, Arıce McAnutty. Uneven Distribution of 
the Effects of Specific Factors. University of 
Southern California Education Monographs, No. 12. 
Los Angeles, Calif. : University of Southern California 
Press, 1941. Pp. x, 107. $1.75.* (PA 16:412) 
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PAUL; WITH THE COLLABORATION OF PAUL 
WALLIN AND Louis GUTTMAN; ASSISTED BY FRIEDA 
Brim WALLIN, JOHN A. CLAUSEN, Roser? REED, 


Erca RoseNTEHAL. The Prediction of Personal åd- 


justment: A Survey of Logical Problems and Re- 
search Techniques, With Illustrative Application 
of Problems of Vocational Selection, School Suc- 
cess, Marriage, and Crime. Prepared for the Com- 
mittee on Social Adjustment under the direction of the 
Subcommittee on Prediction of Social Adjustment. 
Social Science Research Council, Bulletin 48. New 
York: the Council, 1941. Pp. xii, 455. Paper. $2.00. * 
(PA 16:1968) 


J Abn & Social Psychol 37:280-1 Ap '42. 
Daniel Horn. * The monograph is divided into 
two parts, the first of which requires a mini- 
mum of statistical sophistication on the part of 
the reader. This section, most of which was 
written by Dr. Horst, defines the predictive 
process as that of (1) developing an index of 
success in the activity under study, (2) deter- 
mining which background factors, abilities, and 
personality characteristics are useful in predict- 
ing success, (3) combining the predictive items 
into a predictive score which is highly corre- 
lated with the index of success, (4) checking 
the predictive formula on at least one other 
sample of subjects (the utmost stress is laid 
on this point), and, if the predictive formula is 
successful in predicting for the check sample, 
(5) applying it to the general population which 
has been sampled. The many practical and the- 
oretical problems which arise in applying this 
procedure are discussed systematically. Solu- 
tions which have been developed in studies of 
vocational guidance, marriage adjustment, 
school success, and criminal recidivism are cited 
from the literature and evaluated; in many 
cases new ways of meeting these problems are 
suggested. There is offered, also, the broad out- 
line of a program for the coordination of pre- 
dictive research into a systematic attack on the 
problems.of human adjustment. * With the ex- 
ception of a section on the case study, contrib- 
uted by Mr. Wallin, Part II consists of a num- 
ber of supplementary studies which are highly 
technical in nature and require a sound statisti- 
cal background on the part of the reader. The 
sections contributed by Mr. Guttman are espe- 
cially valuable in bringing together in a sys- 
tematic fashion those aspects of statistical the- 
ory which have a direct bearing on the problem 
of prediction, plus some original contributions 
to sampling theory and scale construction. * 
Horst’s mathematical contributions to the vol- 
ume include some which have important appli- 
cations beyond the problems of prediction, espe- 
cially his mathematical solution of a multiple 
rating problem and his discussion of the treat- 
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ment of incomplete data. The bulk of the mono- 
graph is concerned with so-called actuarial pre- 
diction. What is the role of the case study in 
this type of approach? The position is taken 
that the primary value of the case study is in 
furnishing new insights and new hypotheses for 
group investigation. The aim, as stated by Mr. 
Wallin, is “the development of prediction in- 
struments which will approximate more closely 
the ‘individual’ prediction of the case study but 
the application of which will be more stand- 
ardized....” Despite this statement of the aim, 
the general content of the monograph, espe- 
cially in the mathematical sections, is not con- 
cerned with any attempt at a systematic formu- 
lation of what might be called the “statistics 
of the individual" Professor Stouffer has con- 
tributed a stimulating chapter on the unique 
case in which he draws a parallel between the 
logic of group prediction and individual pre- 
diction, the latter depending on intra-individual 
trends. A formulation of individual behavior 
according to intra-individual trends as suggested 
by F. H. Allport and others should lead not 
only to better prediction, but to a better under- 
standing of human behavior as well. The dis- 
cussions of experimental methodology and sta- 
tistical analysis in this monograph and the 
suggestions for further research recommend it 
to anyone interested in the problem of predic- 
tion and related problems in the field of human 
behavior. * Whether or not we share the op- 
timism of the authors in the applicability of 
group prediction methods to the problems of 
human adjustment, we look forward to the re- 
ports of studies which are carried out along the 
lines suggested in this monograph. 

J Am Stat Assn 37:154-5 Mr "42. Douglas 
E. Scates. Paul Horst, editor of Psychometrika, 
engaged for some years in personnel selection 
research for a large industrial concern, and a 
consistent contributor of statistical articles in 
the field of mental measurement, was called to 
Chicago last year to assume charge of the short- 
time production of a monograph on methods of 
prediction in the general field of adult adjust- 
ment. With the assistance of several workers 
and the counsel of various committees, the 
monograph was put together in something over 
three months. The resulting volume reflects 
the competence of the contributors, the breadth 
arising from the mingling of various back- 
grounds, and the diversity of approach found 
in different disciplines. No effects of haste are 


apparent, other than possibly the unreconciled 
frames of reference of the several writers. * 
Part I, comprising two-fifths of the text, pre- 
sents the logic of prediction. It is relatively 
simple, straightforward, non-mathematical ex- 
position. It presents....a comprehensive and criti- 
cal statement of considerations and methods; 
some of the fundamental material is new, and 
certainly there is no other single treatmentthat 
approaches the compass and definition of this. 
Part II....is technical, and illustrates both the 
mathematical background, and the application 
to empirical data, of the methods described in 
the earlier part. Part II is highly original, in- 
genious, and resourceful. It marks an advance 
over methods ordinarily understood and em- 
ployed. * Richardson's section on combination 
of measures is the most realistic treatment of 
weighting which the reviewer has seen. The 
entire volume may be regarded as lying, not 


- solely in the field of prediction, but in the emerg- 


ing statistical field of combinations of measures 
to index a complex trait through appropriate 
composites, The development of this area—accu- 
rate, sensitive, and faithful indexing or meas- 
urement of variables representing general con- 
cepts—is of as much basic importance to those 
interested in the maturing of statistical science 
in the social fields as is the somewhat narrower 
problem of estimating over an interval of time. 
Though somewhat aside from its avowed pur- 
pose, the entire study represents a wholesome 
and timely recognition of the complexity of 
traits in social or psychological fields, and the 
technical developments in the work may be re- 
garded as a contribution to much needed refine- 
ment of indexing or measurement in areas that 
are still somewhat nebulous. Done in connec- 
tion with a personnel which varied in interests 
from case-study methods to almost pure mathe- 
matics, the work represents something of an 
attempt to bring together quantitative and non- 
quantitative approaches, In this it is only partly 
successful; the combination is more in the na- 
ture of an assemblage than an integration. The 
transfer, on the part of a writer, from thinking 
in pure mathematics to thinking in practical 
fields is not accomplished easily. One feels, 
without prejudice to the excellence of the mathe- 
matical contributions, that certain portions of 
Part II represent an over-confidence or over- 
enthusiasm for quantitative methods, even if 
mechanically performed; the volume is clearly 
concerned more with the development of theory 


a 
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y thaà with the improvement of immediate prac-- 


tice, Through its two distinct parts, the volume 
makes an appeal to both the general and the 
technical worker. It is made readable by excel- 
lent organization, clear exposition, and meticu- 
lous editing. 

J Consult Psychol 6:305-6 N-D ’42. Percival 
M. onds. * Part I deals with the predic- 
tion problem as it may be specifically applied 
to problems of personal adjustment. Part II 
contains five supplementary and technical stud- 
ies. * These supplementary studies represent 
valuable contributions in statistical theory which 
may be used in any prediction problem. * This 
reviewer was most interested in the discussion 
of the case method and "actuarial" prediction. 

ne of the first questions which comes to mind 
is, why is there all this emphasis on prediction? 
Tt has been said, of course, that the aim of sci- 
ence is control through prediction. But the so- 
cial scientist interested in prediction almost of 
necessity sees social control in terms of selec- 
tion. Can the interest in predicting the outcome 
of parole be for any other reason than to decide 
who shall be paroled? Or is there any other 
reason for predicting success in marriage than 
to advise those contemplating marriage whether 
to go ahead with it with a fair chance of its be- 
ing successful? Prediction, in a sense, presup- 
poses a certain rigidity of human nature, and 
eliminates interest in the possibility of helping 
the individual with poor parole prospects to 
make a go of it, or counseling the engaged 
couple so that their marriage may turn out to 
be a success. In a sense, then, prediction and 
education are working at cross purposes. On 
the other hand, our only knowledge of the effec- 
tiveness of educational and therapeutic proce- 
dures stems eventually out of the results of pre- 
diction methods. The supplementary study on 
the use of the case method is a unique and valu- 
able discussion. One has the feeling, however, 
that Wallin, who prepared this section, has a 
somewhat narrow concept of case study meth- 
ods. For instance, in referring to the interview 
he says, “the extent to which the interview is 
guided varies according to the number and 
Specificity of questions asked by the investigator 
(p. 189).” The possibility that an interview 
could be held in which questions were not asked 
was not even considered. Later he says, “the 
psychoanalyst has certain definite hypotheses in 
mind, and will direct the attention of the per- 
son being analyzed to areas of behavior which 
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by the hypotheses are considered relevant (p, 
19)." This conception of directing a psycho- 
analysis would seem to be directly at variance 
with free association, which is the principle tool 
of psychoanalysis. The point of view with re- 
gard to case studies “that they are invaluable to 
the process of hypothesis formation, not only in 
giving initial hunches about interrelationships 
of factors, but also in interpreting the excep- 
tional cases, after a given stage in an analysis is 
completed, thus leading to a new hypothesis and 
an improved analysis (p. 29),” seems funda- 
mentally sound and consistent. According to 
the current trend, the use of life history mate- 
rial and personal documents will be more and 
more common for scientific studies of personal 
adjustment. 

Psychol B 39:412-4 Je’42. Dewey B. Stuit. 
This monograph is divided into two quite dis- 
tinct parts. In Part I is presented a systematic 
discussion of the nature of prediction and the 
factors influencing its efficiency. The psycholo- 
gist with some advanced training in statistics 
will find this part relatively easy reading. Part 
II consists of a number of supplementary stud- 
ies, several of which make new contributions 
to the theory and practice of prediction. With 
the exception of the section on the case study 
method, this part will not be readily understood 
by the typical psychologist. The present review 
is limited largely to the material found in Part 
I, the main body of the text. In his discussion 
of the theory and techniques of prediction the 
author has made several important contribu- 
tions to the understanding of the prediction 
problem. Most of the specific points mentioned 
are not new, but, presented in organized form 
for the first time, they are likely to affect bene- 
ficially the practical application of prediction 
techniques. The first, and one of the most im- 
portant, of these contributions is the thorough 
discussion of the problem of the criterion. * A 
second contribution is concerned with the im- 
portance of the temporal relationship of the pre- 
dictive factors to the criterion factors. The 
author emphasizes that it should be possible to 
assess the factors on which prediction is based 
before the individual engages in the activity. As 
stated, the principle sounds simple, but its oper- 
ation is subtle, as shown by the fact that it has 
not been recognized by many investigators. * 
The discussion of linear and nonlinear relation- 
ships constitutes a third important contribution. 
* Pointing out that most prediction studies end 
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without ever attempting to predict is a fourth™ a very considerable variety of methods for prog- 


significant contribution. * The chapter on re- 
search suggestions contains a number of im- 
portant contributions. One of the most interest- 
ing of these is the proposal that coóperative 
research studies be instituted. * Perhaps the 
chief contribution made by the publication of 
this volume is the pointing out of the fact that 
our techniques of prediction are quite inade- 
quate. Large numbers of studies have been 
made, but they do not follow a systematic plan. 
There is a real need at the present time for 
someone to summarize what we know in each 
area of prediction and to suggest what the next 
steps should be. In a sense, this volume does 
point the way in which research energies should 
be directed, but most of the topics are treated 
briefly. The chapter on research suggestions is 
especially disappointing in this respect. The 
only subject which is treated in a very com- 
plete manner is the case study method. Anyone 
interested in making predictions from case stud- 
ies will find this treatment worthy of his careful 
study. In reading this book one cannot help 
but sense the implication that progress in the 
prediction of personal adjustment will be very 
slow as long as nonmathematically-trained 
people continue to dominate (in numbers) the 
field. Without thorough training in mathe- 
matics an individual could hardly hope to solve 
some of the problems outlined. Perhaps the day 
is past when the nonmathematically-minded stu- 
dent could turn to the social sciences for escape 
from quantitative thinking. At least for those 
who go on to the Ph.D. degree in psychology it 
would seem that some training in college mathe- 
matics would be highly desirable. For the more 
intricate prediction problems teams of psychol- 
ogists, sociologists, economists, and mathemati- 
cians would be in the best position to carry on 
the necessary research studies. The illustrations 
used in the monograph are limited to those 
involving predictions of success in educational 
and vocational adjustment, marriage, and law 
Observance, but the methods discussed would 
be equally applicable to other activities in which 
a criterion can be established. While the list of 
references is small, it was carefully selected. 
The presentation of the material, both new and 
old, is scholarly. One can only regret that more 
Space was not available for most of the topics. 

Sociol & Social Res 26:481-2 My-Je 42. 
E(rle) F. Y(oung). * This volume well sub- 
Stantiates the claim that there is now available 


nostic use, * Clearly a command of a certain 
amount of applied mathematics is essential to + 
the development of prediction methods. In a. 
series of supplementary studies several mathe- 
matical techniques are presented and discussed. 
This material should stimulate sociologists to 
acquire more command of mathematics, since 
they can thereby greatly increase their contribu- 
tion to the understanding and solution of a 
whole new range of problems which the new 
techniques make possible. Younger students 
especially should acquire the necessary mathe- 
matical and statistical backgrounds on which 
much of the scientific sociology of the future 
will be built. For the mature scholar the vol- 
ume will serve as both a careful summary and a 
critique of the methods now being tested’ by 
research sociologists in many fields. There are _ 
ample selected bibliographies and the discus- — 
sions are fully documented. The entire study is 
an excellent example of co-operative scholar- 
ship, since the task was divided among a group 
of specialists whose work was all carefully 
co-ordinated into a well-integrated exposition 
of the central problem of prediction of personal 
adjustment. It remains for ways and means to 
be devised by which social workers, penolo- 
gists, and practicing sociologists generally are 


in daily practice. 


[ 943 ] ANS 
Horton, Crank W. Achievement Tests in Relation: 


$0.50. * (PA 18:292) 


Ed Abstracts 6:60 F "41. Victor C. Smith. 
Methods of achievement testing are briefly in- 
troduced. The first function considered is test- 
ing of information. Tests of vocabulary and of 
recognition of plant structures and knowledge 
of facts about them are discussed. Thought 
processes suggested for testing are use of in- 
formation in the solution of problems, interpre- 
tation of data, and the nature of proof. Tests 
of work and study abilities, and of attitudes, 
appreciations, and interests are considered. 
Almost half the report is devoted to an appen- 
dix giving examples of testing devices of value. 

Sci Ed 24:293-4 O ’40. E(lliott) R. D(own- 
ing). The committee approaches its task with 
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this question in mind, What changes is the’ with retention and use of information some time 


teacher really trying to bring about in students 
in a college course in general botany? Unless 
-test papers are scored, returned to the student 
and his errors discussed with him tests are rela- 
tively valueless, the committee thinks. "Detailed 
information....is lost early, while knowledge of 
important generalizations which the student has 
learned to apply are retained longer." * There 
are given 92 examples of tests collected from 
various colleges and universities and classified 
under the several chapters. * The bibliography 
of about 70 titles is confined with few excep- 
tions to articles dealing with teaching at col- 
lege level. The report is very worthwhile for 
every teacher of biological subjects in high 
school or college. It will help him think in terms 
of objectives other than the mere acquisition 
~ of information. 
* 


[944] 
*Howanp, Frenerick Tomas, Complexity of Men- 
tal Processes in Science Testing. Columbia Uni- 
; Teachers College, Contributions to Education, 
No. Irving Lorge, faculty sponsor. New York: 
Bureau of Publications, the College, 1943. Pp. v, 54. 
$175.* (PA 18:292) 


J Ed Psychol 35:254-5 Ap '44. Miles A. 
Tinker. The relationship between the ability to 
recall specific information and the ability to per- 
form more complex mental tasks was investi- 
gated in this study. Items from the Codperative 
General Science Test for college students were 
used. ‘Expert’ judges rated the items for com- 
plexity. Reliability of combined rating by six 
judges was .92. * Among the implications of 
the findings are the following: (1) Contrary to 
previous views, the more complex items in a 
Subject-matter test are neither more nor less 
valid than other items as measures of mental 
ability. (2) The ability to organize (use facts, 
make applications) in a field appears no differ- 
ent from extent of information ( recalling spe- 
cific facts) in that field. (3) This suggests that 
if the student is given the opportunity to ac- 
quire information, concepts and understanding, 
he can and probably will use them. The author 
seems well aware of the limitations of the 
study: (1) He was not concerned with measur- 
ing retention at the end of a specific course 
where intensive review presumably occurs. (2) 
There were some limitations in range of com- 
plexity of the items employed. (3) The results 
do not imply that instructional methods are 
unimportant. Nevertheless, if one is concerned 


after exposure to the material, the findings of 
this investigation are highly significant. They 
should help to correct certain misconceptions 
that have arisen from previous work. 

Q R Biol 19:179 Je *44. * Significant among 
the findings is that, in spite of popular belief to 
the contrary, the more complex test problems 
are neither more nor less valid as measures of 
mental ability, than the simple, specific-item 
tests. Also, these data have shown somewhat 
conclusively that the ability to use and organize 
facts (in problem solving) is essentially the 
same as that required for recalling specific facts 
(remembering). * 


[ 945 ] . 
*Howes, Sattyann. The Validation Study of 
Worksample 307, Form AC. Human Engineering 
Laboratory, Inc., Technical Report No. 310. Boston, 
Mass.: the Laboratory, June 1946. Pp. ii, 24. Paper, 
mimeographed. * 

[ 946 ] 
XHsü, Ew Hsr The Construction of a Test for 
Measuring Character Traits. Catholic University 
of America, Studies in Psychology and Psychiatry, 
Vol. 6, No. 1. Washington, D. C.: Catholic University 
of America Press, January 1943. Pp. vii, 55. Paper. 
$0.75. * (PA 17:2391) 

[94 ] 
*Hupson, HOLLAND, AND GELDER, ROSETTA VAN. Coun- 
seling the Handicapped: A Manual on Aptitudes: 
Their Discovery and Interpretation. New York: 
National Tuberculosis Association, 1940. Pp. x, 55. 
Paper. $0.50. * (PA 16:750, title only) 


Am J Psychol 55:308 Ap '42. Rose G. An- 
derson. A...manual of selected standard tests 
used by the Rehabilitation Service of the Na- 
tional Tuberculosis Association. The norms in 
most cases are those developed by the New 
York City Adjustment Service. Four cases 
illustrating use of the testing battery and re- 
sults are included. 


[ 948 ] 

*Hurp, Arcuer Wiis, What the Testing Pro- 
ram in the Schools of Nursing Has Taught Us. 
ichmond, Va.: Bureau of Educational Research and 

Service, Medical College of Virginia, January 1946. 

Pp. viii, 51. Paper, mimeographed. $0.79, postpaid. * 

[ 949 ] 

*INBAU, Fren E. Lie Detection and Criminal Inter- 

rogation. Baltimore, Md.: Williams & Wilkins Co; 

1942. Pp. vii, 142. $3.00. * (PA 17:264) 


J Appl Psychol 30:193-4 Ap 46. Howard 
F. Hunt. This book is intended primarily as a 
practical manual for criminal investigators. Its 
secondary objective is the stimulation of inter- 
est and research in the arts of lie detection and 
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criminal interrogation. The first section de- 
scribes the technique of scientific lie detection 
using the Keeler Polygraph. * According to 
Inbau, an accurate deception diagnosis can be 
made by a competent examiner in about 70% 
of the cases; while the indications may be too 
ambiguous or indefinite in about 20% of the 
cases to justify a definite statement. And in 
about 10% of the cases the best examiner may 
be wrong. * The second section describes prac- 
tical techniques for criminal interrogation with- 
out lie detector aid. * As a practical manual 
this book is outstanding, but the lack of theo- 
retical emphasis and sophistication may impair 
its research stimulating value. Inbau states, “The 
lie detector technique is predicated upon the 
theory that deception criteria appear in a record 
because of the emotional disturbances resulting 
from the subject’s consciousness of lying and 
his fear of detection.” But he also implicitly 
recognizes the importance of emotional condi- 
tioning or association, for instance, in his dis- 
cussion of the possible after-effects of “third 
degree” examinations on the records of inno- 
cent, truthful persons. After such an experi- 
ence, a truthful denial of guilt may be accom- 
panied by emotional responses and polygraph 
records like those characteristic of deception. 
Moreover, Inbau states that persons who have 
successfully rationalized their crimes or who 
have been relieved of their emotional conflicts 
regarding an offense by confession to a clergy- 
man may show few or no deception indices. 
And these indices of deception are absent while 
the criminal consciously lies concerning his 
offense presumably because of his fear of de- 
tection. Though "consciousness of lying" and 
"fear of detection" are important sources of 
emotional reactions, they may have been over- 
emphasized in previous theoretical discussions 
of this subject. They seem to lack sufficient gen- 
erality to be entirely suitable bases for either 
explicit or implicit theories of lie detection. Per- 
haps a theory formulated in terms of emotional 
conditioning or association might have both 
greater generality and greater stimulus value 
for psychologically meaningful experimentation. 

J Clin Psychol 1:349-51 O '45. Howard F. 
Hunt. * The book easily fulfills its primary 
purpose as a clear, comprehensible manual. It 
should be profitable reading for any educated 
person seriously interested in these problems. 
Attainment of this primary end is facilitated by 
numerous textual and pictorial illustrations as 


“well as by the chapters on legal precedent and 


opinion. The orientation is predominantly prac- 
tical, relatively little space being devoted to 
theorization. While not detracting from the 
book's immediate practical value as a manual, 
however, the lack of theoretical emphasis and 
sophistication may impair its value as a stimu- 
lator of research, Without becoming specula- 
tive or academic, Inbau might have emphasized 
more explicitly the role of emotional condi- 
tioning or association in his theoretical treat- 
ment of lie detection. Implicit recognition of 
this factor is indicated, for instance, in his dis- 
cussion of the possible after-effects of “third 
degree” examinations. “Consciousness of ly- 
ing” and “fear of detection” do not adequately 
account for the appearance of these misleading 
specific responses. Also, Inbau states that guilty > 
persons who have achieved some relief from ~ 
emotional conflicts regarding offenses by means 
of rationalization or confession to a clergyman 
may later show few or no polygraphic indices 
of deception while lying in order to avoid de- 
tection of their guilt. Exceptions such as these 
suggest that “consciousness of lying” and “fear 
of detection” may have insufficient generality 
to be entirely suitable descriptive or explana- 
tory principles of which to base an explicit or 
an implicit theory of polygraphic lie detection, 
Though important as sources of the emotional 
reactions utilized in many instances of detected 
deception, these factors probably have been 
overemphasized in some of the earlier discus- 
sions of this problem. Reformulation of lie de- 
tection theory in terms of emotional condition- 
ing or association might increase its generality 
as well as its value as a stimulus to psychologi- 
cally meaningful research. 

Prison World 7:31 Ja-F '45. * Especially 
interesting and of direct application to our read- 
ers are the sections on the ethics and the law 
concerning criminal interrogations. * 

Psychol B 40:457 Je ’43. William A. Hunt. 
This is a technical manual in an applied field. 
It is technical in the sense that it deals soundly 
and carefully with a professional subject. It is 
applied in that it accepts the phenomena in 
question (the bodily manifestations of emo- 
tions) and makes no attempt to investigate 
their fundamental causes and relationships, 
merely discussing their application in the prac- 
tical situation of lie detection. The experimental 
psychologist will be disappointed in that it con- 
tains no analysis of the intricacies of the physi- 
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ni oen ed to fond that the book general trend of the investigation, the conclu 
sions to be drawn, and the recomendations 
wt a new standard to he 
the construction and administration 
divided into two sections. fret of tests, paychological and educational alike * 
Ed Digest 17:11 My ‘42. Austin G. 
* will be of interest to statisticians 
and to all those who wish to perfect themselves 
the use of tests * 
Occupational Prychol 16:202-3 O "42 
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Am J Prychol 54:310 Ap ‘pi. Stewart H. 
Britt, The author....is perfectly willing to apply 
Grapbology to everything from mentality to 
matrimony, and from personnel to psychother- 
. * intended as an introduction to the at 
[=] ence ol graphology * Science, however, lies out 
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evidence on which the conclusions are based is, 
however, not given, and for this reason the scep- 
tic will still remain unconverted. The book 
makes fascinating reading. From each feature 
of handwriting in turn we are shown how the 
scribe’s idiosyncrasies may be interpreted as 
expressions of outstanding character and per- 
sonality traits, There are very many illustra- 
tions, and a complete analysis is given of a 
specimen of handwriting. There is a small sec- 
tion on the writing of children which will be of 
interest to teachers. Mr. Jacoby has produced a 
most entertaining book. 

New Statesman & Nation 18:256-7 Ag 12 
'39. William Plomer. * Mr. Jacoby combines 
moderation, thoroughness and exactness with 
no slight ability as an imaginative psychologist. 
He is anxious to stress the usefulness of graph- 
ology. * The usefulness of graphology, so ably 
explained by Mr. Jacoby and so easily con- 
firmable by experience, can hardly be contested. 
One would like also to stress its pleasures. * 

For additional reviews, see 40:B9ó9. 
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*Jacosy, H. J. Self-Knowledge Through Hand- | 


writing. London: J. M. Dent & Sons Ltd., 1941. Pp. 
xv, 71. (New York: William Salloch.) Out of print. 
(PA 16:2731) 


Loyola Ed Digest 17:11 My '42. Austin G. 
Schmidt. * Today there are indications that 
graphology is beginning to be held in greater 
respect in the United States. It is to be hoped 
that an instrument so full of promise will be 
still further encouraged and developed. * Mr. 
Jacoby has not given us in the present book a 
complete treatise on graphology, but he has pro- 
duced the most fascinating presentation of the 
subject with which we are familiar. By means 
of well-chosen and beautifully reproduced illus- 
trations from nature and daily life he dfaws a 
parallel between the revelations found in hand- 
writing and those found in tracks in a road, in 
facial expression, in gesture, and in animal life. 
The book is the work of a scholar who knows 
his subject. 


[ 955 ] 
*Jarviz, L. L., anp Extrncson, Marx. A Handbook 
on the Anecdotal Behavior Journal. Chicago, Ill.: 
University of Chicago Press, 1940. Pp. xii, 71. Out of 
print. * (PA 15:1509) 


Am Sch Board J 102:75 Ap ' 41. * a substan- 
‘tial contribution to the field of personal better- 
ment by a careful program of records as they 


function in a going educational program. Nu- 
merous leads are provided for those teachers 
who are seriously concerned with the personal 
development of students on a front much 
broader than academic achievement. 

Clearing House 15:561-2 My "41. J(ohn) 
C(arr) D(uff). * an informative account of the 
procedures by which the faculty of the Roches- 
ter Athenaeum and Mechanics Institute have 
constructively and critically adapted the anec- 
dotal technique. 

Ed Abstracts 5:324-5 N ’40. Dewey B. 
Stuit. * This clear and practical treatise de- 
serves the attention of every faculty member 
or special counselor interested in the welfare 
of students. The authors have indicated the pós- 
sibilities and limitations of the method. If used 
in accordance with their suggestions, it cannot 
fail to be helpful in the guidance program. 

Harvard Ed R 11:395-6 My ’41. W. H. 
Burton, * the most adequate account now avail- 
able of that recent addition to the school record 
—namely, the behavior journal. It is presented 
in direct, lucid language which cannot be too 
highly complimented. * The first chapter defines 
the behavior journal and gives concrete illus- 
trations taken from the record. Three chapters 
then follow on “observing behavior, recording 
behavior, and interpreting the record.” Here 
again are presented specific examples, processes, 
and criteria. Vague generalizations are nowhere 
present. The fifth chapter on “implementing 
the record” discusses the guidance processes 
which should follow analysis. Three case studies 
and six generalizations about guidance proce- 
dures are included. This is the only chapter 
which might be criticized as being too abbrevi- 
ated. The sixth chapter, also sharply abbrevi- 
ated, gives the outline of procedures desirable 
in putting the journal into operation. The last 
chapter consists of twenty-four numbered sum- 
mary statements. This volume should be widely 
read by all types of teachers interested in the 
current effort to improve school records, diag- 
nostic and guidance data, and indirectly to im- 
prove teaching. The widespread attack upon 
school records is but one phase of the revolu- 
tionary movement which has been for some 
time reorganizing curriculums, improving class- 
room practice, and in general modernizing the 
educative process. This volume is by all odds 
the single most helpful discussion on behavior 
records yet to appear. 

J Am Assn Col Reg 16:344-5 Ap "41. W. C. 
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Smyser. * a stimulating and suggestive hand- 
book for counselors. Both values and shortcom- 
ings of anecdotal records are pointed out, and 
there is outlined a constructive program for the 
installation of such records—teaching a faculty 
to select and record anecdotes and the coun- 
selors how to use them, and to base upon them 
practical individual programs for personal and 
intellectual growth. The anecdotal record is 
conceivably far too cumbersome for adoption 
on many campuses. Nevertheless, any counselor 
and any registrar, interested in a practical coun- 
seling program, will in every case find much 
that is valuable in this comprehensive book. 

Jun Col J 11:355 F '41. This practical man- 
ual interprets a method of gathering informa- 
tion as a basis for the understanding of students 
and their problems, which is applicable as part 
of the personnel program in many junior col- 
leges. * 

_ Progressive Ed 18:62-3 Ja *41. Eugene R. 
Smith. * describes the experimentation carried 
on at the Rochester Athenaeum and Mechanics 
Institute in its attempt to improve faculty un- 
derstanding of students and the counseling pro- 
gram of the institution. Based, as is common 
among newer ways of studying individuals, on 
description of behavior, this method requires 
that instructors record in regular sequence 

anecdotes” that seem to have significance as 
Source material for understanding students. To 
the best of the reviewer's belief, no other insti- 
tution has given such complete trial and study 
to anecdotal recording. The care with which 
objectives, both for an individual and for the 
Program of the school, have been analyzed, and 
the unwavering manner in which all procedures 
have been focused on the improvement of the 
guidance of students are evident throughout the 
book. * Of considerable importance is the way 
in which the general text is illustrated by sam- 
Ple cases, It is gratifying that the authors, 
while firmly convinced of the value of this ap- 
Proach to a better understanding of people, 
nevertheless allow flexibility in the use of the 
method and make no extravagant claims for it. 
They recognize the need for more extensive 
experimentation, and speak of the anecdotal 
technique as a “valuable adjunct to personal 
records rather than as a substitute for many of 
those now in use.” This book should be read 
not only by personnel and guidance officers but 
by teachers in general. It has significance for 
better educational procedures whether or not 


the reader ever uses the particular techniques 
described. 

Pub Personnel R 2:87 Ja ’41. * The anec- 
dotal journal, providing a running record of 
behavior and including the composite evalua- 
tion of a person’s activities under a variety of 
situations, might conceivably form a point of 
departure in attacking the problem of rating 
employee efficiency and service. * freely illus- 
trated with forms and charts * written with a 
minimum of theoretical discussions. 
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*Jennincs, Heren HarL, Leadership and Isolation: 
A Study of Personality in Inter-Personal Rela- 
tions. Foreword by Gardner Murphy. New York: 
Longmans, Green and Co., Inc., 1943. Pp. xv, 241. Out 
of print. * (PA 18:202) 


Am J Psychol 57:602-3 O '44. Martha L. 
Lemmon. * The extent to which an individual 
chooses others is called his "emotional expan- 
siveness." This is distinguished from his "so- 
cial expansiveness,” which is taken to mean the 
extent of all social contacts initiated or main- 
tained by the individual. It is measured for each 
subject, somewhat dubiously, by records of the 
number of persons she has spoken to of her own 
accord. As 133 girls were measured twice, with 
an intervening 8-month interval, there is vast 
opportunity for statistical analysis of the 
gleaned results. The author rolls into place the 
heavy artillery of fractionation and calculation 
in order to split the ‘social atom,’ thus reducing 
the entire middle section of the book to a long, 
dry Sahara of figures and tabulations to be read 
with much forward and backward reference to 
tables and definitions. The statistical revelation 
of a more or less constant ‘emotional expansive- 
ness’ for each individual is worthy of note, al- 
though only two measures hardly seem suffi- 
cient for drawing final conclusions. The very 
use of the term (emotional expansiveness), is, 
in fact, open to question, inasmuch as the author 
herself admits that choices for others were 
sometimes made without expectation or even a 
wish for mutual responsiveness, but might be 
determined by the chooser's enjoyment of the 
chosen one's conversation and the like: The per- 
sonal characteristics discovered for leaders and 
isolates are interesting. * The chief values of 
this book...lie in the gross classifications into 
groups of leaders, average, and isolates by so- 
ciometry and then the inquest into their attri- 
butes by questionaries submitted to house- 
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mothers and interviews with the choosing and 
rejecting subjects. 

Am J Sociol 49:372 Ja '44. Robert E. L. 
Faris. * Jennings manages...to make a very 
successful investigation of the roles of leader 
and of "isolate" through the use of sociometric 
methods. Her results sound almost as if they 
had been written under the guidance of Cooley 
—which may be a tribute both to Cooley and 
to her. * The conclusions emerge quite neatly 
and are satisfactorily supported by the statisti- 
cal analysis. Some case studies add a rounded 

image of the results and afford some relief in a 
discussion otherwise heavy with figures and 
new language. * The persons chosen frequently, 
the leader types in this sort of community, are 
those who are aggressive enough to push them- 
selves ahead but at the same time are solicitous 
of others—and skilful in both respects. This is 
in agreement with the advice in' most of the 
“How To Win Friends" books and also with 
traditional or folk sociology. The methods used 
in making the study are novel, however, and it 
may be that we are observing in such studies 
the development of some valuable machine tools 
of sociological and social-psychological research. 
Ed Res B 24:54 F 1445. H. B. English. * 
The techniques and concepts are those of “so- 
ciometry"—an interesting movement which un- 
doubtedly is producing valuable insights, but a 
movement which tends to a certain amount of 
cult-like rigidity. Moreover, the book is far 
from easy to read ; it is full of rather technically 
used terms, and its sentence structure is elabo- 
rate. * That having been said, it is time to rec- 
ognize the genuine importance of this little mon- 
ograph. It outlines and discusses procedures of 
admirable simplicity which yield valuable data 
for both theory and practice. And the analysis 
of how and why we make choice of companions 
in work and play; of our actual contacts with 
people and of the contacts we desire—all of this 
is immensely valuable. The kind of study of the 
interactions of individuals here carried forward 
will presently revolutionize—the reviewer means 
that seriously—the concept of the educational 
process. An eloquent and simple restatement of 
Miss Jennings’ findings would be a first-rate 
contribution hardly second to her own pioneer- 
ing research. 

J Abn & Social Psychol 39:382-4 Jl’44. Jean 
M. Arsenian. * As an analysis of the choice 
process, Jennings’ study rescues sociometry 
from a miasma of undefinable, elective affinities; 


Its contributions in this respect can receive no 
more generous an evaluation than is presented 
by Professor Gardner Murphy in his Foreword 
to the book. As an analysis of leadership this 
study is at once reassuring and disappointing. 
Generalizations about leaders as "group-ori- 
ented” as contrasted with “self-bound” individ- 
uals are familiar—hence reassuring. What is 
disappointing is the confusion in interpretation 
which seems to stem from inadequate concep- 
tualization of two theories of leadership, a role 
theory and a trait theory. The data as pre- 
sented wear now the one aspect, now the other, 
with neither clearly defined. 

J Clin Psychol 1:166-9 Ap '45. Ben Karp- 
man. * Only in recent years has the literature of 
social psychology begun to extend beyond the 
best traditions of armchair research. * A most 
valuable example of the newer trends can be 
found in Jennings' recent book. This reports a 
research that richly deserves to be called ex- 
perimental and which goes far beyond the trite 
and conventional approaches to make a genuine 
contribution to the dynamics of psycho-social 
forces. No person lives alone, and the analysis 
of inter-personal relations in meaningful terms 
is crucial to an understanding of individual and 
group behavior. Jennings' investigation gives 
broad experimental evidence as to how indi- 
viduals differ in the number and kinds of rela- 
tionships they may establish with others, and 
how leadership and followership grow out of 
these relationships. The volume under review 
reports a continuation and extension of the 
techniques she pioneered earlier in collaboration 
with Dr. J. L. Moreno. This investigation goes 
considerably beyond the original sociometric 
tests, for it attempts to learn on a rather elabo- 
rate scale the social and inter-personal factors 
which determine why certain individuals in a 
community are chosen by many as their leader, 
and why others are rejected. * Of extreme inter- 
est is the finding that like, even identical traits, 
may produce widely differing effects on others, 
showing that the manner of expression is fully 
as important as the trait itself. * What Doctor 
Jennings calls emotional expansiveness must 
relate to those personality factors, in the long 
run entirely emotional in nature, which draw 
people to one another, or repel them, and form 
the basis of like and antagonistic groups. Why 
Jean chooses Olga and rejects Vera can only in 
part bc accounted for by the personalities of 
Olga or Vera, but in larger measure yet, the 
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explanation must be sought in Jean, her per- 
sonality make-up, her loves and hates, her anxi- 
eties and her insecurities. The search for these 
factors is not the author’s task, but it is an im- 
portant problem which the reviewer hopes the 
author will take up in her next work. * Addi- 
tionally, and very important, we should like to 
know more about the unconscious factors which 
underlie one’s choice and rejection from which 
our larger social attitudes grow. How much of 
identification, even introjection, is in these 
choices? How much and what kind of hostility 
and other emotions is there behind the rejec- 
tions? All of these problems are basic and posit 
for consideration the problem of the constitu- 
tion versus the environment in the broad sense 
of internal and external environments, of family 
versus society, of the individual versus the com- 
munity, and so forth. Because this is essentially 
an inquiry into the anatomy of psycho-social 
relations and as yet does not consider the physi- 
ology of the situation, the future consideration 
of these factors will have to be made a psycho- 
dynamic focus. The reviewer suspects that the 
author herself would agree. That she is aware 
that this is so is evident from her protocols in 
which the personalities of some of the members 
of the group are described in insightful terms. 
Yet the reviewer insists that what is needed is 
a complete psychoanalysis of each over-chosen 
and each rejected under-chosen individual. The 
re-test of the community after eight months 
shows that there was but little change in the 
choice and rejection structure, and suggests that 
the social fabric has a fairly fixed pattern of its 
own. This finding is exceedingly important for 
the study of social psychology and should pro- 
vide a fruitful attack on the problem of inter- 
personal relations. Another important problem 
is that of the psychology of followership. Every 
over-chosen person is selected by a definite 
group of people and for definite reasons. This 
intimate study of the varied reasons for choice 
by the different people of different leaders illu- 
minates a very obscure social problem. Here, 
again, a comparative psychoanalytic study of 
the differences between followers in their needs 
and expectations of securing fulfillment through 
leaders should be undertaken. The work raises 
further interesting questions. Granted that lead- 
ership and isolation do not change in a period 
of eight months, but what about eight years, or 
twenty-eight years? From what we know of 
Hitler’s early life, he was more of an isolate 


than a leader, yet later we find him a volcanic 
leader. Does this not mean, then, that each per- 
son's psychological situation depends upon his 
own stage of development as well as upon the 
environment in which he finds himself ? Viewed 
broadly, leadership and followership are per- 
haps the most important social problems we 
have before us today. They determine whether 
we shall have a Hitler or a Roosevelt, war or 
peace, an enslaving dictatorship, or a free de- 
mocracy. In this light, Doctor Jennings’ study 
assumes an importance far greater than is ordi- 
narily given to a work dealing with a specific 
psycho-social problem. She has given us in this 
work the basic pattern along which these rela- 
tions are established; she has also presented a 
detailed analysis of a technique by which these 
relations may be determined. By providing us 
with new insights into the problems of psycho- 
social relations, this analysis suggests what lines 
we have to pursue in the future. We urgently 
need to know to what extent and how the 
leader influences the follower and to what ex- 
tent the follower determines the nature and kind 
of leadership. * Of exceptional importance are 
the protocols giving graphic personality descrip- 
tions of some individuals. Would that we had 
been given more of such! For these studies pro- 
vide us with significant clues as to basic person- 
ality make-up of individuals in specific relation 
to other persons. Along with such mean- 
ingful insight into the lives of the subjects, 
Doctor Jennings needs but one step further to 
get into a field which alone can give the basic 
explanation for the phenomena which she has 
so carefully elicited—the field of psychoanalysis. 
By emphasizing the basic sources for the de- 
velopment of social reaction, psychoanalysis has 
shown that our relations to others are only an 
echo of the original relations we had with those 
who were close to us in our family life, an up- 
ward extension of childhood attitudes into adult 
life. Every psychiatrist would profit from a 
reading of this book; it offers the most pro- 
found analysis of leadership and isolation in the 
social process this reviewer has discovered in 
the field of social psychology. From this point 
on, the work must be done in psychoanalytic 
focus. 

J Ed Psychol 36:62-4 Ja '45. H. Meltzer. * 
an interesting and thorough job in studying 
the problem of choice more thoroughly than 
ever done by the sociometric approach. Her 
book....is a contribution to sociometric research, 
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also a contribution to methods in social research 
in general. * Leaders are not perfect individ- 
uals. That is one of the more interesting results 
found by the author. * All in all, this is an in- 
teresting study, an interesting follow-up of 
Moreno’s sociometric approaches. It represents 
a more intensive approach of the whole prob- 
lem of choosing in interpersonal relationships. 
And the fact that these over-chosen and under- 
chosen are related to behavior characteristics 
makes of it a more enlightening study and some- 
what broader than the ordinary sociometric 
approach where choices are merely asked for 
without any correlations to behavior character- 
istics. Considering the fact that Moreno’s basic 
concept is that effective functioning depends 
on spontaneity, one wonders why, instead of 
merely ratings by house mothers, the soci- 
ometric research group does not depend more 
on either dramatic or projective techniques for 
evaluating spontaneity and naturalness in human 
beings and correlating that, rather than ratings 
with sociometric choices of the people who are 
leaders or who turn into isolationists, the over- 
chosen and the under-chosen. Aside from that 
additional suggestion, the book is a valuable 
contribution to sociometric research and should 
be of use to individuals who do not think of 
themselves as being exclusively sociometrists. 

Psychiatry 6:448 N ’43. Clara Thompson. * 
The book gives the impression of a careful and 
well-controlled study. It is not easy to read 
because of the frequent use of abbreviations, 
references to tables, and a concise brevity of 
presentation. However, for those who find in- 
terest in the statistical approach to problems of 
human behavior, this book can be highly recom- 
mended. 

Psychoanalytic Q 14:252-3 Ap '45. Ruth 
Loveland. * This study is carefully thought out 
and executed. The author stops about where the 
psychoanalyst would begin to study the dy- 
namics of choice. The style of writing, the spe- 
cial vocabulary employed by the author and the 
presence of large numbers of complicated tables 
in the text make this a very difficult book to 
read. 

Psychol B 41:263-4 Ap’44. Clarence Leuba. 
* Exception might be taken to the use of the 
term “leaders” to describe the persons with 
whom people like to work and live. The author 
furnishes no evidence to indicate that all such 
persons would necessarily be chosen for posi- 
tions of leadership. The use of the term leader- 


ship is but one illustration of the use of a rather 
grandiose terminology to cover relatively sim- 
ple and limited phenomena. One wonders, also, 
whether the “sociometrist” is justified in coin- 
ing as many elaborate terms as he does. Many 
chapters bog down with definitions and with 
slight correlations among unimportant but im- 
pressively worded variables. This sometimes 
prevents the significant contributions made by 
the author, in this important area of compati- 
bilities and antipathies, from standing out as 
clearly as they might. 

Sociometry 6:450-4 N '43. Ronald Lippitt. 
This book is not an essay on leadership but the 
report of a noteworthy research project. * The 
clinical and social psychologist will both find 
much that is suggestive in Jennings' analysis 
of how the proportion and direction of a given 
person's positive choices and rejections “meet” 
or “pass by” the incoming rejections and choices 
from others. Several clearly different patterns 
emerge from the data. * In her final chapter 
Dr. Jennings attempts to orient her findings in 
terms of the social-field statements of Gardner 
Murphy and the psychobiological field theory of 
Kurt Lewin. Here she runs into difficulties, in 
the opinion of this reviewer, because her own 
data are primarily a structural framework for 
the analysis of social dynamics, and lack the 
facts necessary to draw out interpretations con- 
cerning resultants in terms of social behavior of 
individuals or.trends in group behavior. It is 
hard to utilize a field concept without having 
any systematic concept of direction. The treat- 
ment of leadership, for example, is narrowed to 
the idea of supplying emotional needs because 
the idea of direction of leadership or direction 
of group locomotion is missing. Certainly the 
emotional leadership syndrome is a vitally im- 
portant and sadly neglected constellation, but a 
researcher oriented in terms of field theory 
would feel obligated to collect behavior data 
on the field forces determining specific leader- 
ship behavior in specific situations to see what 
could be predicted about actual social and 
coordinative leadership from a knowledge of 
the structural network of affectional and rejec- 
tive feeling tone centering around various indi- 
viduals. If the prediction value of such data is 
found to be high we will know we have hold 
of one of the most important variables in the 
field determining leadership emergence and 
behavior. It seems to this reviewer that we al- 
ready have enough data from anthropological 
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analysis of the leadership role, institutionalized 
forces of leader-role-expectation, and of many 
other'selection-forces besides the emotional ones 
to raise real doubt as to the validity of building 
up any real field theory of leadership upon the 
basis of verbal choice material, even when it is 
as closely related to behavioral and situational 
criteria as in the present study. Anthropologists 
and research social psychologists have been just 
as negligent in taking into account the impor- 
tant variables studied in this project which 
throw light on several completely unexplored 
areas of group life. It is certainly safe to say 
that this report pioneers in social research 
methodology and in revealing the promise of 
fruitfulness in correlating sociometric choice 
data with actual behavioral data in the areas 
of leadership behavior, social rejection dy- 
namics, and “goodness” of community function- 
ing. Educators, group workers, agency and com- 
munity administrators, and other applied social 
scientists are all in need of the findings of this 
study, and of the even more fruitful fact-finding 
expeditions that should flow from it if social 
scientists get from it the cues they should. 
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XJeNNINGs, Heren H. Sociometry of Leadership: 
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Sociogroup. Sociometry Monographs, No. 14. Re- 
printed from Sociometry, Vol. 10, 1947. New York: 
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XJENNINGS, ISABEL. An Experiment in the Meas- 
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X*JENSs, RACHEL M., AND Soutuer, Susan P. Meth- 
ods of Assessing the Physical Fitness of Children: 
A Study of Certain Methods Based on Anthropo- 
metric, Clinical, and Socioeconomic Observations 
Made of 713 7-Year-Old White Boys and Girls in 
New Haven, Conn., Over a period of 19 or 20 
Months During 1934-36. United States Department 
of Labor, Children’s Bureau, Bureau Publication No. 
263. Washington, D. C.: Government Printing Office, 
1940. Pp. vi, 121. Paper. Out of print. * 


J Am Stat Assn 36:309 Je "41. John R. 
Miner. In any program for improving the physi- 
cal condition of children an important factor is 
a practicable procedure to indicate those chil- 
dren who are most in need of attention. In an 
attempt to develop an easily applied method of 
asséssing physical fitness, various indexes of 
body build have been devised, based on the rela- 
tion of weight or girth of arm to age and height 


or to age, height, and some lateral measurement. 
This study is an evaluation of four of these in- 
dexes: The Baldwin-Wood Tables, the A C H 
Index, the Nutritional Status Indexes and the 
Pryor Width-Weight Tables. It is based on 
observations made on 713 seven year old school 
children of New Haven, Connecticut. The cri- 
teria used in evaluating the four indexes were 
based on a clinical judgment of poor or very 
poor nutritional condition made by a pediatri- 
cian, on a low rate of growth in weight or girth 
of arm or on a combination of the two elements. 
All the indexes were found to lack both sensi- 
tivity and specificity: that is, many of the chil- 
dren whose condition was indicated by the 
criteria as unsatisfactory were not picked out 
by the index and conversely many of the chil- 
dren whose condition was indicated by the 
criteria as satisfactory were picked out by the 
index as tinsatisfactory. There was also con- 
siderable disagreement in the clinical judgments 
of different pediatricians. 
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*Jounson, Loaz W. The Administrative Function 
of English in the University of California: The 
Evolution of the Examination in Subject A. Uni- 
versity of California Publications in Education, Vol. 
7, No. 4. Berkeley, Calif.: University of California 
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*JoiT COMMITTEE OF THE West Ripinc TEACHERS" 
ASSOCIATION AND THE West YorKSHIRE County As- 
SOCIATION OF THE NATIONAL UNION oF TEACHERS. 
Special Place Examinations: The Report of an 
Investigation by a Joint Committee of the West 
Riding Teachers’ Association and the West York- 
shire County Association of the National Union 
of Teachers, of the Merits and Defects of Some 
Forms of Special Place Examination. London: 
University of London Press, Ltd., 1941. Pp. viii, 71. 
2s. 6d. * (PA 16:1673) 


Brit J Ed Psychol 11:223-6 N "41. C. W. 
Valentine. * It is a happy sign of the times 
that teachers' organizations should be carrying 
out such useful research, and the Committee is 
to be congratulated on its enterprise, and on the 
planning of the enquiry. Some 2,000 children 
in West Riding schools were tested by means 
of: (a) ordinary types of examination papers 
in English and arithmetic; (b) ‘new type’ ex- 
amination papers in the same subjects; and (c) 
several Moray House intelligence tests. * the 
statistical analysis of the results of the exami- 
nation papers and of the intelligence tests was 
carried out by Professor Godfrey Thomson and 
Mr. W. G. Emmett. This was done with all the 
thoroughness and exactitude we should expect 
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from these investigators. * The inclusion of the 
‘new type’ examination papers....and the compar- 
ison of their results with those of the ordinary 
types. of examination and with the intelligence 
tests give a special interest to this investi- 
gation: * The correlations were calculated be- 
tween the performances in the various tests and 
the orders of merit drawn. up by the primary 
teachers of the schools the children came from 
* Admittedly this was not nearly so satisfactory 
a criterion as the actual records of the children 
in their subsequent secondary school careers, 
but the present results are offered as a kind of 
interim report. * It is well that this limitation 
of the value of the criterion should be kept in 
mind in considering these reports. * The main 
result is that all the correlations....give corre- 
lations round about .8 with the estimates of the 
teachers. * Here undoubtedly is one very inter- 
esting fact, namely, that the ‘new type’ exami- 
nation papers in English and arithmetic are as 
useful (according to this criterion) as are the 
ordinary types of examination papers. In view 
of the greater ease with which the ‘new type’ 
papers can be standardized and marked, this 
surely is a very strong reason for their being 
preferred. * There seems to be nothing in the 
correlations afforded by the investigation to 
justify the first suggestion, namely, the reliance 
on the recommendations of the heads of schools, 
even if they have the intelligence quotients and 
education quotients before them. * It seems to 
me, however, that the main object of the Com- 
mittee could be obtained in a relatively simple 
manner of the type I have already recommended 
elsewhere [Examinations and the Examinee, 
1938]. There it was pointed out that all the 
results which had come under my own en- 
quiries and that of others so far published, 
agreed in showing that the addition of papers 
in arithmetic and English to intelligence tests 
improved very little, if at all, the correlations 
between (a) the order of merit in the whole 
entrance examination, and (b) the order of 
merit at the end of four or five years in the 
secondary school. It was argued, therefore, that 
one could rely upon the intelligence test alone— 
for the general testing of the particular area. 
That would have the great advantage of avoid- 
ing the evils of cramming in some primary 
schools, emphasized again in the report before 
us, and of special coaching arranged by par- 
ents or by teachers in certain schools. * the 
intelligence tests would determine the quota of 


school A and of school B, etc.; but in the selec- 
tion of school A’s quota, the head’s estimates 
should have an important influence. May sug- 
gest to the Committee that some such plan 
would meet the weakest point in their scheme, 
namely, the variability in the standards of dif- 
ferent heads. In the meantime, we may welcome 
this most useful contribution to what remains 
a difficult and complex problem. It ‘seems 
doubtful, however, how far further improve- 
ments can be made in our technique of selection 
for secondary education (unless valid tests of 
specific abilities can be devised for eleven-year- 
olds) and that one must look for the solution 
of the problem (or, rather, for compensating 
for the inevitable errors) in the lessening of 
great differences between the education of those 
selected and not selected at eleven years by 
lengthening and improving senior school educa- 
tion and extending it in continuation schools, 
and by facilitating later transference at least 
upwards from the senior to the secondary or 
technical school. i 

J Ed (London) 73:274-5 Je"41. Philip Har- 
tog. The West Riding has a high reputation for 
its educational work, and the present Report... 
does credit to all concerned though its conclu- 
sions will certainly not be accepted without 
criticism. The central feature of the Report is 
a statistical investigation by Prof. Thomson and 
his staff. They compared, by means of correla- 
tion co-efficients, the ranking of head teachers, 
taken as the best criterion for the present 
(though not.for the future), with (a) the exist- 
ing ‘County Minor Scholarship Examination’, 
which comprises ‘old type’ attainment tests in 
English and Arithmetic and a Group Intelli- 
gence Test, and with two other examinations; 
(b) a combination of standardized objective 
tests and a Group Intelligence Test; and (c) a 
Group Intelligence Test (or Tests) taken 
alone. The committee recommend that the pres- 
ent examination (and also, it would appear, 
though it is not clearly stated, the present 
method of selecting candidates for that exami- 
nation, based on a Preliminary Test in the 
schools, or on school reports, or on a combina- 
tion of the two), should be replaced by a single 
standardized Group Intelligence Test, followed 
by supplementary tests for children in whose 
case there is a discrepancy between the exami- 
nation results and the reports of the teachers. 
The Chief Examiners in English and Arith- 
metic dissent from the new proposals on grounds 
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which deserve consideration. They suggest in 
_ particular that the danger of missing examiners’ 
blunders has been exaggerated, and refer in this 
matter to a report on the 1939 examination, of 
which precise particulars are not given, but they 
do not appear to have tested their own self- 
consistency. * makes suggestions as to the 
method of selecting the appropriate kind of 
school-for each pupil....and....the form of school 
record card which should be used, a matter on 
which the Kent and Wiltshire County Coun- 
cils have made interesting experiments. The 
present Chief Examiners point out that there 
is no place in the record proposed for “ability 
to write English"—a serious omission. * de- 
serves careful consideration by all those con- 
cerned with Free Place Examinations. * 


[ 962 ] 
*Jones, HaroLn E.; ASSISTED BY THE STAFF OF THE 
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in Adolescence: Appronchias to the Study of the 
Individual. New York: Appleton-Century-Crofts, 
Inc., 1943. Pp. xix, 166. $2.00. * (PA 18:344) 


Am J Orthopsychiatry 14:374 Ap ' 44. ( Rich- 
ard L.) Jenkins. Concerned with the physical, 
mental and social growth of an adolescent boy 
Írom his eleventh to his eighteenth year, this 
study is the result of painstaking observation, 
measurement, and comparison of 200 adoles- 
cents which provide many norms upon which 
the boy's growth is projected. * The usefulness 
of the wealth of material presented is a little 
difficult to appraise. Included are some growth 
norms for adolescence. While the case study is 
careful and detailed for the methods chosen, the 
possibilities of investigative interviews and 
autobiographical narrative appear to a psychia- 
trist to be almost conspicuously neglected. The 
book should serve as a useful reference case 
study illustrating the problems of adjustment 
which exist "within the normal range." It 
throws healthy emphasis upon seeing the sub- 
ject against the background of his group. The 
variety of methods of investigation is provoca- 
tive and stimulating, and well worth the con- 
sideration of students in the field. Studies of 
this nature should in sufficient number indicate 
more clearly which trends and problems tend 
to be self-remedying, and what measures are 
likely to be therapeutic. The present study, 
alone, is perhaps to be valued more for present- 
ing comparative material than for analyzing it. 

Am J Sociol 50:324 Ja ’45. Ruth Shonle 


Cavan. * Although the interpretation of the data 
secured is excellent, the reader has the im- 
pression of gliding over the surface of the 
boy’s life and suspects that much hidden data 
remain unrevealed. A great lack is a clear state- 
ment of the complete plan of the study and of 
the method used. Reference is made to another 
publication in which this information appears, 
but previous publication does not aid the reader 
who comes upon this book without having first 
happened to read the previous statement. Tests 
are mentioned incidental to the presentation of 
scores, but nowhere may the research worker 
learn all that was done, how it was done, or 
what obstacles were met and overcome. 

J Abn & Social Psychol 39:267-9 Ap '44. 
Robert W. White. * the Reputation Test....[is] 
an ingenious method of discovering each year 
by a sort of ballot “what John’s fellow pupils 
thought of him as compared with others of the 
same grade level.” This is augmented by a ques- 
tionnaire on best friends which permits the 
construction of “sociograms” revealing groups, 
cliques, pairs, and isolates from year to year. *- 
The thoroughness and richness of the observa- 
tions of social behavior are an outstanding merit 
of the book. * A chapter is devoted to “underly- 
ing tendencies”—tendencies which Jones views 
with suspicion, surrounding the phrase with a 
cordon of quotation marks. Particularly valu- 
able in this section is the analysis of drive- 
patterns according to a technique developed by 
E. Frenkel-Brunswik; tendencies and wishes 
not expressed in overt behavior, often very 
different from what is expressed, come to light 
by this method of analysis. Less complete and 
satisfactory are the reports on the Thematic 
Apperception Test, a voice record, the Ror- 
schach Test, and emotional reactivity as meas- 
ured by the galvanic skin response. * Let us 
now see how Jones in his concluding chapter 
draws this material together and makes it ask 
significant questions. From the “complex fed- 
eration of factors” which determined the fall 
and rise of the boy’s personal welfare, he selects 
for initial mention the delay in growth and bod- 
ily maturity which occasioned, in “the com- 
petitive adolescent culture,” a loss of status 
with others and a possible undermining of per- 
sonal security. Next he points out the predomi- 
nant ectomorphy of John’s physique and shows 
that much, though not all, of his behavior can 
be ordered to the temperamental component, 
cerebrotonia, recently proposed by Sheldon. To 
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economic pressure is assigned a rather diffuse 
influence over John's development. The prob- 
lem of escaping from the “overprotective atti- 
tudes” and “self-centered demands" of the 
mother is pointed out as another possibly sig- 
nificant factor. Lastly, there is a return to the 
physical factor, the lack of the athletic ability 
and strength that is so much prized by adoles- 
cents. That is all. Jones makes no attempt to 
weigh these factors, to determine their particu- 
lar contributions, or even to squeeze them for 
important questions that might be answered 
by future research. "The concluding chapter," 
he warns us in the preface, “deals primarily 
with interrelationships basic to construction of 
‘the individual as a whole’; however, since any 
such interpretation must include a fair amount 
of conjecture, the reader whose interests lie in 
this field is for the most part left free to de- 
velop his own hypotheses.” This freedom is cer- 
tainly misplaced in a study from which, in the 
interests of anonymity, “we have found it neces- 
sary to exclude a large amount of recorded per- 
‘sonal data." In its stead this reviewer would 
have welcomed a weighting of factors and a 
most likely interpretation of the case as a whole, 
arrived at after discussion and mutual criti- 
cism by all members of the staff. But if it is 
necessary to be content with a mere pointing- 
out of important interrelationships, we can still 
justly demand that what is pointed out should 
be theoretically capable of explaining the out- 
standing features of the case. Nothing in Jones’s 
enumeration explains John’s constructive ac- 
tivities. None of the mentioned factors lead us 
to suppose that John is rapidly becoming a 
well-adjusted college student rather than a 
schizophrenic or an eccentric isolate. John has 
an overprotective, self-centered mother, but 
although he is somewhat dependent he is also 
very strong in the drive for autonomy. Whence 
this autonomy? John is rejected and even hu- 
miliated by the social group, but he continues 
to seek recognition and to strive for social skills. 
Whence this persistence and this shunning of 
pathways to escape? John is serious-minded, 
reading and arguing about the world and its 
problems; he is ambitious to make something 
of himself: Whence these sober aspirations? Sig- 
nificant questions fairly burst from this account 
of a developing personality, But Jones sets it 
all down to “the toughness of the human or- 
ganism"—and no questions asked. Serious 
shortcomings in the data lie behind this curious 
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unwillingness to investigate constructive be- 
havior. The material on social behavior, to be 
sure, with the annual testing and the constant 
reference to norms, sets a standard at which 
future studies of personality should aim. The 
material on family relationships, on the other 
hand, falls distinctly below the line which sepa- 
rates scientific data from guesswork. Did the 
mother give rewards of praise for good school 
work? Did the father encourage the serious in- 
tentions? Were there conditions at home which, 
even though not wholly satisfactory, conferred 
a basic sense of security? The material does not 
seem to be equal to these and other urgent 
questions. Another grave weakness is the lack 
of interview data; John is practically never al- 
lowed to speak for himself, except through 
questionnaires. Perhaps it is the interviews that 
have been excluded, but at all events we can 
only regretfully imagine how much better the 
case might be understood, how much more 
unity might be perceived in it, how much 
knowledge might be gained about the toughness 
of the human organism, if we but knew what 
John said about himself when talking with a 
trusted adult who had become a friend. Even 
when his growth was most retarded, his basal 
metabolism at its lowest, and this social rejec- 
tion most acute, did John have sources of in- 
terest and satisfaction in his private life? Did 
he feel rewarded by things that he did or made 
or read, and did he find strength in secret plans 
and aspirations? Unfortunately, these matters 
remain secret. 

J Appl Psychol 28:438-9 O ’44. Henry E. 
Garrett. This book...deals with the history of 
one boy, John Sanders, chosen from the 200 
boys and girls of the California Adolescent 
Growth Study. We first see John at the age of 
10 and leave him at the age of 18 when he is 
presumably preparing to enter college. * The 
techniques employed in the study are numerous 
and well chosen, and are certainly worth the 
attention of anyone interested in the psychol- 
ogy of individual differences. A wealth of meas- 
urement data is presented, ranging from precise 
physical, mental and educational measurements 
to ratings and observations of John’s social be- 
havior. Whenever possible John’s development 
is appraised against the norms of his group. 
Scores are uniformly in terms of standard de- 
viation units, the norm for the group being set 
at 50 and the standard deviation at 10, so that 
the range of scores is approximately from 20 to 
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8o. The use of standard scores is especially in- 
teresting since, at the present time, many social 
psychologists who emphasize the need for ob- 
taining a picture of the "whole personality" 
decry mental measurement as such, the projec- 
tive techniques being championed as the only 
methods in which an individual is described as 
a person. It is noteworthy, however, that unless 
John is compared with a norm it is impossible 
to tell whether he deviates sufficiently from the 
behavior of boys of his age group to be char- 
acterized as atypical. The author does not hesi- 
tate to make use of any techniques, subjective 
or objective, which will throw light upon John's 
development. Throughout the book the various 
measurements are skillfully interwoven and 
compared. * Chapters five and six, which pre- 
sent a detailed and graphic picture of John's 
physical development and of the growth of his 
mental and motor abilities, are particularly val- 
uable. * One gets the impression that John's 
social and emotional behavior was considerably 
restricted by various factors, environmental as 
well as personal But these restrictive factors 
were probably not more numerous than those 
faced by most adolescent youngsters. To the 
reviewer, the objective and observational meas- 
ures come out better than the projective tech- 
niques. The account given of John by the Ror- 
Schach Test (the records were analyzed “blind” ) 
are not too helpful; and this is true also of the 
description of John's personality obtained from 
the Murray Pictures, As judged by objective 
records, the conclusions drawn from these ac- 
counts are often wide of the mark, and some- 
times are reminiscent of the "fortune" which 
one gets from the penny weighing-machines. 
For example, the Rorschach interpretation de- 
scribes John as follows: “His superior intelli- 
gence and his strong inner. life (inner creative- 
ness) serve to stabilize his labile affectivity.” 
When we find that repeated measures of John’s 
intelligence give him the relatively modest I.Q. 
of approximately 110, it is hard to see how his 
intelligence can be described as "superior." Just 
what is meant by saying that inner creativeness 
serves to stabilize “labile affectivity” is beyond 
the ken of the reviewer. The final picture of 
John, given in Chapter ten, is that of a young 
man, who, despite ups and downs in his social, 
emotional and physical development, has be- 
come a relatively effective young person. We 
are left with the feeling that John has somehow 
came through his adolescent stresses and strains 


successfully. The author writes: “His (John’s) 
college years also brought a successful record 
in courses and an enterprising variety of out- 
side activities. So marked an up-turn in John’s 
personal fortunes is evidence not only of the 
toughness of the human organism but also of 
the slow, complex ways in which nature and 
culture may come into adaptation.” The re- 
viewer suspects that this statement represents 
more often than not the “pilgrim’s progress” of 
most adolescents. 

J Consult Psychol 8:126 Mr-Ap ’44. Peter 
Blos. * In almost all instances the technique or 
procedure which is used in the research is well 
explained * All discussion happily avoids any 
psychological jargon but is easily understand- 
able to the educator to whom this book seems 
particularly addressed. * Applications of the 
findings in terms of educational method, cur- 
riculum, guidance, group life, et cetera are not 
made part of the presentation, and it is hoped 
that the data one day may be interpreted in 
terms of their educational implications rather 
than in terms of individual development. To the 
reviewer it is unfortunate that an adolescent 
boy is reconstructed by an additive process, by 
stringing together the scores of innumerable in- 
genious measurements. The person J. S. does 
not assume flesh and blood after careful read- 
ing, ie. the data have not brought light into 
“what makes J. S. tick.” Most of the objective 
findings can be correlated only to age (chrono- 
logical or physiological), sex or grade place- 
ment and therefore are not differentiated suffi- 
ciently to allow inferences which describe the 
dynamics of individual conduct. In the discus- 
sion of data there is hardly any attempt made 
to advance psychological interpretations, and 
motivational analyses make almost exclusive 
use of the stereotype of compensation. Projective 
methods like Rorschach or Murray Pictures 
are included in the book but are barely related 
to the rest of the findings; indeed one doubts 
the author’s confidence in the validity of such 
techniques. Despite these shortcomings the ob- 
jective findings collected by the Study are of 
great significance, in fact they provide infor- 
mation of and insight into group trends and 
maturational characteristics without which a 
thorough understanding of the individual is un- 
attainable. If John Sanders is considered the 
screen on which the research data were con- 
veniently projected, the reviewer thinks that 
with this in mind every person interested in 
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adolescence will find most valuable material in 
this small and handy volume. 
J Ed Psychol 35:123-5 F '44. Leonard S. 
Kogan. * perhaps one of the most thorough 
investigations of an individual in the history of 
psychology. The treatment is not limited in its 
perspective, however, since discussion is con- 
stantly focussed against the background of a 
group of children making up the "California 
Adolescent Growth Study." * The methodology 
of the study is to lead off with two orienting 
chapters and then devote separate chapters to 
the various modes of developmental expression. 
This type of presentation is much more fertile 
than a straightforward chronological treatment 
could be. Thus the book handles the single in- 
dividual much as the general field of personality 
is treated in most standard books on adolescent 
psychology. * The author has no psychological 
axe to grind in this treatment of adolescence. 
No special thesis or school of psychology colors 
any of the discussion. As a rule, material is in- 
terpreted from the practical, matter-of-fact point 
of view of ‘common sense.’ Jones’ theory of per- 
sonality is professedly a multifactor one with 
special emphasis on the factor of ‘struggle for 
maturity.’ However, he has organized the com- 
bined efforts of psychologists, physicians, a 
physiologist, and a school counsellor in a very 
objective manner. As a presentation of a con- 
crete case in all the manifold aspects of adoles- 
cent development, the study provides very use- 
ful material for both students and teachers. 
Mental Hyg 29:6907-9 O 45. Frank K. Shut- 
tleworth. * a masterpiece of superb writing, 
extensive documentation, and vivid portrayal of 
personality development. The volume should 
be on the “must read” list of all who work with 
adolescents and youth, of all who are concerned 
with personality development, of all who would 
be scientific and critical in their methods of 
studying personality. * The vivid description 
of multiple handicaps, the four-year trend to- 
ward progressively poorer adjustment followed 
by recovery, the great wealth of detailed test, 
measurement, and observational data so well 
integrated into the total picture, and an ex- 
traordinarily skillful job of writing, make the 
development of John Sanders a fascinating 
story indeed. In addition to providing an out- 
standingly excellent case study, the volume is 
an outstandingly excellent illustration of a 
maturing method for the scientific study of per- 
sonality. It combines (1) a great variety of ap- 


proaches, including appraisals of community, 
home, and school environments, physical growth, 
health, mental and motor abilities, interests, 
social adjustment, friends, social aspirations and 
skills, and emotional and intellectual develop- 
ment; (2) massed statistics and group norms, 
providing points of reference relative to a con- 
siderable population at each age; (3) follow-up 
or longitudinal studies of the same children 
over a period of years, revealing trends and ups 
and downs of adjustment ; and (4) the focusing 
of all of this material on the life history of a 
single individual. The method has two serious 
difficulties. It is impossible to present all the 
details; hence, the danger that conscious or un- 
conscious selection may yield a biased picture. 
This difficulty seems to have been adequately 
surmounted: the manuscript was critically re- 
viewed by four members of the staff, each of 
whom knew John and the total body of data. 
The second difficulty is one of exposition. 
There is enough technical procedural detail and 
original observation here to fill dozens of 
learned and abstruse volumes. Extensive use 
is made of these data. Nevertheless, the reader 
is rarely aware of any interruption to the smooth 
flow of the exposition, which is always alive 
and human. 

Psychoanalytic Q 14:256 Ap "45. Elisabeth 
R. Geleerd. * The selection of this particular 
case is interesting because the boy was emo- 
tionally and physically as well as socially handi- 
capped. The correlation between these factors 
is clearly demonstrated. * The book is of value 
to all who are interested in the maturation of 
the adolescent. 

Psychol B 41:334-5 My '44. John B. Wolfe. 
* The method of presenting the growth phe- 
nomena is difficult to describe. It is not bio- 
graphical in the strict sense of being undisci- 
plined artistry ; yet there is presented the social 
setting and, to some extent, the stream of life 
in which the case developed. The reader at 
times becomes engrossed with cultural factors: 
general rather than specific effects of home con- 
ditions, school clubs, unattainable ambitions 
created by our society, and the like. The chap- 
ters are somewhat like a number of short stories 
appearing in one volume; some sequent, some 
not. * The author achieved his aim of present- 
ing specific aspects of growth phenomena con- 
cretely and did so in an interesting fashion. * 
The reviewer is of the opinion that a layman 
or a novice in psychology will accept the ac- 
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count as either fact or fancy depending upon 
whether he is already inclined toward determi- 
nism or magic. It is improbable that the book 
could induce him to ask questions or formulate 
hypotheses as would the trained person when 
considering single cases. This brings up the 
more fundamental question of the treatment in 
psychology of the unique case. The traditional 
approach which seeks laws or principles regard- 
ing the operation of processes is skeptical of it 
as the apparent relationship between various 
features is not subject to empirical verification. 
The dominant mother, for example, appeared 
as an adverse influence on John Sanders, but 
one can wonder how many dominant mothers 
have brought up dominant sons. So can one 
question the seeming effects on development of 
the various cultural factors mentioned. The very 
fact that such questions do arise, however, 
gives the book its value. 

Q R Biol 19:179 Je 44. * The present pub- 
lication is no substitute for a complete tabula- 
tion of the records of the study, nor is it 
intended as such. Jones has here brought to- 
gether, under a single authorship, the results 
of many interdependent investigations, and he 
has chosen to present the story of one child's 
growth rather than to attempt to portray the 
development of two hundred. * What is lacking, 
of course, is a careful treatment of individual 
deviants from the: norms in each category and 
an analysis of the causes of their deviation. But 
such a program would have involved a much 
longer and a much less coherent work. We may 
hope that Tones and his colleagues will let us 
have the further data in time. Meanwhile, they 
are to be commended for having provided an 
unusually stimulating and instructive little book 
summarizing what might have been extraordi- 
narily unwieldy statistics. This book should be 
extremely useful to teachers of courses on the 
psychology of adolescence. 

Sch R 52:317 My 44. Robert J. Havighurst. 
* This book and other case reports which may 
be published by the California Growth Study 
will be valuable to secondary-school teachers 
and will be profitably used in the training of 
prospective teachers. 

[ 963 ] 
*Kanack, Gorni RuTE. Vocational Personalities: 
An Application of the Rorschach Group Method. 
Columbia University, Teachers College, Contributions 
to Education, No. 924. Harry D. Kitson, faculty spon- 
sor. New York: Bureau of Publications, the College, 


1946. Pp. xi, 116. $2.10. * (PA 21:1929) For reviews, 
see 82a. 
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X*KaMar, V. V. Measuring Intelligence of Indian 
Children. Foreword by John McKenzie. Calcutta, 
India: Oxford University Press, 1940. Pp. xvii, 290. 
Rs. 4. * (London: Oxford University Press. 6s.) 


A.M.A. 35:208 S '40. A careful and thorough 
study, giving details of the testing of over 
1,000 children. 

J Ed (London) 73:510-2 N "41. C. M. Flem- 
ing. * a detailed account of what is probably 
their [Binet tests] most recent metamorphosis— 
into the two Indian languages of Kannada and 
Marathi. The book is interesting to experts as 
a description of test material suitable for Indian - 
children; but it is also worth reading for its 
own sake as a simple introduction to the mean- 
ing and history of individual measurement. The 
English version is not intended to be used with 
children whose mother tongue is Indian; but 
its publication permits comparison between it 
and other revisions, and the author deserves 
congratulation upon the detail with which he 
has recorded the nature and distribution of 
results from the population of 1,074 Dharwar 
children upon whose testing the restandardiza- 
tion is based. * It is to be regretted that repre- 
sentation was not also given of the pictorial 
Binet material after its adaptation to Indian 
form. * 

Scottish Ed J 24:110 F 14 '41. * The first 
four chapters present a simple and succinct 
account of the history of the development of 
mental testing, a discussion of the nature and 
definition of intelligence and an enumeration of 
the precautions to be observed in the interpre- 
tation of mental age and LO., and in the 
application of the results of mental tests. The 
remaining chapters give instructions for the test- 
ing of children in India by means of vernacular 
adaptations of the 1916 Stanford Revision of 
the Binet Scale. The author, who was a student 
of Edinburgh University under Professor God- 
frey Thomson, is to be congratulated on tlie 
production of a sound piece of work; it was 
unfortunate that the later Terman-Merrill Re- 
vision was not available when he undertook 
the task of making the Indian version. 

Teaching 13:41-2 S ^40. The title of this 
book (which was not very happily chosen) 
would lead one to expect a much broader treat- 
ment of the subject than one actually finds. On 
the other hand, Dr. Kamat has done an excel- 
lent piece of work in a restricted field, and a 
scientist need not apologize for restricting his 
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range of experiment to narrow limits. * en- 
riches in several respects the data and technique 
of intelligence-testing. It is unfortunate, how- 
ever, that there should have been such a time 
lag between the author's original experiment 
(1933), the preparation of the present book 
(1938), and its final appearance (1940). In 
1933 Dr. Kamat was justified in thinking that 
the only work of any note which had been done 
on mental testing in India was that of Professor 
Rice, but by 1938 studies of some significance 
had been published by Messrs. Jalotra, Kum- 
ria, Jha and Mandry, while an account of adap- 
tations of the Goodenough Test and of the 
Oliver General Intelligence Test had appeared 
in Teaching. In 1939, tests by Perera, Jalaludin, 
Moshin, Maiti, the Dacca Training School Staff 
and others were published, but Dr. Kamat’s 
introduction, which appears to have been writ- 
ten some years ago, makes no mention of recent 
developments. It is true that he is mainly con- 
cerned with the Binet-Simon tests and that 
only one page is devoted to a mere mention of 
some other tests. Yet the first three chapters 
(and part of the sixth) deal with the theory, 
history and practice of intelligence-testing and 
give the reader the impression that the author 
is writing on intelligence-testing in general. 
Chapters four and five (and part of six) deal 
specifically with the author's revision of the 
Binet-Simon tests and will be read with eager- 
ness by those who are interested in the develop- 
ment of intelligence testing in India. They rep- 
resent a real contribution to education in this 
country and will be appreciated by many who 
know neither Marathi nor Kannada. 
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*KaNpzL, I. L. dpt Aptitude Tests in 
Medicine, Law, and Engineering. New York: Bu- 
reau of Publications, Teachers College, Columbia Uni- 
versity, 1940. Pp. x, 78. $1.60. (PA 15:1960) 

Ed Abstracts 6:93 Mr "41. Roger M. Bellows. 
* This book has value as a summary of find- 
ings and as a. source of references to valuable 
original reports (about 5o references are cited). 
The statement that “In selection of candidates 
for the study of medicine greater progress has 
probably been made than for any other pro- 
fessional field” is a questionable opinion ; dental 
schools have in the past decade made great 
progress in selection procedures. In his attempt 
to render a nontechnical account the author has 
in a few places adduced conclusions which ap- 
pear on the basis of internal evidence to be mis- 


interpretations (see especially material on p. 48 
on the value of the college record, and on p. 63 
on mortality). 

J Assn Am Med Col 16:208 My "41. * For 
medicine, most of the material is garnered from 
the Journal of the Association of American 
Medical Colleges. 

J Ed Psychol 32:224-5 Mr "41. Stephen M. 
Corey. * Kandel's book is informal and not 
very critical of the methods involved in the con- 
struction or evaluation of tests. He writes with 
a decided historical emphasis and his non- 
technical discussion will be of interest to the 
administrative officers of professional schools 
as a description of the status of academic pre- 
diction in medicine, law, and engineering. The 
monograph will make little appeal to profes- 
sional students of measurement or guidance be- 
cause the author does not attempt “to under- 
take the more technical study of the validity 
and reliability of the tests used.” (p. ix) 

Loyola Ed Digest 16:10 Ap ’41. Austin G. 
Schmidt. There is always a certain scholarly 
quality and a certain scientific restraint in what 
Dr. Kandel writes. * 
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*KELLEY, Ina B., AND Perkins, Kerra J. An Investi- 
gation of Teachers’ Knowledge of and Attitudes 
Toward Child and Adolescent Behavior in Every- 
day School Situations. Edited by H. H. Remmers. 
Purdue University, Division of Educational Refer- 
ence, Studies in Higher Education, [No.] 42; Further 
Studies in Attitudes, Series 4. Lafayette, Ind.: the 
Division, June 1941. Pp. 99. Paper. $1.00.* (PA 16: 
344) 

Loyola Ed Digest 17:9 F '42. Austin G. 
Schmidt. The purpose of this study was to con- 
struct and evaluate an instrument for measuring 
teachers’ knowledge of the age characteristics, 
the needs and interests, the incipient maladjust- 
ments, the motivation of behaviour, and the 
personality problems of children of elementary 
—and high-school ages. The study was con- 
scientiously made, with reasonably large groups, 
and the resultant inventory is one that should 
prove of value. 

[ 967 ] 
*Kemp, Epwarp H., AND Jounson, A. PEMBERTON, 
Eprrors, Psychological Research on Bombardier 
Training. Army Air Forces Aviation Psychology 
Program Research Reports, Report No. 9. Washington, 
D. C.: Government Printing Office, 1947. Pp. x, 204. 
$125. * (PA 22:4149) 

[ 968. ] 
XKENNEALLY, KATHERINE G. The Construction and 
Evaluation of a Diagnostic Test of Study Skills 


for Grades 4, 5, and 6. Catholic University of Amer- 
ica, Studies in Psychology and Psychiatry, Vol, 8, No. 
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8, Washington, D. C.: Catholic University of America 
Press, 1947. Pp. vii, 60. Paper. $1.00. * (PA 21:3751) 
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*Kennepy-Fraser, D. The Terman-Merrill Intel- 
ligence Scale in Scotland. Publications of the Scot- 
tish Council for Research in Education, [No.] 23. Lon- 
don: University of London Press, Ltd., 1945. Pp. viii, 
47. Paper. 1s. * (PA 19:2393) 
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*Krircong, WiLLIAM Amrow. Identification of Abil- 
ity to Apply Principles of Physics. Columbia Uni- 
versity, Teachers College, Contributions to Education, 
No. 840. S. Ralph Powers, faculty sponsor. New York: 
Bureau of Publications, the College, 1941. Pp. ix, 34. 
$1.60. * (P.A 16:2064) 


Loyola Ed Digest 17:9 Mr '42. Austin G. 
Schmidt. 'The author developed a test to meas- 
ure ability to apply scientific principles in new 
situations and organized a controlled experi- 
ment which produced evidence that science can 
be taught in such a way as to develop pupils' 
ability to apply principles to new problems. 


[971 ] 
X*KriwsALL, Lors I. Developmental Stages of Nine 
Experimental Worksamples With Three Revi- 
sions. Human Engineering Laboratory, Inc., Technical 
Report No. 173. Boston, Mass.: the Laboratory, Octo- 
ber 1945. Pp. ii, 20. Paper, mimeographed. * 


[ 972 ] 
*Kimpatt, Lors I. Origin and Description of Fig- 
urines, Second Sight and Poser. Human Engineer- 
ing Laboratory, Inc., Technical Report No. 268. Bos- 
ton, Mass.: the Laboratory, March 1946. Pp. 28. Paper, 
mimeographed.* , 

[ 973 ] 


X*Krorreg, Bruno. The Rorschach Technique: A 
Manual for a Projective Method of Personality 
Diagnosis. With clinical contributions by Douglas 
McGlashan Kelley. Yonkers, N. Y.: World Book Co., 
1942. Pp. xi, 436. * (London: George G. Harrap & Co. 
Ltd.) (PA 16:3170) For latest edition, see 974. For 
reviews, see 84. 
[ 974 ] 


*Kuorrer, Bruno. The Rorschach Technique: A 
Manual for a Projective Method of Personality 
Diagnosis. With clinical contributions by Douglas 
McGlashan Kelley. With 1946 Supplement by Bruno 
Klopfer and Helen H. Davidson. Yonkers, N. Y.: 
World Book Co., 1946. Pp. xi, 475. $4.80. * (London: 
George G. Harrap & Co. Ltd. 215.) 


[ 975 ] 
XKroPreg, Bruno, AND Davinson, Herren H. The 
Rorschach Technique: 1946 Supplement. Yonkers, 
N. Y.: World Book Co., 1946. Pp. 431-75. Paper. 
$0.48. * For a review, see 83. 


[ 976 ] 

XKxorr, VincrN1A Bercstresser. Physical Meas- 
urement of Young Children: A Study of Anthro- 
pometric Reliabilities for Children Three to Six 
Years of Age. University of Iowa Studies, Studies in 
Child Welfare, Vol. 18, No. 3, [New Series] No. 394. 
Iowa City, Iowa: University of Iowa Press, 1941. Pp. 
99. Cloth, $1.35; paper, $1.00. * (PA 16:2111) 


Loyola Ed Digest 17:10 Mr ’42. Austin G. 
Schmidt. With the purpose of determining the 
reliability of physical measurements of young 
children and of making suggestions for their 
improvement, the author took 35 physical meas- 
urements of 131 children between 3 and 6 years 
of age. Each measurement was taken twice, 
either by different technicians or by the same 
technician. The reliabilities of the different 
measurements are computed afd a schedule is 
offered indicating the frequency with which 
different measurements should be made. 

Q R Biol 17:282 S '42. 


[977 ] 
*Knowzes, Asa S. Merit Rating of Supervisors, 
Foremen and Department Heads. Northeastern 
University Publications, Bulletin No. 4. Reprinted 
from Personnel, Vol. 17, 1940. Boston, Mass.: Bureau 
of Business Research, Northeastern University, No- 
vember 1940. Out of print. * 


[ 978 ] 

*Knarr, Merwyn A.; Royster, Rosert F.; AND 
CrEETON, GLEN U. Selection and Employment of 
Operating Personnel. A manual prepared under the 
direction of the Committee on Selection and Train- 
ing of the Operation of Service Division. New York: 
American Transit Association, 1946. Pp. v, 51. Paper. 
$3.50. * (PA 21:860) For related reviews, see 696. 


[ 979 ] 
*Krause, Lawrence J. The Correlation of Adjust- 
ment and Achievement in Delinquent Boys. Cath- 
olic University of “America, Studies in Psychology 
and Psychiatry, Vol. s, No. 2. Washington, D. C.: 
Catholic University of America Press, 1941. Pp. xiv, 
74. Paper. $1.25. * (PA 15:4764) 

[ 980 ] 
*Kusuner, Rose Estrin. The Relationship Be- 
tween Content of an Adult Intelligence Test and 
Intelligence Test Score as a Function of Age. 
Columbia University, Teachers College, Contributions 
to Education, No. 933. New York: Bureau of Publica- 
tions, the College, 1947. Pp. vii, 59. $2.10. * (PA 22: 


621) 
[ 981 ] 

XLaLEGER, Grace ErizasETH. The Vocational In- 
terests of High School Girls: As Inventoried by 
the Strong and Manson Blanks. Columbia Uni- 
versity, Teachers College, Contributions to Education, 
No. 857. Harry D. Kitson, faculty sponsor. New York: 
Bureau of Publications, the College, 1942. Pp. vii, 102. 
$1.60. * (PA 16:5023) 

J Ed Res 39:231-5 N '45. D. Welty Lefever. 
School counselors and guidance workers may 
well be discouraged on reading the conclusions 
* The investigator compared the letter ratings 
of 703 eleventh-grade girls on The Manson 
Vocational Interest Blank for Women with 
those from Strong’s Vocational Interest Blank 
for Women. * For comparable keys the Man- 
son and Strong blanks did not agree to any 
desirable extent. The correlation coefficients 
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to distinguish this type of science from many 
of its phony predecessors that have tried to 
judge personality qualities from handwriting. 
* a really valuable book * 

Psychiatric Q 16:615-6 Jl 42. * the results 
and conclusions derived from long and inten- 
sive effort to study the scientific sub-strata 
underlying a practice which has been in as 
much disrepute in America as that of astrology, 
palmistry or phrenology. To students of the 
Rorschach, of play therapy or of other person- 
ality projection methods, it might seem obvious 
that one cannot write by hand without project- 
ing something of the personality into the result ; 
and graphology, therefore, should be a valid 
field for research and a promising one for the 
development of clinical psychiatric criteria, 
once a scientific system of analysis can be estab- 
lished. "The purpose of this book," say the 
writers, “is to provide an objective method for 
classifying handwriting into factors which 
graphologists have found useful in their evalu- 
ations.... The plans have been laid out, but much 
work will have to be done before the under- 
growth that has collected can be cleared and 
basic progress in rearing the graphological 
structure made." The claim is modest and ap- 
pears thoroughly justified by the studies re- 
ported. And so is the conclusion that “....when 
carefully controlled measures and rating scales 
are applied, it becomes possible to discern an 
underlying differentiating pattern in the hand- 
writing of an individual which may well prove 
to be the basis for understanding underlying 
personality structures.” * the authors have at- 
tempted to devise scales which will measure 
objectively differing degrees of contraction and 
release in the “four components” of handwrit- 
ing—the form, vertical, horizontal and depth 
(pressure) components. * The authors appear, 
as they have noted, to have provided a method 
of study assuring a high degree of objectivity 
and reliability. The question of validity of the 
conclusions is untouched except for the indirect 
method of comparing the handwritings of five 
normal persons and 15 psychiatric patients 
and noting the differences in the groups. Mrs. 
Lewinson, in a chapter for which she alone is 
responsible, draws on her long experience to 
set down some “working hypotheses” as to 
what the graphological factors mean in reflect- 
ing the personality studied. It is a chapter 
which suggests that this method may some day 
have important clinical applicability. * The 


photolithographic work is excellently done and 
the volume easy to read. The board binding is 
sturdy and the work of a convenient size for a 
desk manual. 

Psychoanalytic Q 13:519-22 O '44. Walter 
W. Marseille. * we....welcome the association 
of Miss Lewinson, a graphologist with a good 
reputation as a practitioner, with Mr. Zubin, 
research psychologist at the New York State 
Psychiatric Institute. Since it is teamwork of 
this kind which can eventually bridge the gap 
between the lofty claims of graphology and the 
scepticism of academic psychology, we regret 
that the authors confined themselves to such a 
limited program. Their book contains two fnain 
parts of unequal length. In the much longer 
first part, scales for measurement of certain 
handwriting features are described, and the in- 
terpretative value of these features is presented 
in a perfunctory fashion in the second part 
under the name of ‘working hypotheses’. Both 
the selection of the relevant handwriting fea- 
tures and their psychological interpretations are 
admittedly based on the graphological system 
of Ludwig Klages. In the authors’ treatment 
both parts are entirely unconnected since they 
do not reproduce the cardinal part of Klages’ 
theory : the assumption of a psychomotoric com- 
ponent (Antriebserlebnis) in all psychological 
phenomena. For this they substitute general 
references to the concepts of expressive move- 
ment and symbolism, leaving the reader with 
two incongruent pieces of doctrine each of which 
thus becomes still more arbitrary and dogmatic 
than it was in Klages’ original context. The 
painfully elaborate definitions of the various 
handwriting features and the truly pedantic 
accuracy of the measuring procedures leave the 
reader completely in the dark as to why just 
those features should be significant from a psy- 
chological point of view. On the other hand, 
the doctrinaire presentation of the charactero- 
logical meanings in Miss Lewinson’s special 
chapter is not amended by calling these inter- 
pretations ‘working hypotheses’. The authors 
admit (Miss Lewinson reluctantly, Mr. Zubin 
willingly) that sufficient empirical evidence to 
substantiate them is not yet available. However, 
this deficiency does not make them working 
hypotheses. These interpretations have never 
been used as a basis for research in the realistic 
spirit of an empirical science. Indeed, it is hard 
to see how Miss Lewinson can call the master’s 
deductions from his general metaphysical prin- 
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ciples ‘working hypotheses. One attempt is 
made in the book to establish handwriting analy- 
sis on a new fundament. According to the 
authors, every one of the twenty-two handwrit- 
ing features (e.g. size, slant, width) is to be 
measured on a scale reaching from extreme 
‘contraction’ (e.g., smallness of the letters) to 
extreme ‘release’ (e.g., largeness of the letters). 
The middle values on these scales are considered 
to represent 'rhythmic balance' and negative 
interpretive significance is connected through- 
oüt with any deviation from these standard 
values. Why this should be so is not explained. 
The statistician would naturally rejoice over the 
symmetry of this perfection when he actually 
discovers it somewhere, but he would not ex- 
pect to find it in a gestalt such as handwriting, 
much less would he postulate it. At this point of 
their book it becomes obvious that the authors 
are involved in wishful theorizing for they re- 
quire normal handwriting, if not rhythmically 
balanced, to have at least one and the same 
value of the contraction tendency for all of the 
twenty-two handwriting features. This assump- 
tion is so much at variance with the facts that 
they subsequently reduce it to ‘one or two 
values’. If no single pair of values of the con- 
traction tendency can be found to fit a given 
handwriting, the authors take it as an indica- 
tion of a personality disturbance. It is difficult 
to understand how they could blind themselves 
to the fact that handwriting with only one or 
two values of the contraction tendency, as de- 
fined by them, is a rare exception and that their 
basic assumption immediately leads into absurd 
consequences. This reviewer is inclined to infer 
from the incoherence and formalism of the 
book, that the two authors found it impossible 
to agree between themselves on the material is- 
sues of handwriting analysis. The Discrepancy 
between an uncontrollable intuitive approach 
and a blind statistical approach to the phenome- 
non of handwriting, the very contradiction the 
authors set out to resolve, stands out blatantly 
as the weakness of their own effort. Thus the 
microcosm of their authorship team reflects 
after a fashion the general situation of graphol- 
ogy with its antagonism between the scholar’s 
scepticism and the practitioner’s belief. Indi- 
rectly, the authors have tried to anticipate criti- 
cism of this kind with the claim that their 
measuring scales retain their value if used in 
connection with an interpretive system different 
from that of Klages. The trust in this sort of 


objectivity is sheer naiveté. Without an under- 
standing of the problems of psychomotility it 
does not even make sense to measure any fea- 
tures of the traces which the writing move- 
ments leave on the paper. The authors pay lip 
service to the relevance of the concepts of ‘ex- 
pressive movement’ and ‘gestalt’ to their prob- 
lem, but they have entirely neglected them in 
their actual work. Not even so fundamental a 
factor as the speed of the writing movement 
has been included among their twenty-two 
handwriting features. The psychoanalytic point 
of view does not predispose the student in favor 
of graphology. Neither the theoretical interest 
in psychogenesis nor the practical interest in 
psychotherapy would support the concern with 
a psychological method which is, at best, purely 
diagnostic in nature. The psychoanalyst is fur- 
thermore inclined to believe that remnants of 
magic thinking and desires of omniscience play 
a role in the motivation of graphologists. On 
the other hand, he would be less afraid than 
others of accepting manifestations of ‘irration- 
ality’ in human beings. He would not be sur- 
prised to learn that the activity of writing offers 
a good many opportunities for both the open 
manifestation and symbolic expression of vari- 
ous personality constellations and disturbances. 
He would doubt, however, whether these hand- 
writing symptoms are so uniform among indi- 
viduals and at the same time specific enough in 
their characterological significance to allow of 
interpretation without a knowledge of the writ- 
ing individual’s case history. It seems that in 
this situation progress could be made only 
through statistical validation of the graphologi- 
cal diagnosis of personality traits, in a field 
where the predictive value of handwriting 
analysis could be compared with that of stand- 
ard tests or other diagnostic methods of proved 
merits. It is regrettable that the authors have 
not turned their efforts in that direction. 

Psychoanalytic R 30:116-7 Ja’43. Although 
psychoanalytic insight and methods have offered 
the most thorough instruments for judging per- 
sonality, short cut and haphazard methods do 
not have to be thrown into the discard. Hence 
the value of a work of this kind. * In the pres- 
ent urgent need for more reliable methods of 
testing personality, the addition of scientific 
graphology to psychological procedure should 
prove to be of great practical as well as theo- 
retical value. * 

Psychol B 40:456-7 Je '43. H. D. Carter. * 
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The treatment of details of handwriting is thor- 
ough, entirely reasonable, and at times tedious. 
The explanation of the method of using the 
scale is clear-cut. It is accompanied by specific 
suggestions as to statistical procedures which 
may be useful in testing various hypotheses. * 
This book may be regarded as a manual for a 
set of objective scales. Judging it as one does a 
newly-published test and manual, one notes that 
the standardization is extremely tentative, the 
scale-making procedure is competent, and the 
idea behind it is a set of hypotheses which con- 
stitutes the basis for graphology. The weakest 
point in the whole approach appears to be the 
basic postulate, namely that “if a person's hand- 
writing is characterized by balance between 
contraction and release, his personality too is 
rhythmically balanced and consequently well 
adjusted.” In the judgment of the reviewer, 
this postulate is contrary to all the tendencies 
of research findings concerning personality, in 
that research on personality fails to show any 
close integration which would permit diagnosis 
of personality type from study of any one lim- 
ited aspect of behavior. The graphological sys- 
tem of Klages and Pulver, presented in chapter 
nine of this book, appears absurd and mystical, 
at least insofar as its application to normal sub- 
jects is concerned. One may therefore be grate- 
ful for the construction of a set of objective 
scales for use in testing theories which might 
otherwise be defended by evasion and general 
vagueness, The scales herein provided are prob- 
ably both reliable and objective to a degree 
satisfactory for the purposes in mind; their 
validity is yet to be established. 
Q R Biol 18:100 Mr ’43. 


[ 987 ] 
*Licut, Marie H. Corresponding Percentiles for 
Inherited Characteristics. Human Engineering Lab- 
oratory, Inc., Technical Report No. 250. Boston, Mass. : 
the Laboratory, January 1946. Pp. 5. Paper, mimeo- 
graphed. * 

[ 988 ] 


*Licur, Marie H. Further Study of Abstract 
Imagination Worksample 329 AB. Human Engi- 
neering Laboratory, Inc., Technical Report No. 306. 


Boston, Mass.: the Laboratory, June 1946. Pp. 6. 
Paper, 'mimeographed. * 


[ 989 ] 
*Licur, Marie H. Further Study of Blocked Re- 
sponses 35 AE, Human Engineering Laboratory, Inc., 
Technical Report No. 281. Boston, Mass.: the Labora- 
tory, June 1946. Pp. 16. Paper, mimeographed. * 


[ 990 ] 


*Licut, Marte H. A New Scoring Method for Per- 
sonality Worksample 35AE. Human Engineering 


Laboratory, Inc., Technical Report No. 233. Boston, 
Mass.: the Laboratory, November 1946. Pp. 8. Paper, 
mimeographed. * 

[ 991 ] 
*Licut, Marie. The Relationship Between Tonal 
Memory and Pitch Discrimination. Human Engi- 
neering Laboratory, Inc, Technical Report No. 178. 
Boston, Mass.: the Laboratory, July 1945. Pp. 4. 
Paper, mimeographed. * 

[ 992 ] 
*Licut, Marw, The Relationship of Vocabulary 
and Wiggly Block Scores of Seven Test Admin- 
istrators. Human Engineering Laboratory, Inc., Tech- 
nical Report No. 177. Boston, Mass. : the Laboratory, 
October 1945. Pp. 7. Paper, mimeographed. * 

[ 993 ] 
*Licut, Marie. The Relation of Wiggly Block to 
Black Cube. Human Engineering Laboratory, Inc., 
Technical Report No. 194. Boston, Mass, : the Labora- 
tory, December 1945. Pp. 4. Paper, mimeographed. * 

[ 994 ] 
*Lire OFFICE MANAGEMENT ASSOCIATION. The Ap- 
plication of Psychological Tests to the Selection, 
Placement, and Transfer of Clerical Employees. 
Report No. 6 of the Committee on Tests of the Life 
Office Management Association. New York: The As- 
sociation, July 15, 1942. Pp. iv, 26. Paper. Out of 
print. * (PA 16:4963) 


Adv Mgmt 8:68 Ap-Je '43. Edward N. Hay. 
* The report touches briefly on three new tests 
adapted from Thurstone’s Primary Mental 
Abilities Tests. Of these the N test (number 
facility) indicates probable success in posting 
and checking operations. The V test (verbal 
facility) deals with the clerical operations of 
discussion and correspondence. The M (mem- 
ory) test "appears to give some indication of 
probable success or failure in machine and typ- 
ing operations." Unfortunately, these tests are 
not generally available but since they stem from 
Dr. Thurstone's work, which is generally avail- 
able, it is likely that other investigators will be 
stimulated to experiment with them. * While 
this report is written primarily for insurance 
people, it has general application and is one of 
the best simple descriptions of the use of tests 
yet written. Tt could, therefore, be profitably 
read by anyone interested. Unfortunately the 
research facilities of the Life Office Manage- 
ment Association are available only to its mem- 
ber insurance companies. This, however, does 
not reduce the value of this splendid study. 
The report does not say so, but the guiding 
hand behind the work of the Committee on 
Tests is that of Dr. Marion A. Bills. She has 
certainly contributed as much to the successful 
use of tests in the clerical field as any other 
person. 
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[ 995 ] 
*LorvixGER, JANE. A Systematic Approach to the 
Construction and Evaluation of Tests of Ability. 
American Psychological Association, Psychological 
Monographs, Vol. 61, No. 4, Whole No. 285. Washing- 
ton, D. C.: the Association, Inc., 1947. Pp. iii, 49. 
Paper. $1.25. * (PA 22:2835) 

[ 996 ] 
*Loxen, Rozert D. The Nela Test of Color Vision. 
Comparative Psychology Monographs, Vol. 17, No. 6, 
Serial No. 9o. Berkeley, Calif.: University of Cali- 
fornia Press, September 1942. Pp. ii, 37. Paper. $0.75. * 
(PA 17:50) 

[ 997 ] 
Lomparpr, MaryetLAN Mauer. The Inter-Trait 
Rating Technique. Columbia University, Teachers 
College, Contributions to Education, No. 760. William 
A. McCall, faculty sponsor. New York: Bureau of 
Publications, the College, 1938. Pp. vii, 99. $1.60. * (PA 
13:5775) 


Ed Res B 19:446 O 23’40. Francis N. Max- 
field. * This monograph is the doctoral disserta- 
tion of one of Mr. McCall’s students, who has 
addressed her research to various questions 
raised by the use of this inter-trait technique. 
Looking down a list of fifty traits the rater 
chooses the one he feels he is best able to judge 
in the person to be rated. This trait becomes a 
base-line, or “calibrator,” and the other forty- 
nine traits are rated plus or minus as they seem 
to him more or less conspicuous respectively 
than the calibrator in the person rated. The 
scale becomes quantitative through the addition 
of estimates of certainty expressed as percent- 
ages from o to roo. Mr. McCall's own exposi- 
tion of the technique is in a somewhat playful 
mood, and he rather fantastically suggests a 
"personality quotient" which is no quotient at 
all Mrs. Lombardi wisely omits any "quo- 
tients" (sic) and sticks to criteria, validity, re- 
liability, and the like. Perhaps unwisely, she is 
entirely serious. 

Pub Personnel R 1:61-3 Jl '40. Margaret 
Marshall. * The study represents primarily an 
exploration in the field of personality measure- 
ment and may suggest an approach for subse- 
quent investigations. The report is clearly writ- 
ten and well organized. It has ample figures 
and tables and has a selected bibliography of 
eighty-five titles. Many will disagree with the 
basic assumptions underlying the study, and 
the newness of the approach is questionable in 
view of the extensive use which is being made 
of “pattern” techniques in personality measure- 
ment. To the public personnel technician who 
is concerned with the problem of personality 
measurement, such experimental work as this 
should he of interest—not because it solves his 


problem, but because it may give some valuable 
clues to him in his quest for more valid tech- 
niques of measurement. It would seem that the 
chief value in the technique lies in the person~ 
ality pattern or profile which it yields. This 
would be useful for diagnostic purposes when- 
ever individuals were being dealt with accord- 
ing to the case method. The fact that individ- 
uals cannot be safely compared on the basis of 
scores, because scores are not meaningful in 
relation to a universal scale, makes such an ap- 
proach of doubtful value to the personnel ex- 
aminer who must evaluate individuals relatively 
and rank them in the order of their fitness to 
meet the requirements of a particular job. 
For an additional review, see 40:B997. 


998 
*Lourrrr, C. M. axe c MPs of Children's 
Behavior Problems, Revised Edition. New York: 
Harper & Brothers, 1947. Pp. xviii, 661. $4.50. * (PA 
21:1914) 

J Consult Psychol 11:155 My-Je’47. Louttit's 
long-useful book has undergone a minimal re- 
vision. Some new material has been added, as 
on the 1937 Stanford-Binet and on techniques 
of treatment, There are a few well considered 
deletions of dead wood. A number of sections 
have been rearranged and revised. Since the 
book's field is defined as that of children's be- 
havior problems, the very brief mention of the 
Bellevue-Wechsler Scale can perhaps be ex- 
cused. Less justified is the scant attention given 
to new and lively issues, such as projective 
methods, play techniques for diagnosis and 
treatment, and relationship therapy. The revi- 
sion hardly seems to present a true picture of 
clinical psychology in 1947. 

Psychiatric Q 21:152-3 Ja '47. * an excellent 
textbook * The first [part] deals with methods, 
describing the various conduct disorders of 
children and indicating their incidence by means 
of a number of well-chosen tables. Anamneses, 
Clinical examinations, psychometric tests, and 
treatment procedures also are described. None 
of these processes is described in sufficient de- 
tail to make it unnecessary to consult other 
texts or an instructor, but the descriptions are 
very clear and adequate for a good orientation 
in the functions, purposes, and achievements of 
the techniques. * The emphasis is upon practi- 
cal procedures rather than upon an understand- 
ing of the etiology of children's personality and 
conduct disorders. Personality dynamics are not 
discussed. Onlv two pages are devoted to a 
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brief presentation of the projective techniques. 
Psychoanalysis is outside the scope of the book. 
Many psychoanalytic authors are quoted, but 
only if their remarks are accessible to statisti- 
cal descriptive verification, In several places, 
Louttit speaks disparagingly (and without 
proof) about the analysts’ theories correlating 
personality disorders with psychosexual disor- 
ders and immaturities. These deviations from 
the author’s sober, clear and objective presenta- 
tion are few and unimportant and by no means 
detract from the value of his book as a very 
good introductory text for students in clinical 
psychology. 

For reviews of the first edition, see 38:B418 
and 40:B998. 


[ 999 ] 
*LovELL, Constance. The Effect of Special Con- 
struction of Test Items on Their Factor Com- 
position. American Psychological Association, Psy- 
chological Monographs, Vol. EE No. 6, Whole No. 259. 
Washington, D. C.: the Association, Inc., 1944. Pp. iii, 
26. Paper. $0.50. * (PA 18:3650) 

[ 1000 ] 
McCatt, Wikram A. Measurement. A revision of 
How to Measure in Education. New York: Macmillan 
Co., 1939. Pp. xv, 535. $4.50. (London: Macmillan & 
Co., Ltd. 215.) * (PA 13:3301) 


Ed Abstracts 6:60-1 F '41. Lee J. Cronbach. 
* The content presented is treated clearly and 
forcefully but the reviewer considers many 
essential elements of a course in measurement 
have been omitted. The book will in many ways 
serve as a "cookbook" for school testing, but 
passes over the scientific literature of the test- 
ing movement. Facts and principles on which 
there is general agreement are not carefully 
distinguished from the writer's own opinions. 
The tests generally considered to be basic ex- 
amples are given little attention. The material 
on classification of pupils and statistical inter- 
pretation for school purposes will be found ex- 
cellent by those administrators whose program 
requires them. 

J Ed (London) 72:200 Ap '40. * can be 
recommended to all who are interested in edu- 
cational research * Among the many useful fea- 
tures in the book should be mentioned the com- 
prehensive list of American tests. 

Occupations 19:233-4 D '40. E. R. Henry. 
* Several new departures appear in this edition, 
including emphasis on educational and social 
philosophy of measurement. Despite the magni- 
tude of the task that Professor McCall has set 
for himself, the book as a whole succeeds in 


achieving the purposes stated in the dedication 
and the introduction. The philosophy of meas- 
urement is clearly set forth in seventeen theses 
in Chapter I. Especially significant is the em- 
phasis on the thesis that “To the extent that the 
pupil's initial abilities or capacities are unmeas- 
urable a knowledge of him is impossible" (The- 
sis 12) and "Measurement in education is 
broader than educational tests" (Thesis 11). 
The remainder of the volume is devoted to an 
extensive as well as intensive treatment of the 
usual subject-matter of a first course in meas- 
urement. And despite the apparently cynical 
attitude toward statistical intricacies of measure- 
ment, there is no slighting of the basic statistical 
concepts involved. * Included..are excellent 
summaries of the arguments for and against 
homogeneous grouping * This volume deserves 
and will probably receive wide use as a text for 
courses in measurement. 

Univ Wash Col Ed Rec 7:46—7 Ja '41. Albert 
Bailey. * contains a chapter headed “A Com- 
prehensive List of Tests and Test Publishers" 
that makes available a supply of ready refer- 
ence material that is invaluable * McCall has 
written some excellent books dealing with tests 
and measurement. Measurement, in the opinion 
of this reviewer, is his best. It is trite, of course 
to say of a book in review that it is “invalu- 
able," that "it should be in every educational 
library." This reviewer spares Measurement 
that dubious critical distinction. Suffice it to 
say, as no faint praise, that when any point con- 
cerning a program of tests and measurement 
comes into question, the answer and its elabora- 
tion, if answer there is, will be found in a re- 
cent book by William A. McCall, entitled 
Measurement. 

For additional reviews, see 40:Brooo. 


[ 1001 ] 
*McCanrRY, Tuomas J. Personality Traits of 
Seminarians. Catholic University of America, Studies 
in Psychology and Psychiatry, Vol. 5, No. 4. Washing- 
ton, D. C.: Catholic University of America Press, July 
1942. Pp. vii, 46. Paper. $2.00.* (PA 17:592) 

[ 1002 ] 
XMCCLELLAND, WirLrAM, Selection for Secondary 
Education. Publications of the Scottish Council for 
Research in Education, [No.] 19. International Exam- 
ination Inquiry. London: University of London Press, 
Ltd., 1942. Pp. xxiv, 264. 5s. * (PA 19:2371) 

Brit J Ed Psychol 13:51-2 F '43. * McClel- 
land's report is far more than a mere corrobo- 
ration of earlier methods or conclusions, tenta- 
tively arrived at in this country or elsewhere. Tt 
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is a highly original contribution, first to certain 
urgent issues raised by the Scottish 'qualitying 
examination' as at present carried out, and, 
secondly, to the whole problem of awarding 
scholarships for secondary education. * The 
whole inquiry is unquestionably one of the most 
important of all recent contributions to educa- 
tional psychology, and forms a model for future 
investigations. Not only Scottish readers, but 
all who are concerned in the theory or practice 
of education should study it, no matter where 
their field of work may lie. To have carried to 
a successful conclusion a co-operative research 
on this extensive scale in war-time is indeed a 
remarkable feat. 

J Ed (London) 76:38-42 Ja '44. C. Birche- 
nough. * has the rare merit of presenting a 
difficult and technical subject in a form un- 
derstandable by the layman. * The book is a 
model of exposition. It shows a care in analys- 
ing its problem, an ingenuity in devising meth- 
ods of attack, a knowledge and dexterity in the 
use of its instruments, and a scientific temper 
that are wholly admirable. Though directed 
primarily to an investigation of conditions in 
Scotland and using the City of Dundee as the 
main theatre of operation its findings and meth- 
ods are of general applicability. * The report 
performs a service of inestimable value in re- 
storing a sense of proportion in regard to the 
different measures to be used in selection, and 
in stressing the vital importance of encouraging 
teachers to form objective judgments of their 
pupils, checking their assessments of individ- 
uals with the help of standardized tests, and so 
bringing the day nearer when the examination 
will occupy a less important place than at the 
present time. As things are it seems impos- 
sible to make use of teachers' assessments when 
candidates come from small schools. * 

Occupational Psychol 17:42-5 Ja '43. Cyril 
Burt. * Prof. McClelland's report is written 
primarily for teachers and educational adminis- 
trators. But it contains much that has an inti- 
mate bearing upon industrial problems. Indeed, 
the whole subject is closely related to that of 
vocational guidance and selection. To award a 
place in a secondary school is in itself as much 
a vocational as an educational decision ; and the 
experimental and statistical devices here adopted 
to study the existing methods of selecting pupils 
for certain schools might serve as a model for 
the vocational psychologist in his studies of 
various methods of selecting applicants for dif- 


ferent occupations. * To the vocational psychol- 
ogist...the most instructive chapter is that dis- 
cussing “the general principles of selection 
technique.” Here, in Professor McClelland’s 
view, the first and most difficult problem is to 
fix the pass mark. A guiding principle in all 
educational policy, he holds, ought to be the 
diminution of what he terms ‘educational wast- 
age’. Of pupils who took a senior secondary 
course, over two-fifths turned out to be fail- 
ures: here the education was wasted. On the 
other hand, among those who did not take such 
a course a still larger number, he calculates, 
would have been successful, had they been ad- 
mitted; and possibly two or three times as 
many really possess the necessary ability, but 
owing to unfavourable home circumstances 
would probably not succeed under present con- 
ditions: here talent is wasted. The total number 
of ‘misfits’, therefore, should include both the 
failures among those who have been admitted 
and the potential successes among those who 
have been rejected; and Prof. McClelland ac- 
cordingly suggests that the ideal pass mark 
would be one which reduced this total to a mini- 
mum. There is a simple statistical device which 
has often been used for this purpose: in techni- 
cal language, what may be called the ‘minimum 
discrepancy’ borderline is located at the point 
where the ‘regression line’ of success on test- 
scores cuts the criterion borderline dividing suc- 
cesses from failures. In dealing with similar 
problems in the London educational system, 
this method of drawing borderlines for scholar- 
ship children, trade school pupils, certifiable de- 
fectives, and the like, was suggested as a con- 
venient basis for theoretical discussions. It has 
a similar value in vocational problems: thus, 
some time before Prof. McClelland’s volume 
appeared, when newly appointed psychologists, 
working for the Army, were considering where 
the pass mark should be drawn with certain 
vocational tests, in order to minimize what they 
also termed ‘wastage’, I ventured to suggest the 
same procedure. Nevertheless, this is not, in my 
opinion, the borderline that should be used for 
practical decisions, either in vocational or in 
educational work. The effort to minimize dis- 
crepancies by means of a pass mark deduced 
from this formula would inevitably result in 
fewer candidates being accepted than would 
otherwise have been the case. Now, if a certain 
industry calls for a hundred skilled engineers, 
it is absurd for the vocational psychologist to 
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say: "I am only passing 80, because, if I send 
you more, I shall increase the number of cases 
about which you and I might disagree." Simi- 
larly, in selecting children for the secondary 
school, if the teacher draws a border-line at an 
L.Q. of 125, it would seem absurd for the edu- 
cational psychologist to argue: "Let us then 
accept a pass mark of 130 I.Q., since in that 
way we shall diminish our misfits.” Most of the 
misfits thus avoided are not cases of gross dis- 
crepancy. Admittedly, with a more generous 
borderline we should be accepting a proportion 
of pupils who are not quite up to the desired 
level; yet they are not likely to fall much be- 
low. And it would seem better to risk including 
a number who are somewhat under the desired 
level than to exclude a number who are defi- 
nitely above it; for, with our present imperfect 
methods of testing, there is bound to be an ap- 
preciable number of good pupils among those 
who fall just below the higher borderline as de- 
termined by the tests. For practical purposes, 
therefore, my own view rather is that we should 
first decide the number of free places to be 
offered ; and then, if the number be, say 50 out 
of a total age group of 5,000, take the best 50 
on our list. Thus the pass mark will be fixed 
by the number to be accepted, not the number 
to be accepted fixed by the pass mark. Prof. 
McClelland would, I think, admit the force of 
this argument, since, when he comes to esti- 
mate the percentage in the qualifying group 
who had the ability necessary for success in a 
senior course, he inverts the regressional for- 
mula, and deduces a percentage of just over 30. 
He adds that he is “fully conscious of the statis- 
tical weakness" of his determination, but puts 
- it forward as providing the ultimate aim. His 
final conclusion he states as follows: “At pres- 
ent the number of pupils fit for a senior sec- 
ondary school course, who embark on such a 
course, is 10.6 per cent. of the Qualifying 
Group ; by a proper use of the bursary schemes, 
and of the best existing methods of selection 
and guidance, we should be able to raise this to 
15.4 per cent.: to reach the final goal of 30.1 
per cent., we require: (a) a far-reaching im- 
provement in social and environmental condi- 
tions; (b) an increase in the variety and at- 
tractiveness of the senior secondary courses.” 
Tt is impossible to put aside this suggestive re- 
port without congratulating both its author, the 
Committee of the International Examinations 
Inquiry, and the Committee’s Convener, on the 


way in which the research has been planned and 
brought to a successful termination, The ardu- 
ous work of applying the tests, correcting the 
test-papers, calculating the statistics, and pre- 
senting the conclusions, has been admirably 
carried out. Professor McClelland has availed 
himself of every technical device to safeguard 
his analysis against any fundamental criticism 
that might be urged by the psychological or the 
statistical expert; and yet he has succeeded in 
describing both his methods and his conclusions 
in a manner that is at once lucid, readable, and 
intelligible to the practical man. 


[ 1003 ] 
X*McCrov, CHanLEs Haroto. Tests and Measure- 
ments in Health and Physical Education, Second 
Edition, New York: Appleton-Century-Crofts, Inc., 
1942. Pp. xxiii, 412. $3.25. * (PA 16:3295) For reviews 
of the first edition, see 40 :B1003. 

[ 1004 ] 
X*McCLov, CHARLES HAROLD, AND CARPENTER, AILEEN. 
Laboratory Manual for Tests and Measurements 
in Health and Physical Education. New York: 
Appleton-Century-Crofts, Inc., 1941. Pp. 140. $1.25. 

[ 1005 ] 
MACFARLANE, JEAN WALKER. Methodology of Data 
Collection and Organization: Studies in Child 
Guidance, I. Monographs of the Society for Research 
in Child Development, Vol. 3, No. 6, Serial No. 19. 
Washington, D. C.: the Society, National Research 
Council, 1938. Pp. vii, 254. Paper, lithotyped. $2.00. * 
(PA 13:3334) 

Am J Orthopsychiatry 12:375 Ap ’42. 
L(awson) G. L(owrey). * The form of the 
experiment—mental hygiene guidance for one 
group of children, with a matched control group 
—is exciting, but the organization to carry it out 
is seriously lacking. It tends to become just an- 
other study. Unfortunate. Nevertheless, it is 
excellent material for young workers if ade- 
quately discussed. 

Brit J Ed Psychol 12:80 F ^42. * a full, 
elaborate, detailed report of research into im- 
portant questions * will be of service to other 
researchers in cognate subjects, and to students 
of child development * 


[ 1006 ] 

*Macuover, Soromon, Cultural and Racial Varia- 
tions in Patterns of Intellect: Performance of 
Negro and .White Criminals on the Bellevue 
Adult Intelligence Scale. Columbia University, 
Teachers College, Contributions to Education, No. 875. 
Percival M. Symonds, faculty sponsor. New York: 
Bureau of Publications, the College, 1943. Pp. v, 9r. 
$1.60. * (PA 17:2831) 


Am J Sociol 49:274 N ’43. Robert E. L. 
Faris. * There are some interesting ways in 
which the southern Negroes show superiority 
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to the northern Negroes in some of the sub- 
tests. * 

Am Sociol R 8:736-7 D '43. Raymond F. 
Sletto. * This new evidence regarding the rela- 
tive weights of race and culture upon intelli- 
gence test performance reinforces the position 
of those who have maintained that comparisons 
between racial groups are invalidated by cul- 
tural differences. Another interesting finding is 
that southern Negroes compare more favorably 
with northern Negroes on verbal subtests of the 
Bellevue Scale than they do on some of the 
non-verbal tests, a finding of particular signifi- 
cance since it bears upon the assumption that 
non-language tests are relatively free from the 
limiting effects of culture. * Machover points 
out that the test pattern found to be associated 
with marked cultural restriction is similar to 
patterns which in clinical practice have been 
found to be indicative of mental pathologies, 
and cautions against use of subtest profiles in 
clinical diagnoses without first inquiring into 
the cultural antecedents of the subject. This 
monograph is, indeed, a contribution to more 
intelligent use of intelligence scales. 

J Speech Disorders 10:56 Mr '45. W. Elias- 
berg. * The author rightly points out the re- 
strictions of mere quantification (p. 88). Par- 
ticulars about the quantification method used 
by the author must be read in the original. * 
Derived from an investigation of the perpetra- 
tor, this book is apt to throw light on the gen- 
eral theory of intelligence. 

Prison World 7:35 S-O '45. 

Q R Biol 19:75 Mr '44. 


[ 1007 ] 
*McHucn, Geroto. Changes in I. Q. at the Public 
School Kindergarten Level. American Psychologi- 
cal Association, Psychological “Monographs, Vol. 55, 
No. 2, Whole No. 250. Washington, D. C.: the Asso- 
ee Inc., 1943. Pp. v, 34. Paper. $0.75.* (PA 18: 
327 

[ 1008 ] 
XMcKoww, Harry C. How to Pass a Written Ex- 
amination. New York: McGraw-Hill Book Co., Inc., 
1943. Pp. xi, 162. $1.50. * (PA 18:299) 


H Sch ] 27:112-3 My "44. A. M. Jordan. 
* Many of the suggestions in this book are 
good, some of them are puerile, and some can't 
be demonstrated to be true. There are entirely 
too many to be remembered. Imagine trying to 
remember 150 suggestions about how to pre- 
pare for an examination while preparing for a 
difficult examination in chemistry. I don't like 
the book because the whole emphasis is a false 


one. We have enough gadgets in education to 
declare a moratorium on them for 1000 years. 
The problem of education is not one of prepar- 
ing for examinations; it is one of acquiring and 
making permanent worthwhile learnings. How 
to Pass a Written Examination takes some of 
the well substantiated principles on how to 
study and incorporates them in its suggestions 
for passing an examination. 

J Bus Ed 19:33 Ja '44. 

J Higher Ed 15:397-8 O '44. Benjamin 
Bloom. * written in a moralistic tone * In the 
three chapters on preparation for the examina- 
tion, the author offers a number of good sug- 
gestions, but these are overelaborated. * The 
general suggestions for taking the examination 
have to do with the physical arrangements, 
habits of work, and ethics. The obvious nature 
of these suggestions is somewhat annoying ; for 
example, "get to the examination on time," 
“make yourself physically comfortable,” “ignore 
all distractions,” “when time is up, stop,” and 
“be honest.” * It is unlikely that the average 
student as a result of reading this book, will 
change his examination marks from failure to 
excellent. He may, if he follows some of the 
suggestions given, earn additional examination 
credit without increasing his comprehension of 
the subject-matter of the course. An examina- 
tion is merely a sample of the information, un- 
derstandings, and skills the student is expected 
to acquire as a result of a particular course of 
study. Preoccupation with the sample causes 
the students to lose sight of the main objectives 
and the whole range of material involved in the 
course. This book is primarily devoted to aid- 
ing the student to demonstrate competence on 
the examination sample without necessarily 
achieving the major objectives of the entire 
course. The majority of the suggestions offered 
are so obvious that most students should know 
them after taking one or more examinations. If 
the student has not already learned most of 
them as a result of a limited experience with 
tests, he will have difficulty in absorbing the 
more than 150 suggestions presented in this 
book. 

Occupations 22:279 Ja '44. Benjamin E. 
Mallary. This reviewer, as a member of a state 
Civil Service Commission, finds the implica- 
tions of Harry McKown's contribution as use- 
ful to him as its practical suggestions should be 
to future examinees. A wide dissemination of 
test-taking techniques will almost through 
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necessity force the test constructors into the de- 
fensive position of strengthening the validity of 
their work. In almost all areas of personnel 
work validity is taken too much for granted and 
our major effort is expended on building up a 
ritual of techniques into an atmosphere of re- 
spectable reliability. * The reviewer agrees with 
the publishers that the McKown book “....pro- 
vides practical help for any student or adult 
taking written examinations. It explains....how 
to prepare for them emotionally, physically, and 
mentally... Written in interesting, informal style, 
the book offers more than 150 definite sugges- 
tions (reviewer found more than twice that 
many)...Reflecting actual practice, with little 
theory, this book can be used for any subject 
field in which written examinations are given." 
The practical experience behind the book and 
its obvious lack of wordiness and theory should 
not make it taboo in academic circles. It is dif- 
ficult to pick out one practical suggestion in 
the book which cannot be supported by the 
findings of scientific research. It is apparent 
throughout that the author had only one goal— 
to assist individuals in developing ability to 
pass any type of examination. He has done that 
job well without any toadying to the ubiquitous 
pseudo-respectable. This book is a "must" in 
practical group guidance classes. It will pro- 
vide basic material for the teacher and serve 
well as a reference for the student. In schools 
without benefit of such courses it should be part 
of the next-best guidance methed, a five-foot 
vocational book shelf in the school library—not 
in the principal's office. 

Pub Personnel R 5:186-7 Jl’44. Norman J. 
Powell. * Both ethical and technical considera- 
tions combine to make real, though currently 
minor, the issue of the extent to which the cen- 
tral personnel agency should concern itself in 
the test preparation methods used by aspirants 
for civil service posts. It is with considerable 
interest, then, that Mr. McKown’s book may be 
approached by the public personnel technician. 
Focussed about the schoolroom teacher-student 
relation, the book is as easy to read as a primer. 
* platitudinous and....faintly comic [statements 
do] not pervade the entire text. Here and there, 
* a rather sensible suggestion is presented. At 
one point,-for example, the student is advised 
to make out a set of examination questions him- 
self and practice writing out the answers. Other 
advice to the examinee includes the exhorta- 
tion that he “read and follow the directions 


most carefully” and that he “plan a time sched- 
ule for the examination.” According to the 
jacket, more than 150 suggestions are set forth. 
The book would have made a first rate pam- 
phlet of some four pages. Expanded into short 
book length, the material is thin as the edge of 
a razor, but less sharp. 

Sch Sci & Math 44:683 O '44. C. A. Stone. 
* an excellent book for those....taking written 
examinations * written in a very interesting 
style and offers a large number of definite sug- 
gestions, nearly all of which are illustrated by 
actual examination questions, settings or condi- 
tions * The reviewer recommends this book as 
an addition to the students' library. 


[ 1009 ] 
*McManama, Maurice. A Genetic Study of the 
Cognitive General Factor in Human Intelligence. 
Catholic University of America, Studies in Psychol- 
ogy and Psychiatry, Vol. 4, No. 2. Washington, D. C.: 
Catholic University of America Press, 1936. Pp. vii, 35. 
Paper. $1.00. * (PA 10:4695) 

[ 1010 ] 
*McManus, R. Louise, XM Guide on Evalua- 
tion: Suggestions for Faculty Committees and 
Other Groups Studying Evaluation in Nursing. 
Printed with the approval of the Committee on Meas- 
urement and Educational Guidance. New York: Na- 
tional League of Nursing Education, 1944. Pp. 32. 
Paper. Out of print. * 

Am J Nursing 44:1202 D '44. H(elen) W. 
M(unson). This useful guide discusses the 
functions of the evaluation program of the 
school of nursing, suggests how committees and 
work groups may be organized, and outlines 
problems for group study. Types of tests are 
described, and their purposes and selection in 
relation to the particular abilities to be ap- 
praised. Examples of test items are given and 


suggestions for giving and scoring them. * 


[ 1011 ] 
MACMEEKEN, A. M. The Intelligence of a Repre- 
sentative Group of Scottish Children. Publications 
of the Scottish Council for Research in Education, 
[No.] IS. International Examinations Inquiry. Lon- 
don: University of London Press, Ltd., 1939. Pp. xvi, 
144. 55. * (PA 14:789) 

Ed Res B 19:499-500 N 20 '40. Francis N. 
Maxfield. * This survey sets a high standard in 
normative procedure. Additional statistical 
analyses of the data are promised. 

Librarian 29:160 Mr "40. 

For additional reviews, see 40:B1006. 


[ 1012 ] 
*McNeEmar, Quinn. The Revision of the Stanford- 
Binet Scale: An Analysis of the Standardization 
Data. Introductory chapter by Lewis M. Terman. Bos- 
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ton, Mass.: Houghton Mifflin Co., 1942. Pp. ix, 189. 
$3.30.* (London: George G. Harrap & Co., Ltd. 
15s.) (PA 16:3329) For reviews, see 293. 


[ 1013 ] 
*Mapison, THuRBER H. Interval Discrimination as 
a Measure of Musical Aptitude. Archives of Psy- 
chology, No. 268. Washington, D. C.: American Psy- 
chological Association, Inc., June 1942. Pp. 99. Paper. 
$1.25. * (PA 17 :238) 


[ 1014 ] 
*MauLER, Warr R. Twenty Years of Merit 
Rating, 1926-1946: Selected, Annotated and 


Classified References From Industry, Business, 
Government, and Education, Experience and Re- 
search. New York: Psychological Corporation, 1947. 
Pp. viii, 73. Paper. $1.75, postpaid. (PA 22:488) 


Occupations 26:130 N ’47. M. W. Richard- 
son. * a competently performed collection of 
bibliographical references * The range of refer- 
ences indicates the recognition of the compiler 
that merit rating has close relationships to test- 
ing and other forms of personnel measurement. 
This condition may confuse the person who 
wants an immediate and "practical" answer to 
the question of what rating device to use. Minor 
faults can be found by anyone reasonably famil- 
iar with the field. Some of the references are 
worthless, in the opinion of this reviewer, for 
one reason or another. Some of them are merely 
enthusiastic blurbs of the amateur, and some 
are rehashes of the older techniques, scarcely 
concealed by a few dress-up. One significant 
omission is Fryer's comprehensive treatise of 
the subject in full-length book form; this was 
clearly an oversight. Other omissions not charge- 
able to the compiler are certain unpublished or 
unpublicized studies known to represent signifi- 
cant advances in the field. The work of the 
Personnel Research Section of the Army in a 
comprehensive and creative study of the merit 
rating problem, culminating in several series of 
restricted rating devices, is notable. Many inves- 
tigators, including Wherry, Henry, Bradshaw, 
Bellows, Remmers, Fryer, and other profes- 
sional personnel men, developed new techniques 
and methods designed particularly to reduce the 
tendency to over-rate personnel. The series of 
investigations conducted by this group prob- 
ably represents the largest research effort ever 
Applied directly to the problem of merit rating. 
The present reviewer is of the opinion that the 
skillful use of certain new forced-choice tech- 
niques and the separation of evaluation from 
reporting represent the greatest single advance 
in choice of method. The techniques and pro- 
cedures are currently being applied to industrv 


by one firm. The reviewer missed references to 
large-scale handling of rating data, particularly 
by use of punched-card techniques including 
mark-sensing and machine scoring. No doubt 
many local applications of merit rating tech- 
niques in industry have been missed, for rea- 
sons not the fault of the compiler. One could 
wish that the excellent factor analysis study of 
rating data by Horst and another by Rogers 
were more generally available. These two stud- 
ies and the Army studies are mutually corrobo- 
rative with respect to the basic factors that can 
be measured by ratings. H. S. Belinsky, here 
cited, has an excellent Master’s Dissertation on 
Ratings in the Library, University of Chicago. 
This reviewer wonders whether the man-to- 
man rating method (at least as originally con- 
ceived) isn't overdue for relegation to ‘the his- 
torical category. The compiler has done a useful 
job, but it is to be hoped that the compiler or 
someone else will write a more critical sum- 
mary of work on Merit Ratings. 


[ 1015 ] 
*MancusE, Irene. Applied Graphology: How to 
Analyze Handwriting. Introduction by Ruth Hamp- 
e New York: Anatolia Press, 1945. Pp. vi, 150. 

.50. 

[ 1016 ] 
*MancuLrs, Heren. Rorschach Responses of Suc- 
cessful and Unsuccessful Students. Archives of 
Psychology, No. 271. Washington, D. C.: American 
Psychological Association, Inc, July 1942. Pp. 6r. 
Paper. $1.00. * (PA 17:963) 

[ 1017 ] 
XMansnALL, M. V. An Introduction to Tests and 
Testing. Toronto, Canada: Ryerson Press, 1944. Pp. 
E fe. (Boston, Mass.: Bruce Humphries, Inc. 

.00. 

[ 1018 ] 
Marston, WitLiIAM M. The Lie Detector Test. 
New York: Richard R. Smith, Inc., 1938. Pp. 179. $2.00. 
* (PA 12:2915) 

J Crim Law & Criminology 29:305-8 Jl-Ag 
'38. Fred E. Inbau. In the opening chapter of 
this book the author makes a favorable com- 
parison between himself and God. To God he 
gives credit for detecting the first lie ever told 
—that related by Adam in the Garden of Eden; 
to himself he pays full respect for “discover- 
ing” in 1915 the world’s first blood-pressure 
test for detecting deception. The reader is then 
whisked away on an Alice-in-Wonderland jour- 
ney through the fields of lies, liars, and lie- 
detectors. Hardly before the reader knows what 
a “lie-detector” might be, he is informed (on 
pages 15, 16, and 17) that it will “cure crime 
itself when properly administered,” that it’can 
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"change criminals into honest men," and that 
“there is no reason why go per cent of society’s 
crime bill cannot be written off by the Lie De- 
tector when it is properly used and universally 
accepted.” Wonderful !—if true. It seems, how- 
ever, that in making such sweeping statements 
Marston forgets that in his book he is supposed 
to play the role of a scientist and not that of 
a popular magazine writer, or newspaper re- 
porter, or special guest on some advertiser's 
radio program. Apparently the author has re- 
peated such exaggerations so often in Esquire, 
in This Week, and over the Nash and other 
radio programs that he really believes them 
himself, For the past five years your reviewer 
has devoted a considerable portion of his time 
to conducting "lie-detector" tests, as an asso- 
ciate of Leonarde Keeler at the Scientific Crime 
Detection Laboratory of Northwestern Univer- 
sity, from 1932 to 1938, and neither he nor 
Keeler is so optimistic of the future of the 
“lie-detector” as Marston purports to be. Al- 
though we have found the device to be very 
valuable in diagnosing deception, our experi- 
ence (Keeler’s for sixteen years, at the Labora- 
tory and elsewhere) in several thousands of 
actual cases, criminal and otherwise, does not 
in any way indicate that the “lie-detector” can 
reduce society’s crime bill by 90 per cent or 
that any such instrüment operates with an ac- 
curacy of 97 per cent as stated by Marston. * 
on the basis of our experience with the method 
claimed by Marston as his “discovery” and with 
other techniques as well, we seriously question 
the validity of his claims both as to the ulti- 
mate value of his “discovery” and also its al- 
leged accuracy. Marston devotes many pages of 
this book to vituperative remarks about many 
other workers in the field who have at some 
time or other been labelled (by newspapers, 
popular magazines, etc.) “inventors of the lie- 
detector.” According to the author these per- 
sons usually avail themselves of the “Marston 
test” with “an assemblage of standard appara- 
tus.” In other words, all that followed Marston’s 
observations in 1915 is only of incidental value 
as compared with his “discovery.” Although 
the author exhibits great concern about others 
wearing his crown of “discoverer” (or "in- 
ventor")—so much so that some of his attacks 
come close to being libelous—he neglects to 
adequately mention the contributions of his 
predecessors who might well have thought they 
had as much right to the crown as Marston 
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himself. Years before Marston’s “discovery,” 
Cesare Lombroso used a plethysmograph for 
the purpose of detecting deception. * In Chap- 
ter 3 Marston reproduces a dialogue which he 
and his former teacher, Hugo Munsterberg, 
were supposed to have had together. The reader 
is given the impression that Marston astonished 
his professor when he informed Munsterberg 
about his “discovery.” Yet, in 1908 Munster- 
berg, in his book, “On the Witness Stand,” dis- 
cusses the effects of emotional changes upon 
blood pressure, pulse, respiration, psychogal- 
vanic reflex, etc., and the possibility of utiliz- 
ing such reactions in detecting deception. After 
reading Munsterberg's chapter “The Traces of 
Emotion” one wonders whether the professor 
himself should not receive a little credit for 
Marston's “discovery.” To avoid any misinter- 
pretation of the foregoing criticisms of Marston 
and his book the reviewer wishes it to be dis- 
tinctly understood that while the facts of his- 
tory tend to contradict Marston's claims of 
“discoverer,” he is deserving of much credit 
for his experiments and early research in the 
detection of deception. And a number of other 
workers in the field have accorded him such 
recognition, ever one of those—Keeler—who 
bears the brunt of many of Marston’s deroga- 
tory remarks concerning "inventors" and “dis- 
coverers” of the “lie-detector.” * As a matter 
of fact, there is no discoverer or inventor of the 
“Jie-detector.” The theory of detecting decep- 
tion by means of observations and recordings 
of physiological changes developed by slow de- 
grees from many and varied sources which 
extend back several centuries. No one person 
has contributed anything sufficiently new or 
revolutionary to merit the acclaim of the world 
as “discoverer” or “inventor.” * When Marston 
states in Chapter 5 (entitled "The Lie Detector 
Goes to Court") that the “lie-detector” is legal 
and that its results are admitted as evidence in 
the courts of four states, and “probably in three 
or four others” he is in error. Although a few 
trial court judges have permitted its use, no 
appellate court in this country has upheld such 
evidence. On the other hand, the Supreme 
Court of Wisconsin has held the “lie-detector” 
and its results inadmissible, and a Federal Cir- 
cuit court of appeals denied its admissibility in 
one of Marston’s own cases. * Marston....recom- 
mends the tools he used in 1915—an ordinary 
sphygmomanometer and a stethoscope, the same 
equipment a family physician carries around 
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with him from day to day. * But whatever ap- 
paratus may be used, Marston's instructions are 
not sufficiently well stated to be of much prac- 
tical value. In the entire book there are only 
four illustrations of “lie-detector” charts or rec- 
ords (one of Marston's “discontinuous” or non- 
recording method, and three of the “continu- 
ous" or recording methods of others). Only 
one of these, that appearing between pages 98 
and 99, gives any-indication whatsoever of the 
criteria used in diagnosing deception by means 
of blood pressure and respiratory recordings. 
This book is practically useless as a guide to a 
person who seeks to detect deception by means 
of a “lie-detector.” It is not possessed of suffi- 
cient interest to warrant the attention of the 
average reader who seeks an insight into 
the catacombs of "science." And because of the 
manner in which it is written—with its self 
praise unconcealed beneath a guise of bashful 
modesty, with its unjustifiable attempts to dis- 
credit the efforts of other workers, and a denial 
of proper credit to the author’s predecessors— 
it can only bring ridicule upon the subject mat- 
ter and disrespect for its author. 

For additional reviews, see 38:B426 and 
40:Bror4. 

[ 1019 ] 

Mase, Wayne E. A Self-Rating Scale for School 
Custodians. Kansas State Teachers College of Em- 
poria, Bulletin of Information, Vol. 19, No. 10; Studies 


in Education, No, 18. Emporia, Kan. : the College, 1939. 
Pp. 24. Paper. $0.20. * 


Am Sch Board J 100:103 Ja *40. * an at- 
tempt to furnish a comprehensive outline by 
which custodians may improve their efficiency 
and their personal qualifications through self- 
rating. The scale makes it possible for the cus- 
todian to check himself and to find weaknesses 
where they exist. 


[ 1020 ] 
*Mastow, Paur. The Analysis and Control of 
Human Experiences: The Individual Seen 
Through Rorschach, Volumes 1 and 2. Brooklyn 2, 
N. Y.: the Author, 16 Court St., 1947. Pp. iii, 233; iv, 
195. Paper, mimeographed. $3.50 per volume. * (PA 
21:2326) For reviews, see 86. 


[ 1021 ] 
*Mastow, Paur. Rorschach Psychology. Brooklyn 
2, N. Y.: the Author, 16 Court St, 1945. Pp. 149. 
Paper, mimeographed. (PA 19:3409) For latest edi- 
tion, see 1020. For reviews, see 87. 


[ 1022 ] 
*Mastow, Paur. Rorschach . Brooklyn 2, 
N. Y.: the Author, 16 Court Dae Bo. Paper, 
ea graphed; For latest edition, see 1020. For re- 
view: e 


[ 1023 ] 

*Maurer, Katuarine M. Intellectual Status at 
Maturity as a Criterion for Selecting Items in 

ool Tests. University of. Minnesota, Institute 
of Child Welfare Monograph Series, No. 21. Minneap- 
olis, Minn. : University of Minnesota Press, 1946. Pp. 
ix, 166. Lithotyped. $2.50. * (London: Oxford Univer- 
E Press. 15s. 6d.) (PA 21:286) For reviews, see 
288. 


[ 1024 ] 
*Meent, Pauk E. An Investigation of a General 
Normality or Control Factor in Personality Test- 
ing. American Psychological Association, Psychologi- 
cal EROS Vol. No. 4, Whole No. 274. 
Washington, D. C.: the Association, Inc., 1945. Pp. vi, 
62, Paper. $1.50. * (PA 21:510) 

[ 1025 ] 
*Menver, Acrrev O. Personality in Handwriting: 
A Handbook of American Graphology. Foreword 
by Rudolf Arnheim. New York: Stephen Daye Press, 


1947. Pp. 375. $3.50. * (PA 22:2156) 

[ 1026 ] 
MexzeL, Eur, W. Eng peetione for the Use of New- 
Type Tests in India, Teaching in India Series, No. 8. 
Bombay and London: Oxford University Press, 1930. 
Pp. xv, 261. For latest edition, see 1027. 


Indian J Psychol 14:235 O '39. H. Maiti. * 
Though meant as a simple non-technical guide 
book for the average teacher, the volume under 
review makes out a very strong case for the use 
of the New-type Tests in our schools and train- 
ing institutions. The language is clear and easy, 
the descriptive illustrations plentiful, and the 
directions are simple and fairly complete. * 
Though the list of Tests already constructed 
given in the pages of the book is not complete, 
one may have from it a fairly good idea of the 
nature and extent of Testing work in India at 
present. There are four appendices, the last one 
giving a very useful bibliography on the subject 
of Testing. The value of the book has been in- 
creased to a great extent by an easily intelligi- 
ble account of the statistical methods which 
usually constitute a bugbear to the average 
teacher in India. On the whole, the book will 
undoubtedly serve a useful purpose at the pres- 
ent stage of growth of Indian education by pro- 
viding much necessary information and guid- 
ance to the average teacher on the subject of 
the New-type Tests. 

J Ed (London) 72:434 S "40. This is a sim- 
ply-worded, clear summary of English and 
American discussions on the nature of new- 
type tests and the reasons for their usefulness 
in the class-room. The author very wisely 
warns Indian teachers of the dangers of indis- 
criminate application of tests of intelligence, 
and indicates the necessity for training in the 
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interpretation of their results. It is to be feared, 
however, that his book may lead to large-scale 
construction of attainment tests which, while 
employing the superficial devices of "stand- 
ardized" tests, lack the preliminary thorough- 
ness of analysis which constitutes their chief 
value. The author mentions the importance of 
diagnosis in education; but his book would have 
helped teachers more had he given definite guid- 
ance as to the use not only of the new-type 
tests themselves but also of the results ob- 
tained from their employment. Representative 
case-studies and suggestions for remedial tui- 
tion and individual work would have lent added 
interest to a book which, in spite of minor in- 
accuracies in test-titles, etc., is obviously writ- 
ten by one who knows Indian schools and is 
familiar with western discussions on educational 
measurement. 
For additional reviews, see 40:B1017. 


[ 1027 ] 
*Menzet, Emm W. Suggestions for the Use of 
New-Type Tests in India, Second Edition. Bom- 
bay'and London: Oxford University Press, 1942. Pp. 


xiii, 283. Rs. 3; 4s. 6d. * 


[ 1028 ] 
*MrrTROPOLITAN LIFE INSURANCE COMPANY, INDUS- 
TRIAL HrALTH Section. Methods of Testing and 
Protecting Eyesight in Industry. Industrial Health 
Series, No. 4, New York: Metropolitan Life Insurance 
Co., 1945. Pp. 70. Paper. Gratis to schools and libra- 
ries. 

[ 1029 ] 
kMer Neat E., Eprtor. Psychological Research 
on Pilot Training. Army Air Forces Aviation Psy- 
chology Program Research Reports, Report No. 8. 
Washington, D. C.: Government Printing Office, 1947. 
Pp. xix, 488. $1.75. * (PA 22:4153) 

[ 1030 ] 
Monror, PauL, Eprrog. Conference on Examina- 
tions: Under the Auspices of the Carnegie Cor- 
poration, the Carnegie Foundation, and the 
International Institute of Teachers College, Co- 
lumbia University at the Hotel Royal, Dinard, 
France, September 16 to 19, 1938. New York: Bu- 
reau of Publications, Teachers College, Columbia 
Men 1939. Pp. xiii, 330. Out of print. * (PA 13: 
5375 


B Int Bur Ed 13:137-8 q 3 ’39. * Some 
first-rate reports were presented * No educa- 
tionist should omit reading this book. 

Brit J Ed Psychol 10:251-2 N '40. A. P. 
B(raddock). * This Conference was no mere 
play. The programme was full and the papers 
contained meat in abundance. * a useful addi- 
tion to the literature that has recently grown 
up round the vexed question of examinations, 
even while one notes that the discussions were 


not confined to this topic solely. At an interna- 
tional conference it is to be expected that one 
would learn what measures are taken by people 
elsewhere to probe the effects of education and 
for purposes of selection. The report does not 
disappoint the reader. * The difficult problem 
and methods of examining composition were 
discussed, yet with the usual lack of finality. 
The conclusion that is left in the reviewer's 
mind is that unless and until a panel of exami- 
ners agrees to mark upon a preconceived plan, 
and adheres closely to it while marking, the 
variations complained about between the marks 
will always occur. * M. Declos' suggestion of 
the writer's personality is worth attention. The 
student of comparative education will find mate- 
rial for study in this report * the deliberations 
of a distinguished body of people like that in 
consultation at Dinard in 1938 cannot fail to 
have manifold interest for educationists * 

J Ed (London) 73:66 F '41. * Among the 
many interesting topics discussed we note the 
inquiry of a French Commission on the bac- 
calauréat, the relations between the École Nor- 
male Supérieure and the examination system, 
the selection of secondary school entrants in 
France, and Sir Philip Hartog's account of an 
investigation of the English essay. The whole 
report appears to be verbatim, even to the ex- 
change of pleasantries and courtesies between 
the members of the conference. 

J Ed Res 34:143-5 O '40. Harold Gulliksen. 
* This book is primarily interesting for the pic- 
ture it gives of the educational system and the 
state of development of test procedures in each 
of the countries represented; and also because 
the verbatim report of the discussions gives an 
impression of the different personalities involved. 

J Excep Child 7:209—10 F *41. Samuel R. 
Laycock. * noteworthy for two things. First, it 
furnishes a great deal of information concern- 
ing the French system of education and particu- 
larly of the French attempts to solve the prob- 
lem of examinations in their secondary schools. 
Secondly, it carries the discussion of examina- 
tions beyond the stage of merely discussing re- 
liability to a discussion of the nature of the abili- 
ties being tested. It would seem that this dis- 
cussion of the nature of the abilities being 
tested is fundamental to further progress in 
refining the method of examinations. The ques- 
tion of the degree to which ability is general or 
special is important as is also the question of 
the nature of linguistic ability—oral and writ- 
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ten. One cannot adequately assess the value of 
viva voce examinations or of the traditional 
essay type of examination unless the relation of 
linguistic ability to other abilities is known. The 
volume is bound to provoke thought among 
those that read it. It is noteworthy because it 
points the way to the next stage of investigation 
rather than for the solutions that it offers. 
For additional reviews, see 40:Brozr. 


[ 1031 ] 
*Moonr, HrnsERT. Experience With Employment 
Tests. Conference Board Reports, Studies in Person- 
nel Policy, No. 32. New York: National Industrial 
Conference Board, Inc., March 11, 1941. Pp. 72. Paper. 
Out of print. * (PA 15:3965) 


[ 1032 ] 

*Moore, HrmsERr. Psychology for Business and 
Industry, Second Edition. New York: McGraw- 
Hill Book Co., Inc., 1942. Pp. xiv, 526. $4.50. * (Lon- 
don: McGraw-Hill Publishing Co., Ltd. 24s. (PA 
17 :9024) 

Occupations 21:618 Ap '43. Joseph S. Kopas. 
* written in a simple, popular style * describes 
a variety of business and industrial testing and 
personnel practices * well illustrated and well 
organized * stresses theoretical ideas that in- 
dustrial people talk about, but which only a few 
in some of the larger industrial organizations 
practice. It is a good reference for someone 
who wants to learn what is going on, but it 
does not go enough into detail to help an indi- 
vidual put some of the procedures described 
into practice. It is up to date, but some of the 
most significant recent developments on the 
practical front have been omitted. * There is 
very little material in the book on recent labor- 
management activities, which are also signifi- 
cant and basic recent developments. In spite of 
these shortcomings, Psychology for Business 
and Industry is one of the best industrial psy- 
chology books in the field today, and should be 
a part of every vocational counselor's library. 


[ 1033 ] 
*MoreEno, J. L. Foundations of Sociometry: An 
Introduction. Sociometry Monographs, No. 4. Re- 
printed from Sociomeiry, Vol. 4, 1941. New York: 
Beacon House, 1945. Pp. 15-35. Paper, lithotyped. $1.25. 
* (PA 15:3521; 19:2004, title only) 

[ 1034 ] 
XMonzxo, J. L. Sociometry and the Cultural Order. 
Sociometry Monographs, No. 2. Reprinted from Soci- 
ometry, Vol. 6, 1943. New York: Beacon House, 1943. 
Pp. 299-344. Paper. $1.25. * (PA 18:2186) 


[ 1035 ] 


*Moreno, J. L., ann JeNNINGS, HELEN H. Sociome- 
tric Control Studies of Grouping and Regrouping: 
With Reference to Authoritative and Democratic 


Methods of Grouping. Sociometry Monographs, No. 
7. Originally printed in Sociometric Review, February 
1936; reprinted in Sociometry, November 1944. New 
York: Beacon House, 1947. Pp. 23. Paper, lithotyped. 
$1.25. * (PA 19:1748) 

[ 1036 ] 
*MonENo, J. L., AND JENNINGS, Heren H. Sociome- 
tric Measurement of Social Configurations Based 
on Deviation From Chance. Sociometry Mono- 
graphs, No. 3. Reprinted from Sociometry, Vol. 1, 1938. 
New York: Beacon House, 1945. Pp. 35. Paper. $1.50. 
* (PA 12:6023; 19:3436, title only) 

[ 1037 ] 
*Moreno, J. L.; JENNINGs, H. H.; AND SARGENT, 
(See Time as a Measure of Inter-Personal Re- 
lations. Sociometry Monographs, No. 13. Reprinted 
from Sociometry, Vol. 3, 1940. New York: Beacon 
House, 1947. Pp. 19. Paper, lithotyped. $1.25. * (PA 
14:3663) 

[ 1038 ] 


*Morcan, Howarp K. Industrial Training and 
Testing. New York: McGraw-Hill Book Co., Inc., 
1945. Pp. x, 225. $2.50. * (London: McGraw-Hill Pub- 
lishing Co., Ltd., 1946. 15s.) (PA 20:543) 


Am J Psychol 59:713-4 O °46. D. R. Miller. 
* The training chapters have one asset in com- 
mon with their sources; they cite concrete pro- 
cedures to illustrate most of the principles dis- 
cussed, but they suffer from the drawbacks of 
popular approaches based on common-sense ob- 
servations rather than on empirical data. * The 
application of this popular approach to topics 
nearer the province of psychology, e.g. testing, 
rating and counseling, leads not only to super- 
ficiality but to many inaccuracies as well. The 
author’s estimate of psychology is best illus- 
trated by his statement that it is not necessary 
for a trained psychologist to direct a firm's test- 
construction division. It is, as he believes, only 
necessary for the director to be well-versed in 
a few commonly recognized principles of psy- 
chology. Among such ‘principles’ he seems to 
include his notion of an extravert as a person 
who is “very arrogant, loud, and thick-skinned.” 
With this curious conception of psychology, it 
is not surprising that less than five pages are 
devoted to the use and interpretation of the nor- 
mal curve, percentiles, standard scores and cor- 
relation, and but seven pages to the construc- 
tion, grading and interpretation of psychological 
tests. The superficiality of the approach to this 
general subject may be judged from the author’s 
statement that job-interest, general intelligence, 
social intelligence, planning, drive and follow- 
through, “in short, the worker’s personality,” 
can be measured by “two simple tests taking 
thirty minutes.” Recent developments in job- 
rating and in counseling are ignored. A five- 
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step rating scale is recommended which the 
author naively claims will tend to break down 
the halo-effect despite the fact that it contains 
such subjective terms as ‘quality,’ ‘versatility’ 
and 'dependability.' The scale is apparently in- 
tended for rating all types of jobs from un- 
skilled, manual and repetitive to highly skilled 
technical and planning. Few psychologists will 
agree with the author's recommendations that 
the counselor be empowered to take such direc- 
tive steps as advising the worker to resign, 
reporting results of certain discussions with 
workers to their immediate supervisors, and 
vigorously condemning the faults of "trouble- 
some workers" during counseling conferences. 

Ed 66:415 Mr '46. William P. Sears. * de- 
scribes how to test new employees simply and 
effectively * Although the book is directed to 
those responsible for training in industry itself, 
vocational educators, especially those in the 
field of trade and industrial education, will find 
the book very valuable as they set up their 
plans for the selection of trainees, for the train- 
ing of those selected, and for the placement and 
follow-up of those who complete their training 
courses. 

Ed Res B 26:82-3 Mr 16 "47. Robert E. 
Smith. * presents many helpful suggestions for 
the inauguration and development of a program 
of selection and training of workers, and should 
be very useful to persons interested in indus- 
trial personnel work. 

Occupations 25:137 N '46. Arthur F. Dodge. 
* The chapters on training programs, training 
department organization, instruction methods, 
and introductory and shop training are espe- 
cially good. Also, because of the increasing use 
of counselors in industry, the chapter on the 
counselor is of interest. The duties, methods, 
and some of the problems of the counselor are 
well stated. The chapter on service and sales 
training contains many helpful suggestions * 
The two chapters on “work review" or rating 
give very careful instruction for the use of a 
specific rating sheet which the author recom- 
mends for rating all employees—supervisors as 
well as rank and file workers. This form re- 
quires each supervisor to rate each of his sub- 
ordinates on twelve factors intended as a meas- 
ure of job performance, and also on twenty-five 
additional factors intended to measure person- 

ality. This reviewer doubts that increased reli- 
ability is secured by extending the number of 
rated items beyond twelve or fifteen. The chap- 


ters on testing programs and selected tests are 
disappointing. The author recommends twelve 
specific tests but fails to indicate why each was 
selected. There is no mention of validity or re- 
liability in describing these tests. It is interest- 
ing to note that the author's entire testing and 
rating program is largely built around two tests : 
The Personality Index and the Kuder Prefer- 
ence Record. * 

Psychol B 43:483-4 S '46. Bruce V. Moore. 
This book is written by an engineer to present 
procedures in testing and training which have 
been empirically developed. To a psychologist 
the treatment is not acceptable as adequate, 
effective, or accurate. * Eleven more or less 
standardized tests are briefly described and 
recommended as the most valuable. There is 
little recognition of the fact that a test is good 
or valid only insofar as it is valid for a specific 
purpose in a particular employment situation. 
Simplified norms are given for dividing appli- 
cants into five groups or levels by the tests. 
Subjective judgment is apparently thought to 
be adequate for deciding what the score or level 
should be for success in any particular type of 
work. The organization of a training depart- 
ment is outlined, and specific instructional meth- 
ods prescribed for shop, sales, and supervisory 
training. In these suggestions is incorporated 
much good sense based on extensive experi- 
ence. * Although not acceptable for college 
classes, it might be used in extension or in-serv- 
ice training courses with considerable explana- 
tion and supplementary discussion. It would 
be difficult for anyone with no previous knowl- 
edge of the subject to get reliable help from the 
book, especially in the use of tests, 


[ 1039 ] 
*Morse, Horace T., AND McCune, Grorce H. Se- 
lected Items for the Testing of Study Skills. 
National Council for the Social Studies, Bulletin No. 
15. Washington, D. C.: the Council, 1940. Pp. 72. 
Paper. Out of print. * (PA 15:531) 

Harvard Ed R 11:264-5 Mr "41. J. Wayne 
Wrightstone. * the authors have provided se- 
lected items for testing such objectives as open- 
mindedness, acquisition of social-studies data, 
use of common reference books, interpretation 
of tabular data, reading of circle, bar, and pro- 
file graphs, evaluating sources, drawing infer- 
ences from socio-economic data, and recognizing 
statements which support generalizations in the 
social studies. In an excellent introductory 
statement to the pamphlet the authors discuss 
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the teaching of study skills and how to use the 
bulletin effectively. They do not pretend to ex- 
haust the possible skills which may be impor- 
tant in social-studies instruction. They have 
provided many types of exercises, however, 
which the busy classroom teacher may use in 
devising tests and scales in order to estimate 
the status and growth of pupils in various study 
skills. The authors are to be commended for 
their initiative and skill in devising or selecting 
exercises of the type which appear in this pam- 
phlet. Such materials will be of benefit in in- 
terpreting the newer trends in appraisal to the 
classroom teacher of the social studies. Some 
of the exercises in the pamphlet may be criti- 
cized because they are awkward in wording and 
do not employ the most effective item construc- 
tion techniques. However, the authors indicate 
in their introduction that improvement and re- 
finement of the present exercises may come 
from the suggestions and criticisms of teachers 
who use this pamphlet. 

High Points 22:68-9 D '40. Howard L. Hur- 
witz. * includes four hundred and seventy-nine 
questions designed to aid in diagnosing the 
study skills of secondary school students. The 
questions fall into twenty-three sections con- 
taining exercises designed to determine open- 
mindedness, distinctions between sources and 
secondary accounts, bibliographical form, inter- 
pretation of tabular data, use of common refer- 
ences, recognition of common abbreviations in 
historical materials, reading of graphs, analysis 
of conflicting statements. Especially commend- 
able is “Section O. Exercise on Consistency.” 
A long statement of campaign promises is given 
in one paragraph, followed by a list of ques- 
tions that might be asked of the candidate for 
office. The problem is to mark those questions 
with an “X” where the candidate would have 
difficulty in answering directly, and to mark 
those with an “O” which would not involve an 
inconsistent answer. The encouragement of this 
type of analysis is especially timely in Novem- 
ber, but could be encouraged with profit as a 
daily exercise. The authors do not pretend to 
exhaust the possible skills which may be taught 
in the social studies, but append an excellent 
bibliography to guide the teacher who feels that 
valuable skills have been omitted. They en- 
courage individual teachers to construct their 
own tests and hope that their efforts will be fol- 
lowed up and criticized. The latter hope, at 
least, is not in vain. The fly-in-the-ointment- 


seeker will have a field day; for if ye seek, ye 
shall find, and it is not hard to find a number 
of items which, at first glance, are confused in 
wording, and at second glance do not measure 
any particular skill or are not answerable with 
anything like dogmatic assurance. It would be 
amazing if this were not the case. The authors 
are to be comniended for their patience and 
skill in drawing up the questions and introduc- 
tory notes. The day of the question-and-answer 
recitation may never pass, but it should move 
aside occasionally for the type of exercise illus- 
trated by the bulletin. * 


[ 1040 ] 
*Moyninan, James F, The Concept of the Syn- 
thetic Sense and a Technique of Its Measurement. 
Studies in Psychology and Psychiatry From the Cath- 
olic University of America, Vol. 5, No. 5. bees 
D. C.: Catholic University of America Press, July 
1942. Pp. vii, 48. Paper. $1.00. * (PA 17:431) 

[ 1041 ] 
X*Mvwnog, Rura Learnep. Prediction of the Ad- 
justment and Academic Performance of College 
Students by a Modification of the Rorschach 
Method. Foreword by Gardner Murphy. Applied Psy- 
chology Monographs of the American Association for 
Applied Psychology, No. 7. Stanford, Calif.: Stanford 
University Press, 1945. Pp. 104. Paper. $125. * (PA 
20:918) For reviews, see 89. 

[ 1042 ] 
Murray, HENRY A., AND OTHER WORKERS AT THE 
Harvarp PsvcHoLocrcAL Crrwic. Explorations in 
Personality: A Clinical and Experimental Study 
of Fifty Men of College Age. New York and Lon- 
don: Oxford University Press, 1938. Pp. xiv, 761. 
$7.50; 42s. * (PA 13:968) 


Am Anthrop 43:118-20 Ja-Mr "41. John 
Dollard. * A brief review can scarcely convey 
the ingenuity and subtlety of the conceptual at- 
tack on the problem; but one sees plainly the 
evidences of intellectual courage, intense appli- 
cation, and high talent for the task. Often, in- 
deed, Murray's paraphrases of the pet ideas of 
others improve upon the originals. For an ex- 
isting psychology which makes a good attempt 
at grappling with important personality prob- 
lems one must turn, of course, to Freudian psy- 
choanalysis, and this Murray and his co-work- 
ers have done. Many of their major orientations 
are drawn from this source, and they give an 
honest (if somewhat ambivalent) recognition to 
the bone labor already done in the field. They 
do not, to be sure, make use of Freud's best 
invention, to wit, his interviewing method, but 
this seems to have been due to the technical 
difficulty of getting so many subjects to accept 
it in a university community, rather than to a 
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lack of appreciation of its utility. * The inter- 
pretations necessary to identify the themas 
(press-need units) surviving from childhood 
often seem forced and do not have the punch 
that they have in actual psychoanalytic work * 
this reviewer sympathizes with the subject who 
said, “Why don't they ask me what they want 
to know; I'd tell them!” (paraphrase). * The 
spooky racial unconscious theory of Jung is 
treated with more respect than it deserves * 
The big problem....is...to test the claims of the 
various psychoanalytic schools, but particularly 
the Freudian, and make their valid knowledge 
public property. In essaying this, Murray has 
shown good sense and great boldness, although 
the result can be described at best (as Murray 
indeed does himself) as a first step. * The rat- 
ing idea is excellent and particularly the ap- 
pended device of the diagnostic council, bor- 
rowed from the clinical conference of physicians 
which often precedes diagnosis. Great ingenuity 
has been shown in devising tests, and great 
labor in carrying them out systematically. The 
correlation device of showing degree of associ- 
ation between needs is frequently used, and the 
operators have been laudably self-critical in car- 
tying out all of the statistical operations. One 
sees here very well in miniature the whole 
process of scientific investigation, in particular 
the exploratory part of science which consists 
of finding and naming variables * One of the 
great and marked talents of the author is the 
invention of new terms for aspects of reality 
once apprehended but never clearly discrimi- 
nated. * I do not find the creation of these new 
terms artificial as some seem to; one is given 
enough practice through well-chosen examples 
and dextrous repetitions of the terms so that 
they sink in. The reader can certainly be assured 
of a desirable expansion of his vocabulary. 
* Himself a chemist, physician, psychologist, 
psychoanalyst, litterateur (what stupendous 
greed for learning!), he has brought all his 
varied knowledge to bear on the research in 
“question. The large frame and scale of the 
effort, its competence in many diverse fields, its 
length and intensity, betoken a gifted individ- 
ual. For example, on the literary side, Chapter 
III begins with the following sentence: “Au- 
thors whose works are read with enjoyment 
cover the bare framework of their thought with 
prose that moves like muscle, employing lively 
images and graceful turns of speech to bring its 
contours to the semblance of palpitating life.” 
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There are few living writers in science, or out 
of it, who can turn out a sentence like that. 
There is much to admire in this book but it - 
misses the clear-cut impression necessary to 
greatness—it is a first step only. When, for ex- — 
ample, the authors finally put the personality 
of Earnst together, after taking so many snap- 
shots of it, it still does not, so to say, make a 
moving picture. One doubts if it really repre- 
sents a correct organization of the facts about 
the man. He does not come to life as the au- 
thors hoped he would (though they don’t seem 
to be deluded themselves on this point). How 
much has he managed to withhold? How differ- 
ent would he seem if one had about ten hours 
of interviewing with him where his cooperation 
was actually freely given because he wanted or 
needed to give it? How lacking is detail of 
actual behavior in natural situations? Might 
one or two telling incidents from daily life (such 
as any intimate could report) have given better 
predictive power as to his next move in any 
defined situation? There is a lot of useful seg- 
mental information about Earnst but the ac- 
count of him does not make sense. In the end 
the experience of enlightenment is still missing. 
Murray should go boldly on, cutting this wide 
swath, searching again and again the tremen- 
dous area he has trampled. He has demon- 
strated, as has no other; what is the essential 
business of a psychologist and has taken all the 
risks of attempting to exemplify the essential 
operations himself. He needs help of two kinds 
—two more skills under that capacious skull. 
He needs to know his modern learning theory 
much better than he does.. Though he uses the 
word “conditioning” frequently, he does not 
employ the major conceptions associated with 
this word. He makes no attempt, for instance, 
to show how the derived psychogenic needs 
arise from the primary physiological needs. 
Here learning theory would help particularly. 
This gap in theory is perhaps the most painful 
in the book. Of secondary, but considerable, 
importance is the necessity for knowledge of 
social structure in undertaking a research of 
this kind. Some very interesting correlations 
between specific needs and class background, 
mobility, and the like might have been forth- 
coming. The book is too big for a review which 
would really do it justice in detail. So the re- 
viewer has had to do what he has often deject- 
edly done—namely, spend a couple of weeks 
on a task and then drop it. 
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Arch Neurol & Psychiatry 44:1152-3 N *40. 
“This Book is Gratefully Dedicated by its au- 
thors to Morton Prince, who had the vision, 
raised the endowment and was the first director 
of the Harvard Clinic, to Sigmund Freud, whose 
genius contributed the most fruitful working 
hypotheses, to Lawrence J. Henderson, whose 
expositions of scientific procedure established 
a methodological standard, to Alfred N. White- 
head, whose philosophy or organism supplied 
the necessary underlying generalities, and to 
Carl G. Jung, whose writings were a hive of 
great suggestiveness." The reader may well 
speculate as to the writhings of these bedfel- 
lows and wonder why certain other deities were 
left out! Nevertheless, the list reveals the au- 
thor's bent and his eclecticism. His theme is 
that psychology should not lose sight of human 
nature as it operates in everyday existence; his 
belief is that progress can be made by conscien- 
tious clinical research and by seeking experi- 
mental evidence for the validity of certain 
general intuitions about human nature. “ Person- 
ology" is the name Murray gives to this branch 
of psychology, and he realizes fully that it “is 
still in diapers, enjoying random movements." 
In the beginnings of such a science exact meas- 
urements are impossible. One must choose the 
most probable hypotheses and test them by new 
methods. Most of the hypotheses chosen by 
Murray are freudian; his shrewd invention of 
methods to test them is the most important con- 
tribution of the book. The very fact that the 
hypotheses are recognized as such and are sub- 
jected to test is important, for psychoanalysts 
are too prone to accept them as proved “facts,” 
when the evidence is all clinical and is so volu- 
minous that it is never adequately presented. 
Murray, like Prince before him, has a flair for 
human psychologic experiment, and this book, 
as the title indicates, is the work of an explorer. 
* The introduction in chapter I gives in 35 
pages the best discussion one can find on con- 
temporary schools of psychology, their points of 
cooperation and their mutual misunderstand- 
ings. In chapter II Murray begins the exposi- 
tion of his own experimental approach, and at 
once his style becomes wary and, unfortunately 
somewhat involved. Pages of remarkably inter- 
esting and important ideas are sometimes con- 
fused by the introduction of new terms, quo- 
tations (though often apt and erudite) and 
parenthetic phrases. The three most important 
new terms seem justified and useful; they are 


"regnant," "press" and "need." * This is a 
pioneering book, difficult but important; it is 
one of the first appreciative, yet objectively 
critical, evaluations of psychoanalysis. 

J Nerv & Mental Dis 93:259-60 F '41. * 
Our first annoyance is to find the concept “the 
normal individual” utilized in the discussion. 
Since, to us, no such bird exists, we get a bit 
pouty and agree with the editor that the at- 
tempt indicated that the concept personality was 
still in diapers enjoying random movements. 
Yet something does come out of it, not as satis- 
fying for some as by the psychoanalytic psychi- 
atric approach, yet a series of formulations 
which embody the chief conceptions of the 
editor. Just why the editor should consider 
McDougall in the dynamic camp, other than by 
his own self assertion, is not plain to us. We 
also strenuously object to the sloppy concep- 
tion “insanity” in a work on psychology. It be- 
longs in the legal menagery of imaginary ani- 
mals. The editor has attempted too much, this 
mélange of dynamic psychiatry, behavioristic 
formulations, Jung, Freud, and Adler—empha- 
sized as to differences rather than similarities, 
philosophies these and those, etc., etc., etc. It 
is informing, very, but spread all over the lot, 
illustrative of the editor's erudition rather than 
directed towards a unified synthesis—the diffi- 
culties in finding which are amply observable. 
Again, why the dualism of viscerogenic and 
psychogenic which some very early biblical ob- 
servers suggested were one? And our chief an- 
noyance the descent into the prevailing political 
alphabetical filing cabinet system—thus, page 
87 (N Nox Sr Inf Sn Aff Sn Coq S.n.Exp.S.n. 
Agg.S.n. Ach.) all in very behavioristic terms 
which tell one very little outside of this particu- 
lar frame of reference. We feel that in general 
the trend to make a harmonious pie of the 
varied tendencies in mural and extramural psy- 
chology is laudable and worthy of much com- 
mendation, but it leaves the present reviewer 
trying to unscramble the omelette—to mix his 
metaphors as well as other things. Others we 
feel may be much more successful. 

N Y Times Book R 44:12 Mr 5 '39. Liv- 
ingston Welch. * To merely begin a study of 
this nature is not a step but a great advance 
in the right direction. If there is any hope of 
dissolving bias and water-tight compartment 
thinking from which psychology has suffered 
and if there is any chance of placing the more 
fluid techniques such as psychoanalysis on a 
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strictly experimental basis, it is along the lines 
that these men have traveled. * 

Q R Biol 15:504-5 D '40. * a detailed and 
technical account of the most original, thorough- 
going, and systematic attempt at a consistently 
scientific appraisal and understanding of human 
personality that has yet been made. It is a col- 
laborative effort of a group of persons * But 
to Murray, the planner, director, analyst, and 
synthesizer of the investigation clearly belongs 
the major part of the credit for the success and 
significance of the outcome. * The primary 
purpose of the study was methodological, and 
in this respect the net result is of first rate 
importance. A methodology has been devel- 
oped that, even though far from being yet per- 
fected, will yield rich and novel rewards to the 
psychologist and psychiatrist if it is consistently 
and intelligently employed. Specifically in the 
present case it has yielded a composite picture 
of the present day individual of college age that 
is clear-cut and illuminating within its defined 
boundaries. The book is so packed with techni- 
cal detail in both its theoretical and objective 
aspects as to make any summary of findings 

impossible in the space here available. But every 
student of human biology, whatever his special 
field of interest, will wish to read and study it. 
For it is a contribution belonging in the abso- 
lutely first rank of significance. There is in- 
cluded at the end a glossary of terms used, 
which the reader will find extremely useful, and 
a competent index, 

For additional reviews, see 40:B1026. 
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*AMorsELL, James L. Psychological Testing. New 
York: Longmans, Green and Co., Inc., 1947. Pp. xiv, 
449. $4.00. * (PA 22:521) 
Music Ed J 34:55 N-D '47. William R. Sur. 
* a scholarly and practical work * It is not a 
book which will be of interest to the larger 
number of school music teachers but will prove 
valuable to those with an interest in the entire 
testing movement. This work is written pri- 
marily for psychologists and others engaged in 
psychological testing as well as students of test- 
- ing. The material presented “covers both the 
basic theories of mental measurement and their 
practical application.” “It lists approximately 
one hundred of the typical and sounder tests 
available today, including those made by the 
Army during the war.” The chapter on Apti- 
tude Testing should be of interest to music edu- 
cators. The section of this chapter on Talent 


Tests includes a description of (1) Measures of 
Musical Talent, Seashore, and (2) Musical 
Memory Test, Drake. The author does not 
clearly indicate whether or not he selected these 
purely as types of talent tests or because he be- 
lieves these tests to be particularly sound. Dr, - 
Mursell states that his purpose was “to present 
a comprehensive and balanced account of the 
testing movement in psychology, its present 
status and its future prospects.” He has achieved 
his purpose and made a distinct contribution to — 
educational literature. 
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Human Engineering Laboratory, Inc., Technical Re- 
port, No. 174. Boston, Mass.: the Laboratory, June 
1045. Pp. 10. Paper, mimeographed. * 
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No. 2. Provincetown, Mass. : Journal Press, November 
1946. Pp. 129-231. Paper. $4.00. * (PA 21 21166) 
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TIONAL Measurements, The First Yearbook of the 
National Association of Teachers of Educational 
Measurements Containing the Papers of 1939- 
1941. Edited by Frank S. White. Fairmont, West Va. : 
the Association, Fairmont State College, 1941. Pp. iv, 
32. Paper, mimeographed. $0.50. * 
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XNATIONAL COUNCIL ON MEASUREMENTS UskD IN 
Epucation, Second Yearbook: Proceedings of San 
Francisco Meeting, 1942, Edited by Frederic Kuder. 
Educational and Psychological Measurements. [Fair- 
mont, West Va.: the Council, Fairmont State College, 
1943.]. Pp. iv, 20-42, 353-70, 401-8. Paper. Out of 


print. 
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XNaATIONAL Counci, ON MEASUREMENTS USED IN 
Epucation. The Third Yearbook of the National 
Council on Measurements Used in Education. 
Fairmont, W. Va.: the Council, Fairmont State Col- 
lege, 1946. Pp. v, 58, vi. Paper, mimeographed. $0.75. * 
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Epucation. The Fourth Yearbook of the National 
Council on Measurements Used in Education. 
Fairmont, W. Va.: the Council, Fairmont State Col- 
lege, 1947. Pp. vi, 62, vi. Paper, mimeographed. $1.00, * 
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UCATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
COUNSELING, GUIDANCE, AND PERSONNEL Work, Mar- 
caret E. Bennett, CHAIRMAN. Counseling, Guid- 
ance, and Personnel Work. Review of Educational 
Research, Vol. 15, No. 2. Washington, D. C.: the As- 
sociation, April 1945. Pp. 97-192. Paper. $1.00, post- 


paid. * 
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GROWTH AND DEVELOPMENT, AmTHUR T. Jersio, 
CHAIRMAN. Growth and perepnamt Review of 
Educational Research, Vol. 11, No. 5. Washington, 
D. C.: the Asociation, December 1941. Pp. 475-618. 


Paper. $1.00, postpaid. 
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X*NATIONAL EDUCATION ASSOCIATION, AMERICAN ED- 
UCATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
MENTAL AND PHYSICAL DEVELOPMENT, Fowrer D. 
Brooks, CHAmMAN. Mental and Physical Develop- 
ment, Review of Educational Research, Vol. 6, No. 1. 
Washington, D. C.: the Association, February 1936. 
„Pp. 152. Paper. $1.00, postpaid. * 
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JoNes, CHAiRMAN. Mental and Physical Develop- 
ment. Review of Educational Research, Vol. 9, No. 1. 
Washington, D. C.: the Association, February 1939. 
Pp. 141. Paper. $1.00, postpaid. * 
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"UCATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
METHODS or RESEARCH AND APPRAISAL IN EDUCATION, 
DoucLAs E. Scares, CHAIRMAN. Methods of Re- 
search and Appraisal in Education. Review of Edu- 
cational Research, Vol. 12, No. 5. Washington, D. C.: 
the Association, December 1942. Pp. 455-596. Paper. 
$1.00, postpaid. * 
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X*NATIONAL EDUCATION ASSOCIATION, AMERICAN ED- 
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InviNc Lorce, Cuairman. Methods of Research and 
Appraisal in Education, Review of Educational Re- 
search, Vol. 15, No. s. Washington, D. C.: the Asso- 
ciation, December 1945. Pp. 333-468. Paper. $1.00, post- 
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X*NATIONAL EDUCATION ASSOCIATION, AMERICAN ED- 
UCATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
PsvcuotoorcAL Tests, ARTHUR E. TRAXLER, CHAIR- 
MAN, Psychological Tests and Their Uses. Review 
of Educational Research, Vol. 11, No. 1. Washington, 
D. C.: the Association, February 1941. Pp. 132. Paper. 
$1.00, postpaid. * 
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XNATIONAL EDUCATION ASSOCIATION, AMERICAN Ep- 
UCATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
PsvcuoLoorcat Tests, Warren G. FINDLEY, CHAIR- 
MAN. Psychological Tests and Their Uses. Re- 
view of Educational Research, Vol, 14, No. 1. Wash- 
ington, D. C.: the Association, February 1944. Pp. 128. 
$1.00, postpaid. * 
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XNaTIONAL EDUCATION ASSOCIATION, AMERICAN En- 
UCATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
PsvcHorocicat Tests, Herpert S. CONRAD, CHAIR- 
MAN. Psychological Tests and Their Uses. Review 
of Educational Research, Vol, 17, No. 1. Washington, 
D. C.: the Association, February 1947. Pp. 128. Paper. 
$1.00, postpaid, * 
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XNATIONAL EDUCATION ASSOCIATION, AMERICAN Ep- 
UCATIONAL RESEARCH ASSOCIATION, COMMITTEE ON 
Puri. PERSONNEL, GUIDANCE AND COUNSELING, RUTH 
SrRANG, CHAIRMAN. Pupil Personnel, Guidance, 
and Counseling. Review of Educational Research, 


Vol. 12, No. 1, Washington, D. C.: the Association, 
February 1942. Pp. 136. Paper. $1.00, postpaid. * 
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X*NATIONAL INDUSTRIAL CONFERENCE Boarn, Inc, Em- 
loyee Rating: Methods of Appraising Ability, 
fficiency, and Potentialities. National Industrial 

Conference Board, Conference Board Reports, Studies 

in Personnel Policy, No. 39. New York: the Board, 

1942. Pp. 119. Paper. Prices on application, * 
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XNATIONAL OPINION RESEARCH CENTER, INTERVIEW 

DEPARTMENT. Interviewing for NORC. Denver, 

Colo.: the Center, 1945. Pp. ix, 154. $2.00. * (PA 20: 

3228 
J Appl Psychol 30:659-60 D '46. Philip H. 

Kriedt. * a manual for the 200 part-time inter- 

viewers who work for NORC in all parts of the 

country. * Although the volume deals specifi- 
cally with NORC problems, some sections are 
of general interest to anyone who is engaged 
in opinion poll interviewing. This is especially 
true of the section called “How to Get a Good 
Interview." The authors have attempted the. 
very difficult task of teaching an interviewing | 
technique which is objective, standardized, free 
from interviewer bias, and at the same time, 
informal and capable of securing more than su- 
perficial responses. * Other sections of the book 
dealing with such problems as quota controls, 
factual data, rural interviewing and telegraphic 
polls do not offer much information of general 

interest, although the material is presented in a 

readable manner. To the best knowledge of the 

reviewer, no other similar manual for opinion 
poll interviewers is now available to the public. 

NORC is to be commended for filling this need 

and for doing a workmanlike job. 
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NATIONAL Soctety rog THE STUDY or EDUCATION, ` 
COMMITTEE ON “INTELLIGENCE: ITs NATURE AND NUR- 
ture,” GEORGE D. STODDARD, CHAIRMAN, Intelligence: 
Its Nature and Nurture: Thirty-Ninth Yearbook, 
Parts I and II. Edited by Guy Montrose Whipple. 
Bloomington, Ill. : Public School Publishing Co., 1940. 
Pp. xviii, 471; xviii, 409, xxxvii. Cloth, $3.00 per vol- 
ume; paper, $2.25 per volume, * 

Ed Trends 8:36-8 N-D ’4o. A. A..Campbell. 
* The thirty-odd original studies presented in 
these books provide no exact formula for the 
prediction of the individual 1.Q. but they do 
illuminate more clearly certain areas of our 
knowledge in this field. Perhaps the most im- 
pressive paper of this series is that of Nancy 
Bayley, a nine-year longitudinal study of 48 
children, tracing the development of the I.Q. * 
It must be confessed that these books will be 
a sorry disappointment to anyone who expects 
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to find in them a clearly stated and soundly 
proved exposition of the relation of mental test 
ability to nature and nurture. On the contrary 
he is apt to feel that the picture has now be- 
come so obscured that psychologists had better 
wipe the slate clean and start all over again. 
However, confused as the whole field may seem, 
progress has unquestionably been made if only 
in a better understanding of the problems in- 
volved. We no longer think of heredity and 
environment as independently contributing cer- 
tain amounts to a total supply of intelligence. 
The: well-nigh universal tendency to reify 
"intelligence" in terms of the I.Q. is passing. 
Mental testers have become increasingly cau- 
tious regarding the instruments which they 
employ as the susceptibility of these tools to 
various influences has become more evident. 
They talk less about intelligence and more about 
behavior-called-intelligent. These things are all 
to the good, and they have been brought about 
by the same kind of apparently exacerbated dis- 
pute which characterizes the present Yearbook. 
The final answer is not at present available 
and one wonders if it ever will be in any exact 
form. The potential range of mental test ability 
set by genetic factors is apparently relatively 
wide, sufficiently so that post-natal influences 
may be of considerable importance. It seems 
probable that this range of possible develop- 
ment is not the same in all individuals and is 
obviously very narrow in some. Those who feel 
inclined to impatience with these attempts to 
arrive at a more definite conclusion regarding 
the nature-nurture controversy should remem- 
ber the fact that, unlike those disciplines whose 
research consists in the investigation of non- 
human material, psychology frequently finds it- 
self unable to do the clearly necessary experi- 
ments because it does not have a free hand with 
the human subjects with which it deals. This 
delimitation often arises in the most crucial 
areas of interest and is nowhere better illus- 
trated than in the attempt to identify the fac- 
tors determining mental development. It is ob- 
vious that given carte blanche over the lives 
of a sizable number of infants (including identi- 
cal twins) research workers could produce a 
most interesting dossier of information. Unfor- 
tunately these studies cannot be done; society 
is not as accommodating to the psychologist as 
the supply house is to the chemist. The result 
is that we do what we can and the conclusions 
are not always completely satisfying. 


Eug News 26:13-7 Mr '41. Gladys C. 
Schwesinger. * a second earnest attempt on the 
part of serious-minded workers, editors and 
sponsors to air the whole problem, and to crys- 
tallize the accumulated knowledge that is still 
floating in solution, and at times apparently 
even disintegrating * A striking feature is the 
inclusion also of very forthright and detailed 
criticism of the methods, findings and interpre- 
tations of the various contributors, one by the 
other. The reports are highly technical. Part I 
is concerned chiefly with theory, Part II with 
experiment and research. The books make far. 
from easy reading. One must be fired with con- 
siderable fervor to persist through to the end 
of their total of 880 pages, plus appendices. One 
is therefore indebted to the indefatigability of 
the contributors who gave of their energy, pro- 
fessional skill and insight, as well as right good 
will and feeling. * The research approach is no 
longer one of attempting to discover what per 
cent is due to nature and what per cent to nur- 
ture, but rather what is the effect of a change 
in cultural environment on the Intelligence 
Quotient. * But what conclusions can be drawn 
from this accumulation of new studies? Taken 
as a whole, and assuming equal validity for all 
of the contributions, the answer, it must be ad- 
mitted, will still lie buried in provocative con- 
fusion and contradiction. From these many 
presentations, one is not able to deduce with 
any greater finality than before that such and 
such a specific change in environment, for such 
and such a designated group, will result in so 
and so many points of change in I.Q. But what 
the reader will be able to do is to develop a 
keener appreciation of the hazards and fallacies 
inherent in nature-nurture data and their fair 
interpretation. From the many analyses offered 
here, he should be able to extract a set of cri- 
teria for evaluating a nature-nurture research 
study, and by applying these criteria, to accept 
or reject those contributions which do not meet 
the test, at least in the Main, of his criteria. * 
the present reviewer....emerges more deeply ad- 
herent to the coastal point of view—namely that 
environmental influences are no more powerful 
in 1940 than they were in 1928 to boost the I.Q. 
beyond the limits set by original nature (given 
normal opportunities for mental development) * 
The most sensational of the Iowa claims, such 
as the one that feebleminded children (diag- 
nosed so at 17 months) can be made over into 
normal-minded when given into the care of 
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moron nursemaids, or that children of feeble- 
minded mothers, if placed early in good foster 
homes, can develop above-average ability, easily 
drop out of the controversy as evidence. Sixteen 
cases are not enough on which to base a general- 
ization; old Stanford-Binet tests at the adult 
levels would ratethe unmarried mothers too low ; 
tests given to pre-school children,—aged in this 
case two years six months at testing, —would 
rate their babies too high. Also in good profes- 
sional child-placement work in America, un- 
promising babies are not offered for adoption, 
thus pointing to a selected group in the sam- 
pling. At an average age of six, the average 
LQ. of the children had dropped to 107.6, 
which reflects a regression to the mean, or a 
change toward greater reliability of the testing ; 
incidentally also, it removes the children from 
superior to average classification! The ex- 
pressed implication is that physically normal 
offspring of feeble-minded mothers can be 
placed early in adoptive homes without risk. 
This suggestion contradicts the crystallization 
of years of experience in social work. Well 
authenticated cases in which this procedure has 
resulted in tragedy are far too common in the 
experience of placement agencies to leave the 
sophisticated reader anything but cold to an 
approach which seems to urge a repetition of 
the mistakes of the past. * The Iowa mothers 
may have been potentially normal, and ipso 
facto, the hereditarian would be the last to ex- 
press surprise that their children should turn 
out normal. Unless one can establish the genetic 
basis of feeblemindedness in the mothers, one 
cannot very well use them as material for 
nature-nurture argumentation. All the relevant 
genealogical and developmental data on moth- 
ers and forbears should be presented. * Results 
of nature-nurture investigations have much 
more than academic interest: new discoveries 
and conclusions, if established, will cast more 
than shadows on educational and social services. 
They can in fact lead to a revolution in practice. 
That is why the research in this field must go 
on in spite of the tremendous amount of time, 
money and critical understanding required. As 
our measures, techniques of analysis, and scien- 
tific insight accrue, more finality can be ex- 
pected in the answers which emerge. Today we 
are not able to answer some of the questions as 
originally stated; but we do have answers for 
relevant questions. One is that wide individual 
differences still persist in spite of the most en- 


riched environment. Another is that previous 
mental tests given to children under school age 
have little predictive power. Recent revisions of 
both the Stanford and Kuhlmann Binet tests 
are much more promising, and already show 
positive improvement. We know also that future 
investigations must be experimentally designed 
and the variables more rigorously controlled 
preceding the event of adoption, special school- 
ing, or whatever environmental experience is 
being investigated. Already definitions, concepts 
and technical approaches have been rather 
sharply delineated, and statisticians have given 
serious attention to valid methods of analyzing 
and interpreting data. The crucial question for 
the future seems to narrow down to one of the 
stability or permanence of improvement in I.Q., 
when obtained, and its expression in life achieve- 
ment. On this point, nearly three decades of 
mental test usage will be available. Children 
tested before 1920 are now approaching or are 
already in their middle ages. Forthcoming con- 
tributions in the 1940's should be revealing, not 
only with respect to gifted children but also for 
those who have average mental endowment. 

Sci Ed 25:170 Mr "41. E(lliott) R. D(own- 
ing). * seems particularly valuable, for it criti- 
cally examines earlier studies that have come 
to contradictory conclusions to see wherein the 
methods have been faulty or the materials in- 
adequate. The whole matter is much more com- 
plex than was at first realized. * It marks a 
distinct forward step and that a large one. But 
investigators in this field must heed its cau- 
tions, otherwise more studies will pile up with 
unreliable conclusions. 

For additional reviews, see 40:B 1034. 
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NATIONAL SOCIETY FOR THE STUDY or EDUCATION, 
COMMITTEE ON THE MEASUREMENT OF UNDERSTAND- 
ING, WiLLraM A, BrowNELL, CHAIRMAN. The Meas- 
urement of Understanding: The Forty-Fifth 
Yearbook, Part I. Edited by Nelson B. Henry. Chi- 
cago, Ill.: University of Chicago Press, 1946. Pp. xi, 
338. Cloth, $3.00; paper, $2.25. * (PA 21:1272) 

J Ed Psychol 37:505-6 N '46. J. B. Stroud. 
* much more than a collection of suggestions 
and illustrations for evaluation purposes. The 
teacher's first task is to bring the student to an 
understanding; measurement of understanding 
is secondary, except as it is envisaged as an 
integral part of the teaching of understanding. 
This volume should be an aid to the teaching 
of understanding as well as to its measurement. 
It is not for the expert in measurement, but 


= 


E 


for the teacher. * contains discussions and many 
helpful illustrative methods of measuring—at 
least of obtaining evidence of—understanding * 
The chapters of Section II vary in quality, as 
“is to be expected. In the writer’s opinion, they 


. vary.from fairly good to excellent. * Each of 


the chapters starts with a listing of the recog- 
nized teaching objectives. In the main, these 
are taken from pronouncements of committees 
of national organizations or other societies 
whose aim has been to foster good teaching in 
the various areas. Subsequently, testing pro- 
cedures and other means of evaluating mean- 
ingful learning with respect to each of the ob- 
jectives are presented. Subject to limitations in 
evaluative procedures the methods of measuring 
understanding should be as valid as the objec- 
tives. The committee is to be congratulated 
upon the securing of this synthesis of measure- 
ment and teaching objectives. The authors have 
made liberal use of techniques other than ob- 
jective tests. In some chapters, for example the 
fine arts, objective tests are scarcely used at all. 
In others, for example science, considerable 
use is made of the better type of objective tests, 
In all, transfer—by which is here meant the 
ability to apply information, principles, pro- 
cedures, etc., learned in one situation in attack- 
ing other relevant situations—has been one of 
the guiding principles. This volume should have 
wide use in professional courses in education. 
It should also prove helpful to classroom teach- 
ers generally. 
[ 1064 ] 
*NeLson, M. J. Handbook of Educational Psy- 


chology and Measurement. New York: Dryden 
Press, Inc., 1941. Pp. vi, 174. $135. * (PA 1 5:4805) 


Ed Forum 6:438 My '42. Unique in its con- 
ception, this book is of value both to the begin- 
ning student and the instructor. A dictionary of 
psychological terms (seventy-five pages) is the 
heart of the book. Approximately an equal 
amount of space is given to summarizing the 
essentials of psychology. As each new term is 
used in the text, an asterisk indicates that a 
definition may be found in the dictionary sec- 
tion. Approximately twenty pages of references 
may be found in an annotated bibliography. 
References in the dictionary section are made 
to a limited number of standard authors. The 
definitions are cross-referenced. 

Loyola Ed Digest 17:9 Ja '42. Austin G. 
Schmidt. * should be useful both for preview 
and for review purposes. 
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[ 1065 ] * 
*Nepove, Fay. Index of Words Used in Person- 
ality Tests Worksample 35. Human Engineering 
Laboratory, Inc, Technical Report No. 335. Boston, 
Mass.: the Laboratory, December 1946. Pp. 19. Paper, 
mimeographed. * 

[ 1066 ] 
XNzw York PunLic SCHOOLS, BUREAU or REFERENCE, 
RESEARCH AND STATISTICS, Division OF TESTS AND 
Measurements. Appraisal of Growth in Reading. 
Educational Research Bulletin of the Bureau of Refer- 
ence, Research and Statistics, No. 2, New York: the 
Bureau, November 1941. Pp. v, 42. Paper. $0.15. * (PA 
16:2091) 

[ 1067 ] 
X*Nrw York PusLIc ScHooLs, BUREAU or REFERENCE, 
RESEARCH AND Statistics, DIVISION or TESTS AND 
Measurements, Determining Readiness for Read- 
ing. Educational Research Bulletin of the Bureau of 
Reference, Research and Statistics, No. 6. New York: 
the Bureau, September 1943. Pp. vi, 49. Paper. $0.15. * 


J Ed Psychol 35:356 Ap '44. Miles A. 
Tinker. * a guide to teachers and supervisors in 
the selection and use of records and tests for de- 
termining reading readiness of first-year pupils. 
* The authors have selected adequately the 
pertinent material from the mass of data avail- 
able and presented a well-organized and clear 
discussion. Throughout there is emphasis upon 
practical classroom situations. The section on 
interpretation (“Putting together the facts 
about reading readiness"), however, might 
have been expanded to advantage, since the 
ordinary teacher is apt to be more inadequate 
in this aspect of the reading readiness program. 
Helpful additions might include: (1) more 
Stress on interpretation of irregular patterns of 
growth, (2) the use of profile charts of abili- 
ties, (3) the inclusion of a few case studies. In 
Some instances the discussion seems too brief, 
ie, too much like an outline, to be most effec- 
tive. It seems to the reviewer that undue stress 
is given the róle of motor development. On 
page 41, where development of 'visual acuity' is 
mentioned, undoubtedly visual discrimination 
is meant since visual acuity is not susceptible to 
training. The bibliography of tests and scales 
would become more useful if material on reli- 
ability and validity were included. This bulletin, 
like earlier ones in the series, will be welcomed 
by teachers of reading. 


[ 1068 ] 

*New York Puric ScHoors, BUREAU oF REFERENCE, 
RESEARCH AND Sratistics, DIVISION or TESTS AND 
MEASUREMENTS. A Supplementary Guide for Scor- 
ing the Revised Stanford-Binet Intelligence 
Scale, Form L. Division Bulletin No. 1. New York: 
ee ee May 1941. Pp. 27. Paper. $o.15. * (PA 
16:363 
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* [ 1069 ] 

X*Nrwnuanr, Horace, AND Recer, Scorr N., Eprrons 
Syllabus of Audiometric Procedures in the Ad- 
ministration of a Program for the Conservation 
of Hearing of School Children. Supplement to the 
Transactions of the American Academy of Ophthal- 
mology and Otolaryngology. Prepared for the Commit- 
tee on the Conservation of Hearing, the Academy. 
Rochester, Minn.: the Academy, 100 First Ave. Bldg., 
April 1945. Pp. 28. Paper. $0.25 (free to public officials, 
medical and educational institutions). * 


[ 1070 ] 

*NewMan, Frances Burks. The Adolescent in So- 
cial Groups: Studies in the Observation of Per- 
sonality. Introduction by Harold E. Jones. Applied 
Psychology Monographs of the American Psychologi- 
cal Association, No. 9. Stanford, Calif.: Stanford Uni- 
versity Press, March 1946. Pp. 94. Paper, lithotyped. 
$1.25. * (London: Oxford University Press. 8s.) (PA 
20:2950) 

Am J Psychol 60:148—50 Ja '47. Donald E. 
Super. * describes the development of the rat- 
ing scales used in studying the personality 
growth of adolescents. It is, to be sure, a meth- 
odological study, and such reports, especially 
when they involve rating scales, are not likely to 
challenge even their professional public. But this 
report does. There are several reasons why 
these studies have special significance. First, as 
Harold Jones points out in the introductory 
chapter, “considerable attention is given to ap- 
praising the usefulness of these techniques, to- 
gether with other types of observational records, 
in depicting individual development, and also 
in studying the characteristic changes which 
occur during adolescence in the behavior of 
social groups." Again, this use of rating scales 
in a real context, combined with other supple- 
mentary techniques, makes them not only more 
interesting to read about but also more mean- 
ingful for diagnosis. Not only the improved 
types of rating scales challenge the reader. The 
method with which they were used also chal- 
lenges him. * To the present reviewer, it is... 
[the] merger of the observational techniques of 
psychology with those of cultural anthropology 
and sociology which—with or without design— 
took place in this study that is significant and 
valuable. * In her conclusions the author lays 
down certain principles followed in her use of 
observational techniques and of rating scales 
which should quickly find their way into the 
literature. It is a long time since textbook writ- 
ers have had anything new to say on these nor- 
mally sterile subjects; they should welcome this 
basis for a change of material and of point of 
view. In his introduction Jones remarks that 
‘Practice in field observation constitutes a de- 


sirable part of the training of clinical psycholo- 
gists.” This is especially true now that a method 
of gathering and recording dynamic observa- 
tional data has been perfected. The methodol- 


ogy described in this monograph should there- Dd 


fore be known, not only by developmental but. 
also by clinical psychologists. Vocational psy- 
chologists as well, who already know these 
techniques from sociological studies of industry, 
will find this more scientific treatment of the 
method rewarding. 

Brit J Psychol 37:109-10 Ja '47. * describes 
an observational study of between 150 and 200 
adolescents of both sexes. * Great use was made 
of rating scales in the recording of observed 
behaviour. These scales were of two kinds: (a) 
composite, in which paired traits such as 'talka- 
tive-silent’, ‘peppy-indifferent’, etc., were to be 
rated as ‘very marked’, ‘marked’ or ‘average’ 
(subsequently rated numerically from 1 to 5)5 
(b) integral, in which behaviour categories 
were rated from 1 to 7. The integral scales in- 
cluded such categories as "Interest in Opposite 
Sex’, ‘Grooming Activity’, ‘Popularity’, ‘Social 
Self-Confidence’, etc. In assigning ratings, the 
judges were asked to keep in mind a normal 
distribution among the scale points. From the 
ratings made, agreements between raters were 
examined by calculating product-moment corre- 
lation coefficients and ‘raising’ the resultant co- 
efficients by the Spearman-Brown formula. The 
whole of this statistical work is exceedingly 
dubious. The forcing of trait ratings into a 
normal scale is most artificial. In what way, for 
example, can fluctuating traits such as ‘talka- 
tiveness’, ‘animation’, ‘eagerness’ and the like, 
be regarded as normally distributed in a rela- 
tively small group of adolescents? Further, the 
underlying assumption of the Spearman-Brown 
formula, that errors of observation are uncor- 
related, seems most unlikely to be true when 
the observers form a small group with similar 
aims and holding frequent conferences to dis- 
cuss their observations. In addition, the ob- 
servers made descriptive records of behaviour 
which were used to supplement the formal rat- 
ings. These were used in attempts to clarify the 
situation when raters disagreed. It is said (p. 
45): "It may be felt that where marked dis- 
agreements are possible, it is inappropriate to 
include the item in question in a rating scale. 
On the other hand, if we keep clearly in mind 
that some characteristics are less easily rated 
than others, and some individuals less easily 
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rated than others, we may find it possible to use 
even the less reliable ratings as providing tenta- 
tive descriptions of behaviour and guides to 
further observation." How far such considera- 
tions were allowed to influence the actual rat- 
ings (which, from the method quoted, must 
have been made retrospectively) it is impossible 
to gather. So much for the methods adopted in 
this study: what of the results? These the 
reader will hunt for with growing bewilder- 
ment. He will find no description of fundamen- 
tal tendencies in adolescent behaviour, no infor- 
mation that parent or teacher may use as a 
guide, and no conclusions about the members 
of the actual group studied. The reader will, in 
fact, find it very difficult to divine the reasons 
for publishing this pamphlet. The writing 
throughout is prolix and involved to a degree. 
J Consult Psychol 11:109 Mr-Ap '47. Dale 
B. Harris. * the monograph does not describe 
the adolescent so much as a method of studying 
him. As a treatise on methodology, it is an im- 
portant contribution to the literature of person- 
ality measurement by the rating scale procedure. 
Certain of the more significant findings include 
the following: Ratings of social behavior can 
be made quite consistently by different observers, 
provided the behaviors are very clearly de- 
scribed. Agreement among raters with respect to 
qualities not requiring simultaneous observation 
increases in relation to the number of situations 
represented. The conference technique, pooling 
the comments and notes of all observers, case 
by case, for a given experience or period of 
time, provides a great deal more information 
than the summation of individual ratings af- 
fords. The use of “comment sheets" along with 
rating scales frequently throws light on inter- 
rater discrepancies. A study of discrepancies 
also indicates that adolescent subjects differ 
appreciably in “ratability” and that these differ- 
ences are fairly stable over a period of years—a 
finding which merits the attention of clinicians 
and others concerned with evaluating personal- 
ity. Although consecutive ratings to delineate 
personality development is not a new procedure, 
this monograph is perhaps the first able experi- 
mental justification of the method. The author 
recommends that the averaging or combining of 
ratings be limited to those made within a cir- 
cumscribed time period, to avoid .concealing 
changes in personality. * The methods described 
in this study will not be practical for most clini- 
cians. Psychologists who have continuing con- 


tacts with subjects in somewhat more varied 
environments than the examining or interview- 
ing room (such as in the school room, the 
boarding school, the detention home, and the 
like) can use with profit the more generalized 
or summary rating techniques herein described. 
Rating methods have limitations, of course. 
Trait ratings give little knowledge concerning 
unique organization of qualities in individual 
cases, provide no basis for studying the ethical 
judgments and cultural codes of adolescents or 
other age groups, and can scarcely delineate 
changes in the reaction of different age levels 
to changing environmental demands, But these 
methods should not be overlooked by students 
of personality dynamics. It would be entirely 
feasible to evaluate with flexible rating proce- 
dures behavior evoked by projective methods. 
Reliability of observation would probably be 
increased without too great loss of clinical in- 
sight. In these days of forced-draft training of 
clinicians, teachers and other personal-service 
workers, it is well to note the author’s insist- 
ence on the training value of systematic ob- 
servation. While there is probably no royal 
road to insight into personality, systematic 
experiences in observation of behavior such as 
here described should do much to reduce the 
period of fumbling which the clinical novice 
inevitably experiences. 


[ 1071 ] 
* Newman, JosePH. The Prediction of Shopwork 
Performance in an Adult Rehabilitation Program: 
The Kent-Shakow Industrial Formboard Series. 
Psychological Record, Vol. 5, No. 12. Bloomington, 
Ind.: Principia Press, Inc, March 1945. Pp. 343-52. 
Paper. $0.35. * (PA 19:2339) 

[ 1072 ] 
Noste, M. C. S. Jr. Practical Measurements for 
School Administrators. Scranton, Pa.: International 
Textbook Co., 1939. Pp. xix, 330. $2.50. * (PA 14:2108, 
title only) 


Am Sch Board J 100:88 My '40. This book 
skims the surface of practical mathematics as 
applied to the measurement of school facts. * 
There may be considerable disagreement con- 
cerning the validity and utility of measuring 
teacher efficiency, of measuring intelligence, and 
of using statistical interpretations for judging 
the ultimate values of many aspects of school 
services. For ordinary reporting purposes and 
for making comparisons, the procedures and 
the types of materials suggested by the author 
are unquestionably the most widely used and 
practical. 
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Nation's Sch 26:97 S ' 40. Extremely elemen- 
tary text on applied statistical measurements 
for school administrators. Would be more effec- 
tive if it covered the entire field and if the 
author had used more extensive source material. 

For additional reviews, see 40:B 1037. 


[ 1073 ] 

%kNosie, Mary Atrrep. Factorial Differentiation 
by Maximal Differences. Catholic University of 
America, Studies in Psychology and Psychiatry, Vol. 
4, No. 6. Washington, D. C.: Catholic University of 
America Press, 1940. Pp. vii, 40. Paper. $1.00. * (PA 
14:4942) ene 
*Nortuway, Mary L. Appraisal of the Social De- 
velopment of Children at a Summer Camp. Uni- 
versity of Toronto Studies, Psychology Series, Vol. 5, 
No. 1. Toronto, Canada: University of Toronto Press, 
1940. Pp. 62. $1.25. * (PA 15:604) 

Sociometry 3:309-11 Jl "40. Joseph Sargent. 
* excellent study * shows how sociometric tech- 
niques have been used in a modified form to 
appraise the social development of some fifty 
girls at a summer camp * describes how the test 
was explained to the entire group and how, 
(apparently immediately thereafter) the camp- 
ers filled out their questionnaires. These called 
for three choices for cabin mates, four choices 
for companions on a canoe trip (of from one to 
eight days), one choice for companion in each 
of five other camp activities and one choice for 
"tribal leader", "tribes" being four activity- 
groups into which the campers were divided. In 
addition, each girl was asked to list her last two 
choices for cabin mate and to name the girl she 
would like least to have on her canoe trip. The 
cabin counsellors were asked to list the girl in 
their respective groups most popular and least 
popular with her cabin mates and in the camp 
at large and to name the leaders in the cabin 
groups and in the camp at large. The instructors 
in the various camp activities were asked to list 
the four best and the four worst campers in the 
activities over which they had charge and to 
name the leaders in the activity. These particu- 
larly intelligent questionnaires covered the 
whole range of the camp’s social situations and 
provided the data for an accurate appraisal of 
the social structure of the camp. It is to be 
especially noted that the counsellors and the 
instructors were asked for information against 
which the spontaneous choices of the girls them- 
selves could be checked. Particularly praise- 
worthy, in the opinion of this reviewer, was the 
manner in which the rejections were determined. 
The campers were asked to note their last 


choices, not to state whom they rejected. This 
seems an unusually tactful method of determin- 
ing rejections, which almost invariably present 
a methodological problem. * Northway points 
out the inaccuracy of the "point" system of 
ranking the choices, but has nevertheless used 
it as being the best method at hand. She also 
comments on the limitations implicit in allow- 
ing only a specific number of choices. Perhaps, 
on another occasion, we may read a report of 
an experiment by Dr. Northway in which there 
is no limit to the number of choices and in 
which another and less arbitrary method of 
gauging the intensity and value of a choice (per- 
haps time) will be used. The study includes 
two sociograms of the camp, one based on the 
choices and the other on the rejections for 
cabin mates; both are taken from the results of 
the second test. * Northway, in her discussion 
of the technique and its use in other educational 
settings which forms Part III of her mono- 
graph, calls attention to the limitations of the 
technique. It might be suggested that these 
limitations lie, to a certain extent, in the inves- 
tigator's hesitancy to draw any conclusions from 
the tests excepting those concerning “accepta- 
bility within a group” (as would seem to be 
indicated by this study), and in the wording of 
the questionnaire. It is quite possible to set 
questions in such a way that the subjects will, 
unknowingly and with complete spontaneity, 
disclose much of the information about them- 
selves which Dr. Northway feels is lacking in 
their responses in this particular case. She has 
made the matter of rejections more acceptable 
than usual by means of adroit phrasing; fur- 
ther experiments along this line should lead to 
a broadening of the scope of the sociometric 
test. 

[ 1075 ] 
*Nortuway, Mary L.; FRANKEL, Estuer B.; AND 
Porasnin, Reva. Personality and Sociometric 
Status. Sociometry Monographs, No. 11. Reprinted 
from Sociometry, Vol. 9, 1946. New York: Beacon 
House, 1947. Pp. 73. Paper. $2.25. * (PA 21 11200) 

[ 1076 ] 
XNovzs, Epwarp SIMPSON ; SALE, WILLIAM MERRITT, 
JR; AND STALNAKER, JoHN MARSHALL. Report on 
the First Six Tests in English Composition: With 
Sample Answers From the Tests of April and 
June 1944, Princeton, N. J.: College Entrance Ex- 
amination Board, 1945. Pp. 72. Paper. $0.50. * 

[ 1077 ] 
*O’Connor, JOHNSON. Aptitudes and the Lan- 


guages. A revision of Uniform Distribution of Apti- 
tudes in Schoolwork. Boston, Mass. : Human Engineer- 


ing Laboratory, Inc., 1944. Pp. x, 115. $2.00. * 
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[ 1078 ] 
XO'Coxwon, JouxsoN. Ideaphoria. A revision of An 
Objective Approach to Group Influencing Fields pub- 
lished in 1937. Boston, Mass.: Human Engineering 
Laboratory, Inc., 1945. Pp. xviii, 116. $4.00. * 

[ 1079 ] 
XO'CoxNom, Jonnson. Structural Visualization. 
Boston, Mass.: Human Engineering Laboratory, Inc., 
1943. Pp. xiv, 182. $4.00. * 

[ 1080 ] 
*O’Connor, Jouwsow. The Too Many Aptitude 
Woman. Boston, Mass.: Human Engineering Labora- 
tory, Inc., 1941. Pp. xii, 185. $4.00. * (PA 18:3633) 

[ 1081 ] 
XO'Cowwom, JouNson. Unsolved Business Prob- 
lems. A revision of Characteristics Common to Execu- 
lives published in 1935. Boston, Mass.: Human En- 
gineering Laboratory, Inc., 1940. Pp. x, 159. $4.00. * 

[ 1082 ] 
XOr»rirp», RrcHamp C. Psychology of the Inter- 
view. London: Methuen & Co. Ltd., 1941. Pp. 159. 5s. 
(PA 15:4776) 


Brit J Ed Psychol 12:78 F '42. The author 
of this very interesting little book became in 
1937 the first holder of a Leverhulme Student- 
ship allocated to the National Institute of In- 
dustrial Psychology. This research subject was 
“The form of the interview best suited for the 
assessment of personal qualities," and this fol- 
lowed previous investigational work on cognate 
subjects at the Cambridge Psychological Labo- 
ratory. This book is not concerned with the 
experimental results of these researches, but 
with the preliminary non-experimental observa- 
tional work necessary. Mr. Oldfield....collected 
much information which was subjected to criti- 
cal and systematic analysis and he is led to the 
conclusion “that improvements in the interview 
are to be sought not by a rigid standardisation 
of technique, but by the attempt to discover the 
nature of, and the conditions affecting, the chief 
mental processes that are at work." This psy- 
chological examination is carefully and scientifi- 
cally assessed in the book. * The real core of 
the book is probably in the analysis of the term 
'attitude' and in the author's views as to its 
importance in “providing the essential basis of 
the interviewers' judgment of the personal 
qualities of the candidate." This portion of the 
book will be criticised by some readers and 
probably the author will wish to clarify his own 
ideas more completely, but the whole work is 
an interesting and stimulating introduction to 
an important and neglected problem in social 
psychology. One looks forward with pleasure 
and anticipation to the publication of the fuller 


results of Mr. Oldfeld's researches. In the 
meantime, interviewers and psychologists will 
find much of value in this volume (to which 
Dr. C. S. Myers contributes an appreciative 
foreword); the former may be encouraged to 
think about their technique and "to perceive 
clearly the psychological implications and con- 
ditions of what comes to their notice"; the 
latter will find suggestions for experimentation 
and research. 

Brit J Psychol 32:176-7 O ’41. This book 
should perhaps be more accurately entitled "The 
Psychological Analysis of the Interview,’ since 
this task of analysis it performs most admirably. 
Tt was indeed opportune to point out that an 
interview is not an interchange of stereotyped 
question and answer, followed by the formation 
of an equally stereotyped assessment in set terms 
of the nature of the candidate's personality or 
fitness for an occupation. Mr. Oldfield stresses 
the fact that the interview is essentially an in- 
terplay of attitudes between interviewer and 
candidate. The former, by a manipulation of his 
own attitudes (of which he is usually only 
partly aware), stimulates the latter to react by 
corresponding changes of attitude. From these 
the interviewer is able intuitively to formulate 
his assessment, apparently by creating in his 
own mind, and then observing and. judging, a 
kind of picture or ‘schema’ of the candidate’s 
personality as a whole. This last conclusion is 
a particularly interesting one, and seems to tally 
well with Reik’s description, in Surprise and 
the Psychoanalyst, of the temporary introjec- 
tion of the patient’s ego into that of the analyst. 
But Mr. Oldfield does not sufficiently discuss 
the essential difficulty of this process, which is: 
The ‘schema’ does indeed take the form of a 
living ‘homunculus,’ characterized by feelings 
and action tendencies which cannot be formu- 
lated in verbal terms. Thus the interviewer may 
be able to say to himself, “I know what sort of 
a man you are,” while groping in the dark for 
words and concepts in which to describe the 
candidate’s personality. Of course if the inter- 
viewer’s task is merely to choose someone to fill 
a particular job, he may be able to judge the 
candidate’s suitability intuitively and immedi- 
ately, and the problem of verbal formulation 
does not arise. But it is disappointing to the 
psychologist that it should be left so much in 
the air; and that Mr. Oldfield has been unable 
to make use of his experimental material to 
elucidate it further, and to investigate the valid- 
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ity of such verbal character assessments as can 
be made. He must, however, be congratulated 
on his analysis of the interviewing process, and 
upon the lucid and attractive manner in which 
he has carried it out. 

J Appl Psychol 26:240 Ap '42. Forrest H. 
Kirkpatrick. * A considerable part of the dis- 
cussion appears to consist of general observa- 
tions which frequently have been made concern- 
ing the art and process of interviewing. Oldfield 
does not include a bibliography or any refer- 
ence to the work of other students of the inter- 
view and one wonders whether he consciously 
or unconsciously ignored the fine research of 
several men in this country in the psychology of 
the interview. * This book will be interesting to 
vocational counselors but they may be some- 
what disappointed in terms of vitality and 
validity. 

J Ed (London) 73:274-5 Je '41. Philip Har- 
tog. * based in part on....experimental investi- 
gation[s]. * Unfortunately no description of 
these investigations is given in the book itself, 
to which they would have lent a sense of actu- 
ality which it lacks. Mr. Oldfield appears to 
have had no experience of his own as a respon- 
sible interviewer. His knowledge is mainly de- 
rived at second-hand. * He tells us....that “any 
basic principle governing procedure in the in- 
terview for the assessment of personal qualities 
«wis best expressed by saying that the task of 
the interviewer is to evoke the display of atti- 
tudes by the candidate, and to form his judg- 
ment upon his apprehension of them". We con- 
fess that a conscientious perusal of Chapters II 
and III which deal with ‘attitudes’ has left us in 
the dark as to Mr. Oldfeld's meaning. We 
cannot help asking if he himself knows what he 
means when he tells us that “if an individual 
be placed in the same situation upon two occa- 
sions, his outward behaviour may be very dif- 
ferent ; and this very change of outward reaction 
may be determined by the fact that the same 
attitude has prevailed" (pp. 29, 30). It is clear 
that a knowledge of a candidate’s ‘attitudes’ in 
the Oldfeldian sense would be of little help in 
foreseeing how he would behave in a situation 
of difficulty as a responsible officer. With re- 
gard to the interview itself, Mr. Oldfield tells 
us that “a game of tennis and an interview 
alike are occasions unsuited to the exercise of 
scientific method”. We know from the context 
what he means though he has not succeeded in 
saying it. And again he converts another obvi- 


ous half-truth into what appears to be complete 
nonsense when he writes, on p. 137, “Never 
mind what the candidate says: note the way he 
says it". Mr. Oldfield’s book itself, though 
more intelligent in parts than the above exam- 
ples would indicate, is itself lacking in scientific 
method. His discussion of the relation of an 
interview to what is often its immediate pur- 
pose—the choice of the candidate most suited 
to carry out the duties of a particular kind of 
job or career seems inadequate. It gives not a 
single illustration of actual conversations at an 
interview. And though he has interviewed in- 
terviewers it seems not to have occurred to 
him to interview candidates and to obtain their 
side of the story. Mr. Oldfield advises inter- 
viewers to “cultivate the art of conversation”. 
Experienced interviewers sometimes observe 
with amusement as onlookers the success in 
conversation of skilful candidates who (quite 
legitimately) evoke ‘attitudes’ favourable to 
themselves in a colleague. The author explains 
that “his especial aim in discussing the essen- 
tials of the interview in more or less psycho- 
logical language has been not so much to arrive 
at practical suggestions or recommendations as 
to induce a spirit of inquiry among interview- 
ers”. How far he will achieve that purpose 
remains to be seen. At any rate the very title 
of his book will help to direct more attention to 
a subject of great importance too much neg- 
lected hitherto. And the disarming modesty of 
his preface, and certain examples of common 
sense in the book itself, suggest that he might 
himself do better when he writes again. But he 
would be well advised to read Descartes’ great 
Discourse on Method, both for its content and 
its style, before he makes his next attempt; and 
a dose of the delightful William James would 
not be amiss. 

J Ed Psychol 32:709-11 D ’4r. Percival M. 
Symonds. * The setting and conduct of the in- 
terview are given interesting treatment and 
there is a chapter on the psychological process 
in forming judgments. A very valuable chapter 
concerns the board interview with a small sec- 
tion devoted to multiple interviews. The discus- 
sion of the group interview seems to be omitted. 
* One can dispute the claim on the wrapper 
that this is a “scientific study of the interview,” 
in the first place, and that it is “the first” study, 
in the second place. I find that except for a few 
comparisons in the chapter on the board inter- 
view there is no reference to data which might 
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have been collected during the period of the 
study which would serve as the basis for the 
conclusions reached. As a matter of fact, a con- 
siderable part of the discussion would appear to 
consist of general observations which frequently 
have been made concerning the art and process 
of interviewing. That this is the first study of 
the art of the interview is certainly contrary to 
fact. Oldfeld does not include a bibliography 
or any reference to the work of other students 
of the interview, but it is difficult to believe 
that a person could have spent fifteen months 
in studying the psychology of the interview 
without having discovered during that time the 
work of Bingham, Young, Strang, Symonds, 
and many others. Although the author states 
that an interview may be used for a number of 
purposes, it is not clear throughout the essay 
exactly what kind of interview he is talking 
about. One would assume during most of the 
discussion that he had in mind the interview 
for the "selection and rejection of applicants 
for employment." It is well known that the 
process of interviewing varies according to the 
use that is being made of it. This functional 
aspect of interviewing seems to have been over- 
looked by the writer. I find that this author 
thinks of the interview largely in terms of ques- 
tion and answer. For instance he says, (page 
72) "A third advantage of routine is that by 
it the interviewer's mind is freed, to some ex- 
tent, from the task of formulating his questions 
and considering his next move." The implica- 
tion here is that the interviewer's task is largely 
that of formulating questions, noting replies, 
and observing the responses of the candidate. 
This is a very limited concept of the interview- 
ing process. There are certain statements in the 
book which would seem to be much more a 
matter of personal opinion than the result of a 
scientific study. For instance, it is stated, (page 
54) “Some interviewers find that movements 
and posture of the whole of the candidate’s 
body and not merely of the upper half are sig- 
nificant for judgment. For this reason, there 
may be some advantage in making the candi- 
date sit so his legs are visible. In the second 
place in order to observe the face of a female 
candidate, removal of the hat may be neces- 
sary.” Notwithstanding the above critical com- 
ments, this book takes its place among others 
of its kind as a compact and well-phrased state- 
ment of the art and process of the interview. 
The convenient size and length of the book 


commends itself to the use of students who wish 
to gain a quick survey of the interviewing 
process. 

Occupational Psychol 15:155-9 Jl '41. Cyril 
Burt, * His book falls into two main sections— 
a theoretical part, addressed primarily to the 
psychologist, and a more practical part, rich in 
useful advice for the employer and the man of 
business. The latter perhaps may feel tempted 
to skip the theory; but, as Dr. Myers points out 
in his illuminating summary, the first object of 
the book is to persuade the professional inter- 
viewer to reflect upon his delicate task. The 
psychologist will look forward to the results of 
the experiments, and to an account of the data 
on which Mr, Oldfield’s conclusions, theoretical 
and practical alike, have apparently been based. 
The book itself is remarkable for the lucid and 
logical way in which the whole discussion is 
unfolded. From beginning to end it is as read- 
able as it is stimulating and suggestive. The 
practical outcome of his research is summarized 
under three heads: the setting of the interview, 
the conduct of the interview, and the assess- 
ment of the candidate. * his detailed hints on 
how to prepare and set the stage are full of 
sound comments on points too often overlooked. 
As regards methods to be followed during the 
interview itself, Mr. Oldfield decides against 
the use of any set or standardized devices. * His 
third problem is the assessment or appraisal of 
the candidate’s personal characteristics. And it 
is here that we reach Mr. Oldfield’s chief con- 
tribution to the subject—a contribution. that 
specially demands the notice of the psycholo- 
gist. He gives a vivid account of the steps, con- 
scious or unconscious, by which the ordinary 
interviewer reaches his final judgments. But 
comparatively little is said about the way such 
methods might be improved by psychological 
devices. What is especially distinctive about his 
theory is the contention that the fundamental 
purpose of the interview is, not to assess the 
candidate’s personal qualities, but to stimulate 
him to reveal his attitudes. Attitudes, unlike 
personal qualities, “are directly displayed and 
directly perceived.” In Mr. Oldfield’s pages the 
word takes on a rather different meaning from 
that which it has received in the writings of 
Binet or Watt, or (more recently) of Bartlett 
or Thurstone. * Each interviewer seems to pos- 
sess a certain “generalized frame of reference 
in regard to which other people are assessed” 
and acquires a “tendency to assimilate people to 
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types." Unfortunately, however, when he comes 
to formulate these attitudes, types, and frames 
of reference, and the predictive judgments based 
upon them, the interviewer nearly always falls 
back on “the resources of ordinary language": 
for, in the field of personality and character, 
“no scientific system of precise and intercon- 
nected concepts as yet exists.” Hence “practice 
in the use of language is one of the main duties 
of the interviewer.” * his discussion scarcely 
does full justice to the help that psychology 
might give. The psychiatric interviewer has al- 
ready developed and defined a system of con- 
cepts, more or less precise, if not altogether 
scientific, without which his assessments of both 
normal and abnormal personalities could scarcely 
be achieved; and educational and vocational 
psychologists between them have hammered 
out a practicable ‘psychographic scheme,’ with 
a more or less reliable ‘frame of reference,’ 
which, for all its obvious imperfections, is cer- 
tainly far better than that of the ordinary inter- 
viewer of the type Mr. Oldfield has in mind. * 
What....the would-be interviewer needs to know 
is, first, where to look for the innumerable clues 
that may be explicitly perceived ; secondly, what 
permanent "attitudes," “types,” or approxima- 
tions to types, may be legitimately inferred from 
such clues, and how they may best be defined 
or described; thirdly, and above all, what reli- 
ance may be placed on these clues and on the 
different inferences deducible from them. The 
Institute itself has already done a good deal of 
work upon these very problems. What Mr. 
Oldfield calls “attitudes,” “types,” and “frames 
of reference,” are very similar to what other 
vocational psychologists are accustomed to call 
“mental factors.” If so, we have an obvious 
experimental technique for testing and confirm- 
ing the detailed conclusions to which his pre- 
liminary observations have led. Admittedly a 
factorial analysis can form only the beginning, 
never the end, of psychological inquiries: for a 
factor as such is a mere logical abstraction. And 
Mr. Oldfield's theory of “attitudes,” if duly 
verified, would yield a great deal more infor- 
mation than could be gained by any mere statis- 
tical inquiry. It not only indicates what factors 
may usefully be looked for; it also suggests 
alternative modes of investigation, which, 
though possibly less definite and conclusive by 
themselves, will nevertheless bring us nearer to 
the concrete mental life of the individual inter- 
viewed, and to the concrete interplay between 


the two personalities who inevitably take part 
in the process. Yet without some kind of corre- 
lational technique, if only as an incidental check, 
it is difficult to see how the validity either of 
the theoretical analysis or of the practical pre- 
dictions can be determined. The factorist, it 
must be granted, too often remains absorbed in 
his tables of figures, and seems trying to turn 
what is really an art into a would-be science. 
Had we perforce to choose between the two 
complementary procedures I would certainly 
prefer, with Mr. Oldfield, to trust to the ‘in- 
tuitions’ of the art rather than to the ‘rigid 
standardizations’ of the science. But are the 
two approaches mutually exclusive. Can we not 
combine them? Can we not, in short, regard 
what Mr, Oldfield rightly calls an art as really 
an applied science? However, these simpler 
statistical inquiries can always be carried out, 
and have been carried out, by the spade-work- 
ers. What psychologists so far have never done 
—certainly never so well as Mr. Oldfield—is 
just this preliminary analysis of the interview 
itself regarded as a psychological process. It is 
a task that badly wanted doing ; and it has been 
excellently performed. The thanks, not only of 
industrial psychologists, but of all who are con- 
cerned with the assessment of personality, are 
due to the Institute, to the Leverhulme Trustees, 
and to Mr. Oldfield himself for the inquiry he 
has undertaken. 

Occupations 20:155-6 N "41. Bruce V. 
Moore. * essentially an essay on the placement 
interview * The psychological discussion itself 
is based not on experiment but on conferences s 
with interviewers. Only one objective study -is 
referred to, in which the correlation between 
the ratings of candidates by one board of inter- 
viewers is reported to have a correlation of 0.43 
with the ratings given by another board. It is 
argued on theoretical grounds that a discussion 
among the board members before recording 
their individual ratings of a candidate might 
make the board’s rating more valid, in spite of 
the fact that the averages of such ratings corre- 
lated only 0.41 with similar ratings by another 
board. This book is not a manual for interview- 
ers. It warns against a naive objectivism that 
would routinize the interview. For one who is 
interested in a more critical and introspective 
analysis of the mental processes in an interview, 
there should be stimulation from this mono- 
graph written by a Cambridge scholar working 
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under the direction of C. S. Myers, who wrote 
the Foreword. 

Personnel J 21:78 Je '42. Forrest H. Kirk- 
patrick. Seven practical suggestions which seem 
to be related to the development of a spirit of 
inquiry and sound judgment on the part of 
interviewers represent the substance of the final 
chapter of this little book, Perhaps these sug- 
gestions represent the major substance of the 
whole book. It is a compact little volume based 
on data obtained by watching numerous skilled 
interviewers at work ; by interviewing the inter- 
viewers about their methods; and by conduct- 
ing interview experiments using certain special 
procedures. It discusses the best types of pro- 
cedure; the attitudes to be adopted by the in- 
terviewer; and the manner in which he may 
formulate his judgments afterwards. * One of 
the best chapters deals with the board interview. 
Unfortunately a considerable part of the discus- 
sion in the book seems to be general observa- 
tions concerning the art and process of inter- 
viewing. There are certain statements in the 
book which seem to be much more a matter of 
personal opinion than the result of a scientific 
study, * 

Pub Personnel R 3:245 Jl "42. James C. 
O'Brien. Not only the paucity of literature in 
the field of interviewing, but the thoroughness 
with which the author has treated his subject 
indicates that this publication is a real contribu- 
tion to present-day writing. While in general 
the author has outlined and given approval to 
what might be referred to in personnel circles 

* as the "psychology approach" to the interview- 
ing situation, he has vigorously refrained from 
pragmatic generalizations in which some psy- 
chologists have been prone to indulge in treat- 
ing of this most complex problem of determining 
the measure of the man. For example, his 
scholarly treatment of the part that attitudes 
play in a social encounter is a definitive and 
thoughtful statement. He likewise candidly ad- 
mits that any individual may in an interview 
situation perform acts and give expression to 
ideas which are inconsistent with his real per- 
sonal characteristics or ideologies. The author 
likewise wisely admits that sensibility to the 
attitudes and states of mind of other people is 
so many times devoid of conscious basis that its 
value as a measurement process is doubtful. * 
The degree of refinement in establishing the 
interview situation, while highly desirable for 
interviews for some purposes, would be too 


costly both in time and money to be used on any 
large-scale public oral examining program. * 
However, the clarity with which this compact 
and well-written book deals with a most com- 
plex problem suggests beyond question that it 
should be required reading for any public per- 
sonnel administrator charged with the conduct 
of oral examinations or interviews for rating or 
recruitment purposes. 
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XOssonxE, Acres ELIZABETH. The Relationship Be- 
tween Certain Psychological Tests and Short- 
hand Achievement. Columbia University, Teachers: 
College, Contributions to Education, No. 873. H. L. 
Forkner, faculty sponsor. New York: Bureau of Pub. 
lications, the College, 1943. Pp. vii, 58. Out of print, * 


Ed Res B 23:140 My 17 '44. Harold E. 
Burtt. About half the book is devoted to a re- 
view of previous studies in this field. These are 
brought together finally in two extensive tables 
which show criteria, validity of tests, kind of 
sample, number of cases, and bibliographical 
references. The author puts her finger on the 
difficulties which are so common in validating 
aptitude tests, for example, securing an ade- 
quate criterion. In her own experiment, she 
used 139 subjects from high-school classes, and 
the following tests: intelligence, reading, cleri- 
cal ability, spatial relations, and visual recep- 
tion. As a criterion, she used a shorthand test 
devised by V. H. Carmichael. * The validities 
of the five tests against this criterion range from 
:38 to .o7. The author is disturbed by the fact 
that visual perception correlates only .o9 with 
her criterion. This contradicts what many 
would consider should be a logical connection 
between perception of symbols and effectiveness 
of shorthand. Combining the tests, we have a 
multiple correlation of only .43, which is not 
much better than the intelligence test alone 
which correlates .38. This is not very encour- 
aging. The reviewer makes the obvious sugges- 
tion that a more satisfactory result can probably 
be obtained by trying out a much larger num- 
ber of tests and finding some with good validity 
and low intercorrelations. 

J Bus Ed 19:33 Ja’44. 

Pub Personnel R 5:241-2 O '44. Thelma 
Hunt. * The introductory part....presents a re- 
view of previous studies of the predictive value 
for shorthand achievement of various tests and 
other factors. This part...may be of value to 
civil service administrators seeking a brief re- 
view of such studies or a bibliographical listing 
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of sources of such information. Unfortunately, 
the summary does not reveal any criteria of 
great promise for selecting good prospective 
trainees for shorthand training. * The conclu- 
sion to be drawn from the analysis presented... 
is that none of the tests, or subtests, or combi- 
nations of tests studied shows correlation with 
the criterion of shorthand achievement suffi- 
ciently high to afford finely discriminating in- 
struments for picking out shorthand trainees. 
However, they probably represent the best in- 
struments the personnel administrator has at 
his disposal, and this study of their comparative 
values may be well worth consideration. Fur- 
thermore, the monograph presents many sug- 
gestions for additional study in this problem 
area and may prove to be a fruitful source of 
ideas for personnel research units and personnel 
workers interested in research. 


[ 1084 ] 
*Perers, RICHARDA. A Study of the Intercorrela- 
tions of Personality Traits Among a Group of 
Novices in Religious Communities. Catholic Uni- 
versity of America, Studies in Psychology and Psy- 
chiatry, Vol. 5, No. 7. Washington, D. C.: Catholic 
University of America Press, December 1942. Pp. vii, 
38. Paper. $1.00. * (PA 17:2047) 
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XPriLres, GiLeert E. The Constancy of the Intel- 
ligence Quotient in Subnormal Children. Publi- 
cations of the *Australian Council for Educational 
Research, Educational Research Series, No. 60. Mel- 
bourne, Australia: Melbourne University Press, 1940. 
Pp. 86. Paper. 5s. * (New York: Stechert-Hafner, Inc. 
$1.25.) (PA 15:2422) 

Brit J Ed Psychol 12:79 F ' 42. The Austral- 
ian Council for Educational Research has once 
more been instrumental in issuing an interest- 
ing and important account of investigational 
work. Dr. Phillips has carried out with thor- 
oughness and skill a research which throws 
light on a much discussed topic in intelligence 
testing. By retesting at definite intervals of time 
a group of subnormal children, who for most 
of the period of investigation were in a residen- 
tial special school, he amassed much information 
about the intelligence quotient of each child. 
Was it constant? His results are interesting and 
cannot be summarised in a necessarily brief re- 
view. But one of his conclusions is "For sub- 
jects whose I.Q.’s range roughly from 50 to 80, 
there is a tendency for a general and substan- 
tial decline of L.Q. with repeated applications 
of a Binet Test. This decline varies directly 
with the length of interval between the tests and 
seems to be symptomatic of a decline in the rate 


of mental growth. The rate of decline increases 
progressively.” There are many other equally 
interesting conclusions and those interested in 
mental testing will give attention to this re- 
search, and will note the implications regarding 
the diagnostic value of Binet tests. One hopes 
that further investigation along similar lines 
will be possible. 

Brit J Psychol 32:262 Ja '42. This mono- 
graph gives a rather lengthy and confused ac- 


.count of a piece of work, interesting in itself, 


which could nevertheless have been adequately 
described in a Journal. The intelligence quo- 
tients of 365 children at a residential Special 
school were measured, by means of the Stan- 
ford Binet test, at intervals over a period of 
nine years (though only 100 children actually 
completed five testings). The average 1.Q. for 
the whole group was found to decline over this 
period. This was due, not to a progressive de- 
cline of all cases, but to the fact that certain 
children, particularly those of low I.Q. and un- 
stable emotionality, ‘matured’ early. That is to 
say, their mental age did not increase regularly 
up to the age of fifteen, but became constant, 
in two cases before the age of eleven. It is clear, 
therefore, that prediction of the future intelli- 
gence level cannot be made with any certainty 
with such children, especially if tested only 
when quite young. However, for nearly 60% 
of the children tested, the Binet test gave a re- 
liable measure of intelligence with a constant 
LOQ. up to fifteen years. It seems a pity that no 
further study was made of the type of child 
who reaches this early ‘maturity’ (‘standstill’ 
seems a better term for it). But the conclusions 
reached seem well established and reasonable. 

J Ed (London) 74:134-6 Mr ’42. J. W. 
Jenkins. * a lucid, concise, and systematic ac- 
count is given of a study of the intelligence 
quotients of a group of subnormal children in 
attendance at a residential school in Sydney. It 
is shown that with subnormal children there is 
a general tendency for the intelligence quotients 
to decline substantially with. repeated applica- 
tion of the Binet Test, and there is a definite 
indication that a considerable proportion of 
them reach maturation point before 15 years. 
These conclusions, among many others stated, 
are significant and make an interesting compari- 
son with those of other investigators. Much 
attention has already been given to ‘constancy 
of intelligence quotients’, and seemingly con- 
flicting evidence has been reported from vari- 
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ous sources. In this admirable little book the 


results of previous research are ably summarized- 


and a useful bibliography is given for those 
who wish to follow up the inquiry. 
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XPnuvsicAL Society Corour GROUP COMMITTEE, J. H. 
Suaxsy, CHAIRMAN. Report on Defective Colour 
Vision in Industry. London: Physical Society, 1946. 
Pp. ii, 52. Paper. 3s. 6d., postpaid. * 


Nature 159:460-1 Ap 5 '47. L. C. Martin. 
A committee of the Colour Group of the Physi- 
cal Society has performed a valuable task for 
the benefit of the community: in preparing its 
recent “Report on Defective Colour Vision in 
Industry”. It is written in a style which will 
be readily understandable by non-technical 
readers. Starting with a clear account of the 
nature and incidence of defective colour vision, 
it goes on to describe various colour vision 
tests, and mentions an almost alarming variety ; 
an interesting chapter deals with colour vision 
in industry and the Services, from which it 
appears that there is a strong case for pre- 
vocational testing and routine examination of 
the colour vision of school children. A conclud- 
ing chapter makes a series of recommendations, 
of which perhaps the most important is that “all 
children should be tested at school at the age 
of 13 or over, using one of the approved Con- 
fusion Chart Tests”. Confusion chart tests are 
also suggested as suitable for pre-vocational 
tests for a great variety of industries and 
trades. * The recommendation for the testing 
of school children is excellent, and there is no 
doubt that it should be implemented as soon 
as possible. * It is possible that the teachers, 
who already nobly carry a great burden of 
duties besides their normal teaching, might be 
able to include the first test as part of some in- 
structive and amusing game in the ordinary 
school curriculum, and this without feeling that 
colour tests were the ‘last straw’; there seems 
scope here for some invention by an education- 
ist who is prepared to study the problem. The 
report speaks in various places of the desira- 
bility of having well-printed cards for confusion 
chart tests, adequate lighting, proper refractive 
corrections, etc.: but these remarks do not 
appear adequately to reflect the immense im- 
portance which is rightly attached by the public 
to uniformity and justice in tests of this kind. 
In regard to one calling, the trade union con- 
cerned has taken a most active interest in such 
matters, and was reputed to go so far as to 


coach candidates in the tests. Even the school 
‘screening’ tests should be uniform in character 
and adequately planned; a set of grimy cards, 
half-seen in a bad light, may make more trouble 
than they are worth. There are people who mis- 
understand the objects of such tests, and are 
capable of purchasing a set of cards for study 
in an attempt to defeat the examiner. If the 
screening test should be carefully conducted, 
this applies even more forcefully to the case of 
the testing of the doubtful cases classed in the 
report as Group Ila. In spite of ‘the conveni- 
ence and undoubted effectiveness of the con- 
fusion chart method, the report leaves one with 
the impression that there is no guarantee that . 
different sets of cards, even of the same sup- 
posed character, will not differ appreciably 
among themselves. This may not matter for a 
first screening test, but seems a serious draw- 
back in the case of the decision (often difficult 
to make) as to whether a candidate is admis- 
sible to a particular vocation. It may be that 
some means can be found of ensuring that con- 
fusion chart methods are properly controlled 
both in the production of the charts and their 
condition of use * Apart, however, from mat- 
ters of secondary importance, the report should 
be of great public service in providing for the 
first time a clear general picture of the prob- 
lem, and the lines along which its solution 
should be attempted. The rewards of success 
will lie in a small but definite improvement. in 
the efficiency of industry, and (much more im- 
portant) the avoidance of the disappointment 
and difficulty which may invade the lives of 
those who are unfortunate enough to suffer 
from defective colour vision. It is much to be 
hoped that the report will be carefully read by 
all educational authorities and persons con- 
cerned with recruitment for industry and the 
professions. 
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X*PrcNATELLI, Myrtte LUNEAU. A Comparative 
Study of Mental Functioning Patterns of Prob- 
lem and Non-Problem Children Seven, Eight, and 
Nine Years of Age. Genetic Psychology Monographs, 
Vol. 27, No. 2. Provincetown, Mass.: Journal Press, 
May 1943. Pp. 69-162. Paper. $4.00. * (PA 17 :3273) 
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see 294. 
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Porteus, S. D, Primitive Intelligence and Envir- 
onment, New York: Macmillan Co., 1987- Pp. ix, 325. 
Out of print. * (London: Macmillan & Co., Ltd.) (PA 
11:4677) 


Brit J Ed Psychol 11:232-3 N "41. A(lbert) 
E. C(hapman). * a valuable contribution to 
our scientific knowledge of racial differences 
among primitive peoples * The introduction, 
consisting of careful statements of principles 
which should govern any researches into the 
mental abilities of uncivilized people, should be 
studied by every student of human nature who 
intends to begin investigating the mental abili- 


“ties or processes of any human beings. The rest 


of the book falls into two main parts: descrip- 
tions of the physical environments of the peoples 
studied, and careful accounts of the tests used 
and of the results obtained. The author... 
through the medium of a narrative of travel 
....Succeéds in enabling readers to build up more 
realistic pictures of the conditions under which 
these primitive peoples live than would be con- 
structed from bald geographical and economic 
data. He convinces us that conditions in Central 
Australia are worse than those in the Kalahari 
Desert, and produces evidence that the bush- 
men inhabiting the latter are inferior to the 
aborigines of Central Australia in the perform- 
ance of all the tests used. From these and other 
similar results Dr. Porteus draws the cautious 
conclusion that the more unfavourable physi- 
cal environment does not appear always to have 
a depressing effect on the mental capacity of its 
inhabitants, and that therefore hereditary fac- 
tors influence the mental capacities of members 
of different groups of human beings. The value 
of this book lies not only in presenting evidence 
against the extremists who maintain that differ- 
ences in environment are the sole determinants 
of racial differences but in making permanent 
the observations and records of a trained scien- 
tific psychologist on the environment and be- 
haviour of rapidly vanishing peoples. 

N Y Times Book R 42:23 Ag 15°37. Martha 
Gruening. * a fascinating story of travel and 
adventure and a serious study of the compara- 
tive mentality of two primitive peoples * The 
more debatable portion of Dr. Porteus’s book 
is that devoted to the comparative results of 
testing in both groups. This is, in part, an an- 
swer to the attack of Klineberg on the value of 
such tests as measures of racial intelligence, an 
attack which Dr. Porteus admits to have been 


highly intelligent and fairly devastating. But it 
does not follow, he says, that there are not racial 
differences in intelligence and that tests so de- 
vised as to give full recognition to environmen- 
tal factors cannot discover these. He believes 
that his results are particularly significant in 
combating the extreme claims made by certain 
environmentalists * Porteus’s book....is disarm- 
ingly undogmatic * 

Times Lit Sup 1874:6 Ja 1 ’38. * It would 
appear, however, that what they [the Bushmen] 
were being tested for was rather their adapta- 
bility to our civilization, very much as if wild 
animals were being rated according to their 
power of breeding in captivity. A good deal of 
the book is somewhat irrelevant to the main 
issue, being concerned with very mild incidents 
of travel * back of all such inquiries [there is] 
the further question whether the value of a 
universal standard of intelligence does not cease 
at the point where the chief racial types and 
sub-types have branched forth from the ances- 
tral stem. In short, does it not take all sorts of 
intelligence to make a world; and is it not the 
one boon conferred on humanity by civilization 
that, by enlarging the possibilities of living vari- 
ously, it enables more kinds of men to fit some- 
where into the scheme? For the rest, let us be 
thankful that Dr. Porteus does not provide 
specimen mazes on which we might, in all pri- 
vacy, test our own claim to normal intelligence. 
Even if one is not thereby directly proved to be 
of the “mental age” of ten, one might suspect 
that from ten onwards one has lost rather than 
gained in the effective power and will to solve 
such puzzles. 

For additional reviews, see 38:B454 and 
40:B1046. 
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of Clinical Psychology. New York: American Book 
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Am J Orthopsychiatry 12:182-3 Ja '42. 
Stanley G. Dulsky. * The book is distinctly per- 
sonal and makes no claim to scholarly objec- 
tivity. Its style is conversational, flavored with 
humor and gibes at professional co-workers. * 
Psychologists will be interested in what a prac- 
titioner of such long and varied experience has 


1091 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 894 


found helpful. The book cannot be recommended 
as a text; it must be regarded as the philosophy 
and practice of S. D. Porteus who has taken 
the pains to tell his colleagues what he does and 
why. 

Char & Pers 10:346-7 Je ' 42. Phyllis Blanch- 
ard. * Porteus states that he has not attempted 
to survey the important aspects of clinical psy- 
chological practice generally but only to present 
his own theories, opinions and methods, realiz- 
ing that other psychologists may differ on the 
basis of their own professional experience. Inso- 
far as Dr. Porteus keeps to his plan and discusses 
his own work, this book is a rather unusual 
document of personal experience and practice. 
When he attempts excursions into allied clini- 
cal fields outside his own personal experience, 
however, inadequate or even erroneous state- 
ments occur far too often. The discussion of the 
work of child guidance clinic psychelogists is an 
illustration of such misstatements of fact. Dr. 
Porteus writes: "These psychologists are usu- 
ally only mental testers, the interpretation of 
their test results being undertaken by the psy- 
chiatrist." * Consultation of the existing litera- 
ture....should have enabled Dr. Porteus to avoid 
such misinformation concerning psychological 
practice in the field of child guidance. This is 
a sample of the kind of error which unfortu- 
nately makes this hardly a suitable text for 
students. Nevertheless, it is a book which af- 
fords decidedly interesting reading for psychol- 
ogists working in the clinical field, because it is 
an intimate account of the actual practice of one 
of the well-known members of our profession, 
who frankly describes his own preferences and 
prejudices without being defensive and without 
making too much effort to justify such personal 
attitudes. 

Fed Probation 7:45 JI-S '43. Milton Gold- 
berg. It is perhaps only natural that the phil- 
osophical distillation of [the author's] experi- 
ences should flavor the entire book. The attempt 
to define and distinguish the role of the clinical 
psychologist from that of the psychiatrist may 
be well taken, but appears to create an artificial 
distinction at variance with the clinical over- 
lapping and complementary nature of the two 
disciplines in actual practice. The field covered 
in the book is so extensive that the presentation 
at times seems stretched rather thin. The chap- 
ter on the biological basis of clinical psychology 
which deals with the neurophysiologic substra- 
tum on which is built the practice of clinical 


psychology is oversimplified and somewhat out- 
moded in view of the recent developments in 
neurophysiology. The chapter on intelligence is 
excellent, particularly in view of the overem- 
phasis frequently placed on the application of 
intelligence tests and I. Q.’s. The author's rich 
experience and practical suggestions are espe- 
cially valuable in his discussion of the various 
tests which may be applied clinically in the 
study of a child. However, in the chapters deal- 
ing with the various specific problems of chil- 
dren, the constitutional elements are overstressed 
without sufficient recognition of the influences 
of early childhood experience in creating healthy 
interpersonal relationships. Three chapters de- 
voted to delinquents and criminals present ma- 
terial of particular interest to those of us who 
are called on daily to deal with these problems. 
* There is postulated a delinquent personality 
whose outstanding characteristic is that of ego- 
centric self-indulgence and greediness with 
little regard for the rights of others. A self- 
indulgence test is formulated which is interest- 
ing and may be used as a range finder. * The 
treatment of delinquents is all too summarily 
presented with not enough distinction between 
those whose delinquency is a manifestation of 
a psychopathologic disorder and those whose 
delinquency more largely is conditioned by 
social-cultural frustrations including the delin- 
quent, his family, his neighborhood, and his 
group. The chapter on the criminal is most 
valuable from the standpoint of parole predic- 
tion. * Porteus presents a critical review of the 
prediction tables devised by Burgess and the 
Gluecks, and also gives his own approach to 
the problem by means of the Criminal Index, a 
system for weighing and summating certain 
factors in the past record of the prisoner as an 
index for successful prediction. Some of these 
factors seem open to question and need a great 
deal more follow-up before the validity and re- 
liability of the index as presented is established. 
However, it is a step in the right direction. Par- 
ticularly ingenious is the author's introduction 
of the chronological crime line which provides 
a graphic guide for prison boards in fixing in- 
dividual sentences. It certainly should be given 
the benefit of extensive trial and evaluation. 

J Abn & Social Psychol 37:286-7 Ap '42. 
Carl Rogers. The field of clinical psychology, 
like any professional field, has need of books 
written by men on the firing line of practical 
work. The more academic and experimental 
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approach needs to be enriched by contributions 
from those whose contacts are with the every- 
day problems of practice. Consequently one 
picks up this book by Dr. Porteus, whose years 
of experience in the Psychological Clinic of the 
University of Hawaii are well known, with 
hope of a rich reward. On the whole, this re- 
ward is not forthcoming. The book is a dis- 
appointing one from a number of angles, and 
its occasional new contributions are not suffi- 
ciently outstanding to overcome this feeling of 
disappointment. In the first place, the work 
seems poorly organized, and this lack of struc- 
ture does not appear to be merely a superficial 
lack, but a surface indicator of poorly organized 
thinking. The order of the chapters, and of the 
topics within a chapter, seems in many instances 
to follow no real pattern. Consequently one 
gains very spotty impressions. A second major 
flaw is the lack in the book of any clear frame- 
work of thinking about the individual. Through- 
out the first part of the volume this reviewer 
was impressed by the fact that there seemed to 
be no consistent viewpoint in regard to the dy- 
namics of human behavior, no clear hypothesis 
as to human motivation. The author is critical 
of the various schools of psychoanalysis and 
psychiatry, but he offers no constructive substi- 
tute. He endeavors to remain openminded to- 
ward truth from any source, but fails to recog- 
nize that isolated "truths" are not particularly 
helpful except to check on the adequacy of some 
hypothesis. It is not surprising to find in the 
latter chapters that the lack of any adequate 
framework in his thinking leads him to be per- 
plexed and discouraged. Discussing the indi- 
vidual, he shows no belief in the consistency of 
predictability of human behavior and believes 
that “man is a creature of whims and caprice as 
well as of habit, and he may suddenly perform 
some act entirely out of character" (p. 480). 
With such a view it is not surprising that in 
discussing the causes of juvenile delinquency he 
concludes, “The matter seems almost hopelessly 
complex" (p. 306). Nowhere does one gain 
the feeling that the author, out of his years of 
work, has acquired a deep understanding of 
human nature and personality. The book has its 
good points. There is much practical wisdom in 
Dr. Porteus's discussion of various intelligence 
and performance tests. His chapters on psy- 
chometrics are not sufficiently complete to serve 
as text material, but they are valuable supple- 
ments which will be helpful to the psychologist 


in training. He suggests a modified form of the 
new Binet test for children who have language 
handicaps. He tells of a test of self-indulgence 
which he has found useful with delinquents. His 
chapter on “Measures of social capacity and 
personality" has many worthwhile comments on 
available instruments. The two chapters con- 
tributed by his collaborators, the late Dr. Mary 
Hunter, and Dr. Colin Herrick, are among the 
best in the book. The chapter on “Examination 
of young children," by Dr. Hunter, should be 
especially helpful to beginning clinicians, and 
Dr. Herrick's chapter in regard to reading dis- 
abilities helps to stimulate thinking beyond the 
diagnostic procedures, in its discussion of re- 
medial techniques. Curiously enough, this dis- 
cussion is classed as an appendix to the chap- 
ter. Although the book contains discussions of 
the feeble-minded, the delinquent, the criminal, 
and the abnormal, this material does not seem 
to be outstanding. Much of Dr. Porteus's think- 
ing seems characteristic of earlier decades. In 
his discussion of “moral defectives,” masturba- 
tion, sex education, and other topics, the author 
makes a definite effort to rid himself of out- 
moded ideas, but this effort is by no means 
entirely successful. One closes this book with 
the strengthened conviction that clinical psy- 
chology cannot be a discipline which merely 
measures discrete aspects of a complex unity 
which it does not understand. It must be based 
upon a deep understanding of structure and 
dynamics of the human personality, an under- 
standing which can be clarified and improved 
through measurements. This volume adds very 
little to such a basic foundation. 

J Appl Psychol 25:269-70 Ap '4r. J. R. 
Gentry. * the author has written easily and in- 
terestingly from his broad experiences in clini- 
cal work * The author intended to write a book 
which “has a practical rather than a scholarly 
aim," and succeeded. From strictly a practical 
point of view, though, there are spots in the 
book which disappoint somewhat. The third 
chapter, “The Biological Basis of Clinical Psy- 
chology," and chapter four, "The Nature of 
Intelligence," are probably the weakest parts. 
Both of these chapters have a good deal of the 
orizing. Another disappointment to the reviewer 
was the analysis of the items in the Binet test 
(p. 120). To enumerate so many items under 
School Attainment, and so many items under 
Common Knowledge and Comprehension of 
Practical Situations is not very enlightening to 
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a student in the introductory clinical course. 
The student would like a more specific identifi- 
cation, if possible, of the mental processes in- 
volved. There is need for about a dozen differ- 
ent cases tested in detail, and the scores fully 
interpreted. In chapter VI the pointed interpre- 
tations of the quality of responses on the Maze 
supply a long felt need. No one doubts the sig- 
nificance of social capacity, but there is prob- 
ably too much space devoted to a discussion of 
Doll’s Social Maturity Scale in chapter VII. 
But this is of minor significance. “The Diagno- 
sis of Reading Disabilities," chapter XVII, has 
much condensed meat in it. It must of necessity 
treat briefly many important points. But it does 
help one to organize material on this topic. For 
the experienced person, the chapters (XVIII 
and XIX) on Psychoanalysis are good reviews; 
but for the undergraduate student there is not 
much of Freud's concepts that are useful. The 
material in chapter XX is excellent but too 
brief. Most consideration is given to predicting 
nursing efficiency. While that is important, it 
is one of many vocations about which there is 
need for information. The book gives a good 
general discussion of functions of the psycho- 
logic clinic, the devices which may profitably 
be used, the types of cases which are most gen- 
erally met, and some excellent suggestions upon 
procedure, points of view of a clinical psycholo- 
gist, and criticisms of a number of widely used 
tests. 

J Consult Psychol 6:54-5 Ja-F '42. C. M. 
Louttit. The keynote of this book is in the first 
sentence of the Foreword: “This book does not 
attempt to report all the significant work that 
has been done in its field of reference, but only 
to epitomize my actual practice in clinical psy- 
chology” (reviewer's italics). This is both the 
strength and weakness of the work. Dr. Por- 
teus' position in clinical psychology is a strong 
one: Several of his contributions, especially the 
Maze Test, are of the highest significance. His 
exceptional scientific history including work 
with primitive peoples as well as with defectives 
and normal children in Australia, Hawaii, and 
the United States gives a unique richness to his 
experiences. In so far as this volume reports his 
practice and several new methods it is valu- 
able. However, the very fact that the orienta- 
tion is so specifically toward the author's own 
work makes it of questionable value as a gen- 
eral textbook. To this reviewer the chapters 
lack organization among themselves although 


each is a coherent essay on a particular topic. 
The chapters differ in value, and only limita- 
tion of space inhibits the temptation to com- 
ment on each one separately. The author does 
well in his attempt in the first chapter to define 
the role of the clinical psychologist. In differen- 
tiation from psychiatry perhaps too much is 
made of the psychologist's need for short ex- 
amination because he has to see so many cases, 
while the psychiatrist may take longer. This is 
certainly not true in guidance clinics, by and 
large. What is apparently the theoretical basis 
for his clinical practice is described in Chapter 
III. The “biological basis" is entirely neuro- 
logical and the neurological analogies are over- 
simplified in the light of modern psychoneural 
theory. This is perhaps the weakest chapter in 
the book. The chapters describing tests and 
clinical testing methods are excellent for the 
instruments included; but one wonders why, 
for example, there is no mention of the Bellevue- 
Wechsler Scale. * The chapter on the feeble- 
minded is the best in the book. * A chapter by 
Dr. Colin G. Herrick on the diagnosis of read- 
ing disabilities is a very practical one in its 
detailed description of special methods. A chap- 
ter called “Psychoanalysis and Psychology" and 
a second on "Psychological Analysis" present a 
clever criticism of Freudian psychoanalysis. For 
the advanced student and the professional psy- 
chologist the descriptions of new methods are 
exceedingly valuable. Worthy of mention are: a 
modified composite Binet (from the 1916, L, 
and M but without supporting statistics), so- 
cial traits scale, local information scale, educa- 
tional attainment scale (first published in 1922 
but never had the recognition it deserved), self- 
indulgence test, criminal index for prognostica- 
tion, visual reading method, nurses social rating 
scale. All of these innovations strike one as being 
potentially useful, but none of them is sup- 
ported by adequate experimental evidence. To 
characterize this book in a few words is diffi- 
cult. As a general text its value is questionable. 
As a coherent systematization of either the the- 
ory or the practice of clinical psychology it 
lacks much. It is really a collection of personal 
essays of unequal value which should prove of 
value to the advanced student and to the pro- 
fessional. 

Psychol B 38:990 1-2 D '41. Francis N. Max- 
field. The author disclaims any attempt to cover 
the entire field of clinical psychology in this 
book. His purpose is technological rather than 
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theoretical in that he seeks to “epitomize his 
own actual practice in clinical psychology.” For 
more than a quarter of a century Dr. Porteus 
has held a frontier post in the field of clinical 
psychology. As director of the Psychological 
Clinic of the University of Hawaii in Honolulu 
his everyday work has brought him in contact 
with a greater variety of clinical material than 
crosses the threshold of any other clinic in the 
world. As professor of clinical psychology he 
has directed the training and research of a con- 
siderable number of graduate students. He him- 
self has done notable research in the field of 
racial psychology * The reader will be dis- 
appointed to find that the author takes his own 
statement of purpose so literally that the book 
deals almost exclusively with aspects of clinical 
work that would be found in any psychological 
clinic. For example, the author is in a position 
to write an outstanding chapter on the advan- 
tages of clinical studies as a research method. 
Or he might well have discussed more fully the 
training that, in the light of his experience, is 
desirable for the clinical psychologist. * [In his] 
discussion of clinical problems and methods... 
the author seems overcautious or overmodest in 
his statement of how much the psychologist 
should undertake to do in regard to these prob- 
lems. “He will function in a diagnostic and con- 
sultative capacity rather than in a directive one. 
He makes no claim that his special training 
gives him the right to dominate the social-adjust- 
ment situation.” Why. speak of a "guidance" 
clinic at all if the work is not to be “directive”? 
It may be better technique to get a parent to 
come to the conclusion that he is too strict with 
his child and to let him feel that trying less 
severe methods is his own idea than to dictate 
this change in treatment, but who will say that 
the indirect method is not directive? Or why 
be overcautious in a discussion of the role of 
the psychologist vs. that of the psychiatrist? 
Why should this question be answered cate- 
gorically? In a given situation the answer should 
should be a practical one. Given a certain prob- 
lem, whether of analysis, diagnosis, or treat- 
ment, and the question of whether a particular 
Psychologist or a particular psychiatrist should 
deal with the case, the answer should depend 
upon the relative merits of the training and the 
professional competence of the two men and not 
on their designations. The author is at his best 
in his discussion of the use of psychometric tech- 
niques and their interpretation and in the ap- 


plication of such methods to the study of vari- 
ous groups. Here he is on more certain ground 
and makes no apology for the role of the psy- 
chologist. These chapters give the book a sub- 
stantial significance. The specious simplicity of 
psychometric scores and careless misinterpreta- 
tion of them have tended to discredit psychology 
as well as to retard the development of thor- 
ough methods in clinical examination. The 
author's very sane treatment of the diagnosis 
of mental deficiency, for example, should be 
read carefully. * Herrick....contributes an excel- 
lent chapter on the diagnosis of reading disa- 
bilities. He might well have supplemented the 
rather meager treatment of "reading readiness" 
appearing earlier in the book. The school psy- 
chologist should play his part in the prevention 
of reading difficulty as well as in its diagnosis 
and remediation. * The critical instructor who 
uses this book as a college text will find dif- 
ferent parts of decidedly uneven merit. This 
may be an advantage in that his students will 
get training in critical evaluation and will find 
excellent material for class discussion. In any 
case they will have a genuine sense of their 
indebtedness to the author for sharing with 
them his long experience in the practice of 
clinical psychology. 
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*PortHorr, Enwarp F. Examinations in Contract 
Law Courses. University of Illinois Bulletin, Vol. 39, 
No. 35; Office of the Provost, Studies in Higher Edu- 
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Preu, Part W. Outline of Psychiatric Case Study: 
A Practical Handbook. Foreword by Eugen Kahn. 
New York: Paul B. Hoeber, Inc., 1939. Pp. xiv, 140. 
(PA 13:4716, title only) For revised edition, see 1094. 


Psychoanalytic R 28:290 Ap ’4r. * a simpli- 
fied outline of the more obvious essential facts 
from the purely objective descriptive stand- 
point. It says nothing concerning unconscious 
material, its symbolization or rationalization and 
as such is definitely dated as prefreudian. The 
psychoanalytically oriented psychiatrist will find 
little of essential value in it. 

Psychosom Med 2:490-1 O *40. S. P. Hunt. 
This book is valuable in its aim of "training 
house officers as to what facts should be re- 
corded in the thorough study of a psychiatric 
case." It is a detailed statement as to what to 
ask the psychotic patient and his relatives. As 
such it will be most useful. However, practi- 
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cally no recognition is accorded the psychoneu- 
roses: the reader is not instructed as to what 
features to look for in the history, mental status 
and physical examination of the neurotic pa- 
tient. The autonomic nervous system is barely 
mentioned. Points aiding in the differential di- 
agnosis between the neuroses, i.e., the anxiety 
neurosis and hysteria, and surgical and medical 
diseases are not taken up. One misses, too, a 
discussion of the interesting psychiatric symp- 
toms that come under the general heading of 
feelings of depersonalization, and of alterations 
in body form, of dysaesthesias and somatic de- 
lusions. The symptomatology of the aphasic 
patient and of those with other cortical or sub- 
cortical lesions is not mentioned. Some treat- 
ment of the subject of the exploratory and the 
psychotherapeutic interview would not be ir- 
relevant in a book on psychiatric case study. 
Instruction as to the relationship of the case 
data to diagnosis, prognosis and therapy would 
have been helpful. The author's book will be of 
considerable aid to the beginner in his attempt 
to clarify for himself the confusing picture pre- 
sented by the psychotic patient. To the more 
experienced physician, it is of problematic value. 
Training Sch B 38:17-8 Mr '41. H. Robert 
Otness. * very readable and understandable and 
should be of great service to physicians, psy- 
chologists and others engaged in social work * 
This case study approach gives detailed consid- 
eration to the whole personality and stresses the 
individuality of each human problem. Studying 
and following such an outline should give one 
a thorough revelation of the cases and help 
clarify the diagnoses and problems of recom- 
mendation and treatment. This volume would 
be a useful addition to the library of any physi- 
cian, psychiatrist, psychologist, sociologist and 
other professional people engaged in human 
problem solving enterprise. The reviewer is 
also reminded of the fact that most problems 
referred to the psychologist or psychiatrist are 
really social in nature having been precipitated 
through some form of social incompetence. Just 
as several other types of examinations are re- 
ferred to in this volume, the Vineland Social 
Maturity Scale would also be of great help in 
revealing the status of the individual's social 
incompetence and thus would aid further in 
clarifying the problem. This instrument has 
many uses with individuals with whom formal 
mental examinations are not applicable and 


where it is impossible to see the individual 
under question. 
For additional reviews, see 40:B1047. 
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Am J Orthopsychiatry 13:375 Ap’43. L(aw- 
son) G. L(owrey). The additions have greatly 
enhanced its usefulness. Nearly two-thirds of 
the book is devoted to the adult patient, the 
balance to examination of the child. The latter 
is useful to orient inexperienced people who 
are not familiar with children, their mental 
processes and behavior. One might take mild 
issue with some of the cautions given, but they 
are certainly needed for those psychiatrists 
who might be tempted to deal with the “prob- 
lem child” as he has habitually done with his 
adult, sick patient. The book covers very com- 
pletely all areas of functioning and disturb- 
ances. It is to be hoped that all who use it will 
heed the author’s earnest admonitions not to 
regard the several sections as questionnaires. 
There is some repetition, but it serves a useful 
purpose. All in all, an excellent book. 

Arch Neurol & Psychiatry 50:632 N ’43. * 
This book is to be highly commended for the 
use of students, nurses, practitioners and physi- 
cians in training in psychiatry. 

J Abn & Social Psychol 39:502-3 O '44. 
Robert W. White. This book is intended to per- 
form a practical teaching service, the training 
of house-officers in the technique of psychiatric 
history-taking and mental examination. * Short 
sections deal wisely with the arts of interview- 
ing, establishing rapport, and guiding conver- 
sation into profitable channels, but the bulk of 
the book consists of highly detailed questions 
constituting a comprehensive survey of the pa- 
tient’s past history, personality, and present 
problems. This is a book to be studied, not 
merely read. It is assumed that the student 
reader is in clinical training with ample oppor- 
tunity for daily practice; if this condition is 
realized, the lack of concrete illustrations in the 
manual will not be felt as a shortcoming. * 
completely rewritten, with considerable material 
added * The reviewer found the outlines for 
adults highly satisfactory, but detected less 
thought and perhaps less experience in the 
methods recommended for children. He did 
not agree that emotional relationships within 
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the family could be sufficiently ascertained by 
questioning the parents, no matter how perfect 
the rapport, and he thought that here there was 
room for an attitude more sympathetic than the 
author's toward play and projective techniques. 
He saw no sign, moreover, of the strong con- 
temporary trend in work with children to con- 
centrate, during the early interviews, on build- 
ing a therapeutic relationship with child and 
parent, letting the formal history take a sec- 
ondary place. Nevertheless, there is a good edu- 
cational wisdom in Preu's statement that “the 
beginner in psychiatry is well-advised to con- 
centrate on learning how to obtain a reliable 
history and how to do a good behavioral exami- 
nation by ordinary methods" ; and many begin- 
ners will be indebted to him for the help af- 
forded by this manual. 

J Am Med Assn 122:142 My 8 43. This book 
attempts to cover succinctly and competently all 
the technics which are used in giving a psychi- 
atric evaluation. The first edition proved useful 
and satisfactory. The new edition is much im- 
proved and extended. Not only is the usual 
routine of psychiatric interrogation covered, 
but there is an extensive guide for history tak- 
ing of both the problem child and the adult 
psychiatric case. In addition to the regular psy- 
chiatric examination of the adult there is an 
outline of the examination to be used with a 
child, and also the methods invoked when a 
patient is stuporous or comatose. The book 
should be valuable to the student who has to 
learn the technic of psychiatric examination, 
particularly if the methods outlined by Preu are 
those which are used with his own teacher, and 
there are individual differences in method. The 
book too should be used in combination with a 
good psychiatric textbook; for, while the ques- 
tions, methods and directions of psychiatric 
investigation are outlined, the expected findings 
in individual differences are not. 

J Speech Disorders 10:59 Mr ’45. W. Elias- 
berg. * a useful book for the beginner * Certain 
repetitions could have been avoided (e.g. the 
literally identical statements on complaints on 
P. 20 and p. 124). On the other hand, the be- 
ginner must be given details for examination. 
We do not agree with the author when he 
states that examination of aphasia does not in- 
terest the beginner. At any rate, it would have 
served the purpose if the author had secured 
permission to reprint such information as must 
otherwise be looked for in other books. 
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Ed Leadership 4:43 N '40. Paul Murphy. * 
will repay the time and effort spent in its peru- 
sal by anyone interested in the improvement of 
educational techniques * the unique contribu- 
tion of the monograph is to be found in the 
suggestions and illustrations of ways of measur- 
ing the more intangible but profoundly impor- 
tant outcomes of education. 
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Am J Psychol 59:306-11 Ap '46. Quinn 
McNemar. * The results for, and the compari- 
sons based on, the Bellevue, Babcock, and sort- 
ing tests are presented in over 100 tables and 
by means of more than 50 graphs. It is the re- 
viewer's opinion that readers will find that the 
graphs are of little use except for the various 
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parts of Fig. 19, which show the Wechsler- 
Bellevue subtest profiles for the several clinical 
groups. In general, the presentation and discus- 
sion of the results, with multiple ways of deal- 
ing with the same data, make for highly tedious 
reading. * It is impossible to summarize in a 
review the findings of such an extensive inves- 
tigation. A hurried reading of this volume is 
apt to lead to the first reaction of the reviewer, 
who found himself so impressed by the claims 
and apparent plethora of evidence regarding the 
diagnostic value of the tests that he anticipated 
a neat, but not to be found, summary which 
would indicate that the psychologist can make a 
correct diagnosis at least 9996 of the time. 
Those who turn to the section (pp. 299-317) 
on the profiles (called scattergrams) based on 
the Wechsler-Bellevue subtest averages for the 
different clinical groups should be warned that 
in the discussion of conclusions from this form 
of summary, which should be highly informa- 
tive and rather inclusive as regards the diag- 
nostic value of the Bellevue Scale, the authors 
are nearly oblivious of the necessity for anchor- 
ing their claims to statistically demonstrated 
differences. In evaluating—by the ordinary 
standards of scientific procedure—the adequacy 
of this monumental research, the reviewer has 
jotted down on the margins many questions, 
but we shall here consider only the few which 
seem basic. Our first reservation concerns the 
sampling adequacy of the study. No evidence is 
presented to indicate that the cases in the vari- 
ous clinical groups are representative of such 
groups in general. Although the study is based 
on 217 clinical cases, the Ns were 207 for the 
Wechsler-Bellevue, 137 for the Babcock, and 
150 for the sorting test. When these cases are 
subdivided into 19 clinical categories, the num- 
ber per group ranges from 5 to 18 for the Bel- 
levue, 2 to 12 for the Babcock, and 3 to 15 for 
the sorting. This calls for great faith in the 
statistics of small samples. For a large number 
of the analyses, the combining of groups avoids 
very small Ns. This leads to our second reser- 
vation, which concerns the rationale for com- 
bining groups. The lack of consistency in this 
would lead one to suspect a lack of rationale, 
but two rules were followed: “(a) We have 
combined only groups which were included in 
the same general nosological category. (b) We 
have combined groups only when they had simi- 
lar distributions” on the tests (p. 32). This ex 
post facto procedure permits the capitalization 


of chance similarities, hence the building up of 
difference between groups by selection, and is 
therefore highly questionable. Our third and 
major criticism has to do with the comparability 
of the several clinical groups among themselves 
and with the control group. In their foreword 
the authors, in attempts to anticipate and pos- 
sibly forestall criticisms, raise the question (p. 
vi) as to the “meaning in the light of modern 
psychopathology” of matched groups. One 
might grant that control by matching is not 
always feasible or possible when dealing with 
clinical material, but this does not justify a 
flagrant disregard of obvious noncomparability 
on variables which are definitely relevant to the 
main argument. The controls are all males of 
the police variety, whereas the clinical groups 
contain both sexes. There are marked varia- 
tions in the educational levels of the several 
clinical groups. Rather than speculate as to 
how the uncontrolled factors of sex and educa- 
tion might affect the results, we shall pass on 
to the matter of age differences. The median 
age for the controls is about 33, whereas the 
median age for the 19 clinical groups ranges 
from 20 to 60! How might such tremendous 
age differences affect the study? The differ- 
ences between the variabilities of the distribu- 
tions of scores for all cases (including the con- 
trols) for the Wechsler—Bellevue subtests (p. 
84) are explicable in part on the basis of age 
heterogeneity (range from 14 to 72 years) 
coupled with the known fact that the Wechsler 
subtests differ markedly in regard to age de- 
cline. In other words, the greater variability 
for some subtests may not be indicative of im- 
pairments due to mental disorders. Since there 
is a differential age decline on the subtests, the 
lack of age comparability means that unknown 
amounts of the group differences in scatter on 
the Wechsler-Bellevue are attributable to age 
impairment. Corrections for age, possible by 
using Wechsler’s age norms, could and should 
have been made in order to ascertain the net 
impairment associated with particular disor- 
ders. The failure to control or allow for age, 
as well as non-comparability of the groups as 
regards education and sex, forces one seriously 
to question the validity of all comparisons of 
item success and weighted scores on the Wechs- 
ler-Bellevue subtests and all comparisons on 
the vocabulary test (pp. 87-109). Since all of 
the scores on the sorting test are used in an 
absolute sense (not relative to some general 
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level of performance of the individual), all the 
results for the sorting test are subject to un- 
known distortion by the non-comparability of 
the groups. Of the subparts of the Babcock test 
it is known that repeating digits shows an im- 
pairment with age, so we suspect the author's 
results for efficiency. Our fourth criticism con- 
cerns the erroneous statistical treatment of a 
sizable portion of the data. The authors seem 
not to have been aware of the fact that the chi 
square comparison of groups requires that the 
sampling units within groups be independent. 
Of the grand total of about 876 chi squares re- 
ported, 339 are significant at or beyond the 1% 
level of confidence. Of these 339, seventy-eight 
are based on inflated Ns, that is, on multiple 
non-independent responses for each case, and 
are therefore definitely not acceptable as indi- 
cating significance. For one set of chi squares 
the frequencies compared add up to 12 times 
the number of cases. The chi squares in the 
following tables involve inflated Ns: Tables 6B, 
7B, 8B, 8E, 13B, 19B, 24B, 36B, 41B, 45, and 
79B. The fact that 78 of the 105 chi squares in 
these tables were significant, coupled with the 
fact that a majority of them were incredibly 
high in significance, should have made the au- 
thors suspicious of the statistical analysis. It is 
interesting to note that, of the 261 remaining 
significant chi squares, 110 spring from com- 
parisons involving the two over-age groups, the 
psychotic and involutional depressives with 
median ages of 52 and 6o respectively. That 
leaves 151 significant chi squares, more or less 
in need of qualifications because of uncontrolled 
factors, but these 151 are not experimentally 
independent—two or more chi squares result 
from different ways of handling the same set of 
data. Since the claimed differential diagnostic 
value of the tests, particularly the Wechsler- 
Bellevue subtests, depends largely upon the chi 
Square comparisons, one can rightly wonder 
about the tenability of the claims. Another vio- 
lation of a condition necessary for defensible 
statistical analysis is the use of the small sample 
or i-technique with non-normal score distribu- 
tions. It is ordinarily difficult with small sam- 
ples to be sure that the assumption of normality 
is tenable, but for 13 of the 15 scores used from 
the sorting test the distributions are so mark- 
edly skewed as to leave no doubt about the fail- 
ure even to approximate the required assump- 
tion of normality. Thus all the 150 t-ratios in 
Tables 82A, 82B, 86, 87, 89, 90, 96, 98, 101, 


105, 106, 110B and 111B are fallible. Finally, 
we ask how clinical psychology can best emerge 
from the private world of clinical experience 
into the objective world of scientific description. 
Granted that some emergence is possible, its 
realization is not aided by a frequent willing- 
ness “to call on experience" or “to present what 
experience teaches” (p. v). Nor can we expect 
much progress unless researchers in this area 
become aware of the necessity of at least ele- 
mental scientific controls and adequate statis- 
tical analyses of their data. It is regrettable that 
so little can be accepted from research which 
involves so much money, time and energy as 
must have been expended prior to and during 
the preparation of this volume. 

J Genetic Psychol 69:123-8 S '46. Clare 
Wright Thompson. This work is “an attempt to 
explore systematically the diagnostic potenti- 
alities of seven psychological tests, used simul- 
taneously” (p. v). * any publication which 
fulfilled this purpose would win acclaim from 
both those psychologists engaged in clinical 
practice and those concerned with the theory 
of mental development and personality; for a 
comprehensive work on the experimental foun- 
dations of clinical techniques is long overdue. In 
spite of its more than 113 tables and-38 figures, 
however, Diagnostic Psychological Testing is 
not experimentally oriented: its purpose is to 
tell and not to find. * The author pleads 
throughout for a non-mechanical use of intelli- 
gence tests. With this one must heartily agree. 
The intelligence test yields not only an IQ, but 
a behavior sample as well, in which the ex- 
perienced examiner can weigh the subject's re- 
sponse against the responses of hundreds of 
other subjects to whom, in the past, he has put 
exactly the same item. Thus Rapaport points 
out that "the response, ‘Wheeled vehicles for 
transportation on land,’ to the question ‘In what 
way are a wagon and a bicycle the same?," 
obtains full credit; but its deviation from the 
usual responses, its overexactness and relative 
redundancy, should raise the question in the 
examiner's mind, 'Is this person really as doubt- 
laden and over-meticulous as this response in- 
dicates?'" (p. 41). From such a use of intel- 
ligence test items it is, however, a far jump to 
"pattern analysis"—to either the demonstration 
of statistically significant differences between 
clinical groups or the diagnosis of clinical en- 
tities upon relationships between psychometric 
scores. What careful studies have been made of 


1102 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 902 


such patterning in clinical groups have resulted 
largely in showing that the rules of thumb com- 
monly used by practicing clinicians cannot be 
demonstrated in clinical groups controlled for 
diagnostic category and for age. * The groups 
on which findings are reported consist of all 
clinical cases (except a few excluded for valid 
reasons) examined at the Menninger Clinic dur- 
ing a three-and-one-half-year period. Results on 
these cases are contrasted with a “control” group 
of 54 Kansas State Highway Patrolmen. One 
wishes, especially in the case of the Wechsler- 
Bellevue, that the large if urban standardization 
population for the scale might have been used as 
a control. Certainly the Kansas patrolmen differ 
from the clinical group in having no female 
members, and in having a mean age of 35 years 
whereas the mean ages of the various clinical 
groups range from 20 to Ór years, neither of 
which considerations enters into the discussion 
of findings; and in showing “a limited range of 
interests, a tendency toward some degree of 
withdrawal, an unusual lack of colorfulness" (p. 
29), and especially in having a lower educational 
and cultural background than the patients of 
a private mental hospital, both of which differ- 
ences are mentioned by the author. * the total 
schizophrenic group includes only 64 members 
ranging from 15 to 55 years. When this group 
is broken down, as it is, into six subgroups, 
none of these has more than 17 members in 
all, or more than seven in a single decade of life 
age. Statistically to demonstrate valid differ- 
ences among such groups is an impossible task. 
In this connection, and indeed when consider- 
ing any portion of the book, it should be kept 
in mind that never is it claimed that the clini- 
cal diagnoses of the groups used were independ- 
ent of the psychometric procedures. Indeed what 
references are made to clinic procedure, as for 
example on page 14, would indicate that such 
independence did not exist. Suggestions are 
made for changes in the administration of the 
tests considered so that the results may be clini- 
cally more meaningful. Again, this is a matter 
of opinion with which one can hardly take issue. 
It should be recognized, however, that in so 
using the tests one renders the results unsuit- 
able for comparison with the standardization 
group of the test under consideration or with 
the results of other workers. * Valuable as is 
any aid to such differential diagnosis, the value 
of the book would be infinitely increased if the 
findings were presented against the background 


of previous work in the field not, as they are, 
ab ovo. This same neglect of the work of other 
investigators detracts from the theoretical chap- 
ters on concept formation and the “psychologi- 
cal rationale” given for each subtest. These lat- 
ter are arm-chair discussions of the "function" 
underlying each subtest: e.g., on the Wechsler- 
Bellevue, the Comprehension test measures 
“judgment,” the Similarities test “verbal con- 
cept formation,” the Picture Arrangement test 
“planning and anticipation,” etc. These sections 
are reminiscent of the writings of an earlier 
psychological era and neglect completely the 
work of other investigators which work is an- 
notated in the bibliography of the book being 
reviewed. * It is apparent to anyone who has 
attempted to explore by statistical means the 
test performances of clinical groups that this 
degree of differentiation [see example on p. 
146] is impossible to obtain statistically. That 
significance is claimed for so many differences 
in the present book should, in itself, suggest 
that the statistical methods used are question- 
able. It might well be argued that the present 
volume would have greater usefulness were it 
simply a handbook of clinical impressions, leav- 
ing to others the statistical validation of these 
impressions. Such a procedure would have the 
advantage of utilizing different subjects from 
those on whom these impressions were formu- 
lated. Even though the statistics in this volume 
are presented only to “advance statistical proof 
for our views, derived from our clinical experi- 
ence” (p. 34) it is impossible to ignore what 
occupies such an appreciable number of pages 
and carries so much prestige in our culture. In 
an effort to understand the statistical methods 
employed, the data in Table 1-B (p. 59) were 
reworked. These data deal with the discrepancy 
between vocabulary score and performance 
score, which Rapaport calls “performance scat- 
ter.” These data, he asserts, indicate that de- 
pressives show a greater discrepancy than do 
other groups. The table gives chi squares to 
assess the magnitude of “performance scatter” 
between the various clinical groups taken two 
at a time. Of 18 such chi-squares, if the stand- 
ard conventions regarding the number of ex- 
pected cases in each cell are observed, only one 
can be computed with three degrees of freedom 
as indicated, and only six more can be computed 
with even one degree of freedom. Of these six, 
one is said to be insignificant and remains SO 
when recomputed with the proper number of 
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degrees of freedom. One of the six is said to 
show a “strong trend approaching a significant 
differentiation” (p. 33). When recomputed with 
one degree of freedom, this probability figure 
drops to .30, or below any level of significance. 
The other four are said to `e “highly signifi- 
cant” (p. 33). When recomputed, the respective 
probability figures are .50, .02, .or and .or. 
That is, in Rapaport’s terms one is insignificant, 
one “significant,” and two remain “highly sig- 
nificant.” Since the errors in this table are not 
consistent, it would seem that only by repeating 
all the calculations would one be able to inter- 
pret the tabular material. When the depressive 
psychotics are compared with seven other 
groups, all these differences are in the direction 
of greater discrepancy between performance and 
vocabulary levels. Of the seven, five are given 
by Rapaport as significant. It was possible to 
recompute three of these, all of which remain 
significant. However, the mean age of the de- 
pressive psychotics is 49.9 years whereas the 
mean ages of the groups with which they are 
compared range from 28.3 to 46.9 years. The 
rank order correlation between the mean age of 
the 19 clinical groups and their mean “perform- 
ance scatter” is + 0.59 + 0.15. Hence it is ap- 
parent that the effect of age cannot be neglected, 
which point is stressed in several of the refer- 
ences making up the annotated bibliography to 
this volume. Rapaport is of the conviction that 
“authors of clinical psychological publications 
are obliged to present raw data if their results, 
as statistically proven, are to become part of the 
used armamentarium of the clinical psycholo- 
gist” (p. 32). This he does in the appendix. It 
is therefore possible to recalculate the values, 
using the age group of 30-39 years, into which 
decade of life age more members of the total 
population fall than in any other. As would be 
expected when dealing with the small groups 
used in this study, only one of the possible com- 
parisons between clinical groups yields a “t” 
Which is significant when age is controlled. 
Since the problem may be stated as that of de- 
termining which groups differ from the total 
population in this regard, “t’s” were computed 
contrasting each group with the total population 
excluding the group in question. When this is 
done, the greatest discrepancy is shown by the 
deteriorated schizophrenic group, next by the 
depressed psychotics, then the coarctated pre- 
Schizophrenics, then the non-deteriorated un- 
Classified schizophrenics. The "scatter" for each 


of these groups is greater than for the rest of 
the population exclusive of the group in ques- 
tion. The four groups do not differ significantly 
from each other. The non-depressed Patrol 
group shows significantly less "scatter" than 
does the rest of the total population. It is not 
possible to differentiate in any way between the 
other seven groups having sufficient members 
in this age range. These findings should be com- 
pared with Rapaport's verbal summary of Table 
I-B...[which] indicates that the most signifi- 
cant drop of the Performance level below the 
Vocabulary level is in the Depressive Psychosis 
groups; next come the Severe Neurotic Depres- 
sions and the Deteriorated Schizophrenias; 
next, the non-deteriorated Paranoid Schizo- 
phrenias; next, the non-deteriorated Unclassi- 
fied Schizophrenias and, on the same level, the 
Neurotic Depressions; next, the Neurotic 
groups, and the subgroup of our Patrol cases 
showing depressive trends. The best of all our 
groups—that is, the group which has the least 
discrepancy between the Performance and Vo- 
cabulary levels—is the Patrol, excluding its de- _ 
pressive cases. The rank order coefficient of 
correlation between this summary and the re- 
viewer's findings with age controlled is +0.51 
0.27, or less than twice its standard error. 
The considerable time spent here in discussing 
these statistics, and the more considerable time 
spent in computing them, has been devoted to 
but one of the 113 tables. Nevertheless, because 
of the need for experimental validation of clini- 
cal variables, it would seem worthwhile to de- 
vote whatever time necessary systematically to 
work up the raw data in the appendix were it 
not for two difficulties which seem insurmount- 
able. The items were not administered accord- 
ing to a standardized procedure and the diag- 
noses of clinical type were not independent of 
the psychometric results. In summary, one is 
left with a feeling of disappointment at how far 
short of a systematic validation of clinical tech- 
niques this work has fallen. One is apprehen- 
sive at the use an inexperienced and credulous 
examiner might make of it. One deplores even 
that two volumes of Diagnostic Psychological 
Testing will be on the shelves of most psychi- 
atric libraries to convey to our colleagues the 
idea that work in this field has progressed to 
the point where two volumes can be devoted to 
it. On the other hand, the book is full of re- 
search ideas. Anyone whose object is to find 
and not to tell can see here a hundred minor 
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research problems for himself or his students. 
It is in the suggestive ideas presented, not in 
the theory or the statistical findings, that the 
contribution of the volume lies. 

Psychol B 43:477-9 S 746. Andrew W. 
Brown. * In the early years of Binet testing 
test variation was assumed to have diagnostic 
significance. On the basis of scatter many claims 
were made which later investigations have 
proved untenable. Even such an astute observer 
as Alfred Binet claimed that scatter on the 
Binet-Simon Test was the “differentiating at- 
tribute of the defective child," and a number of 
American writers following Binet's lead have 
claimed that scatter on a test is symptomatic of 
one kind of mental disorder or another. Some 
have found scatter to be indicative of “poten- 
tial feeble-mindedness,” others have claimed that 
delinquents show a wider scatter than non- 
delinquents, still others claim that psychotic 
patients tend to have a wider scatter than nor- 
mal patients. As a matter of fact, wide varia- 
tions in test performance have been found in 
each of these groups, but what these authors 
fail to point out is that variation in test per- 
formance is the rule rather than the exception 
in all test performance. As psychologists have 
become better acquainted with the nature of 
"general intelligence," and have learned more 
about the theory of mental tests and test con- 
struction, they have paid less and less attention 
to the phenomenon of scatter as a diagnostic 
indicator. For a time it appeared that the argu- 
ments which had been advanced by Richardson, 
Lorr, Meister, and others, had been sufficient to 
convince most psychologists that scatter, par- 
ticularly on the Binet, was as much a function 
of the test as it was the function of the indi- 
vidual tested. Now it seems that the same line 
of arguments must be presented all over again 
for the Wechsler-Bellevue. Let us hope that it 
will be more fruitful. This volume does not give 
much ground, however, for optimism in this 
direction. Part III is a wordy exposition of 
what the author calls Diagnostic Testing of 
Concept Formation. * Since the Hanfmann- 
Kasanin test does not lend itself to a quantita- 
tive approach in the same way as does the 
Wechsler-Bellevue, the author indulges in long 
qualitative comparisons of the reaction of psy- 
chotic patients on these tests. * In one of the 
appendices the author reports the raw scores 
for each of the nosological groups on each of 
the tests—a formidable array of rather mean- 


ingless data. * The experienced clinician, upon 
a superficial first glance at Diagnostic Psycho- 
logical Testing, is likely to hail it with enthusi- 
asm. He will probably exclaim, “Here is some- 
thing really important, something which will 
enable individual psychological testing to con- 
tribute something worth while." Upon more 
careful examination, however, he will be sadly 
disappointed. If he reads carefully and stops 
for a moment to analyze some of the concepts, 
he will find them quite meaningless. It is unfor- 
tunate that such a prodigious amount of work 
and effort should produce so little of real sig- 
nificance. The superficial attempt to apply sta- 
tistical procedures to a series of test data with 
apparently little understanding of the under- 
lying theory gives the book a false appearance 
of objectivity. Many of the figures are almost 
unintelligible and one of them, for example ( Fig. 
2, 56), does not even agree with the text. The 
volume is a good illustration of what happens to 
test interpretation when an investigator starts 
out with the assumption that it is "better to 
present what experience teaches than to by-pass 
it for lack of statistical evidence" (v). The sen- 
tence structure is poor and the exposition is 
almost intolerable. * It is of course necessary 
and important for a psychological examiner to 
analyze thoroughly his test data. At the same 
time he must not be so carried away by his de- 
sire to make a contribution that he makes false 
or untenable claims. Not to make a statement 
or failure to be as astute as perhaps one ought 
to be, can, at least, do no ill, but to draw con- 
clusions and inferences which are not at all 
warranted from the data (and there are many 
such in this volume) may lead to irreparable 
damage. The field of individual psychological 
testing is one in which conservatism is a virtue. 
One could argue with a fair degree of confi- 
dence that Diagnostic Psychological Testing 
has contributed little to the improvement of 
clinical practice. 
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1945: Pp. xi, 516. $6.50. * (PA 20:1712) See also 1102 
and 1104. 


Am J Psychol 59:485-9 Jl '46. D. R. Miller. 
* The discussions of "rationale" are important 


in that they present a theoretical framework 
which occasionally succeeds in making mean- 
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ingful many scoring methods the justification 
of which was heretofore only empirical. It is 
unfortunate that the theory was not more thor- 
oughly developed. Many books pay lip-service 
to the subtleties of administering and scoring 
projective tests. This is one of the few which 
succeeds in conveying the complexities of these 
tests while at the same time offering many con- 
crete methods of dealing with the complexities. 
In addition to careful descriptions of methods of 
administering, scoring and interpreting the 
tests, the book is replete with examples and 
discussions of instances in which the same devi- 
ations from the norm take on different signifi- 
cance in different kinds of adjustments and 
maladjustments and in which the quality of a 
single response, "its location, extravagance, un- 
usualness, or absurdity may carry more patho- 
logical weight than an accumulation of several 
nondescript responses of the same type." Read- 
ers will find in Rapaport's discussion of scoring 
methods a competent defense of the uncom- 
promisingly clinical approach to test-interpre- 
tation. The major contributions of the clinical 
sections are the new methods which are pro- 
posed for scoring and interpreting the tests. 
After stating his preference for Rorschach's 
simple methods, Rapaport proceeds not only to 
utilize most of the recent additions to these 
methods but also to suggest a bewildering num- 
ber of new "diagnostic indications." Some of 
these new methods supply answers to a number 
of present scoring problems. Many of them 
seem to be over-refinements. Very few of these 
new scoring methods are validated. * there is 
not sufficient space in this short review to men- 
tion all the innovations proposed in this monu- 
mental work, to say nothing of an estimation 
of their value. Two more of the outstanding 
innovations should be mentioned, however, 
since they will undoubtedly be the subject of 
much research. The first is the word-association 
test, which is modeled after the Kent-Rosanoff 
and consists of sixty words of familial, home, 
oral, anal, aggressive, and varied sexual conno- 
tations. In addition to the usual free-association 
method, the subject is asked to reproduce his 
original responses to each word. The other major 
innovation is the method of scoring the thematic 
apperception test, which involves analysis for 
noncompliance with test-instructions perceptu- 
ally and in plot, for inter- and intra-individual 
consistency in the stories, and for the extent to 
which the content of stories represents cliches 


and “essential ideational content." Validation 
data are presented only for the Rorschach and 
the word-association tests. In the opinion of 
the reviewer, the experimental method and 
statistical analysis were so carelessly planned 
that the 172 chi squares reported in the ninety- 
nine tables merit little acceptance. The major 
criticism of procedure is the lack of indepen- 
dence between criterion-classification and test 
results. One source of this lack of independence 
is the unconventional system of nosological 
classification which was designed "to bring 
clearly in relief the diagnostic features of the 
tests for different psychiatric nosological cate- 
gories.” The clearest example of such “loaded 
dice" is the statistical finding that "coarctated 
(a rorschachean term) pre-schizophrenics” are 
diagnosed as “coarctated” by the Rorschach. 
There are a number of less obvious examples in 
the criteria of nosological classification which 
are often phrased, not in clinical psychiatric 
terminology, but in terms which are close to 
the language of this test. The procedure of 
classification seems to have been oriented 
toward dividing the subjects into groups which 
could be most easily predicted by the projec- 
tive tests as much as it was toward dividing 
them according to their clinical syndromes. An- 
other source of lack of independence between 
the nosological classification and the test results 
is the fact that the projective tests were admin- 
istered and interpreted after the psychologists 
had had the opportunity to administer other 
tests to the same patients and clinically to 
observe the patients. A final and probably ma- 
jor source of lack of independence between 
classification and testing results was the method 
of classifying the subjects. This was done ini- 
tially in conference between the psychiatrist, 
who was familiar with the tests but not the test 
results, and the psychologist, who was familiar 
with the results but attempted not to be influ- 
enced by them. After the classifications and the 
results had been discussed by the entire staff, 
many diagnoses were revalued “in view of the 
course of the patient in treatment and post- 
treatment follow-up." Finally, when principles 
of scoring and valuation were later changed, 
all the tests were re-examined and re-scored 
according to most recent standards. Despite the 
uncontrolled method, the data might have been 
acceptable if "blind analysis" of test results 
had been performed. This was unfortunately 
not done. The primary criticism of the statis- 
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tics on the Rorschach test is best introduced by 
a statement from the book that, "in subjects of 
great productivity, the absence of inhibition 
will allow for emergence in the record of much 
that in a narrow record would be indicative of 
severe pathology." In other words, the record 
with many responses, because of the large num- 
ber of the responses, would be expected by 
chance to have a larger absolute number of 
pathological responses than the record with few 
responses. Therefore, in order to compare 
meaningfully the relative standing of groups 
differing widely in their average number of 
responses, it would be necessary to compare 
ratios of their pathological responses to their 
average total number of responses rather than 
their absolute number of pathological responses. 
Despite this statement, Rapaport compares the 
absolute numbers of pathological responses of 
his groups in 31 of his 56 tables. As a rough 
measure of the extent to which rank in these 
tables is a function of the number of responses, 
the reviewer tabulated the rank in these tables 
of the group which had the highest average 
number of responses, ie. the overideational 
pre-schizophrenics. In thirty-three comparisons, 
most of them involving from eleven to twenty- 
two groups, the overideational pre-schizophren- 
ics were first in seventeen and within-the-first- 
three in twenty-nine. It may thus be concluded 
that a group's rank in almost sixty per cent of 
the tables in the book depends in great part 
upon its average total number of responses and 
little or none at all upon the theoretical reasons 
advanced by the author. Furthermore, had ra- 
tios been compared instead of absolute numbers, 
the conclusions would have been entirely differ- 
ent in many cases. The remaining criticisms of 
statistical data in this work have been ade- 
quately discussed by McNemar in his review of 
Vol. I (this Journal, pp. 306-311, above). 
They include the use of group N’s as small as 
five in the computation of some of the chi 
squares, the capitalization of chance similarities 
of groups by combining them when they had 
similar distributions, and the questionable com- 
parability of the groups with regard to age, sex, 
education and culture. In view of the poor or- 
ganization of theory, the multiple ways of deal- 
ing with the same data—which make for 
tedious reading-—, and the lack of acceptability 
of most of the statistical data, it would not 
seem that the beginner or, for that matter, the 
average clinician would receive much benefit 
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from the book. On the other hand, individuals 
who are competent in research with projective 
techniques will find themselves amply repaid by 
creative discussions of theory and clinical tech- 
niques and by many suggestions upon topics 
which require further investigation. 

Psychol B 43:479-81 S '46. Robert M. Lind- 
ner. * Much credit must be given to the author 
and his associates for their determination to 
carry through to a conclusion what must often 
have seemed an onerous and difficult task. At 
the same time, just because the effort has been 
so great, a reader cannot help wondering 
whether or not the energy exerted in the fulfill- 
ment of the design was really worth the end. 
For, quite simply, regarded either by halves or 
by the whole, the product falls far short of 
what was intended. The second volume is di- 
vided into five sections, all of them character- 
ized by an unevenness of presentation which 
probably reflects the authors' progression in 
their thinking not by continued development 
but by flashes and eruptions of intuition. This 
is exemplary in literary work and makes for 
reader-interest, but the presentation of sober 
scientific data is harmed rather than helped by 
such a technique. When, moreover, the authors 
engage—as they do in this volume more than in 
the first—in didacticisms for which an earnest 
reader mainly searches evidence, this fault 
emerges most prominently. To lift from con- 
text the numerous instances of this kind of 
thing would be grossly unfair to the authors, 
but it is a fact that the professional reader is 
often left aghast at the recklessness with which 
hypotheses are formulated and interpretations 
made. Such may be forgiven to Sunday Supple- 
ment writers but not in a case where the 
avowed intention of a study is exhaustiveness. 
* an introductory chapter....closes with an in- 
teresting, highly readable brief discourse on 
personality processes underlying projective re- 
actions. Following this, the Word Association 
Test is dissected only to be reassembled on 
page 84 in what the authors believe to be a 
more suitable fashion for yielding not only the 
amount but also the kind of disturbance in a 
patient, and to include more words evocative 
of “aggression....affective lability, fearfulness 
and anal trends..marital relationships... 
oral conflicts..and bodily preoccupations.” 
The Rorschach Test occupies the next three 
hundred and some pages. There is hardly an 
approach to the test which does not concern the 
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authors. Moreover, those among us who are 
and have been somewhat alienated from the 
formal societies and cults of Rorschach by their 
preoccupation with "scoring problems" to the 
exclusion of real psychological and practical 
understanding will find in this section some 
rewarding reading. Were it not for the many 
times the authors "take off and fly blind," 
guided only by the wavering beacon of their 
own thinking and neglecting so much the rich 
literature and findings of others, as well as the 
contributions of scientific psychology, one would 
readily credit them with accomplishing their 
aims. Chapter IV considers the Thematic Ap- 
perception Test of Murray. Its particular field 
of inquiry is ideational content, reflecting as it 
does on motivation of behavior. To enjoy and 
profit from it requires the reader to accept the 
dynamic psychological orientation of the au- 
thors. It suffers, however, from an exclusiveness 
which refuses to admit the contributions of 
other ways and points of view from which 
human behavior may be regarded. Even this 
reviewer, whose bias is similar to Rapaport's, 
must regret the single-mindedness of approach 
herein demonstrated. The last chapter contains 
comments relating to diagnosing with a battery 
of tests. It is essentially a brief recapitulation, 
but wraps and delivers the message of the work 
with more directness than one would have ex- 
pected from their earlier diffuseness of state- 
ment. Students will thank the authors for the 
inclusion of Appendix III in which a critical 
bibliography for the tests discussed is given. 
This is one of the best features of the book and 
will be found to be of invaluable aid in working 
with the material. The section by Martin May- 
man particularly deserves special mention. The 
statement of Dr. Rapaport in the introduction 
that wartime exigencies forced the presentation 
of this text without “statistical substantiating 
material” puts this matter beyond the concern 
of the reviewer: at the same time, it is an in- 
disputable fact that it is in this category of 
appraisal that many serious objections to what 
is presented in Volume II can be raised. This 
writer wishes, out of high personal regard for 
Dr. Rapaport, that he could close his review 
with a recommendation that the volume, “be- 
longs on the shelf of every clinical psychologist 
and psychiatrist.” Perhaps it does—if the clini- 
cian is willing and able to discount many of the 
happy thoughts, sweeping independent insights 
and unjustifiable pronouncements which parade 


through the book. In sum, the field is still wide 
open for a sober and exhaustive survey of “the 
theory, statistical evaluation, and diagnostic ap- 
plication of a battery of tests.” 
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*RAPAPORT, DAVID; WITH THE COLLABORATION OF MER- 
Ton GILL AND Roy Scuarer. Diagnostic Psychologi- 
cal Testing: The Theory, Statistical Evaluation, 
and Diagnostic Application of a Battery of Tests, 
Volumes I and II. Chicago, Ill.: Year Book Publish- 
ers, Inc., 1945, 1946. Pp. xi, 573; xi, 516. $6.50 per 
volume. * (PA 20:929, 1712) See also 1102-3. 

B Menninger Clinic 11:43-4 Ja "47. Paul 
Bergman. The reader will find today's best ego 
psychology in these two volumes, modestly in- 
terspersed in the discussion of test rationales. 
I regretted not finding any syntheses of the 
various tests in the form of case reports. With- 
out access to them, I wonder if the reader 
might not gain the opinion that Doctor Rapa- 
port is chiefly seeking nosological diagnoses, 
whereas in daily clinical practice we actually 
receive insight from consistent, dynamically 
related statements about the total personality. 
As a layman I received from the volumes a 
deep and ambivalent impression of this new 
science which Doctor Rapaport and his collabo- 
rators so competently utilize. * today psychol- 
ogy has become an interpretative, analytical 
science that cannot fail to exert traumatic ef- 
fects on one who meets it unprepared. Freud 
once spoke of the deep lesions which science has 
inflicted in man's narcissism. He mentioned 
Copernicus who shattered man's belief that he 
was center of the universe; Darwin, who de- 
prived man of his belief in his uniqueness. 
Freud added that psychoanalysis has forced man 
to see the dependencies and weaknesses of his 
conscious mind. The source of this pervasive 
injury to man's narcissism, it seems to me, is 
the philosophy of determinism which underlies 
all scientific endeavor. Doctor Rapaport's books 
revive this conviction. In their pages one sees 
hundreds of “unconquerable souls" caught in 
the meshes of the tests and, whether willing or 
not, yielding deeply determined information 
about themselves. Some of the tested subjects 
were eager to be frank and cooperative, while 
some tried to escape diagnosis, but they did not 
know that the test situation itself blocked all 
exits. The subject refusing to tell a story about 
a picture in order not to give away his secret 
does not know that his refusal is better evi- 
dence for the examiner than anything he might 
have told. He attempts to disguise his thoughts, 


1104 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 908 


he tries to be impersonal, he uses cliches, but 
whatever he does he betrays himself and pro- 
jects his weaknesses and strengths indiscrimi- 
nately upon the large screen where the examiner 
analyzes them, compares them with the weak- 
nesses and strengths of previously tested sub- 
jects, and reduces them to statistically work- 
able units in order to emerge with his diagnosis. 
Attention, concentration, concept formation, any 
number of those ego functions which we like to 
consider areas of supreme freedom are easily 
proved by the tests' scatter analysis to be inter- 
dependent with emotional, physical and all 
other human factors. The grandiose narcissistic 
dreams of freedom of the will, of the mind, of 
the spirit must go. In their stead we get an 
instrument of social usefulness. At present we 
may make best use of it for clinical diagnosis. 
Soon, no doubt, we will be able to use it for 
prognosis and for objective establishment of 
treatment results. The time may come when our 
teachers, our economic, social and political 
leaders will be selected on the basis of diagnostic 
psychological testing. The gigantic mistakes 
which we are making today in these selections 
will then have become incredible legends. The 
attraction of living in a world led by untested 
persons may appear to the future as peculiar 
and romantic as life in a smoke-filled cave now 
appears to us. It seems we must leave the old 
cave because social reason is stronger than nar- 
cissistic dreams. Doctor Rapaport’s work is a 
guide out of the cave. 

Brit J Psychol 37:38 S ’46. It is impossible 
to do justice to this immensely long and detailed 
book in a brief review; but anyone who is in- 
terested in diagnostic testing should study it 
carefully, and would find it most valuable. The 
aim of the book is to demonstrate how psycho- 
tics, pre-psychotics, neurotics, and various types 
of these, can be diagnosed and differentiated 
from “normals” by means of their differential 
results on a battery of tests. * The tests used 
were: Wechsler’s Bellevue Intelligence scale, 
and the Babcock test; the Hanfmann—Kasanin 
test of concept formation; the Word Associa- 
tion test; the Rorschach test; and the Sorting 
and Thematic Apperception test. In all these, 
qualitative results and relationships of quantita- 
tive scores were given more importance than 
absolute numerical scores. * Careful study of 
this book does demonstrate how the various test 
results can be fitted together to diagnose various 
pathological categories. It would be rather 


easier to understand if a clearer summary of 
results were given at the end of the book. Even 
more valuable perhaps would be some indica- 
tion of how these test results could be used to 
study the more or less “normal” personality ; 
but that no doubt would constitute another 
major study in itself. 

J Abn & Social Psychol 42: 134-7 Ja '47. 
Henry E. Garrett. * I agree with Rapaport that 
the projective and psychometric approaches to 
testing are parts of the same process and should 
not be treated as though they were quite differ- 
ent, the one significant and satisfying, the other 
superficial and statistical. I am not at all sure, 
however, that the so-called depth psychology 
has clarified what our author calls “functional- 
dynamic relationships” to the extent that he 
seems to believe. Rapaport’s slant is psycho- 
analytic and, in a sense, medical. Chapter I, on 
the nature of intelligence, impresses me as 
sketchy, incomplete, and superficial. The author 
accepts the Iowa studies of the effects of train- 
ing upon the IQ in their entirety, and voices 
criticisms concerning the study of memory in 
terms of association frequency or conditioning, 
which are, I think, irrelevant (p. 37) and in- 
correct. Rapport’s emphasis upon the careful 
study of subtests within a battery, upon the re- 
lation of subtests to each other and their mean- 
ing for the patient, is certainly to be preferred 
to the notion that the psychologist’s work is 
done when he reports a single over-all score. 
But this is not an original contention and has 
long been standard clinical practice. Criticism 
of Stanford—Binet (p. 43) on the ground of lack 
of homogeneity in the functions measured 
misses the purpose of this examination. As it 
was intended primarily for children over the 
elementary-school ages, not for mental cases, 
the authors of Stanford-Binet deliberately chose 
a wide sampling to tasks which might con- 
ceivably provide a good measure of general ab- 
stract intelligence. Several specific comments 
concerning the treatment of data and the con- 
clusions reached from a study of the tests in 
Volume I are offered as follows: (1) Graphs 
are poorly drawn and are difficult to read; the 
one on page 56 is incorrectly labeled. It is often 
hard to follow the discussion from a study of 
these diagrams. (2) Measures of scatter around 
various reference points, means or scores, are 
often legitimate but such indices are hard to 
interpret especially when samples are small. 
The author appreciates the difficulty in com- 
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paring test scores which are not expressed in 
equal units, but seems to think that scatter in- 
dices in some way eliminate the problem of dif- 
ferent reference points. The scatter measures 
presented are useful enough as rough indicators 
of differences among the clinical groups, but I 
feel sure that they have little or no validity in 
the individual case and are certainly not directly 
comparable from person to person. (3) What 
the author calls modified mean scatter is the 
same as mean scatter except for a constant 
multiplier. (4) The subtests of the Bellevue 
Scale contain from 10 to 42 items. Indices based 
upon scores or averages from such short tests 
are open to a high degree of unreliability. I 
doubt if the author is justified in using the 
t-test or chi-square with such shaky primary 
data. Moreover, findings obtained from a single 
standardization group provide hypotheses not 
final results. Hence repetition of the study upon 
another group should certainly be carried out 
in the interests of validity. (5) The efficiency 
scores of the Babcock Test involve the “fallacy 
of subtraction” so often illustrated in reaction- 
time experiments. When test scores are corre- 
lated one does not “remove” the influence of a 
given score by subtracting it from a second 
score. Nor can the difference between two 
scores ordinarily be compared with another dif- 
ference when scores fall at widely separated 
points on the scale. * Rapaport’s treatment of 
the Rorschach test is objective and oftentimes 
critical. He presents a good description and 
analysis of projective techniques in general, 
their advantages and disadvantages, and the im- 
portance of “structure.” In the reviewer's opin- 
ion (he is by no means an expert on Ror- 
schach), Rapaport’s treatment of the Rorschach 
should aid greatly in taking the magic out of 
this really useful instrument. Rapaport’s pur- 
pose is to investigate the characteristic thinking 
revealed by the Rorschach—the individual's 
"style" as shown in his perceptive and associa- 
tive processes, rather than to obtain detailed 
diagnostic information. He rightly stresses the 
point that the Rorschach yields evidence of the 
way in which the patient organizes and struc- 
tures his thinking. Rapaport reports a great 
deal of data and makes a real attempt to vali- 
date the signs and indicators obtained from the 
Rorschach record in terms of clinical diagnoses. 
* Several comments may be offered on the treat- 
ment of the tests in Volume II. (1) It seems 
to me that Rapaport has supplied quite satis- 


factory evidence of the agreement of the major 
scoring categories in the Rorschach with clini- 
cal opinion. I am doubtful, however, whether 
the more subjectively scored responses as, for 
example, the “Da,” “Dr,” and color responses, 
have been shown to reveal what the Rorschach 
tester thinks they reveal. (2) More work 
should certainly be done on the rationale of 
the scoring techniques in Rorschach. Validation 
of Rorschach responses in terms of psychiatric 
diagnosis is good as far as it goes. But since 
such criteria are at best open to question and 
to the influence of suggestion and mutual in- 
teraction, scoring categories should be validated 
against longtime predictions. If people who 
demonstrate certain constellations of responses 
later become schizophrenic, depressive, or para- 
noid, validation of responses would be stronger 
than it is when the only criterion is the psy- 
chiatric diagnosis or clinical opinion. (3) Rapa- 
port’s remarks concerning “IQ intelligence test- 
ers” are irrelevant and beside the point. The 
intelligence test measures the individual's actual 
performance when faced by standard problems. 
The Rorschach, on the other hand, is not a test 
at all but is a subtle psychiatric interview from 
which are drawn various inferences concerning 
the individual’s way of thinking, his attitudes 
toward life, etc. Nothing is gained by belittling 
the one instrument through comparison with 
the other. (4) The percentage occurrence of 
Rorschach signs (pp. 141-149) strikes me as 
of doubtful value considering the small differ- 
ences and the sizes of the samples used. (5) I 
am very skeptical of the evidence which the 
author presents on page 154 regarding the 
meaning of the Rorschach responses. Also it 
seems to me that "anxiety" is frequently used 
as a catch-all explanation when the meaning of 
the term is by no means clear from the con- 
text. Three general criticisms of these two 
books strike me as pertinent. In the first place, 
the control group is not a "control" as this term 
is customarily understood. Members of the 
control group were male adults in good physical 
health, and it is difficult to see how their re- 
sponses have much relevance when compared 
with those of clinical cases differing in age and 
consisting of members of both sexes. In the 
second place, I think the books contain much 
material which could readily be eliminated or 
condensed. Both books are written in an awk- 
ward and periphrastic style. If completely rewrit- 
ten in brief and concise English, their contents 
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could be better comprehended and evaluated. 
Finally, perhaps the most important general 
criticism grows out of the author's bias in favor 
of the clinical as opposed to the scientific atti- 
tude toward test results. We would all agree 
that the clinician should be as sensitive as pos- 
sible to his test data, and should, in his inter- 
pretations, free himself from the rigid precision 
demanded by the objective test. By the same 
token, however, the clinician should differen- 
tiate carefully between objective facts and their 
loose clinical interpretation. For should he dis- 
card objective measurement in favor of clinical 
insight or intuition, his work—however valu- 
able—can never be a part of that body of knowl- 
edge which we call science. * 

J Appl Psychol 30:662-3 D '46. Howard F. 
Hunt. * 'This research encompasses much cru- 
cial but essentially unexplored or controversy 
ridden territory in clinical psychology. The 
authors are both free and ingenious in propos- 
ing and provisionally substantiating hypotheses 
in their development and discussion of test 
rationales. These hypotheses and rationales are 
so numerous and extensive that they cannot be 
discussed adequately in this brief review, how- 
ever, Prolixity in the text, lack of illustrative 
clarity in the graphs, and occasional reliance on 
verbal argument when statistical argument 
might have been more lucid and forceful weak- 
ens the presentation at times. In some instances 
verbal recapitulation of statistical tables unduly 
lengthens the text. Limitation of the statistical 
argument largely to the “t” and “Chi Squared" 
tests of significance may have reduced some- 
what the fecundity and practical utility of the 
analysis, possibly forcing the authors to resort, 
for purposes of argument, to the less rigorous 
clinical impressions and verbal analyses more 
often than was strictly necessary. The authors 
present no summarizing statistics on the dif- 
ferential diagnostic accuracy of the battery as a 
whole. One of their aims was “to show how the 
...tests...were welded in (their) clinical work 
into a single diagnostic tool." This omission is 
thus glaring and disappointing, the more so be- 
cause of the crucial character of such a sum- 
mary. These volumes possess redeeming fea- 
tures which outweigh their defects, however. 
The wealth of clinical information they contain 
as well as other stimulating speculations and 
informative discussions concerning test valida- 
tion, problems encountered in the clinical use 
of the tests, and possible psychopathology un- 
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derlying various forms of impairment in test 
performance should make them worthwhile aids ~ 
to clinical psychologists. Both the clinical and the 
heuristic value of these books more than justifies 
their inclusion in class reading lists and clinic 
libraries, though they can hardly be considered 
as text material for other than advanced 
courses. 

J Clin Psychol 2:301-2 Jl '46. * Very prob- 
ably the publication of this book will mark the —.— 
opening of a new and enlightened era of psy- 
chometric testing. For the first time, psycho- 
logical tests are used not only to obtain a quan- - 
titative index of some mental function but also 
to systematically explore the total personality. 
Beginning with the obvious, but long over- - 
looked, fact that every single response and every 
part of every response of the subject in the test- 7 
ing situation may have diagnostic significance, — 
it is contended that the psychologist should 
evaluate every response which in any manner 
deviates from the conventional in order to de- © 
termine its relation to the total personality 
organization of the subject. Further understand- 
ing is obtained from comparison of successes and 
failures on a given type of test item, the rela- 
tionship of scores on different subtests, the rela- 
tionship of all verbal to all performance scores, 
and the comparison of data from tests of intelli- 
gence with other tests such as the Rorschach. 
Test results are not considered in isolation but in 
relation to postulated dynamic psychopathologi- 
cal processes. The data are treated with great 
originality in the effort to formulate significant — 
relationships between the different tests of the ~ 
battery. Of particular value are the well-written E 
discussions of the rationale of administration and 
statistical analyses of the subtests comprising the — 
battery. The discussion of each subtest is con- 
cluded with a summary of relationships which 
have diagnostic value in differentiating clinical, — 
groups. Many of the postulated relationships — 
are extremely interesting from theoretical view- 
points but only time will tell whether subse- 
quent investigations using larger samples of 
population will confirm their validity. The 
reader should not allow himself to be discour- 
aged at first by the seemingly formidable quanti- 
ties of statistical data and theoretical detail. The 
author has succeeded remarkably well in pre- 
senting a difficult subject in readable style. In 
spite of the relatively high cost of these vol- 
umes, they constitute a major contribution to 
psychological literature and should have a place — 
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in thé library of every one interested in the 
clinical applications of psychology. 

J Consult Psychol 10:285-6 S-O '46. Morris 
Krugman. * Rapaport’s two volumes....will 
probably be instrumental in setting a new vogue 
in clinical practice and research, and in influ- 
encing the publications that will appear in the 
psychological journals for the next few years 
at least. Diagnostic Psychological Testing is 
essentially a research report on clinical data col- 
lected by means of seven psychological tests on 
217 patients at the Menninger Clinic and 54 
“control” subjects. This, in itself, would not be 
remarkable, since there are numerous research 
reports of equal or greater magnitude. Using 
the data as a springboard, Rapaport has em- 
ployed a combination of unusual clinical insight, 
acute analytical treatment, and profound theo- 


‘retical reasoning, to synthesize a diagnostic ap- 


proach to the study of pathological conditions 
that is thoroughly convincing. * Actually, the 
treatment of the seven tests in this book com- 
prises seven independent revised manuals. The 
question could be raised whether it was neces- 
sary to reproduce in such great detail well known 
material about such tests as the Bellevue or 
the Rorschach. * in no other single volume can 
we find so complete a treatment of the use and 
interpretation of concept formation tests, or of 
the T.A.T. Interesting and important as Rapa- 
port’s “rationale and nosological considerations” 
are, (Rapaport is apparently very fond of these 
terms) the major contributions of this work lie 
in the diagnostic criteria for some of the major 
clinical entities, and in the general methodology 
presented. These contributions should lead to 
many developments and refinements that are 
bound to follow when others take up Rapa- 
port’s lead. They overshadow the weaknesses of 
the book to such an extent that one feels reluc- 
tant to mention them, particularly since the 
author is well aware of them and has called 
attention to most of them himself. He makes it 
clear, for example, that organics, psychopaths, 
feeble-minded, the young, the old, and quite a 
few other groups, are not included in the study. 
In spite of this admission on Rapaport’s part, 
however, these omissions constitute a major 
weakness, since the diagnostic signs and factors 
So carefully worked out for the seven tests may 
break down if some of these groups are in- 
cluded. Another weakness lies in the type of 

control” group employed. Here again, the 
author noted the shortcoming, but it is never- 


theless a fact that the highway patrolman of 
Kansas can, by no stretch of the imagination, 
be considered a group representative of Ameri- 
can adults, and comparison of special groups 
with this type of “control” group is meaning- 
less, The organization of the books is so complex, 
that it detracts seriously from their usefulness. 
This is particularly true of the first volume. 
The most detailed data and the most com- 
prehensive conclusions are presented in running 
form without any difference of accent or em- 
phasis, with involved tables and graphs added in 
such a manner that they tend to obscure rather 
than clarify the point under discussion. The use- 
fulness of the two volumes would have been 
considerably enhanced if one of them had been 
written as a manual for the seven tests, and had 
included the rationale, methods, results in gen- 
eral terms, and conclusions, while the other vol- 
ume had been devoted to the supporting data. 
This would have satisfied both the clinician and 
the research worker. Rapaport's earlier volume 
(Manual of Diagnostic Psychological Testing, 
Josiah Macy Jr. Foundation, 1944) is written 
in this manner, and, were it not for the fact 
that it is a limited edition and devoted only to 
the tests of intelligence and concept formation, 
would probably have greater influence on clini- 
cal practice than the larger volumes. There are 
minor weaknesses throughout the work that will 
not be dwelt upon; only an example or two will 
be mentioned. One of these is the impression 
frequently given that the author is the origi- 
nator of some well-known concepts in psychol- 
ogy. Perhaps Rapaport did not intend to leave 
the impression that the theory of projective 
techniques was unknown until he wrote about 
them, but he did not make it clear that Law- 
rence Frank and H. A. Murray wrote about 
them several years earlier. Perhaps, too, he did 
not intend to say that scatter analysis (really 
pattern analysis) was unknown until he intro- 
duced the method, but he did leave that impres- 
sion. Then, there are occasional inconsistencies. 
In discussing the Rorschach, Rapaport makes a 
strong plea for using it as originally outlined by 
Rorschach, and for keeping the scoring cate- 
gories as few and as simple as possible. He then 
proceeds to devise numerous new scoring cate- 
gories. The new items proposed by Rapaport 
probably total more than those found in all the 
volumes of the Rorschach Research Exchange. 
With all its faults, major and minor, Diagnostic 
Psychological Testing is nevertheless destined 
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to become a classic in clinical psychology. It is 
a sample of what has long been needed in the 
clinical field. If psychologists will pursue the 
methods outlined in it, and improve upon them, 
as they inevitably will, they will be more likely 
to become clinicians and diagnosticians rather 
than mental testers and I.Q. finders. Rapaport 
did not originate the methods of diagnostic test- 
ing. Many others have made contributions of 
varying magnitude in this area. He did, how- 
ever, successfully crystallize the major clinical 
developments of the past twenty-five years with 
a unified method that can be used universally. 

J Genetic Psychol 69:223-30 D ’46. C. H. 
Patterson. * essential for all clinical psycholo- 
gists * the most extensive and exhaustive study 
yet reported of diagnosing by means of psycho- 
logical tests. Some of the material was published 
earlier as a Manual of Diagnostic Psychological 
Testing by the Josiah Macy Foundation, for 
distribution to psychiatrists and psychologists in 
the Armed Forces, where it proved useful in 
the clinical psychology programs. The present 
volumes amplify and provide the statistical jus- 
tification for the qualitative results reported 
earlier. * The subjects utilized in the research 
included 217 clinical cases and 54 control cases. 
The clinical subjects were divided into 20 dif- 
ferent diagnostic categories, the number of cases 
in each ranging from 6 to 20. * the smallness of 
the groups were compensated for by their se- 
lection, leading to differences great enough to 
reach statistical significance * Unfortunately, 
the control group is not comparable to the clini- 
cal groups in terms of socio-economic or cul- 
tural factors, Rapaport recognizes that the use 
of this group will meet with criticism (Vol. I, 
p. vi), but attempts to justify it by questioning 
the meaning and possibility of obtaining a 
matched control group, asking what variables 
should be used for matching the groups. He 
concludes that “our best procedure seemed to 
be to take a homogeneous group of more or less 
well-functioning subjects, and explore what 
they are like clinically” (Vol. I, p. vi). How- 
ever, this appears to have been the result of 
necessity rather than choice, and obscures the 
interpretation of many of the results, particu- 
larly on those tests which are influenced by so- 
cial and cultural factors (e.g., the concept for- 
mation tests). * Summarization is frequent 
within the discussion of a test, as well as at the 
conclusion of the analysis of each test, the latter 

being a bringing together of the results under 


diagnostic entities. These chapter summaries 
suffer, however, from an inadequate discussion 
of differential diagnosis. In some cases the main 
diagnosis is assumed, and a differentiation be- 
tween acuteness, chronicity, and deteriorated 
types is discussed. But if it is not known, for 
example, that a patient is an unclassified schizo- 
phrenic rather than a paranoid schizophrenic, it 
might be impossible to determine whether he is 
an acute unclassified or a deteriorated paranoid 
schiz, since both may be characterized by an 
essentially similar test performance. A final 
chapter of 614 pages discusses too briefly the 
problem of diagnosing with a battery of tests, 
but although there are cross references through- 
out the work indicating the supplementary value 
of various tests, there is no bringing together of 
the results of all of the tests in sufficient detail. 
An organized tabular summary of results, by 
test and by diagnostic grouping, would be of 
great value to the clinician. Also, case studies 
of subjects representing the various nosological 
categories to whom the entire battery of tests 
had been given would be valuable to illustrate 
how the tests supplement and complement each 
other in differential diagnosis. * the chapter 
dealing with the Wechsler-Bellevue Scale... 
[is the] longest chapter in the work (347 pages). 
The analysis of this test is based primarily upon 
a study of various measures of scatter. Many of 
these measures are very similar; for example, 
mean scatter (the difference between a subtest 
score and the average of all verbal or perform- 
ance tests) and modified mean scatter (the dif- 
ference between the subtest score and the aver- 
age of all other verbal or performance tests). 
The use of the latter only accentuates the dif- 
ference and its significance. Complete analysis 
of both these measures is therefore superfluous. 
Although as pointed out by the authors, scatter 
is independent (or relatively independent) of 
IQ, this is not true of other diagnostic measures 
which are absolute rather than relative. One 
such measure which is used in the analysis of 
the Bellevue Scale is the percentage of cases in 
each group with high and low scores on a sub- 
test. In the discussion of differences in these 
percentages among the diagnostic groupings, no 
attention is given to the influence of total score 
or IQ levels of the groups, which vary from 91 
to 119, a difference which is certainly signifi- 
cant. (The reviewer has estimated, though not 
computed the degree of significance; although 
individual and mean JQ’s are given, no stand- 
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ard deviations are listed in the table in the 
index). Although JQ bears some relationship to 
nosological groupings, this relationship is small, 
and it is unlikely that such small samples as 
used in this study would all be representative 
groupings in IQ. * The original technique and 
scoring of Rorschach rather than the modifica- 
tions of Beck or Klopfer were used * Testing 
was begun without the usual explanations, 
though after the first card brief instructions 
were given. The inquiry was conducted after 
each card, without the card being seen by the 
subject insofar as this was possible, and was 
supposedly brief and limited. Rapaport main- 
tains that with the experienced examiner exten- 
sive inquiry is not necessary. However, it ap- 
pears to the reviewer that the assuming of the 
area or determinant by the examiner is danger- 
ous, since it is surprising, particularly with clini- 
cal patients, how frequently a response refers to 
an unusual area, or involves an unusual or un- 
suspected determinant. Rapaport is of course 
aware of this, and as a matter of fact appears 
to conduct as thorough an inquiry as anyone 
else. Also, the introduction of other refinements 
in scoring, such as two four-fold classifications 
of F responses, introduces the need for inquiry 
frequently. It would appear that Rapaport's 
system of scoring is actually no simpler than 
that of Beck, or of Klopfer. The main saving in 
time seems to be in Rapaport's rejection of the 
practice of testing the limits. * Volume I suf- 
fers from an overloading of tables and graphs. 
This is true particularly of the section on the 
Wechsler-Bellevue. It is a result of what the 
authors admit is an overemphasis of statistical 
procedures on this test. The tables and graphs 
could be improved for easier reading. The data 
would be more quickly apprehended if some 
means of making the more significant figures 
stand out, such as the use of darker or larger 
print, italics, or underlining, were used. Many 
minor criticisms could be made, such as asking 
why instead of a statement such as “apparently 
these cases with high efficiency on Comprehen- 
sion have other verbal scores which are consid- 
erably impaired” (Vol. I, p. 126), the facts 
were not checked, since only three subjects were 
involved. There are occasional minor disagree- 
ments in figures between tables, or between a 
table and the text. These may be typographical 
errors, however, several of which occur through- 
out the volumes. These minor defects are hardly 
worth mentioning in a work which contributes 


so much to the field of clinical psychology, both 
to the basic problem of diagnostic testing—the 
formulation of the rationale of the tests and the 
interpretation of results in terms of clinical be- 
havior—and to the practical aspects of clinical 
work, in its practical, empirical results, The 
analysis of the data, and the organization and 
interpretation of the results, represent a tre- 
mendous amount of time and effort, to» say 
nothing of the years required to collect the data. 
The study is by no means complete, as recog- 
nized by the authors themselves. The authors 
are also aware of its defects—the small number 
of cases, the nature of the control group, the 
lack of other diagnostic groups such as organics, 
psychopaths, etc., and the incompleteness of the 
statistical analysis on certain tests. The main 
defect which might have been remedied is the 
lack of a sufficiently detailed integrating sum- 
mary of the findings for all tests for the various 
diagnostic groups as an aid in differential diag- 
nosis. * The results are clear enough so that 
an experienced clinician by the use of one or 
more of these tests can correctly diagnose a 
relatively clear case. However, the crucial test 
of diagnostic psychological testing comes in be- 
ing able to diagnose the perplexing, “impure,” 
borderline cases, at least to the extent of direct- 
ing the observation of the psychiatrist, or to 
confirm or deny his tentative diagnosis, and 
lead him to a diagnosis which he recognizes as 
consistent with the patient’s history and be- 
havior. It is in just these cases that the psychi- 
atrist needs assistance; he does not need test 
results to diagnose a classical type of disorder. 
The ability of a test to diagnose clear cases is 
not helpful to the psychiatrist, and is not likely 
to lead to recognition by psychiatry of the value 
of psychological tests. Investigations such as 
that represented in the volumes being reviewed, 
however, dealing with the relatively clear case, 
are basic. It is from knowledge of the character- 
istics of the test performance of these cases that 
the unclear signs and trends in more difficult 
cases can be interpreted. It goes without saying 
that in the group from which the cases used in 
this study were selected many difficult cases 
were met in which the practical use of the tests 
was undoubtedly important. Psychological diag- 
nosis is thus not a cut and dried, mechanical, 
quantitative process, performed by the use of 
statistical tables. Nor is it a purely intuitive one. 
It becomes a process of weighing and compar- 
ing the qualitative and quantitative perform- 
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ances of the patient in the light of findings such 
as those provided by this study, and reaching a 
final judgment which is the synthesis of a com- 
plex analysis of data. Skill can come only from 
experience with the tests, and from clinical 
knowledge of and experience with the various 
nosological categories, not from study and read- 
ing alone. The multitude of findings in an in- 
vestigation such as this is too great to be com- 
mitted to memory. The number of factors to be 
considered in diagnosing a single case is large. 
Except in rare cases (that is, where routine 
testing of all patients is not followed), a diag- 
nosis cannot be made immediately following 
the administration of the test, but may require 
considerable analysis and thought. Psychologi- 
cal testing for purposes of diagnosis is thus not 
the routine process of testing for an JQ or other 
test score, where several tests can be adminis- 
tered and scored in a day. These volumes are a 
monumental work in the field of diagnostic psy- 
chological testing. “Clinical psychologists are 
often under the misconception that to do a bet- 
ter job they need more testing tools; but what 
needs improvement is the training of the clini- 
cal psychologists and the exploration of testing 
tools now at their disposal” (Vol. II, p. 89). 
This latter area of need is considerably ad- 
vanced by the present investigation. As indi- 
cated at the beginning of this review, no clini- 
cal psychologist can afford to be without these 
volumes. 

Psychoanalytic Q 16:108-13 Ja’47. Solomon 
Machover. The sheer volume of this ponderous 
work....strikes the reader with an initial impact 
that, to put it mildly, is slightly staggering. The 
quality, rescued from a tiresome and gratuitous 
exhaustiveness of presentation and from more 
serious errors of procedure which would have 
damned a lesser work, remains impressive. For 
this book exemplifies, contributes to, and to 
some extent indicates the future of much of the 
relatively progressive and effective current prac- 
tice in diagnostic psychological testing. Since 
the publication in 1927 of Wells's Mental Tests 
in Clinical Practice, clinical psychology has 
taken long strides in the broadening of its diag- 
nostic scope, in the perfection of more searching 
test methods and in the fuller utilization of test 
results as indirect expressive and projective 
manifestations of important but not readily ac- 
cessible ideational content and especially of per- 
sonality structure and behavior dynamics. A 
good deal of this progress is competently if dif- 


fusely reflected in this book which may, there- 
fore, be regarded as bringing Wells’s book up 
to date, and which is offered as an aid to clini- 
cal psychologists who work in cooperation with 
psychiatrists, or who themselves desire the com- 
prehensive and systematic information which 
tests can give as adjuncts to their therapeutic 
work. * The classifications [of patients] are 
somewhat unconventional (e.g., coarctated pre- 
schizophrenia) having been determined in part 
by....the perfectly legitimate consideration of the 
diagnostic areas within which the tests used 
might be expected, a priori, to yield effective 
differentiations. It is quite definitely a virtue, 
far too rare in research of this type, that the 
criteria defining each nosological group are 
quite clearly and specifically stated, that a high 
level of homogeneity as to psychopathological 
status was maintained, and that effective recog- 
nition is made of the importance of precisely 
characterizing the kinds of ‘normality’ repre- 
sented in any ‘normal’ control group. * The 
reviewer does not doubt the fundamental sound- 
ness of Rapaport’s general approach with its 
emphasis on holistic, projective and dynamic 
orientations. On the basis of his own experience 
he is able to confirm many of the specific find- 
ings. It is all the more unfortunate that, despite 
a many-sided and almost forbiddingly detailed 
statistical treatment, Rapaport fails to make a 
clean demonstration of his case. The foremost 
shortcoming is his cavalier dismissal of the need 
for a more carefully matched control group. 
That age, cultural background and general level 
of intellectual performance have an important 
bearing on the organization of mental abilities, 
and of other aspects of psychic structure, has 
been attested repeatedly in the literature. Con- 
sequently, the differences in these attributes be- 
tween the clinical group as a whole and the 
control group, and among the various clinical 
subgroups themselves must cast a shadow on 
the validity of the statistical findings no matter 
how cogent the rationale which supports and 
explains these findings. This failure to control 
extraneous variables is clearly not accidental for 
it accords with the authors’ expressed opinion, 
notwithstanding occasional lip-service to the 
contrary, that deviation in performance, at least 
on the predominantly cognitive tests, from some 
dead level of uniformity defined by general 
population averages means impaired function- 
ing, and is correlated not only with the fact but 
also with the degree of maladjustment. Actu- 
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ally, it would be no violation of scientific deter- 
minism, nor of the projective hypothesis, to 
recognize that quite wide variations in intra- 
individual pattern may occur independently of 
psychopathological involvement, as the ‘normal’ 
outcome of individual development and corre- 
lated with the infinite richness of more-or-less 
successful modes of adaptation. Even if the sta- 
tistical demonstrations were on a sounder basis 
the emphasis they receive, in table after figure 
after table (which incidentally, are often ob- 
scurely captioned and which must try the pa- 
tience of readers who are not already trained in 
statistics and in the tests under consideration), 
creates a serious potential danger. It is not diffi- 
cult to foresee a deluge of inept, preposterous, 
unbidden psychopathological diagnoses at the 
hands of unseasoned psychological examiners 
who will blissfully take their authority from the 
impressive statistics of Rapaport’s innumerable 
tables and the curves of his many graphs. The 
statistics of groups, with respect to one variable 
at a time, serve adequately to test the validity 
of general relational trends. They cannot make 
a diagnosis in the individual case unless there 
is complete absence of overlap between any one 
group and all other groups with respect to one 
or more variables. The danger of misleading 
statistics is compounded by the fact that a num- 
ber of quantitatively important clinical groups 
were systematically excluded from the investi- 
gation. While the exclusion of cases of organic 
pathology, of psychopathic personalities, alco- 
holics and others who might obscure the mean- 
ing of the results can be justified by the re- 
searcher’s fiat to limit his investigation as he 
pleases, it cannot serve the aim of improving 
diagnostic practice if any of the excluded groups 
ever come into differential-diagnostic conflict, 
as most assuredly they do, with any of the 
groups investigated. Again, while restriction to 
clear-cut cases legitimately serves the validation 
of theory, it cannot be of much practical help 
to conclude after six hours of testing (and how 
much time for the study and integration of re- 
sults?) that this hallucinating, grimacing, bi- 
zarrely productive patient with a premorbid 
personality characterized by extreme shyness, 
sensitivity and seclusiveness is indeed a schizo- 
phrenic and not a depressive neurotic. Clearly, 
the emphasis on the statistics of groups with 
reference to isolated variables has been grossly 
misplaced, and what is conspicuously lacking is 
any example of the integration of the findings 


for particular individuals. It was an explicit aim 
of the authors to ‘show how the eight tests 
which are the subject of the investigation re- 
ported here were welded in our clinical work 
into a single diagnostic tool.’ Notwithstanding 
the valuable if telescoped statement of general 
principles in Diagnosing with a Battery of 
Tests, the last chapter of the second volume, 
there is nowhere a clear demonstration of how 
the tests under consideration were actually 
welded together in the clinical work of the Men- 
ninger psychologists into a single diagnostic 
tool in the case of any single subject. Appen- 
dices giving for each case, clinical and control, 
various vital data, specific symptomatology, sub- 
test scores and derived indices for the Bellevue, 
Babcock and Sorting Tests, the detailed Ror- 
schach response summary and, in some in- 
stances, examples of deviant verbalizations in 
response to the Rorschach, may challenge the 
devotedly interested reader to attempt for him- 
self the diagnostic welding together of at least 
some of the tests. To this reviewer the con- 
sistent, often brilliant, effort to analyze the psy- 
chological rationale underlying performance on 
each of the tests and to interpret the clinical 
findings in terms of it is perhaps the most valu- 
able part of the book. In this direction lies prog- 
ress away from the mechanical interpretation of 
test scores and the shaky empiricism of the 
diagnostic ‘sign’ approach. Thus, too, the psy- 
chology of the development and expression of 
perception and thought and the psychology of 
the development and expression of personality 
are brought functionally together to the recipro- 
cal enrichment of each and to the improvement 
of clinical practice. An inevitable consequence, 
very largely to the good, would be a reduction 
of emphasis on nosology and a focusing of the 
importance of striving with tests to achieve a 
comprehensive understanding of the psychology 
of the individual. With a sound psychological 
rationale for his tests the psychologist who 
works in conjunction with psychiatrists has no 
difficulty in justifying his work as offering an 
independent and efficient source of essential 
psychological data obtainable in no other way, 
indispensable for a rounded clinical understand- 
ing of psychopathological status and valuable 
as an aid to therapy. The authors’ discussion of 
psychological rationale is by no means uni- 
formly successful in all tests. In particular, the 
assumptions as to what the subtests of the Bel- 
levue Scale measure seem arbitrary and sur- 
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prisingly narrow and abstract. There seems in- 
sufficiently effective recognition that the routes 
to test solution are multiple and that the opera- 
tion of specific mental functions such as judg- 
ment, attention, concentration, varies with the 
context. It is manifestly hazardous to speak, 
for example, of impairment of judgment on the 
basis of responses to a very limited sampling 
of situations in which judgment may be ex- 
pressed. Similarly, it misses the essential point 
to speak of impaired concentration in the case 
of an arithmetic defect resulting from resistance 
to the coercive, authoritative arbitrariness of 
numbers and number relation. The treatment of 
the dynamics of memory distortion, the consid- 
eration of various types of concept formation 
and the discussion of the rationale of the major 
dimensions of the Rorschach reflect an ex- 
tremely rich experience vigorously, often quite 
originally thought through. The introduction of 
the concept of distance in the analysis of stimu- 
lus-response relationships with special reference 
to the Word-Association and the Rorschach 
Tests, but applicable in other tests as well, 
seems especially fortunate. The analysis of de- 
tailed verbalizations on the Rorschach, hereto- 
fore always considered but not formally classi- 
fied, is sure to prove extremely useful. There 
is so much stimulating discussion of theory, so 
many practical suggestions regarding testing 
technique, such provocative interpretative inno- 
vation and a detailed communication of such a 
wealth of clinical experience, that even the ex- 
perienced clinician cannot fail to benefit from 
a careful reading of the text despite the tedium 
and the grave vulnerability of the statistical 
presentation. The psychiatrist who is interested 
in learning what the clinical psychologist can 
contribute to his work will not get a complete 
picture from this book. But he could do worse 
than read the introduction, the last chapter, and 
the discussion of the rationale of any test factor 
which interests him in between. In the review- 
er's opinion, this work will find its most effec- 
tive use in the hands of competent clinicians 
who can perform the necessary radical surgery 
to make of it an invaluable text for the training 
of graduate students. It is a pity that the work 
could not have been turned out in a single 
wieldy volume without misleading statistics, 
with tighter theoretical discussions, and espe- 
cially with illustrations of the application of a 
battery of tests, *welded into a single diagnostic 
tool', to particular cases. 
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*Rapaport, DAVID; WITH THE COLLABORATION OF Roy 
Scuarer AND Merton Gitt. Manual of Diagnostic 
Psychological Testing: I, Diagnostic Testing of 
Intelligence and Concept Formation. Preíace by 
Karl Menninger. Review Series, Vol. 2, No. 2. New 
York: Josiah Macy, Jr. Foundation, 1944. Pp. xiii, 239. 
Paper, lithotyped. Out of print. * (PA 19:3195) 


Am J Orthopsychiatry 16:736 O '46. Adolf 
G. Woltmann. This brief comment is a refer- 
ence rather than a review because the material 
has since appeared in a two volume edition. The 
original was published during the war years 
and was made available primarily to psycholo- 
gists in the Armed Forces as an aid for diagnos- 
tic refinement. * 
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*Rararonr, Davin, AND SCHAFER, Roy; WITH THE 
COLLABORATION OF Merton GiLL. Manual of Diagnos- 
tic Psychological Testing: II, Diagnostic Testing 
of Personality and Ideational Content. Review 
Series, Vol. 3, No. 1. New York: Josiah Macy, Jr. 
Foundation, 1946. Pp. 105. Paper. $0.75. * (PA 21:814) 


Arch Neurol & Psychiatry 57:275 F ’47. * 
To be highly commended is the approach to the 
formulation of a consistent theory of personality 
revelation through projective technics * This 
discussion (covered largely in the introductory 
passages) should prove of interest to both the 
psychiatrist and the psychologist. For the clini- 
cal psychologist, there is a wealth of specific aid 
in areas such as the differential diagnosis of 
groups and subgroups * So many diverse psy- 
chologic types have been included under the 
general classification of schizophrenic disorders 
that the authors’ suggestion of differentiation 
into a hierarchy of malignancy with reference 
to findings in psychologic tests is particularly 
welcome. After an introductory passage, laud- 
ing the simplicity and ease of Rorschach’s origi- 
nal scoring, it is difficult to accept the many 
complex elaborations and refinements then 
offered by the authors. In view of the already 
existing variations in scoring, the burden of 
proof for further changes must fall heavily on 
the new advocates. While one may be in full 
accord with the statement that “the Rorschach 
is the most potent psychological test extant,” it 
is refreshing to find an evaluation of the limita- 
tions of this technic (as with paranoid condi- 
tions), with positive recommendations for over- 
coming such lacks through a supplementary 
battery. Although the statistical validation is 
still on an insecure foundation, the wealth of 
empiric experience and observation is impres- 
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sive and provocative. This book is highly recom- 
mended. 

J Nerv & Mental Dis 103:538 My 746. S. 
Brecher. * Though even the authors note that 
statements in the manual seem at times to be 
arbitrary, the material it presents should con- 
cern all those who use or are interested in psy- 
chological tests as aids in diagnostic evalu- 
ations. 
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*Raven, Joun C. Controlled Projection: A Stand- 
ard Experimental Procedure. London : H. K. Lewis, 
1944. Pp. 38. 12s. 6d. (New York: Psychological Cor- 
poration. $4.00, postpaid.) * (PA 20:2792) For re- 
views, see 29. 
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*Reavis, WrLLIAM C. Epitor. Evaluating the Work 
of the School. Proceedings of the Ninth Annual Con- 
ference for Administrative Officers of Public and 
Private Schools, Vol. 3. Chicago, Ill.: University of 
DE Press, 1940. Pp. ix, 236. Paper, lithotyped. Out 
of print. * 


J Am Assn Col Reg 16:243-5 Ja "41. J. 
Conrad Seegers. * stimulating and useful vol- 
ume * The papers presented....are printed in 
their entirety as are the questions discussed in 
the several subsequent round-table conferences. 
These questions are invariably interesting, fre- 
quently provocative. Many would provide prof- 
itable material for faculty or staff meetings in 
any public school system or university school 
of education. These papers are more than merely 
interesting. They are practical, concrete, and to 
the point. Moreover, in spite of the fact that 
there are fifteen separate contributors, repre- 
senting four universities or colleges, two outside 
agencies, and five school systems in five differ- 
ent states, there is a remarkable community of 
thought and consistency of purpose running 
through the volume. The tenor is sanely pro- 
gressive, but never ethereal. Problems are ap- 
proached and discussed realistically. * Wright- 
stone’s paper (p. 12) describes many recently 
developed techniques for measuring or evaluat- 
ing progress toward the “so-called ‘intangible’ 
objectives of instruction.” While a more criti- 
cal analysis of these tools would be desirable, 
the section is valuable. Many useful references 
are provided. * Buswell (p. 53) contributes an 
excellent discussion of “effective instruction.” * 
It is a vigorous and..an eminently sensible 
paper. Sound in principle and useful in sugges- 
tion are Morgan’s paper (p. 86) on measuring 
the effectiveness of teachers and Reavis’ discus- 
sion of the self-survey (p. 131). One of the 


most interesting and practical papers is the plan 
proposed for a “program of evaluation for a 
school year for the general improvement of” 
instruction, written by Philip Falk * Falk 
thinks clearly and writes well. There are excel- 
lent treatments of other pertinent problems in 
this volume, but space does not permit individ- 
ual comment. Obviously, a volume of minutes 
is bound to be uneven. All of the contributions 
are not written with equal incisiveness or per- - 
spicacity. But not one of these papers fails to 
challenge the schoolman, not one strays from 
the subject. The book is a distinct contribution 
to current thought, not so much because what 
it says is particularly new, but because the 
analyses are clear, the applications practical; 
because it provides abundant food for thought. 
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XRzavis, WirLiAM. C., AND Coorer, Dan H. Evalua- 
tion of Teacher Merit in City School Systems. 
University of Chicago, Department of Education, Sup- 
plementary Educational Monographs, No. 59. Chicago, 
Ill: University of Chicago Press, January 1945. Pp. 
vi, 138. Paper. $1.50. * 


El Sch J 45:474-5 Ap '45. Virgil M. Rogers. 
* a distinct contribution to educational research 
and to the improvement of teaching * this study 
presents rather convincing proof that, if the 
profession of teaching is to be improved at any- 
thing more than a snail’s pace and if tenure 
legislation is to. become generally accepted 
throughout the country, certain specific prac- 
tices with reference to teacher evaluation must 
become the rule rather than the exception * It 
is most encouraging to find the authors recom- 
mending the “cumulative personnel record sys- 
tem,” in place of any sort of rating scheme, as - 
a means of evaluation of teaching. In the ex- 
amination of various methods employed by 
school systems, two appraisal forms are excep- 
tionally interesting and seem to merit atten- 
tion. An improvement check list used at Grosse 
Pointe, Michigan, indicates qualities of per- 
sonality, teaching proficiency, etc., and provides 
space to indicate the amount of improvement 
needed—"little, some, much.” This scheme is 
an innovation in the use of check lists. The 
Teacher's Service Report in use in Seattle, 
Washington, is prepared by the principal. In 
this report one finds an emphasis on helpful- 
ness to the teacher, rather than on effort to rate 
or specifically to measure educational growth. 
* a considerable amount of evidence leads to 
the conclusion that measurement of specific 


1110 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 918 


teaching ability is most difficult and that the 
validity of such tests is highly questionable. On 
the contrary, the measures seem to be quite 
valid with reference to the rater's personal opin- 
ions. The "cumulative file" is described in some 
detail in the final chapter, and administrators 
will do well to make this chapter the basis of 
discussions by teacher groups and to use it as 
the theme for educational gatherings, as well as 
to analyze it in round-table sessions. The “cu- 
mulative personnel record system" is based on 
the collection of evidence about the teacher. It 
makes no attempt to pass judgment on the work 
of the teacher. The monograph emphasizes the 
important difference between these two pur- 
poses in the development of a cumulative file 
for evaluative purposes. If the records are ade- 
quate, there is sufficient evidence upon which 
to base a judgment as to effectiveness of class- 
room teaching or any other aspect of the edu- 
cative process. * The monograph points out 
that the development of policy and practice in 
evaluation *may represent highly co-operative 
effort by the entire staff." This statement might 
well be strengthened by emphasizing the abso- 
lute necessity for participation by all whose 
work is to be evaluated by the instruments de- 
veloped. This part of the study might have ar- 
gued more forcefully for greater emphasis on 
evaluation as a means of stimulating continuous 
growth, and less for cumulative records of evalu- 
ation for the purpose of promotion, demotion, 
= dismissal, etc. However, the point is well argued 
that such cumulative records in no sense super- 
sede the sound professional judgments of ad- 
ministrative officers in their decisions. No 
: administrator may shirk his responsibility by 
leaving to a faculty committee the making of 
final decisions as to specific action affecting the 
employment status of employees. There is a 
suggestion of paternalism in the proposal to 
“permit the faculty of a school system to devise 
the details” of the evaluation system. Instead, 
let the superintendent and staff, through stimu- 
lating leadership, encourage the faculty to help 
develop a philosophy which demands effective 
evaluative criteria, not only to serve as cumu- 
lative data for administrative appraisal of teach- 
ing, broadly conceived, but also to promote real 
growth in understanding, appreciation, and 
practice on the part of all concerned with the 
educative process. - 
Sch R 53:360-70 Je '45. Leo M. Hauptman. 
* Evaluation devices used by 104 city school 


systems were studied * Five types of rating de- 
vices were identified: check scale, guided-com- 
ment report, characterization report, descrip- 
tive report, and ranking report. More research 
investigation and thoughtful analysis have been 
devoted to check scales than to any other means 
of teacher evaluation. Check scales cannot be 
valid measures of general teaching ability be- 
cause they neither follow accurately nor sample 
adequately an acceptable pattern of general abil- 
ity. Check scales can scarcely be expected to be 
valid in view of the way in which they are usu- 
ally made. Anyone desiring to improve a rating 
scale would have been helped if the authors had 
brought together into one list the desired qual- 
ities of a rating scale. Perhaps this was not to 
be expected from these authors since a very 
good case is presented’ against the use of rat- 
ing scales. * The cumulative personnel record is 
the recommendation of this monograph. The 
justification for the more complicated proce- 
dure of the cumulative record must rest on the 
argument that any less complicated procedure 
will not produce the desired results. This stim- 
ulating and helpful monograph should become a 
guide for every school administrator, school- 
board member, and school supervisor who is 
interested in the improvement of teaching and 
in the welfare of children. 


[ 1110 ] 

*RepMonp, Mary, ann Davies, F. R, J. The Stand- 
ardization of Two Intelligence Tests. New Zea- 
land Council for Educational Research, Educational 
Research Series, No. 14. Wellington, New Zealand: 
Whitcombe & Tombs Ltd., 1940. Pp. xv, 129. 75. 6d. 
(London: Oxford University Press, 1941. 7s. 6d.) 
(PA 15:2841) 


Ed Abstracts 5:374 D "40. Lee J. Cronbach. 
* Valuable as an example of a carefully con- 
trolled and interpreted large-scale research 
study. Conclusions are cautiously presented, and 
the many brief comments on the difficulties in- 
herent in intelligence testing and test standardi- 
zation may well be studied by the student of 
educational research. 


[ 1111 ] 
Reese, THoMas WHELAN. The Application of the 
Theory of Physical Measurement to the Measure- 
ment of Psychological Magnitudes, With Three 
Experimental Examples. American Psychological 
Association, Psychological Monographs, Vol. 55, No. 
3. Whole No. 251 Washington, D. C.: the Association, 
Inc., 1943. Pp. v, 89. Paper. $175. * (PA 18:961) 

[ 1112 ] 
X*REMMERS, H. H., and Gace, N. L. Educational 
Measurement and Evaluation. New York: Harper 
& Brothers, 1943. Pp. xi, 580. $3.50. * (PA 17:3255) 
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J Am Assn Col Reg 19:108 O '43. C. L. 
Murray. * The approach to guidance....is inter- 
esting and in many respects unique. Although 
some sentences are quite long and tedious, the 
book is scholarly and well written. * a, splendid 
contribution to our educational literature. The 
authors are to be commended for their masterly 
production. 

J Consult Psychol 8:52-3 Ja-F '44. Warren 
G. Findley. * A unique feature of this text is 
inclusion of physical development among the 
six aspects of individual development to be 
evaluated. The chapters on what should be 
evaluated and how to evaluate in this area pro- 
vide the classroom teacher with a clear plan for 
continuously evaluating her pupils’ physical 
health and development so that she may be of 
maximum aid in the school health program and 
at the same time obtain information useful in 
guiding her treatment of individual pupils. The 
book abounds in significant information about 
testing and tests as well as in psychological in- 
sights, aptly phrased. * addressed throughout 
to classroom teachers and promises to be highly 
instructive to them * Space might have been 
given to outlining the functions of adminis- 
trators and teachers in total evaluation pro- 
grams. Tt is difficult to understand why, in their 
discussions of evaluation, the authors have 
placed physical development between achieve- 
ment of instructional objectives and mental 
abilities in the series of six aspects to be evalu- 
ated. This unnecessarily separates achievement 
testing and mental ability testing which are in- 
creasingly recognized as highly related if not 
merging into one another. In fact, in the discus- 
sion of general mental abilities in this book, this 
relationship is pointed out and reading and 
arithmetic tests are classified as tests of general 
verbal and mathematical abilities. * 

J Ed Psychol 35:381-3 S '44. Gustav J. 
Froehlich. * well adapted to the needs of those 
teachers and prospective teachers who are de- 
sirous of getting an orientation in the field of 
educational evaluation. It could very well be 
used as a basic text in an introductory course 
in evaluation. * presents an excellent sampling 
of the recent literature. * In spite of the wealth 
of good material....[the book] has, nevertheless, 
two limitations, both due to the ambitious scope 
of the book. Its very comprehensiveness necessi- 
tates a superficial, and perhaps even misleading, 
treatment of some topics. The most serious lim- 
itation is an inadequate treatment of the factors 


affecting test validity and test reliability. For 
example, the authors state: “The emphasis of 
the present discussion on accuracy of measure- 
ment as the crux of the concept of reliability 
follows that of Jackson and Ferguson" (p. 202). 
Yet the thesis of these two gentlemen is not 
given, except as a bibliographical reference. 
Again, on page 206, optimum administration 
time is linked with test reliability ; but the effect 
of administration time on test validity is not 
discussed. A discussion of item analysis is con- 
spicuously absent. It is indeed unfortunate that 
the authors saw fit to devote only nineteen pages 
to the topic of test validity, and only nine pages 
to the topic of test reliability in a book of five 
hundred eighty pages; but it is most unfortu- 
nate that this limited presentation is so given 
that its readers are apt to accept as 'gospel 
truth' many of the exaggerated statements about 
tests found in manuals and in research reports. 
The second limitation of the book—a confusion 
of terms—is academic. This limitation does not 
affect its worthwhileness as an introductory text, 
but is one which concerns the more advanced 
reader. There are instances of meticulous defi- 
nition and differentiation of terms followed, 
however, by a loose use of these same terms in 
subsequent discussions. The most glaring ex- 
ample is the very careful distinction between 
‘measurement’ and ‘evaluation’ (pp. 29-30), 
but a synonymous use of these same terms 
throughout the book. On page 208 the ‘esti- 
mated true score’ is called a ‘true score,’ thus 
giving rise to a confused statement: viz., “One 
method of expressing the reliability of scores 
on a test which has the advantage of being in- 
dependent of the range of talent used in deter- 
mining reliability is the standard error of a true 
score. This statistic tells the range within which 
scores on the same test would be expected to 
fall two-thirds of the time if a very large num- 
ber of the tests, equivalent in all respects, were 
given to the pupil. Thus, if a pupil’s true score 
on a test is 70 and the standard error of this 
true score is 4, then his true score would be 
expected to range from 66 to 74 two-thirds of 
the time if additional forms of the same test 
were given to him.” 

Psychol B 41:132-3 F '44. Howard L. Siple. 
* All too frequently educators have considered 
subject-matter changes to the exclusion of 
changes in total behavior. This book, on the 
other hand, emphasizes the importance of eval- 
uating the whole or “organismic” individual 
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from a guidance point of view. * there is an 
unusually extensive treatment of physical as- 
pects, environment and background, emotional 
and social adjustment and attitudes in addition 
to the traditional content of measurement texts. 
Part I attacks the theoretical problem of “What 
Should Be Evaluated" as a basis for guidance. 
Part II presents practical suggestions on “How 
to Evaluate” each of the various aspects dis- 
cussed in the first section of the book. The 
authors seem much more comfortable and “at 
home” in the second part of the book than in 
the first. Could this be a reflection of their 
preference for psychological results as opposed 
to educational theorizing? One is somewhat re- 
lieved to find a book about measurement which 
does not make a fetish of tests, even though 
many different kinds of tests are briefly de- 
scribed and weighed. Psychologists who have 


lamented the misuse of scientific tools will be » 


pleased to find a text which reveals the major 
limitations of statistical analyses and encourages 
classroom teachers to interpret test results cau- 
tiously. The two chapters entitled “Interpreting 
Scores” are particularly well organized, fairly 
complete and clearly written. Even though the 
reviewer agrees with the authors in considering 
attitudes important for evaluating pupils, it 
seems to him that more space has been devoted 
to attitude measurement than the topic merits. 
Some over-emphasis is, however, justified when 
one considers the number of similar texts which 
have ignored the importance of such measure- 

ments. The book as a whole is a synthesis of 
many materials into an organized approach to 
meaningful evaluation. It may be used by al- 
most every instructor of educational measure- 
ments as a text. In fact, classroom teachers 
could probably use it as a convenient handbook 
for selecting, constructing and administering 
evaluative devices. It is well written, carefully 
indexed and has a satisfactory format. 
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XREMMERS, H. H., anp Gace, N. L. Student Exer- 
cises in Measurement and Evaluation for Educa- 
tion and Guidance. New York: Harper & Brothers, 
1947. Pp. iii, ror. Paper. $1.50. * (PA 22:1387) 
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*ReMMers, H. H., AND Gace, N. L., Eprtors. Two 
Thousand Test Items in American History. La- 
fayette, Ind.: State High School Testing Service 
for Indiana, Division of Educational Reference, Pur- 
due MA cen 1941. Pp. v, 110. Paper. $0.90. * (PA 
I5:2 


Loyola Ed Digest 16:11-2 Ap '41. Austin G. 
Schmidt. Of special value to Indiana teachers, 
this work should prove useful to all teachers 
of American history and to users of tests in 
general It contains two thousand test items 
based on the Indiana course in American his- 
tory. The items are classified under free mul- 
tiple choice, controlled multiple choice, coherent 
groups, matching, historical sequence, true- 
false, completion, allocating powers, grouped 
names, and same-opposite vocabulary. Items 
the statistical validity of which was determined 
experimentally are indicated by asterisks. The 
material is not only useful because of the well- 
constructed test items which it contains, but 
also because of its suggestiveness for those in- 
terested in constructing their own classroom 
tests in the same or in other fields. 


[ 1115 ] 
XRrsnaLL, Ruts. Revision of 294V to 204VA New 
Order of Difficulty. Human Engineering Laboratory, 
Inc, Technical Report No. 375. Boston, Mass.: the 
Laboratory, February 1947. Pp. 30. Paper, mimeo- 
graphed. * 

[ 1116 ] 
*RrsHALL, Ruta. Second Study of Grip, Work- 
sample 185 Conversion Factors and Norms. Human 
Engineering Laboratory, Inc., Technical Report No. 
333. Boston, Mass.: the Laboratory, October 1946. Pp. 
I2. Paper, mimeographed. * 

[ 1117 ] 
*ResHart, Ruts. Third Study of Grip Worksam- 
ple 185. Human Engincering Laboratory, Inc., Tech- 
nical Report No. 370. Boston, Mass.: the Laboratory, 
January 1947. Pp. 13. Paper, mimeographed. * 

[ 1118 ] 
*Ricuarps, T. W. Modern Clinical Psychology. 
New York: McGraw-Hill Book Co., Inc., 1946. Pp. xi, 
331. $3.50. *. (London: McGraw-Hill Publishing Co., 
Ltd. 17s. 6d.) (PA 21:148) 


J Clin Psychol 3:110-1 Ja 47. * Richards 
„intends this book to be a text on understand- 
ing personality conceived in the modern psy- 
chodynamic approach for college juniors. In 
view of current trends in professional educa- 
tion in the direction of restricting specialization 
to graduate years after the student has been ex- 
posed to basic science, this reviewer questions 
how much value such a text would have on 
undergraduate levels where the students are 
completely unprepared for technical discussions 
of Rorschach or TAT material. Two introduc- 
tory chapters present a discussion of clinical 
psychology as a specialty and an orientation to 
clinical method. Three chapters are devoted to 
methods of psychological appraisal, approach 
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to the patient and the significance of the physical 
examination. Diagnostic methods are discussed 
under appraisal of capacity (2 chapters), ap- 
praisal of motivation (2 chapters), and ap- 
praisal of control (3 chapters). Two concluding 
chapters deal with aetiology and techniques of 
readjustment. This arrangement is not opti- 
mum for teaching purposes since it is desirable 
to have psychopathology precede diagnosis and 
therapy. In view of recent interest in psycho- 
therapy and counseling as part of clinical psy- 
chology, it would have been desirable to present 
more detailed discussions of therapeutic meth- 
ods and particularly the theory and practice 
of nondirective methods. Dr. Richards con- 
ceives that inadequate patterns of behavior may 
be arranged in a continuum from hysteria to 
schizophrenia at opposite poles and, with 
neurasthenia, psychasthenia, epileptic behavior, 
paranoia, depressions and excitements as inter- 
mediate stages from the socially oriented to 
the personally elaborated. His dynamic ori- 
entation is that of psychoanalysis documented 
extensively by the latest experimental findings 
where applicable. Numerous case studies are 
included with detailed discussions of psycho- 
metric results, Rorschach responses, Minnesota 
Multiphasic Inventory Items, electroencephalog- 
raphy and other data. One of the values of the 
book is its genuinely clinical orientation. It will 
be of value as a textbook for graduate students 
who have had enough contacts with clinical case 
materials to evaluate the argument, and for 
practicing clinicians. * 


[ 1119 ] 
*RickERS-OvstANKINA, Marra. Rorschach Scoring 
Samples: Compiled From Various Sources for 
Private Circulation, [Revised Edition]. Norton, 
Mass.: the Author, Wheaton College, 1942. Pp. v, 197. 
ues mimeographed. $4.00, postpaid. * For reviews, 
see 89a. 


[ 1120 ] 
*Rinc, Heren. New Four-Point and Six-Point 
Scales, Odd and Even Reliabilities, and Distribu- 
tion Curves, Squares, Worksample 313. Human 
Engineering Laboratory, Inc., Technical Report No. 
180. Boston, Mass.: the Laboratory, October 1945. Pp. 
15. Paper, mimeographed. * 
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XRixstAND, Henry D. A Clinical Method of Grad- 
ing. A revision of Chapter 8 of the author's Construct- 
ing Tests and Grading in Elementary and High School 
Subjects (see 40:B1052). Norman, Okla.: University 
Book Exchange, 1947. Pp. 17. Paper. $0.35. * 


[ 1122 ] 
*Roserts, Jonn R., AND BAUMAN, Mary K. Motor 
Skills Adapted to the Blind: Minnesota Rate of 


Manipulation Test [and] Pennsylvania Bi-Man- 
ual Worksample. Minneapolis, Minn.: Educational 
Test Bureau, 1944. Pp. 8. Paper. $0.50. * 
[ 1123 ] 

*Rozerts, KATHERINE ELLIOTT, AND FLEMING, VIR- 
cinta Van Dyne. Persistence and Change in Per- 
sonality Patterns. Monographs of the Society for 
Research in Child Development, Vol. 8, No. 3, Serial 
No. 36. Washington, D. C.: the Society, National Re- 
search Council, 1943. Pp. vii, 206. Paper, lithotyped. 
$1.50. * (PA 18:2477) 


Brit J Psychol 35:89 My '45. In this mono- 
graph, twenty-five life histories of women ob- 
tained through interviews given by an Advisory 
Service on personal problems are analysed to 
determine how far personality traits and char- 
acteristic attitudes persist throughout life, and 
how far they are modified. Three of the life 
histories are given in full, and others described 
more briefly. A long list of traits and attitudes 
was drawn up and checked for each case by 
independent investigators, as they seemed to 
occur in the life history during three periods of 
life, pre-college, college and post-college. The 
women themselves checked a similar list, but 
the correlations for persistence of traits between 
the two sets of lists was low. It is not clear, 
therefore, that such conclusions as were reached 
could not have been equally well derived di- 
rectly from studying the life history without 
resorting to this elaborate and only pseudo- 
objective procedure. As might be expected, 
some traits were persistent in some cases and 
others in others; some were comparatively per- 
sistent in all cases, others hardly appeared at 
all. There seems to have been rather more per- 
sistence on the whole than fluctuation. Little 
attempt was made, however, to show why cer- 
tain traits were relatively modifiable and in 
what circumstances, beyond the one or two 
rather obvious facts such as that a happy life 
in childhood is likely to lead to stability and 
happiness in later life. The fundamental defect 
of this method of tackling the problem of the 
persistence of traits is, of course, that all evi- 
dence of behaviour characteristics and impor- 
tant modifying influences during the earliest 
years of life is necessarily lacking. 

Ed Res B 24:136-7 My 16 '45. Francis N. 
Maxfield. * a careful analysis * [The conclu- 
sions] do not offer the parent or teacher any 
certain assurance in regard to the persistence 
of any particular trait or attitude in any indi- 
vidual child whom they are interested in devel- 
oping. They do, however, indicate that con- 
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scientious parents and teachers have the odds in 
their favor. 
J Am Med Assn 125:683 Jl 1°44. This mon- 
ograph attempts to answer the question “Are 
human traits fixed, or are they modified by life 
experiences?” The material dealt with consists 
of the life histories of 25 women interviewed in 
the Advisory Service for College Women of the 
Merrill-Palmer School in Detroit. Part I pre- 
sents three of these histories (original “average 
length 90,608 words”) in condensed novelette 
form, but, possibly because the authors them- 
selves did not do the interviewing, the published 
anamneses show little insight into the nuances 
of individual and social development of their 
subjects. Substituted are the usual cliches about 
“feelings of inferiority,” “difficult relationships,” 
“emotional outlets” and other phrases used as 
secondary evaluations of the original biographic 
data. From such histories, nevertheless, various 
“traits” such as “shyness” and "strong ego 
drive” are then further abstracted, coded, tabu- 
lated and analyzed for the factor of persistence," 
which admittedly varied from case to case. De- 
spite the methodologic tenuousness of such pro- 
cedures, the authors then construct tables to 
show that traits such as “seeks help when 
needed” rate only 11 per cent persistent, whereas 
“has a few friends in whom she confides” is 100 
“per cent constant, presumably up to the time 
the subject confided in the interviewer. The 
authors conclude that “there seems to be a 
nuclear personality pattern for every individual 
-...and around this nucleus other traits and atti- 
tudes shift as life proceeds...even the nuclear 
pattern may change when conditions are changed 
radically.” To answer the subsidiary question 
“Do homes matter?” the authors classify their 
subjects into those from “happy” homes (5) 
“unhappy” homes (4) and “medium” (16), 
according to criteria like “gets along with par- 
ents” vs. “thinks mother considers doing things 
for children too much trouble.” The authors 
conclude, again somewhat predictably, that 
homes matter. Similar revelations by tabular 
divination are then related to chapter headings 
such as “Is Personality Related to Marital Hap- 
piness?” “Personality and Religious Values” 
and “Social Adjustment and Personality.” There 
is a fifty-six page appendix of tabular data with- 
out specific or implied qualifications to the effect 
that these “data” really represent the authors’ 
abstracted opinions of the interviewers’ recorded 
opinions of the subjects’ expressed opinions of 
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themselves. Characteristic also is the inclusion 
of several elementary textbooks and three dic- 
tionaries in the bibliography of twenty-nine 
titles, with almost no reference to the rich liter- 
ature of psychoanalytically and clinically ori- 
ented studies in the field. The main defects of 
this monograph are those of the traditional aca- 
demic approach to personality: methodologic 
devotions to questionnaire “tests,” the tabula- 
tion of “traits” or other Vague generalities and 
the statistical derivation of the same pedantries 
that were originally injected into the material. 
Despite the undoubted sincerity of its composi- 
tion, therefore, this monograph is recommended 
only as a documentary case of a rapidly passing 
preoccupation with the trite and the sterile in 
our concepts of personality, 

J Nerv & Mental Dis 104:343 S '46. S. 
Brecher. * Twenty-five histories were analyzed 
for the persistence and fluctuation of personality 
traits and attitudes throughout the life span of 
the individuals. * 

Psychosom Med 7:385-6 N ’45. Jules Mas- 
serman. [Practically the same as the review in 
J Am Med Assn excerpted above.] 

Q R Biol 20:105-6 Mr "45. Theta Wolf. * 
The authors proclaim their belief in a dynamic, 
integrative concept of personality, but their 
procedure results in an atomistic description that 
precludes any such personality. They have cre- 
ated 523 attitudes and 229 traits (sic!) by 
which they pretend to classify personality, and 
then proceed arbitrarily to impose this “sys- 
tem” upon their material. The evidence is dis- 
covered from a case history in which the sub- 
ject herself was the witness for it, and evidence 
applicable to the present study was not asked 
for according to any organized plan. One finds 
it difficult, too, to accept the authors’ distinc- 
tion between their “traits” and “attitudes.” For 
example: if a person “lets children solve their 
own problems,” she exhibits an attitude; if the 
same person “tries to solve problems rather 
than avoid them,” she exhibits a trait. More- 
over, in making distinctions within the traits, 
the authors place in separate categories such 
subjective terms as “excitable,” “explosive,” 
“impulsive.” A kind of behavior, then, might be 
labelled “explosive” at one “life stage” and very 
conceivably be called “impulsive” at another 
stage, with results indicating fluctuation in pat- 
tern; a different labelling would, on the other 
hand, have shown persistence. Furthermore, if 
this kind of behavior did not happen to appear 
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again in the case history—at another life stage 
—the results were treated as though the be- 
havior did not occur, although it is clear that it 
simply may not have been reported. This fact 
irreparably and unaccountably affects the re- 
sults. Again, in the case histories, a false pic- 
ture of persistence or of fluctuation is given 
because the setting of the behavior—or its 
meaning to the individual—had to be ignored. 
A person who may always be “sensitive” in a 
given social situation might conceivably “per- 
sist” in this behavior throughout the three 
stages, but in accordance with the plan in this 
study, because she was sensitive in situations x 
and y, and not sensitive in situations a and c, 
the “trait” is tabulated as fluctuating. These 
conscientious authors very evidently wished to 
draw some important educational lessons from 
this study. It is regrettable that so much honest 
work was spent upon such an unproductive 
project. The material might have been studied 
from the point of view of integrated person- 
ality. A fair sampling of the case histories, 
analysed and briefed in a non-statistical man- 
ner, could have given evidence both of the per- 
sistence and modifiability of human behavior 
that was colorful, dramatic, and useful. On the 
other hand, it is abortive to attempt to force 
into statistical units material collected without 
method, without defined aim or uniformity, and 
replete with gaps that, if filled, might have 
changed the results altogether. 

Sch R 52:438-9 S '44. Morris E. Eson. 
Progressive educators have been interested for 
a number of years in the possibility of education 
toward the development of desirable personality 
patterns. In order to consider such a possibility, 
certain questions must be answered objectively. 
For example, do personality patterns change 
with a changing environment? Which traits are 
more modifiable and which are less? Are there 
any interrelations between various traits and 
attitudes? In this study ambitious attempts are 
made to answer these questions and many more. 
* In the second part of the monograph numeri- 
cal results and a detailed analysis of the data 
are given. These add very little to the first part 
of the monograph. Since the number of cases 
are few, in some instances only five, these data 
seem worth while as indication of trends rather 
than of quantitative differences. Some sixty-five 
pages of appendixes are presented. In one ap- 
pendix are listed 229 traits. The meaning of 
many of these is not clear; for example, such 


general terms as "adaptable," “co-operative,” 
“discriminating,” “flippant,” “resigned,” “vari- 
able,” and many others are listed without ade- 
quate operational definitions. The same criticism 
is true of the 523 attitudes, which are not clearly 
differentiated from the traits. One general criti- 
cism to be made is that the authors have at- 
tempted a short cut between a longitudinal and 
a cross-sectional study. The value of such a 
study may be questionéd because the data ob- 
tained from an interviewee concerning past his- 
tory will be influenced by the present emotional 
state. The monograph is of value in showing 
how an old though laborious technique, namely, 
case-history analysis, can be applied to the prob- 
lem of personality study. It may prove of some 
value in establishing the validity of personality 
schedules and inventories, if we assume that the 
interview produces only valid material. 


[ 1124 ] 
*Ropixsox, Francis P., AND HarL, WitLrAM E, Con- 
cerning Reading Readiness Tests. Bulletin of the 
Ohio Conference on Reading, No. 3. Columbus, Ohio: 
Ohio State University Press, March 1942. Pp. iii, 16. 
Paper. $0.25. * (PA 16:3304) 

[ 1125 ] 
XRonrrAiLLE, Henry J. The Development of a Test 
for Reasoning Ability and Its Application to 
Students Majoring in Science, Classics and Com- 
mercial Courses. Catholic University of America, 
Studies in Psychology and Psychiatry, Vol. 6, No. 3. 
Washington, D. C.: Catholic University of America 
Press, 1943. Pp. vii, 64. Paper. $0.75. * (PA 18:1248) 

[ 1126 ] 
Rocers, Cart R. The Clinical Treatment of the 
Problem Child. Boston, Mass.: Houghton Mifflin Co., 
1939. Pp. xiii, 393. $3.75. (London: George Allen & 

nwin Ltd. 15s.) * (PA 13:5971) 


Am J Psychiatry 97:501-2 S *40. Leo Kan- 
ner. In an era when the treatment of children's 
behavior problems is still beset by strife over 
competencies, this reviewing psychiatrist is 
pleased with the opportunity to discuss so ex- 
cellent a contribution coming from a clinical 
psychologist. There was a decided need for a 
summing up of the many and varied therapeu- 
tic methods and "techniques," and the author 
has supplied this need in a comprehensive and 
inoffensively critical manner. Treatment is con- 
ceived broadly and practically as “any planned 
procedures by which professional workers have 
sought to modify the behavior or adjustment of 
the individual child." * One single page of the 
book's 383 pages of text disposes of the. “‘or- 
ganic influences," which unfortunately receive 
no further consideration as being worthy of, or 
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accessible to, treatment. One wishes to ascribe 
this one serious gap of the book to the modesty 
of the author, who, not being a physician, ap- 
parently did not feel sufficiently qualified to 
speak of matters outside his training and ex- 
perience. The other factors that influence be- 
havior “in complex interaction" are taken up 
briefly, sensibly and undogmatically * The 
author’s own “component-factor method of 
diagnosis” presents an 8-point numerical rating 
which may be helpful to those who depend on 
scales and graphs as guides for constructive for- 
mulation. * The third section is devoted to 
“Treatment Through Modifying the Environ- 
ment” by means of work with and through the 
family, the school and recreational agencies 
(clubs, groups and camps). This section offers 
much needed orientation and constitutes a major 
contribution to practical sociology on behalf of 
maladjusted children in maladjusted, but rea- 
sonably modifiable situations. Not since the 
publication of The Child in America by D. S. 
and W. I. Thomas has this reviewer encoun- 
tered such an informative, lucid, and, in spite 
of its relative brevity, encyclopedic survey of 
studies, efforts and facilities. This section alone 
is worth many times the price of the book. * 
The author’s conciliatory good nature does not 
prevent him from maintaining a clear perspec- 
tive and a wholesomely critical attitude through- 
out, even though at times he may seem slightly 
over-awed by certain “scientific” techniques. 
There is a good sizing up of the “qualifica- 
tions of the therapist.” Educative, expressive 
(catharsis, ventilation of conflicts, play therapy), 
and interpretative methods come in for sober 
discussion of their values and shortcomings. 
The author wisely advocates sharing, instead of 
rigid division of labor between the various pro- 
fessional groups * This book should and un- 
doubtedly will find a cherished place in the 
library of every person who is interested in the 
treatment of difficult children. It contains a 
mine of valuable information. It is written in 
a clear and tranquil style. It speaks well for the 
author's wisdom, perspective and modesty. 

J Ed (London) 72:465 O ’40. R. H. Crow- 
ley. * written from a severely practical stand- 
point. The author sets out to provide a much 
fuller answer than can be found probably any- 
where else to the question “What can be done to 
help this child ?" * the subject is not approached, 
in a way in which we are all too familiar, from 
the point of view of the various professional 
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groups involved...largely regardless of the ex- 
tent of the overlapping of the various techniques 
employed by these groups. * The teacher will 
find....a good deal of food for thought in the 
chapter on The School's Part in Changing Be- 
haviour. * He has some interesting and wise 
observations to make on the organization re- 
quired to secure effective treatment of the in- 
dividual child. * There is an interesting section 
dealing with the role of each professional group. 
* Altogether, this book, though written pri- 
marily with a view to helping workers directly 
engaged in the treatment of problem children, 
is full of interest to the teacher in whatever type 
of school and to the educationist generally. 
Psychoanalytic Q 9:559-6r O '40. Lillian 
Malcove. The child guidance approach to the 
study and treatment of the problem child is 
presented....in a sincere and methodical fashion. 
* The contents....should be useful and interest- 
ing to the now large and inclusive group of 
orthopsychiatrists for whom it was written. 
The absence of theoretical formulations, the 
abundance of case illustrations, the clarity with 
which ideas are presented and the bringing to- 
gether of varied opinions and methods of diag- 
nosis and treatment are not inconsiderable 
assets. The most useful contribution is the mid- 
dle third of the book which is devoted to treat- 
ment methods which modify behavior through 
environmental manipulation. This section in- 
cludes methods used to effect changes in parental 
attitudes. The discussion of the entire problem 
of child placement is especially good. Unfortu- 
nately the same praise cannot be extended to 
the first and third parts of the book on diagnosis 
and treatment. The fact that the author is a 
psychologist would in itself tend to influence 
certain aspects of the presentation on diagnosis 
and treatment. But since the author is a psy- 
chologist who is openly critical of and preju- 
diced against psychiatric and psychoanalytic 
theory and practice, these sections contribute 
much less than they otherwise might. * The 
psychiatric interview seeris to have little place 
as it is considered less reliable. Similarly psy- 
chiatric treatment by psychiatrists has very 
limited place. Those who are personally ac- 
quainted with child guidance clinic work must 
know that in the original clinics and in most 
clinics currently, the diagnosis was and is de- 
pendent on all the environmental and individual 
factors, as in Mr. Rogers' work, but addition- 
ally it always took into account a psychiatric 
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examination which was essential for the total 
and final diagnosis. The author's implication 
that the new trend in the diagnosis and treat- 
ment of problem children is toward a change in 
allocation of work among the coóperative work- 
ers with a minimization of the work of the psy- 
chiatrist must be a limited one. The pretty well 
defined division of therapeutic efforts of the 
earlier child guidance days certainly continues 
today. * One rightfully questions the wisdom 
of the new trend which regards each member 
of the working group as potentially equipped to 
do ‘intensive psychotherapy’ with children and 
parents, and ‘the entire staff....as a group of 
therapists, with functions determined by experi- 
ence and aptitude rather than professional label’. 
As further evidence of this ‘progress,’ it is stated 
that in some clinics experienced social workers 
train psychiatrists and also that other clinics 
utilize psychiatrists only for occasional consul- 
tation. And not only is all treatment regarded 
as psychotherapy, but the author refers to atti- 
tude therapy (an intensive case work proce- 
dure introduced by Dr. David M. Levy) as a 
treatment ‘amounting to psychoanalysis.’ All 
this suggests an urgent need to clarify the func- 
tions of the codperating professional persons 
and to differentiate the work of one from the 
other. The function of the psychiatrist to do 
psychotherapy or psychoanalysis is adequately 
clear but evidently the function of the case 
worker, clinical psychologist and others is not 
so uniformly clear. Social workers in more re- 
cent years have done much to clarify their 
function and methods, and certainly as a group 
they would not accept the author’s tendency to 
endow them with the rights of psychotherapists 
whether in diagnosis or treatment. This may be 
equally true of psychologists but the reviewer 
knows less about their delineations of function 
in the field of orthopsychiatry and therefore 
cannot pass judgment on how completely Mr. 
Rogers represents clinical psychologists. It is to 
be hoped that the trend suggested by the book 
is not only a limited one but definitely a transi- 
tional one, for infantile neuroses and their fore- 
runners in the form of defined symptomatic 
behavior require psychiatric skills for under- 
standing (diagnosis) and in most cases for 
treatment. 
For additional reviews, see 40:B1053. 
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*Ronscuacu, HERMANN. Psychodiagnostics: A Di- 
agnostic Test Based on Perception: Including 
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Rorschach’s Paper, “The Application of the Form 
Perception Test” (Published Posthumously by 
Dr. Emil Oberholzer), Second Edition. Transla- 
tion and English edition by Paul Lemkau and Bernard 
Kronenberg. Edited by W. Morgenthaler. Berne, Switz- 
erland: Verlag Hans Huber, 1942. Pp. 238. (New 
York: Grune Stratton, Inc. $575.) * (PA 16:4443, 
title only) For reviews, see 9o. 

[ 1128 ] 


*RosenstEIN, J. L. The Scientific Selection of 
Salesmen. New York: McGraw-Hill Book Co., Inc., 
1944. Pp. xii, 259. $3.50. * 


J Appl Psychol 29:498-9 D ’45. Milton E. 
Hahn. The audience for, and major objective 
of, this book is stated as follows by the author: 
“This book is for sales executives who are long- 
range minded. Its purpose is to describe, to 
explain, and to teach the steps in a scientific 
salesman selection program" (ix). For many 
reasons the publication fails by a wide margin 
to meet either the needs of its intended audi- 
ence or its primary objectives. Throughout the 
book there are evidences of hasty, superficial 
writing. Important concepts are ignored or 
treated so sketchily that they might better have 
been ignored. On the other hand, certain simple 
ideas, such as the computation of an average, 
are heavily labored. Part of the difficulty en- 
countered by the author arises from his attempt 
to oversimplify psychological tools and tech- 
niques which just cannot be so treated for lay 
readers. An example of this is the attempt to 
teach interpretation of complex psychographs 
to laymen in fewer than forty pages (211-244). 
In the section on interviewing, such terms as 
“substitution,” “persecution,” and “projection” 
are introduced with little explanation for naive 
readers. Treatment of tests and testing is en- 
tirely inadequate. * Sections of the book are 
devoted to the application blank, and selection 
of salesmen for small companies. Materials con- 
cerned with the application blank are the 
strongest point in the publication, Even in this 
section much of importance is ignored. Read- 
ers who wish to read further about topics which 
interest them will be irritated by the author’s 
failure to include sources. The names of writers 
and researchers are referred to, but the publica- 
tions and publishers are omitted. In the opin- 
ion of the reviewer, this is a poor book on a 
topic which is deserving of much better treat- 
ment. 

[ 1129 ] i 
*RosENzwEIG, SAUL; Bunpas, Lourene E.; LUMRY, 


Ketty; AND Davrpsow, Heren W. An Elementary 
Syllabus of Psychological Tests. Reprinted from 
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Journal of Psychology, Vol. 18, 1944. Pittsburgh 13, 
Pa.: Saul Rosenzweig, 3811 O'Hara Street, 1944. Pp. 
9-40. Paper. 25 or more copies, $0.25 each. * (PA 19: 
272) 

[ 1130 ] 
*Ross, C. C. Chapter Exercises and Tests to Ac- 
company Measurement in Today's Schools, Sec- 
ond Edition. New York: Prentice-Hall, Inc, 1947. 
Pp. vi, 74. Paper. $1.25. * 

[ 1131 ] 


X*Ross, C. C. Measurement in Today's Schools. 
New York: Prentice-Hall, Inc., 1941. Pp. xviii, 597. * 
(PA 15:3172) For revised edition, see 1132. 


Am J Psychol 55:303-4 Ap '42. Robert Y. 
Walker. In the presentation of this topic the 
author has been successful in avoiding the nor- 
mal dullness of such a subject. There is a re- 
freshing manner of discussing the various prob- 
lems of testing with no attempt to be dogmatic. 
This book is definitely written for the student 
who is new to this field and who is seeking an 
adequate survey of the major problems and 
methods of the subject. * It is unfortunate that 
the author does not include statistical methods 
for small sampling as much of the testing in 
Schools will involve small sample groups. * a 
rather extended presentation of remedial read- 
ing is included. The book follows the present 
trend in education in placing the emphasis on 
remedial reading and does not cover other fields 
of difficulty adequately. * 

B Nat Assn Sec Sch Prin 25:191-2 My "41. 
* should be useful to teachers and administrators 
regardless of subject or level of instruction * 

Clearing House 16:56-7 S '4r. E(arl) R. 
G(abler). Measurement is at last beginning to 
recognize its limitations, along with its con- 
tributions. In this book Ross sets forth quite 
adequately the basic considerations in measure- 
ment, a philosophy of education, and a state- 
ment of desirable educational outcomes. His 
style of writing is easy, interesting, and more 
like popular writing—quite a relief from the 
generally formal and pedantic style usually 
found in professional books. * Highlights of 
the book: (1) Unusually good presentation of 
all aspects of test construction, with special 
reference to use by the classroom teacher. (2) 
Probably the best documented book in the field, 
tapping a variety of sources outside the imme- 
diate field of education. (3) At the end of each 
chapter there appears an objective test cover- 
ing the educational outcomes sought. (4) A 
most interesting and easy style. and (5) A 
recognition of education as something beyond 


factual accomplishment. Some considerations 
omitted: (1) No attempt was made to list and 
analyze the standardized tests available in the 
various subject areas. This is not a defect. (2) 
Importance of pupil participation, the pupil- 
constructed test and the work of Richard D. 
Allen in self-measurement. (3) In the chapter 
on motivation there may be something more 
fundamental and lasting to consider other than 
the results of experiments with the examina- 
tion as a motivating factor. 

Ed Adm & Sup 27:474-5 S '41. A. S. Ed- 
wards. * represents [an attempt]....to present a 
“functional approach to the subject” of meas- 
urement * There is need for such a book and 
the general organization of the subjéct-matter 
appears to be very useful. It is, however, a 
question whether some of the subjects that are 
included are properly relevant. The principles 
of learning, for example, cannot be properly 
handled in a few pages and they are not pri- 
marily problems of measurement; nor does it 
seem appropriate to include discussions upon 
devices for preventing cheating, the use and 
abuse of work books, educational values of final 
examinations, and discussion of remedial teach- 
ing. Also it appears that “satisfactory statement 
of a school’s philosophy” is rather too easily 
assumed to be possible. * A considerable amount 
of space is given to elementary statistics and 
many at least think this is more properly taught 
in a separate course. The historical review of 
tests and testing is in general well done. Wil- 
helm Wundt, whose picture appears, is not gen- 
erally considered as a great contributor in this 
particular field, nor is the Army Alpha "the 
first of a long succession of group tests." The 
Army a and b tests were used for some little 
time before the Alpha was evolved from them. 
In general the historical summary is good, but 
since emphasis is placed upon firsts it is not out 
of place to suggest that the excellent work of 
Miss Clara H. Town might have been men- 
tioned. She reports a first complete translation 
published in America—the translation of the 
IQII revision of the Binet, Simon scale pub- 
lished by Miss Town in 1912. In dealing with 
the special problems of tests, commendable 
work has been done and the reader can find ex- 
cellent handling of the problems that arise in 
connection with their construction, validation, 
etc. The author rightly emphasizes the value 
of scientific procedure and of measurement, and 
raises doubt as to the value of argument from 
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authority. Assuming the correctness of his atti- 
tude on this point, one is somewhat surprised 
to find a large use of quotations from various 
authors although perhaps they are excused on 
the basis of quotations for the purpose of clarifi- 
cation. The latter, however, is hardly accom- 
plished by quotations of several bits of poetry. 
Some statements seem rather too arbitrary. For 
example, the statement that the usual distribu- 
tion of grades by per cents is 7-24-38-24-7. lt 
appears also that in a scientific book in which 
there are many fine scientific statements there 
is altogether too much use of the “pedagogical 
imperative.” There are pages on which hardly 
a statement is made without the use of “must” 
or “should” and in some sections hardly a para- 
graph that does not tell what “should” or 
“must” be done. Excellent indexes are added. 
Many valuable references are given. In general 
the book will be found somewhat different from 
its predecessors and of great value to many 
students. 

J Appl Psychol 25:601-2 O ’41. Amos C. 
Anderson. The author states in his preface that, 
despite progress made in measurement in the 
last quarter century, textbooks have remained 
much the same. In educational measurement 
they have been largely concerned with tests of 
subject matter and have tended to early obso- 
lescence because of the publication of other 
tests. He has sought a functional approach 
through emphasizing problems relating to the 
intelligent use and interpretation of the tools 
of measurement rather than the tools them- 
selves. * In terms of practical value to the class- 
room teacher and the school administrator, the 
writer's approach would seem to be a fruitful 
one. * The three chapters on teacher-made tests 
should be of great help to teachers in preparing 
tests that better meet the purposes for which 
such tests are designed. * The book is thought- 
provoking and covers a wide range of problems. 
Tt is about as interestingly written as such a 
book very well could be. On the other hand, 
the critical reader will probably find himselt 
unable to agree with the writer at many points. 
He will challenge certain statements or points 
of view presented in the book, or find his own 
challenged. Measurement in Today's Schools 
may be characterized as a sane, thoughtful piece 
of work, dealing with the fundamentals of edu- 
cational measurement. Not only will it be valu- 
able as a textbook for the teacher in training 
but it merits a wide reading by teachers, ad- 


ministrators and those laymen who as non- 
professional students of education, endeavor to 
keep themselves informed on current educational 
practices. 
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*Ross, C. C. Measurement in "Today's Schools, 
Second Edition. New York: Prentice-Hall, Inc., 
1947. Pp. xvi, 551. $4.50. * (PA 22:1387) 

[ 1133 ] 
XRourr, Forrest V., Ja. Personnel Testing: A Field 
Check on the Current Status of Personnel Testing 
in One Hundred Organizations. San Francisco, 
Calif.: California Council of Personnel Management, 
October 1946. Pp. 26. Paper, mimeographed. $1.00. * 

[134] | 
*Rucx, Froyp, How to Use Employment Tests. 
Employment Testing Bulletin, No. 1. Los Angeles, 
Calif.: California Test Bureau, 1944. Pp. 15. Paper. 
$0.50. * 

[ 1135 ] 
Ruc, Gites M., AND, SEGEL, Davi». Minimum Es- 
sentials of the Individual Inventory in Guidance. 
United States Department of Interior, Office of Educa- 
tion, Vocational Division Bulletin, No. 202; Occupa- 
tional Information and Guidance, Series No. 2. Wash- 
ington, D. C.: Government Printing Office, 1940. Pp. 
vi, 83. Paper. $0.25. * (PA 14:4266) 

Pub Personnel R 1:65-6 O ’40. J. Kenneth 
Mulligan. Vocational guidance literature all too 
frequently has the fault of verbosity. New in- 
formation and techniques are frequently ob- 
scured by lengthy, sometimes imaginative ex- 
position. This new pamphlet avoids this fault, 
presenting a concise treatment of the specialized 
aspect of guidance that is most concrete, i.e., 
the tools for successful guidance. The relative 
values of various items in the individual inven- 
tory are discussed, including family and cultural 
background, physical and medical history, marks 
in school subjects, extra-curricular activities, 
mental test scores, achievement test scores, in- 
terests, special talents, aptitude scores, and 
anecdotal information. The condensation of this 
material in this pamphlet makes easily available 
to the vocational guidance worker information 
concerning progressive trends, and to the ap- 
prentice in this field, there is provided a bal- 
anced overview of the field. Various aspects of 
the pamphlet should be suggestive to the public 
personnel administrator. To what extent might 
the information available in progressive school 
records be useful to recruitment processes in 
government? * Of more practical and immediate 
importance to the personnel field is the possi- 
bility of adapting certain practices suggested to 
personnel record keeping for purposes of pro- 
motion, transfers, and employee counselling. 
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Counselling on the job may in some instances 
be as valuable as counselling in school, and as 
personnel principles become more intensive in 
their administration, cumulative records of em- 
ployee achievements, test scores, ratings, and 
anecdotal material will become important to 
personnel management. 
For an additional review, see 40:B1055. 
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*Ryans, Davin G. Comparing the Qualifications of 
Teachers. Teacher Selection Papers and Reports, No. 
IL New York: National Committee on Teacher Ex- 
aminations, American Council on Education, June ro, 
1947. Pp. 27. Paper, lithotyped. Gratis. * 


[ 1137 ] 
X*Rvaws, Davin G. The First Step in Guidance: 
Self-Appraisal: A Report of the 1940 Sophomore 
Testing Program. Cooperative Test Service Publica- 
tions in Measurement and Guidance, Series 3, Vol. r, 
No. 1. New York: Cooperative Test Service, January 
1941. Pp. 35. Paper. $o.10. * (PA 15:2382) 


[ 1138 ] 
*Ryans, Davin G. Notes on the Selection of Class- 
room Teachers: The Interview. Teacher Selection 
Papers and Reports, No. 6. New York: National Com- 
mittee on Teacher Examinations, American Council 
a Education, October 9, 1946. Pp. 7. Paper, lithotyped. 
ratis. 


[ 1139 ] 
*Ryans, Davin G. Use of the National Teacher 
Examinations in Colleges and Universities. 
Teacher Selection Papers and Reports, No. 10. New 
York: National Committee on Teacher Examinations, 
American Council on Education, June 3, 1947. Pp. vi 
Paper, lithotyped. Gratis, * 

[ 1140 ] 
*Saetverr, JOSEPH G.; Lewis, Don; AND SEASHORE, 
Cart E. Revision of the Seashore Measures of 
Musical Talent. University of Iowa Studies, New 
Series, No. 338; Series on Aims and Progress of Re- 
search, No. 65. Iowa City, Iowa: University of Iowa 
cay October 1940. Pp. 62. Paper. $0.50. * (PA t5: 
2311 

[ 1141 ] 
XSarrER, Mary D. An Evaluation of Adjustment 
Based Upon the Concept of Security. University 
of Toronto Studies, Child Development Series, No. 18. 
Toronto, Canada: University of Toronto Press, 1940. 
Pp. 72. Paper. $0.65. * (PA 15:2274) 


J Ed (London) 74:139-40 Mr '42. C. M. 
Fleming. * importhnt..as a definite contribu- 
tion to the developing science of the measure- 
ment of personal attributes. A very careful 
account is given of the methods adopted in the 
construction, validation, and standardization of 
the scale, and of the means used for the avoid- 
ance of errors traceable to over-simplification 
or prejudice. Interesting corroboratory evi- 
dence as to the personal history of the subjects 
tested is appended from questionaries and from 


detailed studies of eight distinctive types, When 
material of this kind becomes more widely used 
it should bring fuller understanding of the per- 
sonal consequences of differing types of disci- 
pline at different stages of growth. Evaluation 
of any form of the service of Youth may also 
some day be possible in terms of such scales for 
tolerance, independence, and security. 


[ 1142 ] 
SANDIFORD, PETER; CAMERON, M. A.; Conway, C. B.; 
AND Lone, J. A. Forecasting Teaching Ability. 
University of Toronto, Department of Educational 
Research, Bulletin No. 8. Toronto, Canada: the De- 
partment, 1937. Pp. 93. Paper. $0.50. 


Brit J Ed Psychol 13:56 F ’43. Owing to 
continuous increase of unemployment amongst 
teachers in the Province of Ontario an attempt 
was made in 1934 to find a way of forecasting 
teaching ability and restricting the number of 
teachers in training. This bulletin gives an ac- 
count of two investigations which were made. 
The experimenters consider that both gave dis- 
appointingly negative results, but publish the 
methods and findings to indicate to other work- 
ers in the same field the difficulties of the prob- 
lem and the unfruitfulness of the methods used. 
They hope a positive solution may be found, 
and indeed in their own findings here and there 
are some positive factors of importance. There 
is a useful bibliography. 

For additional reviews, see 38:B471 and 
40:B 1056. 

[ 1143 ] 
X*SaNronp, R. Nevitt; ADKINS, MARGARET M.; Mr- 
LER, R. BRETNEY; Coss, ELIZABETH A.; AND OTHERS. 
Physique, Personality and Scholarship: A Co- 
operative Study of School Children, Monographs 
of the Society for Research in Child Development, 
Vol. 8, No. 1, Serial No. 34. Washington, D. C.: the 


Society, National Research Council, 1943. Pp. ix, 705. 
Paper, lithotyped. Out of print. * (PA 17:4319) 


J Ed Psychol 35:255-6 Ap '44. David V. 
Tiedeman. In recent years psychologists have 
become interested in the investigation of the 
organism as a whole, that is, in the study and 
understanding of the complete interplay and 
organization of the many factors determining 
behavior. * This monograph is a reflection of 
the trend. Tt presents the results of a three-year 
exploratory study of physiological, personality 
and environmental factors. In the three-year 
period, forty-eight children ranging in age from 
five to fourteen years were studied intensively. 
Repeated testing of forty-three of these subjects 
permitted longitudinal studies of behavior as 
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well as the usual cross-sectional studies. The 
few subjects that were studied and the authors' 
dubious use of correlation coefficients based on 
the grouping of subjects in the extremes of the 
distributions tend to vitiate the findings of this 
investigation. However, the authors are aware 
of the homogeneous character of their small 
population and are careful in limiting the nature 
of the generalizations drawn from their data. 
As a pioneer publication in the study of the 
whole child this monograph should serve to 
guide and crystallize the thoughts of the in- 
creasing number of investigators who are begin- 
ning to think in terms of the whole personality. 
Moreover, it contributes to the methodology of 
experimentation with the whole personality by 
introducing a technique, a so-called syndrome 
technique, that the authors have found to be of 
practical use. The authors are courageous in 
attempting a task as huge as the one they set 
for themselves. Although they failed to find 
many of the answers they were seeking, they 
have made a start along a path that will prob- 
ably become increasingly well used, and knowl- 
edge of their studies and results should aid 
future investigators in eliminating blind alleys 
before proceeding with their experiments. 

Q R Biol 19:177-8 Je '44. The authors of 
this monograph present here the results of a 
three-year growth study in which almost every 
conceivable variable of physique, health, family 
background, social adjustment, personality, and 
Scholarship has been investigated. The prob- 
lems involved, in constructing a useful sum- 
mary from the multitudinous data accumulated 
in such studies, are enormous, and the need 
which Sanford explicitly recognizes for avoiding 
preconceptions in breaking such new ground 
makes the problem of selectivity doubly diffi- 
cult. Omitting, selecting, simplifying, and ra- 
tionalizing all offer dangers. It seems likely, 
however, that the reader could have been treated 
better than these authors have treated him. 
Physique, Personality, and Scholarship is one 
of the most unreadable monographs that has 
come to the attention of this reviewer. Part of 
the difficulty lies in mechanical factors. The 
book is printed in mimeotype, has been badly 
proofread (it even contains a misspelling of one 
of the authors’ names), and one finds no evi- 
dence of attention to literary style. But the 
difficulty goes much deeper. Sanford writes in 
his summary that his group not only bit off more 
than it could chew, but that it "included things 


that were better rejected, things indigestible 
and things too poor in vitamins." No one can 
censure the researchers for failing to know in 
advance what leads would prove profitable, but 
deciding to stuff their readers with the un- 
profitable and indigestible, after it has been 
identified, is surely misguided. It has resulted 
here in smothering a large number of interest- 
ing observations and discoveries under an ava- 
lanche of graphs, calculations, correlation coeffi- 
cients, and special classifications that are, for 
the most part, of little value. On the positive 
side, Sanford and his colleagues must be cred- 
ited with a useful and thorough application of 
Murray's Thematic Apperception Test. Their 
studies have clearly advanced the usefulness of 
this tool, though it must be admitted that some 
readers will still boggle at the apparent lack of 
objectivity in certain aspects of the method. 
The authors use Murray's system of needs and 
press (the latter term being gratuitously and 
objectionably given the same spelling in both 
singular and plural) and show that the system 
has sufficient usefulness to repay its careful 
study. For the rest, time alone can tell what 
value this monograph has as a reference work. 
The reviewer suspects that it will suffer from 
the same deficiencies which have been men- 
tioned, but psychologists may welcome the thor- 
ough survey, by definitions and examples, of 
Murray's needs. One may hope, moreover, that 
certain hypothetical syndromes, relating these 
needs to each other and to intellectual variables 
will stimulate research. 


[ 1144 ] 
*Sarcent, Heren. An Experimental Application 
of Projective Principles to a Paper and Pencil 
Personality Test. American Psvchological Associa- 
tion, Psychological Monographs, Vol. 57, No. 5, Whole 
No. 265. Washington, D. C.: the Association, Inc., 
1944. Pp. 57. Paper. $1.75. (PA 19:1728) 


[ 1145 ] 
*Scnarr, PrisciLLA. The Effect of Crossed Domi- 
nance on Accounting Aptitude. Human Engineer- 
ing Laboratory, Inc., Technical Report No. 246. Boston, 
Mass.: the Laboratory, February 1946. Pp. 3. Paper, 
mimeographed. * 


[ 1146 ] 
*ScioxEtL, Frep J. Diagnosis of Individual Diffi- 
culties in Arithmetic, Second Edition. Edinburgh, 
Scotland: Oliver & Boyd Ltd., 1942. Pp. xi, 115. 4s. 6d. 
* (Toronto, Canada: Clarke, Irwin & Co., Ltd. $0.90.) 
POE ARDEN of the first edition, see 38:B474 and 40: 
1058. 


[ 1147 ] 
*Scuroever, Extvor Marte. On Measurement of 
Motor Skills: An Approach Through a Statistical 
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ca Pren, mas eye Pope. fang.” (PA 
19:2170) 
Am J Psychiatry 102:286-7 S '45. Knight 
Dunlap. This monograph bears the sub-title: 
"An Approach through a Statistical Analysis of 
Archery Scores," but it is never made quite 
clear what is being approached. The presenta- 
tion of archery scores and their comparison are 
all that is contained in the report. * The author 
seems to assume that readers are experts in the 
field of statistical methods; but since many psy- 
chologists are not experts in that field, and 
probably fewer psychiatrists are expert statisti- 
cians, it would have helped if the technicalities 
represented by the terms and formulae had been 
explained more fully. * The data are presented 
as group averages, with standard measures of 
variation; but presentation of the ranges of 
variation for the groups would have been help- 
ful, since no measure of variation tells the whole 
story. Improvement in scores was measured in 
two ways; for each range (distance), the aver- 
age scores for rounds 1 and 2 were 
with average for rounds 6 and 7, and the aver- 
age for rounds 6 and 7, and the average for 
rounds 1I and 12, The average scores of the 
first four rounds were also compared with the 
averages for the second four and the last four. 
These comparisons, however, are of group 
Scores, and there is no reason for thinking that 
an "average archer" is any less fictional than an 
"average rat." Tabulation of improvement by 
individuals from series to series would have 
been helpful, especially for comparison of learn- 
ing in archery with learning in other fields. 
Psychol B 44:182-3 Mr '47. James M. Lynch. 
* Because of the standardized nature of archery, 
many of the findings of the study—such as, for 
example, that the sum of the best two scores 
estimated individual skill more accurately than 
did the sum of the last two scores of the series 
of lessons, that the Range score on twenty-four 
shots was sufficiently reliable to be a useful test 
of group ability, that the lesson of standard 
length was too short to measure accurately the 
skill of the majority of subjects—will have 
direct application to testing in that field. Taken 
as a whole, however, the results obtained illus- 
trate and emphasize a fact which is of prime 
importance in the interpretation of test scores, 
namely, that a score made at any stated time is 
likely to estimate ability with varying degrees 
of accuracy. For the light it throws on the na- 
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ture of scores, the study should be helpful in 
suggesting procedures to improve the accuracy 
of measurement in other sports activities. The 
methods by which the various types of problems 
undertaken were handled are presented in de- 
tail. Careful definition of terminology and the 
adequate treatment given to the statistical tech- 
niques employed add to the effectiveness of the 
presentation. As an exposition of testing prob- 
lems that are unquestionably fundamental, the 
book is an excellent contribution. 


[ 1148 ] 
*Scorr, Ira D. Manual of Advisement and Guid- 
ance: Prepared in Accordance With the Approved 
Policies of the Veterans Administration. Wash- 
ington, D. C.: Government Printing Office, 1945. Pp. 
ix, 233. $1.25. * (PA 20:2051) 
[ 1149 ] 
*Scorr, M. Grays, AwD Frencnt, Estner, Better 
Teac Through Testing: A Practical Manual 
for the Physical Education Teacher. New York: 
S. Barnes & Co., Inc., 1945. Pp. viii, 247. $3.00. * 
(PA 19:2762) 
[ 1150 ] 
*Secet, Davin. High-School Credit and Diplomas 
Through Examinations and Out-of-School Expe- 
rience. United States Office of Education Bulletin 
1 No. 7. Washingto: D. C.: Government Printing 
, 1946. Pp. vi, 46. Paper. $0.20. . 
[ 1151 ] 
SerLixo, LoweLL Six x. Diagnostic Criminology. Ann 
Arbor, Mich.: Edwards eg E 1935. p^ vii, 
175. Lithotyped. Out of print. * (PA 10:1111) 


J Crim Law & Criminology 26:962 Mr-Ap 
"36. Frederick H. Allen. * A manual of this 
type will be of some help to those students and 
beginners in courts, but the somewhat dog- 
matic manner of presentation may be mislead- 
ing and somewhat harmful to an untrained per- 
son. The trained person will probably find little 
use for this book. There is a small bibliography. 

Social Sci 11:285-6 J1 36. Herman Hausheer. 
* a competently written account of the multitu- 
dinous techniques in use on the part of psychi- 
atric criminologists * No sociologist interested 
in the study and treatment of criminals can well 
afford to ignore this book, because it admirably 
gives expression to the best techniques now in 
use by competent psychiatric criminologists. 
Sociologists probably would welcome a more 
elaborate account of the material presented, giv- 
ing ample factual data of the validity of the 
psychiatric approach to the understanding of 
the offender. * 

For additional reviews, sec 40:B1068.1. 
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ies. e 
Agricultural Experiment Station, Technical Bulletin 
No. 9. Stillwater, Okia. : the Station, April 1940. Pp. 
88. Paper. Gratis. * (PA 15:412) 


Am Sociol R 5:805-6 O '40. Howard R. 
Cottam. This bulletin is among the most impor- 
tant studies of the material aspects of standards 
of living. It represents a refinement of previous 
methods for determining socio-economic status 
from a minimum number of items, With a criti- 
cal examination of previous research as his 
starting point, Sewell constructed and stand- 
ardized an abbreviated scale of socio-economic 
status which is applicable to Oklahoma farm 
families. The major shortcomings of this study 
are not in method or in workmanship but in 
the premises from which it proceeds. The con- 
cept of socio-economic status is said to include : 
“Cultural Possessions ; Effective Income; Mate- 
rial Possessions ; and Participation in the Group 
Activities of the Community,” yet the final 
scale is largely a measure of possessions. No 
items of income are included in the final scale 
and it is tentatively suggested in an appendix 
that “no sacrifice in reliability or validity would 
result from omitting the social participation 
items” (p. 83). It might be asked, then, is the 
concept “socio-economic status” as used in this 
study a valid concept? Even though factor anal- 
ysis of these four components shows that they 
meastire a common factor, the inclusion of social 
participation is questioned on the basis of re- 
sults given on page 84 where it is shown that 
intercorrelations of this factor with the other 
three are r = .366; .245; and .318 respectively. 
Its inclusion might also be questioned from the 
standpoint of logic. A further point might be 
raised concerning the classification of posses- 
sions into “cultural” and “material” categories. 
In addition to the discrepancy with current use 
of the concept “cultural,” there is much diffi- 
culty in assigning items to one category or the 
other. The technique of weighting might be 
questioned on statistical grounds since assump- 
tions of normality are made, and also on logical 
grounds since items to be retained in the scale 
are selected on the basis of their diagnostic 
capacity (the critical ratios). The extent of 
diagnostic capacity is not, however, taken into 
account in final weighting of the retained items, 
but is replaced by the sigma method of scoring 


and ending with logical conclusions. 
by standard 
certain 


Sociol & Social Res 25:194 N-D "go. A great 
deal of careful work has gone into the making 
of this scale. * 


[ 1153 ] 
Sueeny, Loretta Manta. A Study of Preadolescents 
by Means of a Personality Inventory. Washington, 
D. C.: Catholic University of America 1938. Pp. 


x, 76. Paper. $1.50. * (PA 13:5424) 
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piling up of certain kinds of answers eventually 
assumes diagnostic meaning. The reply is sound 
enough; indeed it means merely the application 
of quantitative methods and statistics to a pro- 
cedure we all use all the time. There are, how- 
ever, other difficulties. For example, how often 
is “usually” in the question about usually liking 
school. Not all of these difficulties can be ironed 
out by statistics. But, above all, any single reply 
has meaning only in relation to a whole gamut 
of other responses—indeed to the whole reper- 
toire of responses of which the person is capa- 
ble. For certain specific purposes, the isolated 
reply may be sufficiently indicative, although 
even here we are learning to be cautious; but 
when it comes to personality and maladjust- 
ment, it is quite evident that no summation of 
misunderstandings (for the response taken in 
isolation is likely to be not merely not under- 
stood but actually misunderstood) will yield 
wisdom. The reviewer would therefore deplore 
the use of this inventory for clinical investiga- 
tion and so, but perhaps less emphatically, would 
its maker. Children with deeply intrenched 
problems need a different approach. I should 
object no less, however, to her statement that 
it gives "a ready appraisal of the general per- 
sonality pattern of school children." The chief 
use is to help us single out children for more 
careful attention—children who may be more 
maladjusted than we had suspected from casual 
observation. More adequate norms are needed 
if the Inventory is to be widely used for this 
purpose, but these in time will doubtless be pro- 
vided. Is it not, even so, a double-edged weapon? 
It is bad when a teacher ignores the signs of 
personally maladjustment, but it is worse when 
he finds a "problem child" beneath every curly 
or straight-haired top. Perhaps the only answer 
is that we must learn to use double-edged weap- 
ons, that for too little psychological understand- 
ing we must substitute more. 
For an additional review, see 40:B 1069. 


[ 1154 ] 
Su UTTLEWORTH, FRANK K. The Adolescent Period: 
A Graphic and Pictorial Atlas. Monographs of the 
Society for Research in Child Development, Vol. 3, 
No. 3, Serial No. 16. Washington, D. [o : the Society, 
National Research Council, 1938. Pp. v, 246. Paper, 
lithotyped. Out of print. * (PA 12:5061) 


Am J Orthopsychiatry 12:375 Ap '42. L(aw- 
son) G. L(owrey). * contains about the most 
- comprehensive sets of data yet seen. Invaluable 
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as a statistical background for any sort of re- 
search in this age period. * 
For additional reviews, see 40:B 1071. 


[ 1155 ] 
XSiEnRECHT, ELMER B. Measuring Driver Attitudes. 
Research Contributions to Safety Education, Vol. 2, 
No. 3. New York: Center for Safety Education, New 
York University, 1941. Pp. 29. Paper. $0.45. * (PA 
16:738) 

[ 1156 ] 


*Simmons, KATHERINE. The Brush Foundation 
Study of Child Growth and Development: II, 
Physical Growth and Development. Monographs 
of the Society for Research in Child Development, Vol. 
9, No. r. Washington, D. C.: the Society, National 
Research Council, 1943. Pp. xvii, 87. Paper, lithotyped. 
$1.25. (PA 19:835) 


El Sch J 45:600-1 Je '45. Gustav J. Froeh- 
lich. * contains a summary and a statistical 
analysis of the accumulation of data on physical 
growth and development gathered during the 
period 1931-42. The children included in the 
study numbered 999 and were of North Euro- 
pean stock, above average in economic and edu- 
cational status, and free from gross mental, 
physical, and nutritional defects. The children 
were examined at three, six, nine, and twelve 
months of age, then at six-month intervals until 
and including the fifth birthday, and thereafter 
at yearly intervals on or near the birthday until 
and including the seventeenth birthday. The 
number of examinations ranged from two to 
seventeen per child, averaging nine per child. It 
was necessary to include new individuals each 
year because participation was on a voluntary 
basis. Thus the study is both cross-sectional and 
longitudinal. This factor, however, is not con- 
sidered to be a limitation of the validity of the 
data. * The most important aspect of Miss 
Simmons’ monograph is not her conclusions 
(all of which have been shown or estimated 
previously), but, rather, the thoroughness of 
her treatment. Statistical data are presented in 
great detail. Seventy-eight tables and twenty- 
seven figures are included in the eighty-eight 
pages of the monograph. Interpretations are 
tempered by caution—in a number of instances 
by overcaution—resulting in extremely detailed 
statistics and some repetitious presentation. 
The classroom teacher may be interested in 
Miss Simmons’ conclusions but will find it dif- 
ficult to maintain sufficient interest to follow 
the very detailed tables and text. The research 
worker, on the other hand, will recognize this 
comprehensive body of data as a valuable con- 


933 ] 


BOOKS & REVIEWS: SHUTTLEWORTH 1159 


tribution to the study of child growth and de- 
velopment. 

Q R Biol 20:195 Je '45. Adolph H. Schultz. 
* The data which have here been analyzed 
statistically approach closely to the best that 
can be desired, and hence this report contributes 
unusually reliable cross-sectional age-sex con- 
stants for 25 anthropometric dimensions and 
for assessments of skeletal age during the entire 
period of active growth. All the many detailed 
findings are clearly discussed in the text and 
fully shown by means of tables and graphs. Of 
the conclusions, the following might specially 
interest all those with faith in popular age- 
height-weight charts: "The 'normal' weights 
found in height-weight-age tables approximate 
the truly normal only when height is not com- 
posed excessively of either leg length or trunk 
length and when ratings of body breadth are 
in harmony with ratings of body length." 


[ 1157 ] 
*Sims, VERNER M. A Report on the University of 
Alabama Freshman Testing Program 1927-1941. 
University of Alabama Bulletin, New Series No. 235; 
Studies in Education, No. 3. University, Ala.: Bureau 
of Educational Research, College of Education, the 
University, 1941. Pp. 22. Paper. $0.50. * (PA 17:328) 
[ 1158 ] 
Situ, B. OTHANEL. Logical Aspects of Educa- 
tional Measurement. New York: Columbia Uni- 
versity Press, 1938. Pp. x, 182. $2.50. (London: Ox- 
ford University Press. 12s. 6d.) * (PA 13 :2188) 


Lib Q 9:369-71 J1'39. Alice I. Bryan. * an 
original, timely, and valuable contribution to 
educational science. Although his book may 
find a rather limited audience, since it assumes 
some knowledge of technical developments in 
the field of achievement testing, it is sufficiently 
broad in its theoretical implications to be of 
interest to any intelligent reader who has kept 
abreast of current viewpoints and research in 
learning. The admirable clarity and economy 
with which the author expresses his ideas and 
the concise summaries at the close of each chap- 
ter should make his work intelligible even to the 
reader with little background in logic, mathe- 
matics, and scientific method. * Professor Smith 
hopes that his discussion of the fundamental 
principles of educational measurement will con- 
tribute toward a clarification of some of the 
problems of educational science. His hopes 
should be amply fulfilled. 

Psychoanalytic Q 10:497-8 Jl "41. Elizabeth 
H. Ross. An interesting statement of the special 
problems of this specialized educational field, an 


adequate and thoughtful account is given of the 
foundation and the development of the science 
of measurement. * The style, unfortunately, has 
the flavor of a Ph.D. thesis. The text is clut- 
tered with exaggerated acknowledgments to 
various authorities and is unnecessarily repeti- 
tious. And in spite of the author’s humanizing 
conclusions, the book remains a far better refer- 
ence on the discipline of mathematics than on 
the human beings whose behavior is in question. 

Psychoanalytic R 28:279 Ap '41. * The “self” 
as a “structure of attitudes” is an apt but in- 
complete phrase which the author would sup- 
plement, with a gestalt background. The author 
makes out a good case for his general thesis. 
We cavil a bit, albeit small, that one finds little 
or no reference to holistic conceptions of the 
human animal. 

For additional reviews, see 40:B1076. 


[ 1159 ] 
*Suitn, Evcene R.; Tyrer, RALPH. W.; AND THE 


EVALUATION Starr. Appraising and Recording 
Student Progress. Adventure in American Educa- 
tion, Vol. III. New York: Harper & Brothers, 1942. 
Pp. xxiii, 550. $3.50. (PA 16 :5033) 


Curric J 14:18-21 Ja '43. Cyril O. Houle. 
* In attacking the problem of evaluating the 
objectives of progressive schools, Smith and 
Tyler and their associates formulated a proce- 
dure which involves seven major steps. The 
testing out and validation of this procedure is, 
perhaps, the major contribution of their study. 
* Eight years of hard thought and study have 
gone into this book. Much is in it that is not 
easily mastered and a third and fourth reflective 
reading of certain parts of it will yield produc- 
tive insights to the reader. It is an important 
book, and it will live for a time and have great 
influence. Yet,curiously enough,the very method 
which was used in developing it will shorten 
the period of time in which it will be a valuable 
and primary source and hasten the time when 
it will be esteemed only as a pioneer study. For 
the studies which the book reports enlisted the 
cooperation of many people and spread wide 
the knowledge of what evaluation can be. Al- 
ready, much of what the book says, which 
would have seemed original in 1935, seems al- 
most to be a part of common knowledge in the 
field of education in 1942. This broad base of 
understanding means that there are already 
many workers who are studying along the line 
developed in the book and who are already ex- 
tending and refining the concepts and instru- 
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ments described in it. * The careful and scru- 
pulously logical thinking which is demonstrated 
in this book provides a synthesis for the oppos- 
ing positions of the traditionalist and the pro- 
gressive. And thus, to a large extent, the prob- 
lem which Smith and Tyler set out to attack 
has been solved. 
H Sch J 25:334-5 N-D '42. A. M. Jordan. 
* sets forth not only a philosophy but supplies 
techniques for its implementation. The authors 
point out that in the past attempts at evaluation 
and appraisal have made two egregious errors: 
(1) they have constructed instruments which 
pretty largely measured the more superficial 
aspects of education, and (2) much of the work 
has been done by persons outside of the teach- 
ing staff itself. To correct these errors we must 
construct instruments which will evaluate the 
whole individual in action and we must enlist 
the cooperation of the teachers themselves. The 
reviewer found this volume very tantalizing to 
read. Samples of the appraisal instruments were 
sometimes given and at other times omitted. In 
some instances exact statistical procedures were 
used and in others not. All told this book is an 
exploratory volume which points the way to 
better appraisal and evaluation rather than fur- 
nishing us with records of accomplished facts. 
J Consult Psychol 7:209-10 Jl-Ag 43. War- 
ren G. Findley. * a progress report on the great 
variety of educational and psychological tests 
and other instruments of appraisal developed 
by the Evaluation Staff in the Eight-Year Study 
«to aid the thirty participating secondary 
schools in determining the extent to which they 
have achieved their avowed objectives. The psy- 
choeducational techniques described include so 
many fresh and promising approaches to the 
analysis of personality development, approaches 
based on careful thought proceeding from pro- 
found understanding of developmental psychol- 
ogy, that the volume merits the thorough study 
of all professional psychologists. It will be par- 
ticularly valuable to test constructors. * Applied 
psychologists, especially those in education, will 
be challenged by the attempt to appraise the 
ability of adolescents to apply advanced tech- 
niques of thinking. Distinctions between intelli- 
gence and achievement tend to disappear in 
tests of what may be called "applied intelli- 
gence.” These tests have been designed to re- 
veal ability to interpret data, to apply principles 
of science in specific situations, to judge what 
conclusions follow from particular assumptions 


and by what principles they follow, to locate ' 
and appraise the basic assumptions on which _ 
particular proofs depend, to apply democratic 
beliefs consistently and objectively to a variety 
of social problems, to apply facts and generali- 
zations to social problems, to appreciate basic 
similarities in works of art by the same artist. 
In all instances they yield results that may be 
broken down in a variety of ways to provide 
qualitative descriptions of a student's ways of 
thinking. By ordinary standards of measure- 
ment most of the tests are unreliable, but “ap- 
praising student progress” is here conceived as 
a qualitative task. * Despite the variety of well- 
adapted instruments, the observational tech- 
niques Tyler has developed elsewhere and 
recommends as basic to appraisal of many be- 
haviors are given no place in the account. Health 
habits and physical development in general are 
omitted from appraisal. * An excellent concise 
statement of the philosophy and objectives of 
record-keeping is given. The appraisers have 
recognized more clearly than many the psycho- 
logical effects of testing programs on adoles- 
cents. Appraisal by tests was held to the pur- 
pose of guiding students, Much testing was 
placed at the beginning rather than at the end 

of instructional activities. Comparisons, norms 
and ratings were avoided and behavior descrip- 
tions, in varying degrees of quantification were E 
substituted. The thirty schools were not typical | 
in that they serve an elite. Even the ten public 
high schools were hardly representative of pub- 

lic schools. Nevertheless, psychologists in school | 
systems that have strong testing and guidance ` 
programs may well encourage use of the in- 
struments and practices of appraisal described = 
in this-report. They should welcome these mate- 
rials as a means of furthering through schools 
the guidance of personality development in 
adolescents. : 

Social Ed 8:94-5 F 44. Howard R. Ander- 
son. * Many reviewers have already argued 
that this book should be required reading for 
school administrators, supervisors, curriculum . - 
and research workers, and students in tests and 
measurements, With that opinion this reviewer 
agrees. * Doubtless social studies teachers 
should ponder the suggestions in Chapters I 
and VIII for developing an inclusive and valid 
program of evaluation. * It is interesting to 
speculate on how the behavior patterns of social 
studies teachers who read the chapters men- 
tioned...may be modified. From their reading 
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they should have acquired (1) a better under- 
standing of the purpose and scope of an adequate 
evaluation program, (2) a knowledge of certain 
evaluation instruments developed in the Eight- 
Year Study and the purposes for which they 
can be used, and (3) an appreciation of how 
evaluation data should be used in the guidance 
of pupils. About all this reviewer would predict 
in the way of changed behavior is, however, that 
many teacher-readers will give one or another 
of the P.E.A. tests to their pupils and will make 
better use of the information derived from re- 
sults on these and other tests. And that cer- 
tainly will be to the good ! In general, classroom 
teachers certainly cannot be expected to develop 
evaluation programs of the kind described in 
this volume. Quite obviously they lack the nec- 
essary time, ability, and expert supervision. In 
most cases teachers probably should be advised 
to formulate carefully the objectives sought by 
their school, and to choose from published tests 
and evaluation instruments those which best 
meet their needs. * (1) Some of the tests dis- 
cussed seem complicated (for example, 5.21 
Nature of Proof, 5.31 Analysis of Controver- 
sial Writing, 1.41 Social Problems, and 1.5 
Applying Social Facts and Generalizations to 
Social Problems). At what grade levels can 
these and other P.E.A. tests be used? (2) Be- 
cause the responses expected of pupils who take 
the tests just mentioned are so complicated, one 
wonders what correlation these and other P.E.A. 
tests have with tests of general intelligence. (3) 
Having done some experimental work with so- 
called attitude scales, this reviewer wonders 
what tests 4.21 and 4.31, Beliefs on Social Is- 
sues, really measure. This question is considered 
by the authors (pp. 225 ff.) but their general 
conclusion is that the tests are valid. One may 
wonder whether the fact that pupils react the 
same on the tests and in interviews is conclusive 
(p. 227). And it is difficult to interpret the 
statement (p. 228) that "over 9o per cent of 
the judgments of the teachers coincided with 
the test ratings." What is the precise meaning 
of the term “coincided”? But these "doubts" 
should not keep a teacher from using the 
P.E.A. tests. If he uses them perhaps he can 
answer the questions for himself. 


[ 1160 ] 
*SwirH, Henry Lester, AND Eaton, MERRILL T. 
Analysis of the Proficiency in Silent Reading of 
11,424 Sophomore Pupils in 243 High Schools 
in Indiana. Bulletin of the School of Education, In- 


diana University, Vol. 22, No. 1. Bloomington, Ind. : 


Indiana University Bookstore, January 1946. Pp. 46. 
Paper. $0.50. * A 
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$0.50. * (PA 20:2912) 
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*Smiru, Henry Lester, AND Eaton, Merritt T. A 
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University Bookstore, September 1940. Pp. 68. Paper. 
$0.50. * (PA 15:1070) 
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Spencer, Doucras. Fulcra of Conflict: A New Ap- 
roach to Personality Measurement. Yonkers, 
K. Y.: World Book Co., 1938. Pp. xii, 306. $2.52. * 
(PA 13:1534) 


J Abn & Social Psychol 36:130-1 Ja "41. 
Jerome S. Bruner. * deals with the construction 
of a battery of tests for adolescents designed to 
reveal those areas of behavior in which ground 
for conflict is present * An experimental form 
of the test-battery has been administered and, 
although no direct validation was attempted, 
presumptive evidence of validity in the form of 
a high measure of internal consistency is pre- 
sented. High correlations, however, between 
section scores and total battery scores and be- 
tween individual sections are very questionable 
indices of validity when applied to this type of 
material. Divergence of one's own ideals from 
those of one's father need not indicate, for ex- 
ample, a high general conflict score nor a high 
conflict score on any other section of the bat- 
tery. The conventional "consistency" technic of 
validation is simply not applicable here. Since 
the author reports that he is attempting a val- 
idation by the use of the interview, these ques- 
tionable findings can be temporarily overlooked. 
Work on the test is continuing, and the in- 
genious beginning reported in this book gives 
promise for the development of a useful instru- 
ment for the study of conflict. The reviewer, 
however, is at a loss to understand why, with 
the research still in such an unfinished state, the 
author undertook publication in book form. 

Psychoanalytic Q 11:255-6 Ap ’42. H. E. 
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Chamberlain. * Spencer presents a hypothesis... 
that arbitrary scoring keys similar to those used 
in measuring intelligence are of little value in 
measuring personality. Briefly stated, the hypo- 
thesis is that currently used personality test 
items have no dependable meaning in the ab- 
stract, and that responses to them can be evalu- 
ated only on the basis of their personal meaning 
to the individual subject and that factors in the 
concrete experience of the individual determine 
the personal meaning of such responses. Per- 
sonality conflict is defined as the degree of 
discrepancy and incongruity between the in- 
dividual's report of his own characteristics or 
behaviors and his report on corresponding items 
in relation to the fulcra of conflict; i.e., the 
discrepancy between his behavior and his ideals 
regarding such behavior, his mother's behavior, 
his father's behavior and the behavior of his 
closest associates. The bibliography is adequate, 
the innovations in methodology are justifiable 
and the conclusions are conventionally stated. 
In a way, this effort of the author to bridge the 
gap between enumeration and appraisal by a 
clean-cut forthright hypothesis is indeed reas- 
suring if not actually convincing. He says know- 
ingly: 'In psychological counseling, as in 
measurement, behavior in isolation—in the 
abstract—is a fiction of little value, the para- 
mount significance of behavior being in terms 
of what it means to each subject in his own 
concrete experience." 

Psychoanalytic R 30:478 O '43. A. N. Foxe. 
* The volume already indicates a huge amount 
of work and the further studies may be pa- 
tiently awaited. There is a large bibliography 
and numerous tables. The volume is dedicated 
to Marion E. Kenworthy, an accredited psy- 
choanalyst ; the author more or less depreciates 
psychoanalysis (p. 37) ; one wonders how much 
of this personality work would have been pos- 
sible without Freud's contribution on parent 
child relationships. 

For additional reviews, see 40:B 1081. 
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*Sprer, JuLius. The Hands of Children: An In- 
troduction to Psychochirology. Translated by Vic- 
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Co. Ltd., 1944. Pp. xii, 179. 16s. 6d. (PA 19 :836) 
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Sreap, WirLIAM H.; SHARTLE, CARROLL L.; OTIS, Jay 
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tional Counseling Techniques: Their Develop- 


ment and Application. Published for the Technical 
Board of the Occupational Research Program, United 
States Employment Service. New York: American 
Book Co., 1940. Pp. ix, 273. Out of print. * (PA 14: 
2627, title only) 


Ed Res B 21:111 Ap 15 '42. Ross L, 
Mooney. Occupational counseling techniques 
are not discussed. The Worker-Analysis Sec- 
tion of the Occupational Research Program of 
the United States Employment Office is making 
its report. This is an exposition of the aims, 
procedures, and results of the initial work of 
this group. The techniques discussed are those 
of research and not of counseling. Therefore 
the volume is not of direct use to counselors, 
but is of chief value to research and professional 
workers in the guidance and employment fields 
who are interested in following the develop- 
ments of research in the United States Employ- 
ment Office. * There is a nice perspective 
running through the report and no more is 
claimed for the results than is justified. The 
book is well worth its publication, though 
chiefly for its indirect values in showing a com- 
petent staff hard at work on a challenging job. 
The direct results are not far enough along to 
be of much use except in highly limited areas of 
employment-service activity. A less misleading 
title would have been appreciated. 

J Am Stat Assn 35:713-5 D '40. Harold E. 
Burtt. * interesting book * Readers slightly fa- 
miliar with the field of vocational testing will 
find the book valuable as a survey of conven- 
tional research procedures—almost a manual. 
Those experienced in the field will be interested 
in numerous variations and improvements of 
methodology recounted. * The emphasis obvi- 
ously is on the ultimate use of the tests for 
guidance rather than solely for selection, The 
authors stress the fact that vocational counselors 
should not be allowed to use such tests until 
adequately trained. The reviewer, mindful of 
what happened in the testing field in 1919, 
heartily seconds the motion. Development of 
oral trade tests constituted a considerable part 
of the program. * The scales for rating sales- 
people and interviewers will be of especial in- 
terest. * the phrases do not extend from a high 
to a low rating across the page but are mixed 
so that the rater will have to read them all. 
Why the above form was used rather than the 
conventional graphic scale is not explained in 
the text. At any rate the reliabilities reported 
with this form compare favorably with those of ` 
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other rating scales. * There is an interesting list 
of criterion items for retail sales clerks such as 
sales, quota, bonus, errors in computation, shop- 
ping report grades. The chapter on construction 
of aptitude test items is centered around depart- 
ment store sales persons. It includes a good 
summary of the literature on tests for that par- 
ticular job. Then follows a rather detailed 
description of the process of analyzing the 
data statistically, with the more technical details 
in the appendix. An interesting procedure was 
followed when the weights for tests or items de- 
rived with one sample gave low validity in 
another sample. * The eight appendices give 
some of the technical details and procedures 
mentioned in the text and portions will be of 
particular interest to statistical readers. * The 
book will fill a very useful place. While the de- 
scription of methods is for the most part con- 
ventional it does give a good summary of 
procedure in an up-to-the-minute setting. The 
discussion will be elementary for the more 
technically trained reader but a very good out- 
line for the beginner, In short it could almost 
constitute a manual for a researcher in this field. 
Even the technically trained will find some stim- 
ulating suggestions and original variations of 
conventional procedures. The work would also 
constitute a good reference book for academic 
courses in vocational psychology. However the 
free-lance counselor should not be mislead into 
thinking that it will provide him with a set of 
instruments for his practical work. Unless he 
knows a good deal about tests and has had 
sufficient training in their interpretation he will 
find the book “over his head.” For persons un- 
dergoing training in the U. S. Employment 
Service for counseling positions in the organiza- 
tion the book is admirable. Such persons will 
doubtless call it “the Bible.” 

J Consult Psychol 5:295-6 N-D "41. G. E. 
Bathhurst. This book lies in the field of job 
analysis and measurement techniques rather 
than in the area of occupational counseling 
techniques, in the opinion of this writer. For to 
him counseling techniques are involved pri- 
marily in the process of interpreting data, in- 
formation and other materials to an individual 
in terms of his needs to the end that he may 
make satisfactory choices in the solution of his 
problems. The techniques involved in obtaining 
accurate data and information about jobs and 
about the individual, the subject of this book, 
while essential to vocational counseling, are pre- 
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liminary to it. * The authors are especially 
effective in presenting techniques of testing and 
ways and means of verifying the reliability and 
validity of test results in the measurement of 
certain human characteristics. * The book as a 
whole is a definite contribution to the areas of 
measuring human characteristics that have oc- 
cupational significance and of transplanting 
these characteristics into occupational profi- 
ciency, both potential and real. One misses 
certain theoretical considerations which seem 
essential to the writer as underpinnings to assist 
in evaluating the contribution of the book, to 
aid in orienting it in the literature in the field 
and to clarify the thinking background of those 
working in the vocational field. There are cer- 
tain problems in this area which the writer 
would like to have seen discussed more fully. 
For example, the meaning of trait patterns or 
patterns of human characteristics. What do cer- 
tain of these patterns mean when translated into 
vocational proficiency? This problem is the 
“bottleneck” in the program of the vocational 
counselor. However, the authors have kept 
closer to their data than such a discussion would 
have permitted. For the statistician, the book 
contains in the appendices many tables of 
norms, standard deviations, and other statistical 
symbols. This book is not a text. It was written 
primarily as a contribution to a better under- 
standing of how to measure reliably those 
human characteristics that have vocational sig- 
nificance and how to translate these human 
characteristics into vocational proficiency. How- 
ever, if one is looking for an explanation of 
ways and means of interpreting this translated 
information to the counselee so that he can 
utilize it himself in making his own choices, 
(the heart of vocational counseling,) one will 
be disappointed. The authors do not state but 
apparently imply that if the vocational coun- 
selor knows what certain human characteristics 
mean in terms of occupational or job profi- 
ciency, he will be able to so interpret his in- 
formation to the counselee that the latter will be 
able to use it in making his own choices in the 
solution of his vocational problems. But this 
omission does not necessarly detract from the 
great worthwhileness of the book. Tt could be 
read and studied with profit by the vocational 
counselor in general and those connected with 
federal and state employment service in par- 
ticular. The book is quite readable, straight- 
forward and to the point. It is a book that 
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might well be in the personal library of anyone 
interested in vocational guidance or employ- 
ment work. _ 

Nation's Sch 26:96 O ’4o. Valuable presenta- 
tion of current occupational guidance technics. 
Designed for college classrooms and for guid- 
ance counselors. 

Occupations 19:618 My "41. Eleanor Loeb. 
* The material of this book could perhaps have 
been presented more concisely and effectively 
by one writer rather than by several research 
analysts. However, the research project itself is 
of unquestionable value in pointing the way 
toward further study of occupational counseling 
techniques which can profitably be made by all 
community agencies dealing with the problem 
of occupational guidance and job placement. 

Sch R 48:632-3 O '40. Dean M. Schweick- 
hard. * The subject matter is typical rather 
than comprehensive in that it presents a method 
which might be applied, with modifications, to 
almost any situation involving occupational se- 
lection. The graphs, charts, and tables are 
extremely well prepared ahd are generously 
supplied throughout the entire volume. Looked 
upon as a whole, the book appears as a rather 
unified, running account of the experience of 
the Employment Service in setting up tech- 
niques for the analysis of occupations and the 
analysis of individuals, with the ultimate end 
of matching the appropriate elements of each 
with the other. * This book does not attempt to 
supply counselors with information about occu- 
pations or to give sources where such informa- 
tion can be secured ; rather it indicates lines of 
approach. * will serve not as a busy worker's 
handbook but rather as a. fundamental guide- 
book. It may be described more perfectly as a 
book presenting a technique for developing 
techniques of counseling. It constitutes a good 
foundation of information and reference and 
should be extremely helpful as a guide for 
training counselors or for setting up a program. 

For an additional review, see 40:B 1083. 
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Am J Mental Def 48:180-90 O "43. I. N(ew- 
ton) K(ugelmass). This book brings into focus 


various findings on the broad question of in- 
telligence; it attempts to relate technical issues 
to certain problems in modern life and proves 
most helpful to workers in psychology, educa- 
tion and related fields when baffled by the 
tremendous literature thus far as yet inco- 
ordinated. * 

Am J Psychol 57:105-9 Ja '44. Florence L. 
Goodenough. * The book is not easy to value. 
The style is prolix and the argument, which 
proceeds mainly by analogy, not always easy to 
follow. The task is not simplified by the author's 
apparent attempt to maintain a foothold in 
either camp * To the psychologist, the major 
interest of the book lies....in the more theoreti- 
cal sections dealing with the nature of intelli- 
gence and the principles of measurement. * 
Stoddard presents the following definition. “In- 
telligence is the ability to undertake activities 
that are characterized by (1) difficulty, (2) 
complexity, (3) abstractness, (4) economy, 
(5) adaptiveness to a goal, (6) social value, 
and (7) the emergence of originals, and to 
maintain such activities under conditions that 
demand a concentration of energy and a resist- 
ance to emotional forces." With this descrip- 
tion,....few will take very serious issue, though 
I believe that, with the possible exception of the 
sixth attribute, for which an adequate criterion is 
not éasy to find, a careful consideration of the 
implicit requirements of the third, which has been 
emphasized by Terman, will show that the re- 
mainder of the definition is largely if not wholly 
superfluous. Neither the "difficulty" nor the 
"complexity" of an act is of any intrinsic sig- 
nificance, but it is true that as the level of 
abstractness is increased a task becomes more 
difficult for the average person to perform with 
"economy" or "adaptiveness to a goal." Like- 
wise, thinking in abstract rather than concrete 
terms enables one to deal with more “complex” 
problems since it leads to the substitution of 
single broad symbols for multitudinous narrow 
categories. While I am not quite sure that Stod- 
dard's seventh attribute, "the emergence of 
originals," is completely covered by the process 
of abstract thinking, from Stoddard's statement 
(29) that he has included this attribute “be- 
cause of its special place at the upper end of any 
valid distribution of intelligence," T am inclined 
to think that it is. For at a corresponding level 
of abstract reasoning one is likely to reach a 
point at which the “old” concepts and symbols 
may no longer serve, and when this point is 
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reached, the test of extraordinary ability is 
found, not simply in the emergence of a new 


"way of dealing with the isolated problem at 


hand, but in carrying the process of reasoning 
to a form of solution that embodies the general 
principle at issue, and that will accordingly 


| serve for the more effective solution of all prob- 


lems of the class. I think this is what Dr. Stod- 
dard has in mind but his discussion nowhere 
makes the distinction clear. For surely the mere 
fact that an idea is "new" or "different" (the 


` descriptive terms used by him) does not neces- 


sarily entitle it to top rank in the distribution of 
intelligence. To accept this idea is to place a 
premium upon mere eccentricity. My major ob- 
jection to Stoddard's analysis lies in the fact 
that his seven attributes and his two conditional 
factors are all apparently placed on the same 
level. Nevertheless his description will be of un- 
doubted value to teachers and others with lim- 
ited scientific background for whom a detailed 
definition that makes little demand upon inde- 
pendent thought will often carry greater mean- 
ing than a more concise formulation in which 
many of the specific implications must be filled 
in by the reader. I amr somewhat at a loss to 
follow the reasoning that led to the selection of 
material for the second chapter, entitled The 


"physiological basis. Under this title one would 


expect to find some consideration of neurologi- 
cal development and of general organic func- 
tioning in the normal as well as in the abnormal. 
Except for two or three brief paragraphs on the 
nature of nerve action, however, paragraphs 
which will explain little except to those already 
familiar with the field, and a rather trite discus- 
sion of the *mind-body problem,' the chapter is 
almost entirely devoted to the question of the 
effect of brain lesions upon intelligence. There 
is but casual reference to the animal literature 
from which most of our meager information on 
this topic has been gained. * the author....is of 
the opinion each shift in an obtained measure- 
ment is at least a potential indication of a “true” 
shift in mental ability and in his discussion of 
such changes the implication that they are 
usually meaningful is implicit. That they do 
Mean something no one can doubt, but the fact 
which Stoddard neglects is that IQ shifts of fair 
magnitude sometimes appear when the tests are 
separated by only a few hours of time, especially 
when the subjects are young children. That the 
course of mental growth is not always as even 
and regular as it appeared from the early stud- 


ies when mental testing was new and the results 
of repeated tests were just beginning to ac- 
cumulate has been recognized for many years. 
* But the fact remains that the whole concept 
of IQ constancy remains as before—a question 
of probability. This fact was made very explicit 
by Terman in his original formulation published 
in 1919. In his The Intelligence of School Chil- 
dren (149) he states these probabilities as fol- 
lows: “Roughly speaking, the chances that an 
IQ will either increase as much as 6 points or 
decrease as much as 4 points are 1 and 2; that 
it will either increase as much as 12 points or 
decrease as much as 8 points, 1 in 5; that it will 
either increase as much as 18 points or decrease 
as much as 12 points, I in 20.” In comparing 
these figures with those of other investigators it 
is necessary to remember that they are based 
upon children of school age whose mean IQ was 
not far from 100. More recent investigation has 
shown that high IQ's vary more than low and 
that IQ's obtained for pre-school children are 
far less reliable than those secured at a later 
age. Most of the figures reported by Stoddard 
are based upon children who were both young 
and bright. It should also be remembered that 
even a change of 20 IQ points, while material 
(if “true”) is only a little more than one stand- 
ard deviation of a representative distribution. 
The emphasis which Stoddard places upon IQ 
inconstancy is perhaps a healthful reaction from 
the blind faith in the absolute diagnostic signifi- 
cance of a single mental test that has been 
characteristic of many persons in the field of 
education; a faith that has persisted in spite of 
the continued labors of research psychologists 
to present the facts. But it is regrettable that 
in his zeal to demonstrate the hazards of pre- 
dicting future mental-test standing from present 
findings, Dr. Stoddard is so ready to reify 
change in standing. Particularly is this unfor- 
tunate since the difference between two fallible 
measures; eg. a change in IQ, is obviously 
more fallible than either, since two experimen- 
tal errors instead of one are involved. The pos- 
sible educational and social consequences of 
engendering among those responsible for the 
welfare of children a belief that by appropriate 
forcing procedures the backward can be made 
bright, or of valuing a school or teacher in 
terms of changes in the IQ's of students cannot 
be lightly disregarded. I am also somewhat dis- 
turbed over the author’s insistence that test re- 
sults should be accepted at their face value 
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without reference to the factors that may have 
contributed to the variance in individual cases. 
* [ do not think the argument is valid. The dis- 
tinction between the appraisal of a performance 
and the appraisal of an individual on the basis 
of a performance is vital. No one cares in the 
least how well or how badly the subject may 
have done on the brief series of tasks constitut- 
ing a single mental test except insofar as this 
serves as a basis for estimating his ability to 
solve other tasks that he will encounter in the 
course of his daily living. The tester is much 
more than a “timer”; he is—or should be—a 
highly trained observer, quick to note small be- 
havioral cues and experienced in the interpreta- 
tion and utilization of whatever supplementary 
information can be obtained that may be of 
service in making the diagnostic transition from 
test score to level of ability. To do this is not, 
as Stoddard indicates, a confusion of the issue 
but to fail to do so may lead to gross misunder- 
standing, both of the subject’s present ability 
and, consequently, of any “change” that may 
thereafter occur. In Part V, Intelligence and 
Society, the author presents his views on a wide 
variety of educational and social problems. This 
part of the book is interesting and stimulating 
but it is not closely related to the sections that 
precede it. 

Am J Sociol 49:265-6 N "43. Robert E. L. 
Faris. * The sociologist is more likely to in- 
cline toward Stoddard's side on the issue, not 
merely because of freedom from biological bias, 
but more because he is by virtue of his study 
aware of the complex and subtle interconnec- 
tions between social experience and mentality. 
The more we become familiar with such influ- 
ences, the more hopeless it appears to attempt 
to measure such a thing as innate intelligence 
that is independent of experience. * The work 
is, on the whole, an able, well-organized, and 
in places sparkling exposition of the systematic 
view of the members of the rebellion in educa- 
tional psychology. It is a statement that calls 
for respect, whatever our prejudices on the 
issues may be, and which deserves to be known 
as an important work. 

Am Sociol R 8:618-9 O ’43. Paul R. Farns- 
worth, * The frame of reference is mainly that 
of radical environmentalism, although occasion- 
ally the social philosophy of interactionalism is 
implied. The data of the book are carefully se- 
lected to bolster the author’s theoretical stand. 
A few experiments which point in other direc- 


tions are mentioned and criticized, but the 
major critics are ignored. * the highly contro- 
versial Iowa data have been sharply criticized 
by a number of competent researchers. Their 
criticisms seem to fall into three categories: 
that the experimental procedures of the lowa 
researchers were occasionally unsound, that 
the statistical treatments were often faulty, and 
that serious semantic errors were present. Now 
at this date nothing can be done about the way 
in which the Iowa data were collected. But is it 
too much to ask that the major objections to 
the statistical procedures and the semantic uses 
be at least mentioned? Although Stoddard 
demonstrates that he is aware of the semantic 
errors of certain others he still fails to see that 
if a preschool test and the Stanford Binet are 
measuring almost totally different variables it 
is wrong to give the identical label to what each 
measures. Yet a goodly share of his argument is 
based on just such a semantic absurdity. * With 
Stoddard’s social philosophy, if not with his 
data, the reviewer is in substantial agreement. 
Too long has social science been hamstrung by 
the extreme hereditarians and by pseudoscien- 
tists of the Popenoe sort. The ‘philosophy of 
interactionalism is apparently much closer to 
the truth than is their dogma. Yet, even now, 
definite answers to pressing social problems can 
rarely be given. And, since this is true, is it not 
good sense to ascertain the IQ's of the parents 
of the child one may desire to adopt? Yet Stod- 
dard rules against this. And, although one may 
not relish the wild claims of the eugenists, must 
he blind his eyes, as does Stoddard, to the fact 
that the sterilization of certain defectives al- 
lows them to marry and cate for themselves 
successfully without the crushing responsibilities 
children may bring? It would seem that a belief 
in the prognostic value of IQ’s or in the social 
utility of an occasional sterilization should not 
make one per se a hereditarian. The Meaning 
of Intelligence is, in the main, an interestingly 
written book. Whether or not one agrees with 
its data and its social philosophy one should, 
the reviewer feels, become acquainted with what 
the progressive educators will in all probability 
soon accept as another of their gospels. 

Ed Res B 23:220 N 15’44. Francis N. Max- 
field. Reports of research on the influence of 
environmental factors on psychometric scores 
by investigators associated with the Iowa Child 
Welfare Research Station have led to so much 
controversial discussion that one might expect 
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the former director to give us some light on 
what has come to be called the “Iowa contro- 
versy.” The book is disappointing in this regard, 
and the author’s treatment of most of the other 
subjects suggested by the title is inadequate. * 
For the author of this book to point out the 
limitations of the “Binet method" in psycho- 
metrics, and the mistakes committed by those 
who have been overenthusiastic about psycho- 
metric scores in general, does not go far toward 
establishing the validity of interpretations of 
the Iowa researches, for which, paradoxically 
enough, this method was fundamental. * The 
chapter entitled "The Course of Mental Growth” 
is particularly disappointing. The Gompertz 
formula may not be the last word in growth 
curves, but why are the work of Courtis, or the 
findings of the Harvard growth studies, com- 
pletely ignored in this discussion? In a treat- 
ment that emphasizes gains in psychometric 
scores under a stimulating environment, why 
not discuss more fully what was actually taking 
place and the relation between learning curves 
and curves of mental growth? One does not 
question the value of the Iowa data, even 
though they revolve around what the author 
calls “the thin axis of the I.Q.,” and though 
they have been extravagantly acclaimed. How- 
ever, to obtain psychometric scores on tests 
standardized on populations living under aver- 
age conditions from children living under sub- 
standard conditions, and then to obtain higher 
scores from the same children after they have 
been living under better conditions, can hardly 
be considered surprising or revolutionary. 

J Appl Psychol 27:476-9 O '43. Henry E. 
Garrett. * Stoddard has evidently read widely ; 
but the informed reader will quickly discern 
that the references are highly selective and are 
usually for the purpose of defending a very 
definite point of view. The tone of the book is 
well conveyed by the “definition” of intelligence 
given on page 4 of Chapter I. Stoddard writes: 
“Intelligence is the ability to undertake activi- 
ties that are characterized by (1) difficulty, (2) 
complexity, (3) abstractness, (4) economy, 
(5) adaptiveness, (6) social value, and (7) the 
emergence of originals, and to maintain such 
activities under conditions that demand a con- 
centration of energy and a resistance to emo- 
tional forces.” There is certainly little that one 
could possibly add to this amazing congeries, 
which is, of course, no definition at all but a 
general summary of the 1921 symposium. I am 


quite sure that no tests now available, or likely 
to be available, could possibly encompass Stod- 
dard’s definition. Certainly even its most rabid 
partisan has never believed that the Stanford- 
Binet runs the gamut of human behavior. Ap- 
parently Stoddąrd is none too happy with his 
definition as he tries several times later on in 
the book to pin it down to earth. The most 
intelligible attempt, it seems to me, is on page 
285 where Doll’s statement that intelligence is 
only a part of “mentality” is quoted. * It will 
soon be abundantly clear, even to the casual 
reader, that one of Stoddard's purposes—if not 
his major purpose—is to discredit and demolish 
the IQ. The "inconstancy" of the IQ is the 
recurring theme in Parts I-III. To be sure, 
Point Scales receive a mild chastisement in 
Chapter 5, but the brunt of the assault (Chap- 
ter 4) is against the Stanford-Binet and the 
IQ. The evidence against IQ constancy is skill- 
fully martialed and is often cleverly distorted, 
as will be pointed out later. More important as 
giving the polemic and non-scientific character 
of the book is the frequent use of ridicule, at- 
tempted wit, exhortation, special pleading, and 
specious argument. The following are samples : 
p. 50, “The whole village knows the idiot, but 
academies of science have welcomed men whose 
schemes for differentiating IQ 70 from IQ 170 
were quite worthless"; p. 115, "Point scoring 
should reduce the fanatic attachment that some 
persons have for a fixed quantity that can be 
predicted, subsidized, and inherited—the mod- 
est virtues of the IQ"; p. 258, "the IQ, some 
say, is fixed: what varies is the relationship of 
an individual to it, such variation being a prod- 
uct of invalidity in the test and idiosyncrasy in 
the child—given a perfect test and an ever- 
normal child that IQ would be constant!”; p. 
120, where Thurstone's criticism of the use of 
IQs for adults is made to appear to be a criti- 
cism of IQs in general; p. 110, the futile 
and involved criticism of the vocabulary test as 
a measure of intelligence by the use of false 
premises and questionable logic. It is significant 
that throughout Stoddard's discussion of the 
age-scale and the IQ, almost nothing is said 
concerning the care with which the Stanford- 
Binet was constructed ; the adequacy of its sam- 
pling; the heroic efforts made to insure item 
validity; the careful determination of the PE 
of an IQ through which are computed the in- 
evitable fluctuations to be expected in the IQ; 
the many factors to be considered before a fair 
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estimate of IQ can be obtained or the IQ itself 
evaluated. On the whole, Chapter 4, “Transla- 
tions of Binet Tests" constitutes, I am sure, one 
of the most misleading and unfair treatments of 
the Binet-type test ever written. Stoddard's 
technique in Parts I-III is to. set up a straw 
man and then lustily lambast it. A very con- 
venient straw man is the proposition that the 
IQ is a fixed, invariable measure of native 
ability. I doubt if any experienced psychologist 
today seriously believes this or thinks that IQs 
never change. The IQ is essentially a statistical 
device through which a child's performance on 
a carefully selected array of tests is compared 
with the performance of white American school 
children of his own age, born and reared under 
ordinary conditions of American life. The Stan- 
ford-Binet is one of the few tests so constructed 
that a child's position with reference to his 
group (his IQ) can remain constant within its 
PE of measurement, The really important fact 
then is not the change in IQ itself, but the 
reasons for the change. Does anyone believe 
that the feebleminded child who shows an in- 
crease of 20 points in IQ after several months 
of good food and kind treatment in an institu- 
tion is "really" smarter? Or that the bright 
youngster whose IQ drops from 150 to 122 (Old 
Stanford-Binet) as he reaches his late ’teens 
has "really" deteriorated? Marked changes in 
IQ up or down are not facts either to bemoan 
or rejoice over. Rather such shifts are a chal- 
lenge to the Examiner. Is the change due to 
errors of measurement, to drastic changes in 
the environment, or to changes within the child 
himself? The first two seem the more plausible. 
Certainly studies of intensive practice have 
given little evidence of persisting changes in the 
level of a child's performance. And I think we 
should all agree that one cannot conclude that 
a change has taken place in the child's "true" 
ability until all possible extrinsic causes have 
been investigated and eliminated. Stoddard's 
thesis is that of the Iowa group, namely, that 
intelligence can be increased by training and 
that the IO, in consequence, is necessarily “in- 
constant." The evidence is carefully slanted 
towards “proving” this contention. I shall cite 
one example, to illustrate what I mean by “slant- 
ing.” A frequency polygon taken from P. Cat- 
tell is reproduced on p. 221 showing the changes 
in IQ (Stanford-Binet, 1916) upon retest after 
varying time intervals. The figure shows 3331 
comparisons obtained from about 1200 children 


over a 7-year period. Extreme IQ changes 
range from +50 to —50 points, the bulk of the 
changes, however, falling between -+20 and 
—20. Stoddard remarks, after quoting a state- 
ment from Cattell, that this result “is perhaps 
typical of IQ variations for large samplings 
based on the 1916 Stanford Revision." What 
the reader would not know from this discussion 
are the following facts: (1) Two or more ex- 
aminations were administered to each child, 
the time intervals between tests varying from a 
few months to 7 years. Many children had more 
than two examinations and the practice effect 
must have been considerable (admitted by Cat- 
tell.) (2) Cattell found a tendency for bright 
children to gain in IQ upon retest and for dull 
children to lose. Cattell points out also that in- 
cluded in her data are the records of many 
bright children who were tested several times. 
Both of these conditions would tend to exag- 
gerate IQ changes and automatically lengthen 
the range. (3) There were 60 different examin- 
ers. Stoddard writes that “while Cattell appears 
to feel that tests given by different expert ex- 
aminers will cause a spurious variation in meas- 
urement, there is no evidence for this in her 
report.” On the contrary, the evidence is clear 
(given on p. 616 of Cattell’s report) that the 
“personal equation” of the examiner makes a 
difference of 13 IQ points, on the average, and 
may make a difference of 20-25 points. (4) 
Taking the PE of an IQ to be approximately 
4 points, we may expect a change of +16 or 
more IQ points to occur at least once in 100 
times from errors of measurement alone. If to 
test unreliability is added the “personal equa- 
tion” of the examiner, changes of 20 or more 
points in IQ are hardly unexpected. But in 
spite of such variations, it may be noted that 
only 3 per cent of all comparisons yielded IQ 
differences of more than 20 IQ points. In short, 
by a change in emphasis and the presentation 
of further data, one could easily argue for a 
rather remarkable exhibition of IQ constancy 
from Cattell’s data. The same may be said of 
other evidence submitted by Stoddard. Part IV 
deals with heredity and environment. Here a 
rather amazing reversal of position takes place. 
The IQ, Previously demolished, is now rein- 
stated and refurbished, and becomes the means 
of “proving” the all-importance of environment. 
The Iowa studies of Wellman et al. are pre- 
sented to “prove” that a child’s IQ (and pre- 
sumably his intelligence) can be dramatically 
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elevated through nursery school training. The 
work of Skeels and Skodak is cited to “prove” 
that children of low grade or feebleminded par- 
ents when placed at an early age in a good 
foster home exhibit [Qs on the average above 
too (as high as 140). In one group of such 
children, only 4 in a total of 87 had IQs below 
8o. Practically all of the work quoted is out of 
Iowa. Leahy's study of foster children (perhaps 
the best of its kind) is described as “puzzling” ; 
L. S. Hollingworth and H. E. Jones are not 
cited ; “doubt is cast" upon the Minnesota stud- 
ies of Goodenough and others because of "se- 
lective factors" ;.Burks’ work is omitted in toto, 
as is also Woodworth's recent critical mono- 
graph. Almost nothing is said about the diffi- 
culty of obtaining valid IQs for adults; of get- 
ting fair measures from the mothers of illegiti- 
mate children; and of making certain of the 
“true” fathers of these children. I admit that 
an author is privileged to present his own point 
of view ; and even to select data which will sup- 
port it. But doesn't he also have the obligation 
of giving his readers a fair deal? Shouldn't the 
reader at least be told what criticisms have been 
leveled at the Iowa work by competent psy- 

- chologists? It happens that I agree with McNe- 
mar and Goodenough (whose criticisms of the 
Towa work are nowhere mentioned) that the 
Iowa investigators have committed almost 
every experimental and statistical error in the 
calendar, as well as the unforgivable sin of be- 
ing well-nigh unintelligible in their presenta- 
tion. And I think it is little short of criminal to 
suggest, even by implication, that no matter 
what a child's parentage may be one can count 
on his being "normal"—or even bright—if only 
he has a "good" home. * 

J Consult Psychol 7:302-4 N-D '43. Herbert 
S. Conrad. * The presentation, while not “pop- 
ular," makes unusually smooth and easy reading. 
The basic tenet....is that "intelligence" has been 
poorly defined and inadequately measured. The 
basic thesis and underlying principle is that a 
favorable environment exerts upon intelligence 
(properly defined) a marked and permanent 
influence. * intelligence tests typically aim to 
distinguish between capacity and acquired abil- 
ity—or, in Stoddard’s terminology, between 
“potential intelligence” and “delivered mental 
power.” This distinction, according to Stoddard, 
is irrelevant to intelligence-measurement, and 
generally best dispensed with. * the reviewer 
would like to suggest that both types of meas- 


urement—of "native" or "potential" intelli- 
gence, and of "functional" or "status" intelli- 
gence—have their place in scientific thought 
and practical application. A. further suggestion 
is that the correlation between adequate meas- 
ures of these two kinds of intelligence, among 
children who have had opportunities for contact 
with the customary educational influences, may 
be higher than Stoddard implies. * With regard 
to the heredity-environment issue, Stoddard 
stands squarely in the environmentalist camp: 
(1) “Nobody knows what any child could do 
if presented with facts and principles in an 
orderly, ascending sequence with full attention 
to his motivational needs" (p. 9). (2) “Retest- 
ing will show that any IQ obtained at any age 
may (a) remain constant; (b) move steadily 
upward; (c) move steadily downward; or (d) 
fluctuate. The IQ derived from a single mental 
lesting gives no clew whatever as to the course 
of future events" (p. 384). (Italics as in the 
original). In comment on the first quotation, 
let us point out that nobody “knows” anything 
in the social sciences (particularly when an “if” 
situation is introduced)—but we can estimate 
probabilities within rather vague limits of error. 
Most students of the subjects of heredity and 
environment, we judge, would question Stod- 
dard's optimistic estimate of the possible influ- 
ence of environmental stimulation upon basic 
(or even functional) intelligence. The second 
quotation cited above is puzzling to us. Appar- 
ently Stoddard means that, unless possible en- 
vironmental changes are taken into account, the 
IQ suffers in predictive value. For possibly 90 
per cent of the population or more, environ- 
mental changes pertinent to intelligence are 
usually not very marked ; at any rate, the correla- 
tion between retests of intelligence in ordinary 
samples is certainly greater than the italicized 
part of the quotation above seems to imply. * 
Stoddard's treatment seems, in general, over- 
loaded with studies from the Iowa group of 
researchers, and biased against studies which 
lead to an emphasis on the role of heredity. 
Conspicuously lacking is any comment on the 
critical reviews of McNemar, or the critical 
summary by Woodworth. In the presentation of 
nursery-school studies, the work of Jones and 
Jorgensen is excluded, though at least one other 
study of equal (if not lesser) importance is in- 
cluded. The presentation of data based on chil- 
dren in orphan asylums omits the important 
English study by Lawrence. Leahy's study of 
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foster children is included, but less completely 
than it deserves, and with an unjustified criti- 
cism of its basic rationale. The literature on 
heredity-environment is, of course, extremely 
large, so that occasionally an important study 
is likely to be neglected. Most of the omissions 
noted above, however, were doubtless deliber- 
ate. * One implication of intelligence-test re- 
sults, according to Stoddard, is that youths 
eighteen years or over should be given the right 
to vote. In this connection it is chastening to 
recall that, had the youth of Germany exerted 
greater political influence, Hitler would have 
risen to power sooner than he did. * Evidently 
with some emotion, Stoddard speaks for the 
"age-old right, the right to have children" (p. 
462). No one, of course, will deny that "age- 
old" character of the process, though some may 
doubt that it is always "right." Stoddard objects 
that the effects of a feasible program of sterili- 
zation of the feeble-minded would be both small 
and slow; while this may be granted, it should 
be added that the effects are fairly permanent, 
and without cost of maintenance. Stoddard's 
responsibilities as Commissioner of Education 
of the State of New York make his chapter on 
Educational Implications especially interesting. 
Unfortunately, however, the treatment is too 
brief and general to provide much basis for 
understanding or serious comment. * One of 
the best features of Stoddard's book remains 
to be mentioned: this is the generous, humani- 
tarian spirit which pervades the volume, and 
the strong sense of social consciousness. Stod- 
dard realizes that educational goals “may re- 
quire economic and social reconstruction” (p. 
434) ; he urges that "scientists....can save them- 
selves only by stimulating, demanding, and 
fighting for the kind of social organization that 
lets ideas live" (p. 311); he appreciates “the 
deadly dichotomy between the men who know 
and the men who control” (p. 300); he de- 
plores “the tragic deterioration that follows the 
wet finger to the wind and the quick ear to the 
ground” (p. 311); and he recognizes that 
“genius flourishes in a subtle confluence of 
what a nervous system has to offer and a so- 
ciety is willing to accept.” No one, in our judg- 
ment, could reasonably object to Stoddard’s 
general orientation and goals; everyone, we 
think—whether hereditarian, environmentalist, 
or neutral in belief—could proudly join in the 
fight for the social and educational gains which 
Stoddard evidently has at heart. 


J Ed Psychol 34:186-90 Mr '43. Benjamin 
R. Simpson. It is unfortunate for the advance- 
ment of science when differences in point of 
view or in methods of interpretation lead to 
heated controversy rather than to codperative 
effort. The much publicized reports of the Iowa 
investigators and their students, with their ex- 
travagant claims of the extent to which mental 
exercise brought about by a stimulating environ- | 
ment can raise intellectual ability, exemplify the 
point in a striking way. Burks, in two analyses 
which are admirable both for their accuracy. 
and their tolerant attitude, has made a laudable 
suggestion to the effect that.the controversy 
over the Iowa reports may well have its termi- 
nation in an attempt to appraise both sides of - 
the case, with a view to clearing up reasons for | 
differences in interpretation, with admission of . 
known errors, thus paving the way for coóper- 
ative effort. * If such a salvage project is the 
hope, there is little encouragement to be found 
in this recent broadcasting of the original Iowa. 
interpretations, which constitute the essence of 
Stoddard’s book on The Meaning of Intelli- 
gence. Doubtless the large majority of educa- 
tional psychologists would agree that the scien- | 
tific movement in education of the present 
century has made a very material and lasting | 
contribution to the interpretation of intellectual 
ability, and to the improvement of learning ad- 
justments in the schools of the nation. * Judged ` 
from the standpoint of its contribution to the 
cumulative store of sound insight to this area, — 
Stoddard's book is much more likely to add fog 
and detriment to the problem than illumination. — 
The general attitude is that of casual attack 
upon the best that has been built up over am. 
extended period of years in intelligence testing. 
It would seem that there are some educational 
theorists who would like to be as unhampered — 
in their educational interpretations as the child. 
was expected to be in the ‘child-centered school" 
of a decade or more ago. Adherence to sound 
use of tests and to sound statistical methods 
of interpretation is apparently found to be too 
restricting. The method used by Stoddard with 
a view to securing the desired freedom is two 
fold; first, that of flooding the field with ex- 
tensive quotations from the Iowa studies, as 
though the erroneous interpretations contained 
in them had never been pointed out at all; and, 
second, that of casting casual discredit upon the — 
use of the most carefully constructed and au- 
thentic tools of measurement available. For ex- 
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ample, McNemar, whose analysis of the star- 
tling fallacies involved in the Iowa reports has 
been the most extensive and adequate of all of 
the many critics, partly because he had the advan- 
tage of access to some of the original Iowa data, 
is not mentioned at all. Likewise, the wealth of 
investigations which refute the Iowa conclu- 
sions is hardly mentioned at all, or is errone- 
ously interpreted. The conclusions of the Iowa 
studies are cited under conditions which make 
critical consideration by the reader of the valid- 
ity of their interpretations practically impos- 
sible. In this way sanction is given to the use of 
methods of research in this difficult area which 
ignore many of the most important safeguards 
of valid inference of intellectual ability. Infant 
tests are accepted as valid though the careful 
work of Bayley, Anderson, and others has 
shown that they are not valid for the purposes 
for which the Iowa investigators use them. It 
is little wonder that the Iowa investigators find 
reason for discounting the value of the IQ 
when they so flagrantly disregard the factors 
which have been laboriously built up to safe- 
guard its claim to validity and usefulness. Nor 
does Stoddard qualify as a positive contributor 
in his attempt to point out unrealized limitations 
of intelligence tests when used as prescribed. * 
Stoddard's attack is characterized by extensive 
quotations from critics whose opinions are not 
based upon adequate experimental data, or upon 
recognized expertness in this specialized field. 
One who is interested in adequacy of statistical 
and psychological analysis of classical studies 
of the nature-nurture problem, would do well to 
compare Stoddard’s casual critical comment 
upon such well-known investigations as those 
by Burks and by Leahy, with the masterly and 
detailed analysis of the same studies by Shuttle- 
worth. The contrast is astonishing and most 
illuminating. The nature-nurture problem is one 
which calls for the use of data suited to its pur- 
Pose, and often for highly technical statistical 
analysis. These safeguards are conspicuously 
lacking in the data utilized so extensively and 
so ill-advisedly by Stoddard. A crucial problem 
which should be dealt with in the field covered 
by Stoddard is the explanation of ‘why’ mental 
exercise as stimulated in a good environment 
increases the general intellectual ability, in terms 
of established psychological principles. No defi- 
nite analysis of the nature of the mental exercise 
brought about, of what is gained by the exer- 
cise, or of the reason for the alleged breadth 


and startling significance of the transfer of the 
improvement brought about, is to be found, 
other than the vaguest generalities. The inter- 
pretations are altogether out of line with the 
extensive work which has been reported upon 
the transfer of training. There is little doubt 
that, in the long run, the elusive fallacies which 
are hidden from the view of the casual and un- 
critical reader will be revealed. Yet it is well 
to remember that it took at least fifteen years 
for the fallacies in the experiments of John B. 
Watson upon the significance of early environ- 
ment to come to light in all their superficiality. 
There are significant similarities in the appeal 
of Stoddard and that of Watson. Both brought 
to bear the weight of professional prestige. Both 
write in readable style with a popular appeal 
which capitalized upon popular prejudice. Both 
made interpretations altogether unwarranted by 
the experimental evidence. A typical illustration 
of such unwarranted interpretation appears in 
Stoddard's admission: “It is painfully clear that 
some children do not improve, and that other 
children, under apparently excellent environ- 
mental conditions, may continue to deteriorate.” 
This admission is in accordance with the facts, 
but it is Stoddard’s interpretation of these facts 
which is at fault, which is that it is the excel- 
lent environment which alone is responsible for 
the improvement of those who improve, but not 
for the deterioration of those who show an 
equally marked decline. Surely those governed 
in their thinking by the ordinary principles of 
logic would infer and search for the operation 
of factors other than the influence of the stimu- 
lating environment, which might account for 
either the rise or the decline. Moreover, Stod- 
dard seriously clouds the nature-nurture prob- 
lem by his implication that the issue hinges 
upon the alleged ‘constancy of the 1Q.’ The 
leading contributors do not and have not de- 
fended an absolute IQ constancy. The issue is 
rather the extent to which identifiable differ- 
ences in environmental stimulation cause marked 
change in actual intellectual ability. It is most 
unfortunate that the Iowa contributors fail to 
control, or to take into actual account in their 
interpretations, the possible influence of the 
many factors in addition to that of the stimula- 
tion of mental exercise, when they make their 
extravagant claims with regard to the extent to 
which high intellectual ability can be developed 
almost irrespective of biological inheritance. It 
is still more unfortunate that Stoddard lends his 
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influence to the promulgation of such unwar- 
ranted and harmful claims. 

J Higher Ed 14:451-3 N ’43. H. H. God- 
dard. * The reader meets numerous surprises 
—of both kinds. Many conclusions, generaliza- 
tions, and statements of accepted truths are 
clearly and forcibly stated in well-chosen words. 
Other pages arouse the more or less vague feel- 
ing that "all is grist that comes to his mill." 
Still again, there are times when one feels that 
he is being treated to a bit of sophistry. The 
data furnished are more easily and naturally 
interpreted in quite a different way. Of course 
there are slips of the pen and lapses of thought ; 
but who can be entirely free of these! When 
the author says “A child of seven could attain 
an LQ. of 150 by passing all tests up through 
fourteen years” the blunder is obvious. Lapses 
of thought are not so obvious. For instance, he 
says: “The Iowa workers....feel that, over the 
years the Stanford revisions have offered not 
very reliable measurements of functions not 
very close to intelligence.” His entire discus- 
sion of tests leaves one with the definite feeling 
that they are not very reliable. Yet when he 
comes to the much discussed Iowa claims that 
the intelligence quotient can be modified by im- 
proving the environment, he accepts without 
question all the Binet test results (pages 340 
et seq.). Indeed he goes further. On page 341 
he says: "These figures lend validity to the 
authors' conclusion that "Differences in educa- 
tional and social environment produce undeni- 
able differences in intelligence and scholastic 
achievement as measured by our tests.” All 
this in face of the fact that in the immediately 
preceding chapter he has, with the help of Jen- 
nings, settled the nature-nurture problem by 
the now generally accepted view that they can- 
not be separated. * What the Iowa studies dem- 
onstrate—as it seems to the reviewer—is that 
during the plastic pre-school age, the organism 
requires more and better stimulation than it 
frequently gets or than it needs later, just as 
it requires special food and more careful han- 
dling. This, it would seem, is a matter of far 
more importance than the question of whether 
a hypothetical intelligence quotient can be 
changed. * Perhaps we have all been wrong in 
applying the intelligence quotient to pre-school 
children at all; as well try to apply it to pre- 
natal life! Tt is evident that there is still much 
to be learned about the intelligence quotient 
and intelligence. What one misses in the book 


is a consideration not of the tests but of the 
testers. Most examiners are careful that the 
child is comfortable and in a proper state of 
mind while being tested. Otherwise the results 
are worthless as a measure of intelligence. Is 
the state of mind of the examiner any less im- 
portant? It has been pointed out that there are 
individuals who are temperamentally unfit to 
become mental testers. It might equally well be 
shown that most of us are at times in mental 
conditions that render our results equally value- 
less. Someone has said that we have our preju- 
dices always with us. Apparently the only thing 
we can do about it is to admit our prejudices 
and allow for them. The poet Horace said that 
men easily believe what they wish to believe. 
Today wishful thinking is recognized as the 
rule—to which there are, fortunately, some ex- 
ceptions. It would seem that the author has cov- 
ered every other point. * 

Occupational Psychol 19:39-42 Ja '45. Cyril 
Burt. * The book.... [the author] claims, "brings 
into focus various research findings on the 
broad question of intelligence", In point of fact, 
it would seem rather to consist of a systematized 
exposition of the psychological concepts, the 
democratic faith and the practical aims that 
have inspired the numerous studies on child 
welfare that have emanated under his direction 
from Iowa—an exposition supported more by 
copious extracts and quotations than by a scien- 
tific analysis of facts. * His definition of the 
concept of intelligence is unusual, not to say 
cumbersome * it does not include the common 
implication that ‘intelligence’, as a technical 
term in current psychology, denotes innate 
‘ability’, or rather the innate component in the 
ability to undertake the activities described. * 
The problem of heredity versus environment he 
virtually identifies with the problem of the con- 
stant I.Q., regardless of the fact that those who 
have believed in the influence of innate mental 
factors have been the first to admit that an 1.Q., 
as obtained by current procedures, may be pro- 
foundly affected by schooling, home environ- 
ment, and other non-intellectual factors. His 
review of “Studies based on the 1.Q.” forms 
the longest chapter in the book. But it is almost 
wholly confined to inquiries supporting his 
standpoint (chiefly researches undertaken at 
Iowa); and the numerous researches refuting 
the sweeping contentions of the Iowa school are 
scarcely referred to. * Unfortunately, the major- 
ity of those who read and accept these popular 
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7 ‘claims can gather no notion of the pitfalls be- 


setting such inquiries, and of the highly tech- 


- nical planning and analysis required to circum- 


vent them. * The experimental and statistical 


"techniques evolved by Banks, Shuttleworth and 


others to deal with the complexity of the prob- 


- Jem are never mentioned. McNemar's full and 


extensive examination of the numerous falla- 


cies involved in the Iowa reports themselves 


(especially valuable because McNemar had ac- 
cess to the original data) is passed over in 
silence. However, the more critical reader will 
probably feel that even Dr. Stoddard's own 
account does not carry complete conviction. If, 
as he has maintained, the I.Q. is so profoundly 
influenced by environment, are we justified in 
accepting it as an indication of the inheritable 
intelligence of the mothers? If so, how are we 
to explain the high educational level reached 
by these mothers at school? (Dr. Stoddard sug- 
gests promotion by age, but this scarcely seems 
adequate; and in any case the I.O. based on the 
1916 Stanford revision is notoriously unreliable 
for adults.) Moreover, in a country like Amer- 
ica and in a rural state like Iowa, is it safe to 
infer the low intelligence of the fathers from 
their low moral and occupational level? Can we 
suppose that the alleged fathers were always the 
real fathers? Moreover, of about 30 per cent. 
of the fathers nothing seems to have been 
known: is it fair to assume that these were also 
of the same low intellectual grade? (Closer in- 
quiries have shown that illegitimate children 
whose paternity remains undisclosed are apt to 
be the offspring of fathers from higher social 
levels.) Dr. Stoddard concludes with a ‘demo- 
cratic programme’ for post-war re-organization 
of education and society. His practical proposals 
he sets out as ‘implications’ of his psychological 
theories. But most of his ideals would be read- 
ily endorsed even by those educationists and psy- 
chologists whom he has so vigorously attacked. 
The essential problem is to determine how far 
such aims are really practicable or (in Dr. Stod- 
dard’s own words) “just how far will the 
hereditary limits imposed on human beings per- 
mit us to go?” Since he himself defines intelli- 
gence in terms which exclude innate or heredi- 
tary limitations, it seems obvious that his line 
of approach can never give a clear-cut reply to 
the question. Indeed, his answer is optimistically 
vague: “Of this we can be sure, the possibilities 
for human development are far greater than our 
ancestors supposed”. Well-meaning enthusiasts 


who think that every type of school and every 
type of vocation should be freely thrown open 
to all, and that any “healthy infant” can be 
educated to become “any type of specialist”, 
will welcome and eagerly cite this book. The 
more experienced, whether school-miasters, vo- 
cational psychologists, or research-workers in 
the field of education, will fear that Dr. Stod- 
dard’s breezy optimism may itself lead to the 
waste of far “more man-hours”, on the part of 
both teachers and taught, than have ever been 
expended by the cautious and critical investi- 
gators whose efforts he so airily dismisses. 
Without for one moment wishing to bar any 
worth-while attempt at enlarging educational or 
social opportunities, they will still hold to the 
“standard doctrine” that, although heredity is 
not everything, nevertheless “it has in general 
at least one-third the influence of environment 
and in setting an upper limit may occasionally 
be all-important”. 

Psychiatry 7:98 F'44. M. F. Ashley Montagu. 
A better book than this on the meaning of in- 
telligence can well be imagined, but it has not 
yet been written. The reader interested in this 
subject may find all that he needs in the present 
volume * Stoddard deals with every aspect of 
the subject, a subject which is a very large and 
complex one; but this has often meant that he 
has had to treat many of those aspects quite 
briefly. Dr. Stoddard has brought together an 
immense amount of material and presented it 
in an easily accessible form. He has digested 
this material, and analyzed it critically. His 
judgments are, on the whole, sound, although 
not as well informed as they would have been 
had he been able to add an anthropological 
dimension to his horizons. 

Sch R 52:122-4 F '44. G. T. Buswell. * A 
contribution to the classification of thinking 
about the nature of education * While the book 
will appeal especially to educational psycholo- 
gists, it deals with matters that are important 
for both school administrators and teachers. * the 
author discusses in some detail seven attributes 
which he considers essential to any concept 
of intelligence. This part of the book pro- 
vides an analysis much needed by those teach- 
ers who are using intelligence tests without ever 
raising the question of just what is the nature 
of the thing that they are measuring. * the 
author’s....contention that “while maturation is 
a legitimate concept, it differs only in degree 
from the abstract learning which is deemed in- 
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telligent" (p. 72) seems to the reviewer not to 
cover the case fully but rather to confuse one's 
thinking on both maturation and learning. The 
statement that “the lines of maturation are com- 
paratively well laid down, while learning has 
great scope and flexibility” (p. 73), leads to a 
much more productive line of thinking. It leads 
to the very significant implication that “in 
American education we have yet to learn to 
sacrifice large amounts of casual information in 
order to generate thinking, planning, and pro- 
ducing at high levels” (p. 74). * Part IV... 
deals with the highly controversial issue of 
heredity and environment. The issues dealt with 
here have already been so widely discussed, 
particularly the Iowa data which Stoddard 
quotes and attempts to explain, that there is 
little hope of agreement among readers in re- 
spect to this part of the book. While the re- 
viewer is unable to agree with the main position 
taken by the author, he must agree that the 
issue is one which is of importance to the school 
and that school people should be aware of the 
nature of the controversy and the factual data 
on which it rests. Part V, entitled “Intelligence 
and Society,” deals entirely with the educational 
and social implications of the earlier sections. 
This discussion is exceptionally meaty and stim- 
wlating and is worth serious study by every 
educator. * Any discussion of the meaning of 
intelligence is bound to run afoul of differences 
in basic points of view. Readers of Stoddard’s 
book are likely to overemphasize the treatment 
of the nature-nurture controversy. However, 
the book is not designed as a defense of one 
position on this issue but is rather a clarifying 
overview of the entire subject of intelligence 
and its measurement. It is one of the important 
books of the year. 

Sch & Soc 60:188-9 S 16 '44. H. B. Reed. 
This book should have been entitled “The New 
Meaning of Intelligence” for that describes the 
net result of it to the reader. Not only does Dr. 
Stoddard give a new meaning to intelligence 
but he also makes the reader see the numerous 
shortcomings of current tests of intelligence, 
throws into the discard many concepts about 
intelligence that have been held for several dec- 
ades, and tells us what an up-to-date intelli- 
gence test should be. * Some hitherto widely 
held ideas that Stoddard proves to be false are 
the following: the IO is constant, an IO below 
70 means feeblemindedness, an IQ above 140 
means genius, intelligence comes to maturity at 


15 or 16. In support of the falsity of these state- 
ments Stoddard reviews not only the Iowa 
studies but many others independent of that 
influence. In regard to the relative influence 
of nature and nurture on the IQ, Stoddard's 
position is that nature permits a wide shift in 
the IQ, setting limitations and potentialities. 
Environmental forces may be either positive 
or negative and produce differential effects at 
different age ranges or under different circum- 
stances. * an important contribution * indis- 
pensable to psychologists and research workers 
in the field of mental tests * [The book] has a 
rather heavy technical vocabulary, and this no 
doubt will make the number of its readers fewer 
than those who should know what is in it or 
who would like to know. 


[ 1169 ] 

XSrRANG, Rutu. Exploration in Reading Patterns. 
Chicago, Ill.: University of Chicago Press, 1942. Pp. 
ix, 172. $2.00. * (London : Cambridge University Press, 
1943. 12s. 6d.) (PA 17:975) 

El Sch J 43:434-5 Mr '43. Helen Crossen. 
* Strang's study..is particularly timely. The 
purpose of the study is to find out how reading 
affects our daily lives. What kinds of books and 
articles do people prefer to read? Why? What 
ideas are they able to grasp? How much can 
they remember? The author believes that each 
person's reading choices and abilities are the 
result of many complex factors—age, sex, intel- 
ligence, vocabulary development, vocational in- 
terests and necessities, etc. The interaction of . 
these many factors influences greatly the total 
picture of any individual’s reading desires and 
powers. This total reading situation, so condi- 
tioned by a multitude of influences, the author 
calls the “reading pattern.” * An important 
aspect of the study is the use of the case-study 
technique, which included tests of reading speed 
and comprehension and of vocabulary, as well 
as detailed questions about the subject’s own 
reading processes and interests. A major test 
item consisted of three fairly long selections: 
the first being rather easy; the second, some- 
what more difficult; and the third, a scientific 
article which most persons found difficult. Each 
person interviewed read each selection and then 
responded freely in writing to the question : 
“What did the author say?” An objective-type 
test on each selection was also given. As a 
means of finding out the reading likes and dis- 
likes of individuals, questions were asked by the 
interviewer. In addition, seventy-five “titles” of 
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magazine articles, each with a brief descriptive 
statement, were presented. The subject indi- 
cated the type of article that he would like to 
read, would like to avoid, or was uncertain 
about. Miss Strang, in her concluding state- 
ments, admits that the sampling of cases used 
is neither large enough nor representative 
enough to justify generalizations and that the 
suggested hypotheses must be tested further. It 
seems clear, however, from the wide variety of 
responses on all aspects of the reading process 
analyzed, that each individual presents a unique 
“pattern” of reading tastes and abilities. Specific 
comments were found to be colored generously 
by the feeling of the readers. * The chief values 
of this study are to be found (1) in the explora- 
tion of the testing, case-study method of inves- 
tigating reading interests and tastes and (2) in 
the suggestions made concerning the nature of 
the reading habits, interests, and abilities of 
adolescents and adults which might profitably 
be subjected to further investigation. Teachers 
interested in making a careful and thoughtful 
analysis of the reading abilities and interests of 
pupils may find helpful suggestions in the tech- 
niques and the materials reported. 


[ 1170 ] 
*Srranc, RuTH. Reporting to Parents. Practical 
Suggestions for Teaching, No. 10. Edited by Hollis L. 
Caswell. New York: Bureau of Publications, Teachers 
College, Columbia University, 1947. Pp. xi, 105. Paper. 
$0.95, postpaid. * 
[171] 


XSrRoNG, Epwarp K., Jr. Vocational Interests of 
Men and Women. Stanford, Calif.: Stanford Uni- 
versity Press, 1943. Pp. xxix, 746. $6.50. * (London: 
Oxford University Press. 40s.) (PA 18:313) For re- 
views, see 652. 

[ 1172 ] 


*Stuart, Hamorp C.; Hit, PENELOPE; AND SHAW, 
Constance. Studies From the Center for Research 
in Child Health and Development, School of Pub- 
lic Health, Harvard University: III, The Growth 
of Bone, Muscle and Overlying Tissues as Re- 
vealed by Studies of Roentgenograms of the Leg 
Area: Description of a Method of Evaluating 
These Tissues, Standards Derived From a Series 
of Normal Children Examined Periodically From 
Birth to Seven Years and Examples of the Growth 
of Individual Children. Monographs of the Society 
for Research in Child Development, Vol. 5, No. 3, 
ien Pus 3t Mti D. C.: the Society, Na- 
ional Research Council, 1940. vii, 2r T, 
lithotyped. Out of print. * x Sipa 


[ 1173 ] 
*Sruit, Dewey B., Eprrog. Personnel Research and 
est Development in the Bureau of Naval Per- 
sonnel, Prepared by the Staff, Test and Research Sec- 
tion, in cooperation with N.D.R.C. Project N-106 and 
Ne College din Eroa Board. Princeton, 
+ J.: Princeton Universi: ress, 1947. Pp. xxiv, 51 
$7.50. * (PA 22:877) d sci X 
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*SuLzMAN, Joun H.; Cook, EnLswogrH B.; AND 
Bartiett, New R. Comparative Study of Measures 
of Heterophoria. Progress Report No. 3 on Bureau of 
Medicine and Surgery Research Project No. X-493 
(Av-263-p), "Comparison of Various Screening De- 
vices With Standard Medical Procedures." New Lon- 
don, Conn.: Medical Research Department, U. S. Sub- 
marine Base, February 22, 1946. Pp. ii, 39. Paper, 
mimeographed. Gratis, * 


[ 1175 ] 


*SurtzMAN, Jonn H.; FARNswomrH, Dean; Cook, 
ErtswonrH B.; Bartiett, New R.; AND KINDRED, 
Mary I. Comparison of Various Screening De- 
vices With Standard Medical Visual Procedure. 
Progress Report No. 1 on Bureau of Medicine and 
Surgery Research Project No. X- Av-263-p). 
New London, Conn.: Medical Research De; ent, 
U. S. Submarine Base, November 22, 1945. Pp. iv, 50. 
Paper, mimcographed. Gratis. * 
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*Sutzman, Jon H.; Cook, ErtswogrH B.; AND 
BamrtLETT, Nem. R. Visual Acuity Measurements 
With Three Commercial Screening Devices. 


Progress Report No. 2 on Bureau of Medicine and 
Surgery Research Project X-493 (Av-263-p), "Com- 
ison of Various Screening Devices With Standard 


edical Procedures" New London, Conn.: Medical 

Research rtment, U. S. Submarine Base, Febru- 

ary 7, 1946. Pp. vi, 43. Paper, mimeographed. Gratis. * 
[ 1177 ] 


Surer, Donato E. Avocational Interest Patterns: 
A Study in the Psychology of Avocations. Stan- 
ford, Calif.: Stanford University Press, 1940. Pp. xiv, 
148. $2.25. * (London: Oxford University Press. 105. 
6d.) (PA 14:4756) 


Am Sociol R 6:614-6 Ag '41. Paul H. Sheats. 
* The problem is clearly stated, the research 
methods employed are fully described, and, as- 
suming faith in the statistical legerdemain which 
passes for scientific method in the measurement 
of attitudes and interests, the conclusions are 
defensible. * It is learned....that “the avocation 
a person chooses depends on his needs in a given 
situation.” Aid to the guidance worker in de- 
termining these needs will have to come from 
other sources—non-statistical, perhaps. 

Char & Pers 11:177-8 D '42. William 
McGehee. Super has opened an interesting and 
challenging field of investigation in his Avoca- 
tional Interests Patterns. Guidance counselors 
and others working with problems of vocational 
orientation have tacitly assumed that avocational 
interests are indices to vocational aims. Super 
has outlined an objective method of testing this 
assumption. He also has indicated a method of 
investigating avocational interests other than 
by the armchair method. * The primary criti- 
cism of Super’s research is his very small num- 
ber of subjects of each avocational group. Super 
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acknowledges this limitation, but his conclusion 
that "certain differences in the background of 
the avocational samples were probably due not 
to sampling but to genuine interavocational dif- 
ferences" (p. 124) needs substantiation from 
further research. Investigators of both avoca- 
tional and vocational interests will find the au- 
thor's review of the literature on interests as 
well as his suggestions for further research very 
helpful. 

J Abn & Social Psychol 36:612-3 O "41. 
Thomas M. Harris. Super's book is both a 
speculative work on the significance of avoca- 
tions and a statistical report on the application 
of the technique used in the development of the 
scales for the Strong Vocational Interest Blank 
to the study of the avocational interests of four 
groups of hobbyists—model engineers, instru- 
mental musicians, amateur photographers, and 
stamp collectors. He found, not very surpris- 
ingly, that the interests of avocational engineers 
are similar to those of Strong's engineers, that 
amateur musicians have interests like those of 
professional musicians, and that the interests of 
hobbyists in photography resemble those of per- 
sons engaged in "photographic, technical, and 
artistic occupations." Scoring keys were devel- 
oped for each of the avocational groups; the 
keys turned out to be of variable validity and 
the one for stamp collectors is described as 
"probably not valid." However, "the interest 
patterns of stamp collectors appeared to be very 
much like those of hobbyists in general." Only 
members of organized hobby groups are present 
in Dr. Super's sample. It would be difficult to 
quote statistics, but individualistic hobbyists are 
probably more numerous than those who get to- 
gether, write a constitution, and hold regular 
meetings. The author assumes, rather archly, 
that the organized and unorganized hobbyists 
belong to the same population. Many would 
doubt it and the burden of proof rests with Dr. 
Super. The more philosophical parts of the book 
—particularly the introductory chapters and the 
section on the psychological role of avocations— 
are rich in insight and contain many observa- 
tions of interest to clinical workers. Those con- 
clusions based directly on Dr. Super's somewhat 
elegantly presented experimental data, however, 
seem rather trivial. The hope that his specific 
findings will be concretely useful to guidance 
workers seems ill-founded and based on a curi- 
ously schematic view of guidance. A great deal 
more needs to be known about the psychological 


meaning of scores obtained with such inven- 
tories before unreserved confidence can be 
placed in new scales, however ingeniously de- 
rived. The possibilities for group differentiation 
through the use of Strong's blank and the 
Cowdery-Strong-Kelley scoring technique are 
probably endless. Interest blank research has 
become, therefore, relatively less a gamble than 
many types of research and is attractive to the 
investigator looking for tried, true, and tested 
methods. The statistical significance of new 
Scales is easy to determine; their psychological 
significance is much harder to put one's finger 
on. The distinction is vital, although it is often 
glossed over or ignored. Until an adequate ra- 
tionale for the scientific (i.e., statistical) study 
of interests has been mapped, the haphazard 
recourse to exact quantitative analysis seems 
more evasive than creative. 

J Ed Res 34:705-6 My '41. E. G. William- 
son. * As the author states, this preliminary 
investigation opens up a very promising field 
for further investigation. The small standardiz- 
ing groups and even smaller validating groups 
make the results tentative even though the 
author points out that Strong's early vocational 
groups were small. It is to be hoped that this 
highly promising investigation will stimulate 
other workers to make scientific investigations. 
At the present time the field of leisure time 
activities and guidance is bogged down with an 
uncritical and even naive understanding of 
avocational interests. The uncritical use of 
questionnaires and interviewing methods to de- 
termine avocational interests as a basis for 
guidance reminds one of the early methods used 
in vocational guidance. It is to be hoped that 
the application of scientific methods in the field 
of avocational interests will produce as fruitful 
results as it has in the diagnostic phases of 
vocational guidance. 

Occupations 19:467 Mr '41. Lysle W. Croft. 
* a contribution to a better understanding of 
avocations and paves the way for more intensive 
studies in the future. The book delves into the 
following problems: Do men engaged in a given 
avocation have a characteristic pattern of inter- 
ests, just as men in a given occupation do? Are 
the avocational interests of adolescents similar 
to those of adults? How are avocations related 
to vocations? Are they similar in nature and 
content, and does the major avocation tend to 
resemble the usual occupation? * In the devel- 
opment of the study the author presented an 
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individualized theory of avocation in which he 
says that "avocations are chosen according to 
present needs of an individual in a given situa- 
tion and on the basis of the possible ways in 
which that individual can meet those needs in 
that situation." In the reviewer's opinion, pre- 
caution should be taken in accepting this theory 
in that the number of illustrations were limited, 
and there is some doubt as to whether those 
interest patterns are purely avocational. How- 
ever a further investigation with a wider variety 
of avocations might do much to justify the con- 
clusions set forth by Dr. Super. An appendix 
with a bibliography containing both the book 
and periodical references presents a splendid 
source of information on vocational interests. 
Also included is a copy of an interesting blank 
devised by the author and employed in this 
study. This is indeed a worthy handbook for 
those interested in counseling as it raises the 
question of the value of avocational interest pat- 
terns in counseling. It contributes to a better 
understanding of avocational interest patterns 
and prepares the way for further research in 
this field. 

Psychol B 37:650-1 O '40. Stuart M. Stoke. 
* The results are suggestive, even if inconclu- 
sive in many respects. * the author concludes, 
with some reservations, that avocational pat- 
terns do exist and can be determined, although 
this is not equally true for all avocations * The 
author wisely discards both the theory of bal- 
ance and that of contribution as inadequate to 
explain his findings, and suggests, instead, an 
“individualized theory" whereby “avocations 
are chosen according to the present needs of an 
individual in a given situation, and on the basis 
of the possible ways in which that individual 
can meet those needs in that situation." The 
above theory would seem to imply a temporary 
and opportunistic basis for avocations which 
would preclude the relative permanence that 
must be predicated if the interest blank is to 
have much significance as a research instru- 
ment. * The question of whether the avocational 
interests of adolescents are similar to those of 
adults can scarcely be answered with this re- 
search technique. All one can discover is 
whether adolescents and adults following the 
same hobbies show similarities of response on 
an interest blank. This is what the author really 
did. The field of research is relatively new, and 
methods are still to be sought. One must admit 
that the author has succeeded in turning up a 


number of interesting leads by his methods, 
even if one is disposed to be critical of the 
instrument and the validity of some of his con- 
clusions. 

Sch R 49:74-5 Ja ^41. Frank J. Kobler. * 
An interesting chapter on the theory of voca- 
tions and avocations enhances the value of the 
book. * The reviewer is impressed with the 
author's hesitancy about the adequacy of his 
sample and is of the opinion that these doubts 
and conflicts are well founded. The sampling is 
subject to serious questioning for any general- 
ization beyond the immediate data. For exam- 
ple, the Y.M.C.A. group used in the vocational- 
avocational validation is as distinct a social and 
vocational group as are Mannheim's "intel- 
lectuals,” Steinbeck's “Okies,” or Hooton's 
“firemen.” With respect to his major sample of 
adults and adolescents, the reviewer, before 
attempting to make a fair estimate of the sam- 
ple's validity, would like not only a more inti- 
mate knowledge of the clubs and members that 
were selected but also a knowledge of those that 
were not included in the study because of re- 
strictions of one kind or another. In addition, 
the reviewer is inclined to question the broad 
conclusions and generalizations somewhat gen- 
erously deduced from the exceedingly small 
number of cases. While the reference to Fisher's 
sampling work on small numbers may be justi- 
fied in certain instances, the reviewer is inclined 
to doubt its applicability to data of this kind. 
In any event an observance of Yule's cautions 
with regard to small numbers in a sampling 
might materially alter the author's notion of the 
significance attachable to the critical ratios ob- 
tained from his data. The reviewer believes 
that the great need of research workers in the 
field of avocations, which Super is pioneering, 
is a more adequate and incisive logical analysis 
of the ideas involved, with a view to establishing 
usable and meaningful scientific concepts. No 
amount of manipulation of data can compensate 
for the absence of a satisfactory theoretical 
content. 

Sociol & Social Res 25:187-8 N-D "4o. 
E(mory) S. B(ogardus). * Two theories of 
avocational interest are discussed, and a third 
one is advanced. The “balance” theory, which 
suggests that avocations contribute most to per- 
sonality when they represent activities widely 
different from the person's vocational life, is 
judged inadequate. But the basis of this judg- 
ment is a study of three and perhaps four avo- 
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cations which are admittedly closely related to 
the vocations of the persons involved and hence 
do not represent the balance theory. If the 200 
persons studied had had avocations that really 
represented the balance theory, the results 
might have been entirely different. Whether or 
not the balance theory is sound cannot be 
decided until studies have been made of people 
who have utilized it. The “contributory” theory, 
that avocations are supplementary in values to 
vocations, is also found wanting, but probably 
further experimentation is needed before this 
theory is discarded. The author suggests the 
"individualized" theory, which may also be re- 
ceived as a conclusion: “Ayocations are chosen 
according to the present needs of an individual 
in a given situation, and on the basis of the 
possible ways in which that individual can meet 
those needs in that situation." But do individ- 
uals usually "choose" their avocations, and do 
they examine their needs; then do they canvass 
possible related avocations, and finally do they 
select the avocation that particularly fits their 
own situations best? Throughout this interest- 
ing and valuable study the term avocation is 
used broadly. If its meaning were sharpened 
and if it were distinguished clearly from hob- 
bies, the results would be more worth while. 
For an additional review, see 40:B 1089. 


[ 1178 ] 
*SuncENT, Lours Vincent, The Use of Aptitude 
Tests in the Selection of Radio Tube Mounters. 
American Psychological Association, Psychological 
Monographs, Vol. 61, No. 2, Whole No. 283. Washing- 
ton, D. C.: the Association, Inc., 1947. Pp. v, 40. Paper. 
$1.00, * 

[ 1179 ] 
XSUTCLIFFE, A., AND Canntam, J. W. Selection for 
Secondary Education Without a Written Exam- 
ination. London: John Murray, 1945. Pp. viii, 88. 
Paper. 3s. 6d. * 

[ 1180 ] 
XSwiNEFORD, FRANCES, AND HorziNGER, KARL " A 
Study in Factor Analysis: The Reliability of Bi- 
Factors and Their Relation to Other Measures. 
University of Chicago, Department of Education, Sup- 
lementary Educational Monographs, No. 53. Chicago, 
I.: University of Chicago Press, January 1942. Pp. 
xi, 88. Paper, fithotyped. $1.00, * (PA 16:3809) 


Ed Adm & Sup 28:394-5 My "42. A. S. 
Edwards. * The work is well done and those 
who are interested in the application of a valu- 
able mathematical tool will find an important 
contribution in this report. 
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*TeLtscHer, Herry O. Handwriting: The Key to 
Successful Living. New York: G. P. Putnam's Sons, 
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1942. Pp. ix, 278. * For latest edition, see 1182. (PA 
16:3656) 


Am J Orthopsychiatry 13:173 Ja’43. Lee R. 
Steiner. * Mr. Teltscher has evolved the solu- 
tion to all the world’s ills through his new 
science—How to Know Yourself, Child Guid- 
ance, Compatible Marriage, Vocational Apti- 
tudes, Business Abilities, How to Choose 
Friends, Instructions for the Physician, and 
even, How to Readjust Your Personality 
Through Handwriting. He has dipped into 
criminology and can make diagnoses for the 
court as to whether, according to the hand- 
writing, the accused was emotionally capable 
of committing the crime! There is no indica- 
tion of how he formulated his “science” for the 
basis of his conclusions. Mr. Teltscher may 
know what he is talking about. The reviewer 
doesn’t. 


[ 1182 ] 
*TrLrSCHER, Herry O. Handwriting: The Key to 
Successful Living, Second Edition. New York: G. 
P. Putnam's Sons, 1946. Pp. xiii, 301. $300. * (PA 21: 


3485) 

[ 1183 ] 
XTERMAN, Lewis M. Psychological Approaches to 
the Biography of Genius. Eugenics ociety, Occa- 
sional Papers on Eugenics, No. 4. London: Hamish 
Hamilton Medical Books, 1947. Pp. 24. Paper. 15. 6d, * 

[ 1184 ] 
*THomas, Lawrence G. Mental Tests as Instru- 
ments of Science. American Psychological Associa- 
tion, Psychological Monographs, Vol. 54, No. 3, Whole 
No. 245. Washington, D. e. the Association, Inc., 1942. 
Pp. iii, 87. Paper. $1.90, * (PA 17:1381) 

[ 1185 ] 
TnoMsoN, Gonrrey H. An Analysis of Performance 
Test Scores of a Representative Group of Scottish 
Children. Publications of the Scottish Council for Re- 
search in Education, [No.] 16, International Examina- 
tions M London: University of London Press, 
Ltd., 1940. Pp. vii, 58. 5s. * (PA 14:5775) 

Brit J Psychol 31:356 Ap "41. * The statisti- 
cal reader will no doubt find this monograph of 
great interest; but the more general psychologi- 
cal reader would have liked further informa- 
tion about the psychological nature of the Per- 
formance tests and the abilities they measure. 

J Ed (London) 73:514 N "41. * The testees 
were all given a Binet Test and eight perform- 
ance tests. * Prof. Godfrey Thomson and his 
helpers undertook the extensive task of analys- 
ing all these results, and right well have they 
done the work. * The main conclusions are: 
(a) That Cube Construction correlates best 
with Binet I. Q. (.55). If Cube Construction, 
Healy's Picture Completion and Koh's Block 
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Design are combined, the correlation with 
Binet rises to almost 0.7. This is, I think, a 
highly significant result for psychologists and 
one that should quieten somewhat those who 
are continually questioning the value of a verbal 
test as a test of general intelligence as compared 
with non-verbal tests. (b) Using Thurstone’s 
Centroid Method of Analysis, Prof. Thomson 
finds that the results give evidence of two 
factors—the general factor, possibly, but not 
necessarily to be identified with “g”, and the 
rhythm or speed factor linking tests 1, 2 and 3. 
On this last conclusion readers should consult 
Prof. Burt’s interesting detailed consideration 
of the factor analysis and its interpretation 
given in The British Journal of Educational 
Psychology, November, 1940. 

J Ed Psychol 32:77 Ja "41. Miles A. Tinker. 
* The statistical analyses appear adequate. 
Nevertheless, as happens occasionally in statis- 
tical studies, the psychological implications re- 
ceive slight and inadequate attention. 

For an additional review, see 40:B1097.t. 


[ 1186 ] 


*Tiowsox, Goprrey, The Trend of National In- 
telligence: The Galton Lecture, 1946. With a Sym- 
posium in 1947 by Alexander Carr-Saunders, Cyril 
Burt, Lionel Penrose, and Godfrey Thomson. Eugenics 
Society, Occasional Papers on Eugenics, No. 3. Lon- 
don: Hamish Hamilton Medical Books, 1947. Pp. 35. 
Paper. 2s. * 
[ 1187 ] 


*TuonNprKE, Rogert L, Editor, Research Problems 
and Techniques. Army Air Forces Aviation Psychol- 
ogy Program Research Reports, Report No. 3. Wash- 
ington, D. C.: Government Printing Office, 1947. Pp. 
viii, 163. $1,00. * (PA 21:3721) 


J Consult Psychol 11:281-2 S-O '47. Of the 
series of AAF research reports, Thorndike's 
volume on problems and methods may hold the 
widest interest for non-military psychologists. 
It presents a summary account and a critical 
evaluation of each principal area involved in 
research on the selection and classification of 
personnel. Topics include job analysis, inven- 
tion and refinement of test forms, determining 
criteria, validity of single tests, composite apti- 
tude scores, reliability determination, correla- 
tional analysis, and control of error variance. 
There is a brief chapter on problems encoun- 
tered in training experiments. The book is not 
only a report of a program, but a potentially 
valuable reference book and textbook on re- 
search methods. 


[ 1188 ] . 
XTnuunsroNg, L. L. A Factorial Study of Percep- 
tion. Chicago, Ill.: University of eum Press, 1944. 
Pp. vi, 158. $2.50. (London: Cambridge University 
Press, 1945. 155.) (PA 19:890) 

[ 1189 ] 
XTuunsroxg, L, L., AND THURSTONE, THELMA GWINN. 
Factorial Studies of Intelligence. Psychometric 
Society, Psychometric Monograph No. 2. Chicago, Ill, : 
University of Chicago Press, 1941. Pp. v, 94. Paper. 
$1.50. * (London: Cambridge University Press. 9s.) 


(PA 15:3743) 


Loyola Ed Digest 17:8 Ja '42. Austin G. 
Schmidt. * The work is one that can be justly 
appraised only by statisticians of the highest 
type. 
Q R Biol 16:510 D '41. * This investigation 
indicates broad possibilities for the use of factor 
analysis in related fields, 


[ 1190 ] 
XTirrIN, Josern, Industrial Bay chloe ys New York: 
Prentice-Hall, Inc., 1942. Pp. xvi, 386. * For latest edi- 
tion, see 1191. (PA 16:4513) 


Am J Psychol 56:148-9 Ja '43. George W. 
Hartmann. This volume is a product of the 
young and vigorous Purdue school of applied 
psychology. * [Tiffin’s] book is rich in indirect 
evidence that the altered political relations cre- 
ated by that strange complex, the New Deal, 
have changed the framework within which in- 
dustrial psychology operates—and yet it is sur- 
prising how little attention the author pays to 
employer-employee conflict, problems of union- 
ism, and allied issues in the social psychology 
of the contemporary workingman's life. The 
first third of the book is devoted to employment 
testing, illustrated by attractive bits of local 
color from Indiana and Illinois establishments, 
and characterized by a little more enthusiasm 
for the Taylor-Russell predictive tables than 
these statistical aids to personnel policies prob- 
ably warrant. A long and technically highly 
competent chapter on vision indicates, among 
other things, that proficiency in close work is 
usually inversely related to scores on a distance 
acuity measure. This is added support for the 
elementary concept that normative excellence in 
any trait variable does not alone confer job 
competence. Unfortunately, Tiffin makes less of 
occupational profiles and the delicate individ- 
ualized "resonance" of a worker with his work 
than their intrinsic importance for personality 
wholeness seems to justify. * As in most texts 
in this field, the chapter on learning and train- 
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ing is paradoxically weak, despite the growing 
significance of this form of "practical" educa- 
tion. The section on inspection is a good tech- 
nical monograph, but it seems a little out of 
keeping with the rest of the volume. The 
treatment of accidents and safety, however, is 
exceptionally good—perhaps the best brief psy- 
chological discussion in the entire literature of 
this topic. A short final chapter is devoted to 
"attitudes and morale," an area that is probably 
the very heart of all industrial psychology ; but 
one would not guess this from Tiffin’s distribu- 
tion of emphasis. Despite a marked unevenness 
in performance and the absence of broad per- 
spectives, this volume does contain a mass of 
new and important material which advances the 
subject beyond the point where Viteles left it 
a decade ago. Much solid craftsmanship has 
gone into its construction. * An engineer with 
some factory background will probably gain 
more from it than most psychologists. To the 
undergraduate, however, it offers an appropriate 
introduction to a large and vital area of re- 
search in human relations—a professional serv- 
ice of no small merit in the eyes of those who 
expect industrial psychology to contribute as 
much to man's happiness and achievement as 
educational and clinical psychology at their best 
seem to have done. 

J Am Med Assn 120:1430 D 26 '42. * a 
rather academic work * It gives a fairly good 
general picture of the methods of testing em- 
ployees in order to place the right shaped peg 
in the appropriate hole. * Of most importance 
to industrial physicians, however, are the chap- 
ters on visual problems of industry and on 
work, fatigue and efficiency. The material cov- 
ered in the chapter on vision shows that many 
of the visual standards previously conceived as 
being accurate by the ophthalmologist fluctuate 
and need to be dealt with by directing the eye 
specialist's attention directly to the job in hand 
rather than to the general correction of visual 
errors. The effect of age, specific job require- 
ments, eye wear, classifications and retest of 
vision are among some of the interesting topics 
dealt with in this chapter. The atmosphere of 
the book is distinctly peacetime: stress is laid 
on selecting the best worker for the best job 
and on the readjustment of employees who are 
deficient to begin with in order to make them 
superior workers, whereas the wartime attitude 
of industrial psychology is to develop placement 
procedures which will make the hitherto inade- 


quate employee a useful contributor to produc- 
tion. 

J Appl Psychol 26:862-3 D "42. Charles C. 
Gibbons. * The presentation, in the language of 
industry, is sound but not academic. * There is 
a valuable discussion of the several factors 
which operate to determine the functional value 
of a testing program to the employment man- 
ager. There is a timely emphasis upon the use 
of tests for the proper placement as well as the 
selection of employees. * Three chapters are 
devoted to a rather orthodox description of 
various kinds of tests. This section would be 
more helpful if it maintained a more critical 
attitude toward the validity of the tests. The 
section of the appendix on statistical procedures 
will be of limited value because it is too con- 
densed to be useful to the uninitiated and is 
not comprehensive enough to be useful to a 
person with statistical training. Much of the 
material is new and highly valuable. The out- 
standing sections deal with industrial inspec- 
tion, visual problems of industry, merit ratings, 
industrial morale, and accidents. The treatment 
throughout is comprehensive but definitely 
readable. The material contained in this book is 
a real contribution to both industry and psy- 
chology. Psychologists and personnel adminis- 
trators will derive from this book valuable 
information regarding the industrial applica- 
tions of psychology. 

J Consult Psychol 7:208-9 Jl-Ag ' 43. Harold 
E. Burtt. * suitable for students with some 
background in psychology and statistics. It is 
reasonably satisfactory for the general reader 
except for certain places where the discussion 
becomes technical, especially from a statistical 
standpoint. * extensive discussion of psychologi- 
cal tests with a commendable emphasis on 
placement rather than selection exclusively. * 
The author wisely emphasizes the attitude and 
personality of the tester or industrial psychol- 
ogist. Several chapters describe various employ- 
ment tests, giving validity in many cases. This 
section is well annotated. The discussion of 
personality tests, however, is rather meager. It 
is unfortunate that considerable space is given 
to one such test that is somewhat in disrepute. 
It would be well to indicate some of the trends 
in the development of more objective tests in 
this field, even though they are still pretty much 
in the experimental stage. “Visual Problems of 
Industry” are discussed in a long chapter. It 
describes some commercial apparatus and pre- 
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sents various studies of industrial production as 
related to visual tests, such as, acuity or stereo- 
vision. It is not always clear whether the tests 
discussed are applied when the subject already 
has his vision corrected or whether they take 
him without glasses. Toward the end of the 
chapter there is commendable emphasis on get- 
ting professional help where it is needed. The 
reader gets somewhat the impression that a man 
might be rejected for a job on visual grounds 
although, with adequate correction, he could do 
the job perfectly well. Furthermore, lighting is 
given only casual mention. The content of the 
chapter is entirely sound but seems somewhat 
out of proportion to the rest of the book. * 
there is a list of suggested topics for a foreman's 
course and a rather interesting test which might 
be used to check on the results of such a course. 
* The chapter on fatigue misses the difference 
between production curves for a fatiguing job 
and a monotonous one. There is a very brief 
treatment of illumination, ventilation, and noise, 
which overlooks some rather significant litera- 
ture on the last of these. Rest pauses, and 
motion studies are mentioned, but this chapter, 
in view of the tremendous amount of material 
available, is little more than an outline. There is 
a helpful list of items used by numerous com- 
panies in merit rating systems. * The discussion 
includes some technical points that are applica- 
ble to the procedure of rating in general. It 
would be well to point out that these same pro- 
cedures and principles are equally applicable to 
employment purposes. The chapter on industrial 
inspection is one that is not ordinarily found in 
such books and is of special interest. * The 
chapter on accidents presents some new data on 
accident proneness. * There is a good discus- 
sion of the development of attitude scales for 
measuring morale. Various factors affecting 
morale are mentioned briefly, such as super- 
vision, working conditions and wages. Con- 
spicuous by its absence is the very interesting 
morale work done at the Hawthorne Plant of 
the Western Electric Company. The appendix 
includes the detailed Taylor-Russell tables 
for interpreting test validity. * The tables 
should be particularly helpful to industrial psy- 
chologists who are not fortunate enough to have 
a reprint from the Journal of Applied Psychol- 
ogy. Practically, the reviewer's only adverse 
criticism is in the matter of emphasis. The 
author, like most of us when we have done 
original work of our own which fits into the 


discussion, is inclined to present the findings in 
considerable detail. Tiffin seems to lean in this 
direction a little more than the average. The 
discussion of visual problems, industrial inspec- 
tion and accident proneness are cases in point. 
However, it is worth while to have this mate- 
rial available in the literature, although it does 
somewhat distort the balance of the book. Places 
where more complete treatment would be de- 
sirable are personality tests, fatigue, monotony, 
illumination and morale. * The general treat- 
ment is systematic, well annotated, well illus- 
trated with diagrams and tables and the style is 
straightforward and to the point. Teachers of 
industrial psychology courses can supplement 
the thinner portions by additional material. The 
general reader will certainly carry away the 
impression that industrial psychology is a scien- 
tific field which needs no apology and which is 
already contributing materially to increased 
efficiency. 

Personnel J 21:298-300 F ^43. Everett Van 
Every. * The chapter on Industrial Merit Rat- 
ings casts little worth and some doubt as to 
whether ratings are a fair tool for management 
and worker unless it can be determined what 
factors influence these ratings. The Chapters 
on Industrial Inspection, Accidents & Safety 
and Attitudes & Morale are particularly timely 
for war plants, with many suggestions and gen- 
eral policies that safety and training men should 
check carefully. Much of the material in these 
chapters, especially in merit rating and place- 
ment tests appears to be offered here for the 
first time. The last section of the book on 
Elementary Statistical Procedures will be very 
helpful to the personnel man who missed this 
highly important field of study. You may be 
assured that statistics will play an important 
part in future personnel work and the qualified 
personnel executive should learn at least the 
rudiments of statistical procedures. * 

War Med 3:686 Je "43. One does not have 
to be an industrialist or a psychologist to recog- 
nize the value and excellence of this book by 
Professor Tiffin. The author has a reputation 
for scientific skill in his profession, as well as 
an unusual ability to impart information to his 
readers. Business executives, industrial rela- 
tions and personnel directors and industrial 
physicians, engineers, nurses and safety super- 
visors, as well as students and research workers 
in the fields of applied psychology and voca- 
tional guidance, should profit greatly from read- 
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ing this book. Those industrial organizations 
which cannot add psychologists to their staffs at 
least should add this book to their shelves! * 
much of this book is based on previously un- 
published experimental research. From the 
standpoint of the physician, one might wish that 
a separate chapter on health problems had been 
included, although this undoubtedly would have 
taken the author too far into the fields of 
psychiatry and mental hygiene, which in them- 
selves are of increasing significance and impor- 
tance in industry. The former subject, however, 
is approached in the chapter on Tests of Per- 
sonality and Interest, while the latter is implied 
in the chapter on Attitudes and Morale. The 
chapters on Visual Problems in Industry and 
Work, Fatigue and Efficiency have various 
health aspects and should be of particular value 
to industrial physicians. * Tiffin's work should 
be of inestimable service to industrial manage- 
ment. 


[ 1191 ] 
*Tirriw, Josern. Industrial Psychology, Second 
Edition. New York: Prentice-Hall, Inc., 1947. Pp. xxi, 
553. Trade edition, $5.35; textbook edition, $4.00. * 
(PA 22:505) 


[ 1192 ] 
XTOMKINS, SILVAN S.; WITH THE COLLABORATION OF 
ELIZABETH J. Tomkins. The Thematic Appercep- 
tion Test: The Theory and Technique of Inter- 
etation, New York: Grane & Stratton, Inc., 1947. 
D. xi, 297. $5.50. * (PA 22:1658) For reviews, see 
104a. 


[ 1193 ] 
*Toors, Hersert A. The Evolution of the Ohio 
State University Psychological Examination. 
Ohio College Association Bulletin, No. 113. Columbus, 
Ohio: the Association, Ohio State University, March 
20, 1939. Pp. 2287-2311.2. Paper, mimeographed. $0.15. 


[ 1194 ] 
XToncERsON, THEODORE L. Studying Children: Di- 
agnostic and Remedial Procedures in Teaching. 
New York: Dryden Press, Inc., 1947. Pp. x, 230. $2.75. 
* (PA 21:4656) For reviews, see 105. 


[ 1195 ] 
*Travers, R. M. W. A Study in Judging Opinions 
of Groups. Archives of Psychology, No. 266. Wash. 
ington, D. C.: American Psychological Association, 
Pide nE 1041. Pp. 73. Paper. $1.25. * (PA 16: 
2787 


[ 1196 ] 
*Trawick, MACELDIN. Trait-Consistency in Per- 
sonality: A Differential Investigation. Archives of 
Psychology, No. 248. Washington, D. C.: American 
Psychological Association, Inc., April 1940. Pp. 45. 
Paper. $0.80. (PA 14:5075) 


[ 1197 ] 


*Traxcer, ARTHUR E, Functions of the Educational 
Records Bureau in Comparable Measurement. 


Educational Records Supplementary Bulletin H. New 
York: Educational Records Bureau, September 1943. 
Pp. iii, 29. Paper. $025. * (PA 18:316) 

[ 1198 ] 
*Traxter, ArTHUR E. How to Use Cumulative 
Records: The Manual for the SRA Cumulative 
Record for Junior and Senior High Schools. Chi- 
cago, Ill.: Science Research Associates, 1947. Pp. 40. 
Paper. $0.50. * (PA 22:852) 

[ 1199 ] 


X*TRAxLER, AnrHUR E. The Nature and Use of 
Reading Tests. Educational Records Bulletin, No. 34. 
New York: Educational Records Bureau, 1941. Pp. 64. 
Paper. $1.00. * (PA 16:1217) 


Ed Adm & Sup 28:159-60 F '42. Ruth 
Strang. In this concise bulletin Dr. Traxler, 
with his usual thoroughness, accuracy, and good 
judgment based on knowledge of research in the 
field, has answered four recurring questions 
about tests: What are the best current reading 
tests available and what are the advantages and 
disadvantages of each? How may reading tests 
be profitably used in diagnosis and remedial 
instruction? What is the relationship of 
Scores on three reading tests to scores on the 
Kuhlmann-Anderson Intelligence Tests? What 
materials are available for use in remedial, 
corrective, and developmental reading pro- 
grams? This pamphlet is eminently practical. 
It gives specific answers to specific questions. 
For those who wish to build up a theoretical 
background for understanding the nature and 
use of reading tests, Traxler provides in the last 
three pages a selected bibliography. 

El Sch J 42:327-8 Ja '42. G. T. Buswell. 
[Editorial] * Traxler's bulletin will enable 
teachers to save many hours of searching for 
suitable reading tests. It also makes possible a 
more discriminating selection of tests for par- 
ticular purposes. 

H Sch J 25:237 My "42. W. D. Perry. * In 
Section II there is description, analysis of cur- 
rent reading tests, and a discussion of diagnostic 
tests vs. survey tests. * Section III....discusses 
the use of reading tests in diagnosis and 
remedial instruction....under four main head- 
ings: (1) analysis of the reading achievement 
of groups and of individuals, ( 2) selection of 
pupils for special help in reading, (3) study and 
corrective teaching of individual pupils, and 
(4) appraisal of remedial instruction. The dis- 
cussion is too concrete to be of any considerable 
help. The discussion is treated through the use 
of actual test scores for various grades in indi- 
vidual pupils but this treatment is not compre- 
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hensive enough for real understanding. * Section 
V includes a listing of materials available for 
use in remedial, corrective and developmental 
reading. The reading materials listed in this sec- 
tion should be of concrete help to the classroom 
teacher. * The bulletin is short, compact, and is 
so organized that it should be easy to use. Sec- 
tions II and V have considerable helpful mate- 
rial for the elementary, secondary or college 
teacher of reading. 

Loyola Ed Digest 17:9 Mr '42. Austin G. 
Schmidt. The Educational Records Bureau en- 
trusted to Dr. Traxler the task of....describing 
and appraising reading tests with diagnostic 
features and suggesting methods for using such 
tests. Dr. Traxler discharged his task with his 
usual competency, and the result is a bulletin 
that compresses the essential information into 
small compass. 


[ 1200 ] 
XTnaxLER, ArtHuR E. Techniques of Guidance: 
Tests, Records, and Counseling in a Guidance 
Program. New York: Harper & Brothers, 1945. Pp. 
xv, 394. $4.50. * (PA 19:3187) 

Clearing House 20:378-80 F '46. E(arl) R. 
G(abler). * well written and well organized * 
The most considerable contributions of the book 
are the specimen record forms, especially those 
dealing with appraisal of personal qualities, and 
the detailed descriptions of suitable tools for 
guidance purposes. Every teacher who has guid- 
ance responsibilities should read Dr. Traxler's 
book. Other teachers will find it helpful in 
understanding the aim and scope of the guid- 
ance program. 

J Appl Psychol 29:499-501 D '45. F. S. 
Beers. The first thirty or forty pages of this 
compact volume may give the reader, sophisti- 
cated or otherwise, some blank misgivings. But 
if he persists, his patience will be richly re- 
warded. Few books that have come to my 
attention in recent years have been more appro- 
priately titled or more effective in their pro- 
gression. From a conventional beginning that 
toys with. definition, staff organization, and 
costs, the science of guidance gradually becomes 
the art of bringing into conjunction to their 
mutual advantage the potentialities of pupils 
and the resources of school and community. 
School and community resources are treated 
rather generally but with ample reference to 
sources of information. Understandably the cur- 
riculum, which is the school's chief formalized 
resource, is neither described nor challenged; 


but the growth of the extracurriculum is given 
subtle but effective emphasis as a promising ad- 
junct to readin’, writin’, and 'rithmetic. * Tests 
as instruments of fact finding, however, are 
given the largest amount of space and the most 
exhaustive treatment. Aptitude as a concept is 
intelligently discussed, and likewise the fact of 
individual differences; but without any attempt 
at explaining the why. A selected but reason- 
ably exhaustive list of tests carefully and fully 
annotated is attached, including titles, authors, 
publishers, research information and references. 
An even more extensive array of achievement 
tests is described with equal fidelity to fact. * 
Planning and administering a testing program 
and scoring, organizing and reporting test re- 
sults are fully and effectively treated with con- 
crete suggestions drawn from actual practice 
buttressing the general discussion. * With in- 
finite patience Dr. Traxler describes the 
instructional uses of tests. He elaborates them 
so fully and with such reasoned precision that 
he may feel, as many readers will, that he has 
argued his case perceptibly beyond the point of 
diminishing returns. Administrative uses of test 
results, and by inference, their possible effect 
upon the curriculum, are it appears "outside 
the scope" of his book. Within the conventional 
setting of school resources, counseling uses of 
test results are recommended to identify indi- 
vidual weaknesses of pupils, to discover special 
abilities, and as a basis for conferences on 
behavior adjustments, course sequences, voca- 
tional and educational planning. But tests are 
only one element in the total guidance program. 
* Some eighty pages of text and sample forms 
are devoted to the cumulative record and re- 
porting devices and to dependence on facts of 
status and progress for immediate or ultimate 
attainment of desirable educational or career 
objectives. The ideal is admirable and convinc- 
ingly presented as within the realm of the 
attainable. * Useful, practical suggestions are 
given for filing and accessibility of materials. * 
lists of references, not exhaustive, but well 
chosen, have been appended to each chapter; 
and the final chapter consists of reading re- 
sources for counselors. For those who have 
the urge to learn their students as well as teach 
them, Dr. Traxler's volume holds out great 
promise. For those who are professionals in 
the field of guidance, the volume is an excellent 
handbook. 

J Clin Psychol 2:402 O ’46. The author... 


1200 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 958 


has assembled and organized from a holistic 
viewpoint the most comprehensive presentation 
of modern guidance techniques known to this 
reviewer. All types of tests, records and coun- 
seling methods are described in detail well 
documented by the latest objective studies and 
comprehensive bibliographies with each chap- 
ter. The presentation is objective, concise and 
well outlined. It may be recommended without 
qualification to clinical psychologists and others 
working in the guidance field. 
J Consult Psychol 9:307-8 N-D '45. Warren 
G. Findley. * based largely upon articles and 
bulletins prepared by the author in his capacity 
as associate director of the Educational Records 
Bureau. These materials, which were designed 
originally to guide the so-called independent 
schools that use the services of the Bureau, 
have been assembled, revised and synthesized. * 
The subtitle “Tests, Records and Counseling in 
a Guidance Program" gives a good index to the 
contents and purpose of the volume. * This 
book may well be recommended to guidance 
counselors as a clear statement of the possibili- 
ties of using tests and records in counseling. It 
does not pretend to go further than description 
of these techniques and discussion of their uses. 
Beyond this, it explicitly points out the limita- 
tions of such techniques and of counseling 
based upon them. Counselors are advised to 
consult psychologists and psychiatrists on minor 
problems of adjustment and to refer cases of 
more serious adjustment. The illustrative case 
study is wisely restricted to remedial reading. 
The discussion of cumulative records and the 
large number of full-size, fold-in sample forms 
are admirably calculated to be helpful. * It 
may be questioned whether the 54 pages given 
to briefly annotated bibliographies of tests and 
record forms is justified in view of the avail- 
ability of “The ro4o Mental Measurements 
Yearbook." Further question might be raised as 
to the advisability of giving the space required 
for the discussion and bibliography of person- 
ality tests since all that is finally proposed is 
use of responses to individual questions as leads 
in interviewing. One of the excellent features of 
the book is its carefully chosen and well- 
organized list of 200 reading resources for 
counselors, which includes titles related to child 
psychology and mental hygiene as well as many 
in the narrowlv specific fields of educational 
and vocational guidance. A total of 61 titles are 
starred as especially recommended and might 


well be considered a well-rounded minimum 
library for guidance counselors. * The volume 
seems particularly well designed to serve as a 
textbook in a course bearing the same title. The 
well-planned framework and full bibliographies 
form a basis on which the instructor might 
build at several points to fit the backgrounds 
and needs of various classes and individuals 
seeking this training for guidance counseling. 

J Ed Psychol 36:567-8 D '45. Henry C. 
Mills. * an excellent book masquerading under 
an inaccurate title. Nor is the subtitle, Tests, 
Records and Counseling in a Guidance Pro- 
gram, any more satisfactory. Any text which 
devotes but one chapter of six pages to oppor- 
tunities for young people, another of twenty-two 
pages to kinds of information needed, and still 
another of approximately the same length to 
case-study procedures in guidance can hardly 
lay claim to so ambitious a title. What Traxler 
has done basically is to present a wealth of 
material on the use of the cumulative record in 
guidance. What should enter into such a record, 
how it should be recorded and used, and the 
values to be expected from its use are all ade- 
quately presented with considerable emphasis 
on the activities of the Educational Records 
Bureau. Indeed, the author specifically states in 
his Foreword (p. xiii) that the book consists of 
an “organization, revision and synthesis" of 
materials written for the Bureau’s member 
schools “together with other material written 
specifically for it.” * the author considers at 
some length the areas in which testing should 
be undertaken, including useful lists of tests in 
each area, steps in planning and administering 
a test program, and recording and using the 
results. This constitutes the core of the book 
and is very well presented with a wealth of 
illustrative material. It is followed by inade- 
quate treatments of the case study, the réle of 
the teacher in guidance, the follow-up program, 
guidance in the adjustment of individuals and 
reading resources for counselors. * Throughout 
the book the author emphasizes....the key place 
played by the cumulative record which is con- 
ceived as the heart of the process of guidance. 
In fact, the book has overtones almost implying 
that the record is the important thing and the 
child merely happens to be a vehicle for it. 
While such an emphasis may have been desira- 
ble some years ago, the need for it has perhaps 
largely disappeared. In spite of this, the book is 
an excellent, objective treatment of a limited 
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aspect of the field of guidance and as such de- 
serves both a better title and careful reading by 
counselors. 

Occupational Psychol 20:200 O '46. Alec 
Rodger. This is an interesting book. It is, in 
many ways, a very good book indeed. But it has 
hardly anything to do with 'techniques of guid- 
ance’. It is a compendium of information about 
how to collect information by means of tests 
and school records. Or perhaps we should say 
that it is a compendium of information about 
how such things are done in American schools. 
It is better to put it this way because—fortu- 
nately—nobody here is likely to be seriously 
tempted to follow their example. Mr. Traxler 
might next try his hand at telling us how 
American teachers find the time, and develop 
the inclination, to fill up forms of the kind he 
so lucidly explains. But although the title is 
thoroughly misleading, and although it assumes 
that the reader is test-minded and form-minded 
to a degree that many of us would think perni- 
cious, it will be useful to people who can make 
responsible use of its descriptions of published 
tests and 'personality' scales and of its enormous 
mass of references to books and articles on edu- 
cational and vocational guidance. These refer- 
ences are almost exclusively American, but 
Mr. Traxler has been generous enough to in- 
clude the names of two British psychologists in 
his list of approximately 800 authors. 

Occupations 24:53 O ’45. Arthur J. Jones. 
This compact volume might well be called “The 
Counselor’s Handbook,” it is so full of practical 
help to all who are engaged in the guidance 
service. It meets a long-felt need for a brief, 
comprehensive, practical guide through the 
maze of tests and other instruments now offered 
as panaceas, or, more modestly, as aids in 
guidance. * The title, “Techniques of Guid- 
ance,” is somewhat misleading for many phases 
or techniques are either omitted entirely or 
touched upon only lightly. The subtitle more 
adequately describes the purpose and the treat- 
ment—"Tests, Records, and Counseling in a 
Guidance Program.” Even this leads one to 
expect much greater emphasis upon counseling 
techniques than is found. Techniques of secur- 
ing information about occupational and educa- 
tional opportunities are barely mentioned and 
topics on follow-up, characteristics and training 
of counselors are very sketchy. The outstanding 
contribution of this book is contained in Chap- 
ters III to XIV, which offer a really masterly 
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assembly of data and suggestions regarding in- 
struments that are useful in securing informa- 
tion about individuals and in recording and 
using them. So helpful is this treatment that one 
might wish that the author had greatly reduced 
or omitted entirely much of the treatment not 
related to such topics and given more space to 
the techniques concerning the individual inven- 
tory. The descriptions of tests and other instru- 
ments, while necessarily brief, are clear, concise 
and very informatory. The assembly of types of 
records and their use is also noteworthy. The 
author shows a refreshingly critical attitude 
toward certain common uses made of test mate- 
rial and is very discriminating in his recom- 
mendations. He also stresses the need for care 
in the interpretation of records and of keeping 
constantly in mind their real purpose. * This 
book should be a must for every vocational 
counselor and should be in the professional li- 
brary of every school. 

Psychol B 43:183-4 Mr ’46. Merrill Roff. 
* Apart from a brief discussion of occupational 
opportunities early in the book and an equally 
brief treatment of “adjustment” problems at the 
end, the primary concern is with problems of 
distribution or placement of students within the 
educational system. The book is organized 
around the individual cumulative record, which 
is to include information obtained periodically 
for each student in....ten areas * Information 
in the last five areas involves the use of tests 
and observational procedures, and it is here that 
a detailed explanation of methods of obtaining 
and applying such information is needed. A 
minimal explanation of the applicability of a 
measure in a guidance situation should include 
a statement of the validity of the measure in 
predicting scholastic, vocational, or other per- 
formance, so presented that a statistically un- 
sophisticated person could grasp the actuarial 
nature of the problem. This we are nowhere 
given. The chapter on the appraisal of aptitudes 
describes or mentions a number of standard 
tests of intelligence or general academic apti- 
tude suitable for use with high school students. 
A carefully prepared annotated list of thirteen 
such tests....is presented * For only one of the 
thirteen tests is validity in relation to scholastic 
performance mentioned at all. Since tests of this 
type do have substantial validity for many pur- 
poses, and considerably less validity for many 
others, it would seem that a more intensive 
consideration and explanation of this subject 
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trend in foreman training. The authors have 
taken a pioneering step toward seeing the 
teacher selection and education program as a 
whole when they describe testing laboratory, 
the orientation program, and self-analysis as 
leading to clinics in reading, writing and speech, 
and when they show how strengths and how 
weaknesses revealed by teachers in the field 
might work back to improve the training pro- 
gram. By no means the least contribution is the 
illustration by the study itself of how research 
may become a working part of a training pro- 
gram when it is carried on by those who are 
to use its results. One weakness in the book 
from the psychologist’s point of view should not 
be attributed to the authors. It comes from the 
absence of sound investigations in the schools 
and colleges. No institution seems to have car- 
ried on a thorough evaluation program from a 
research point of view. This handicap is most 
evident in the lack of objective evidence regard- 
ing the.value of the instruments. The validity 
and usefulness of these devices depends on their 
discriminating power and on the extent to 
which they increase learning. But institutions 
should at least be given credit for starting to 
evaluate. The whole field of evaluation and se- 
lection in education would be improved if suc- 
cess were defined in other than ambiguous and 
contradictory terms (16); if the instruments 
that seek to measure the intangibles were 
checked more carefully as to validity and relia- 
bility as suggested by the authors (166—167) ; if 
earlier work in psychology were digested and 
brought to bear on the new study; if the most 
important problems could be cooperatively de- 
fined and studied by several institutions ; and if 
the research could be more systematic in its 
attack on the myriad of problems involved. 
Nonetheless, by providing opportunity for 
those responsible for the selection, training and 
appointment of teachers to work on the prob- 
lem, the American Council on Education through 
its specialists has exerted tremendous influence 
on these colleges and on others who will use the 
new suggestions and insights. 

Social Ed 9:186-7 Ap '45. Howard R. An- 
derson. * The authors have provided a wealth 
of practical suggestions for cooperative planning 
in delimiting objectives, for developing evalua- 
tion instruments, and for interpreting and using 
constructively information which has been ac- 
quired. Although many of the situations de- 
scribed by Professors Troyer and Pace are not 


located in our field, the procedures described 
can readily be adapted to the needs of social 
studies teachers concerned with evaluating out- 
comes other than those which can be measured 
by paper and pencil tests. Social studies teach- 
ers will want to read carefully Chapter IV on 
"General Education," to learn how objectives 
are determined and to appreciate the variety of 
approaches used in evaluating outcomes. The 
discussion of case-study techniques, anecdotal 
records, and observations in Chapter V, “Pro- 
fessional Education," also is helpful. The 
thoughtful analysis of teacher-pupil relation- 
ships and their implications for effective learn- 
ing in Chapter VIII, *Growth in Service," puts 
that chapter on a "must" list of professional 
reading. Of course, social studies teachers who 
are interested in teacher education will want to 
read the entire book. The purpose of this re- 
view, however, seeks only to identify material 
which is especially useful in planning an evalu- 
ation program in the field of social studies. 
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Am J Psychol 54:154-5 Ja '41. Anne Anas- 
tasi, * a fairly elementary book * The construc- 
tion and use of tests, as well as the critical in- 
terpretation of test scores and statistical data 
are given detailed consideration. Approximately 
one-third of the book is devoted to an intensive 
discussion of school marks and examinations * 
the book overlaps the fields of statistics, mental 
testing, and educational psychology, its boun- 
daries being dictated by the needs of a par- 
ticular group of individuals rather than by tra- 
ditional compartmentalization of facts. The 
author has succeeded admirably in combining 
simplicity of expression and a completely prac- 
tical point of view with scientific accuracy and 
a sound conceptual basis. The book is up-to- 
date in its factual background, and critical and 


cautious in pointing out implications. * If the 
data of psychology are to find immediate prac- 
tical application, it would seem that this type 
of book is a much more effective means of ac- 
complishing this end than those books which 
rely on extraneous and spurious motivation to 
attract the largest number of readers, by arous- 
ing grandiose expectations or by appealing to 
the desire to look at pictures and read jokes. 

Brit J Ed Psychol 10:255 N ’4o. Dr. Vernon 
has written an extremely useful book which 
will find a place on the shelves of the libraries 
of all psychological laboratories, of many re- 
search workers, mental testers, teachers, doc- 
tors, clinic and social workers, and psycholo- 
gists; and it will not remain on the shelves, for 
it will be continually referred to. The author 
has found, as have so many others, that most 
of the standard textbooks on statistical meth- 
ods are too difficult for many students of psy- 
chology. He has therefore dealt with the essen- 
tials of these methods in the earlier portions of 
this book, and has illustrated the principles by 
reference to many psychological and educational 
problems. He has succeeded, to a marked de- 
gree, in making plain much that often remains 
obscure in those text-books mainly concerned 
with technical or mathematical details. In the 
later parts of the book Dr. Vernon deals lucidly 
with the principles of test construction and 
application, with the interpretations of tests and 
examinations, with the new-type examinations, 
and with many recent improvements in examin- 
ing. Readers will find the classified list of edu- 
cational and intelligence tests of use. The criti- 
cal comments on mental tests and the many 
hints to testers contain much timely advice, 
attention to which would prevent a great deal 
of the uninformed criticism against some and 
the extravagant claims made for mental testing 
to-day. An extensive bibliography completes the 
book. 

Brit J Psychol 31:183-4 O ’40. R(obert) H. 
T(houless). * an admirable book * provides an 
interesting and clearly written account of the 
use of statistical methods, of mental testing, and 
of examining. The account of statistical meth- 
ods should be valuable to those students who 
find Fisher too hard, but who want an account 
of the matter which is practical and sound, and 
who also prefer one that is illustrated from the 
field of éducational psychology. * I should have 
preferred to see a fuller account of the method 
of testing significance in small samples. * ability 
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to deal adequately with small samples is a useful 
part of the equipment of every psychological 
Statistician. In experimental psychology, it is, 
of course, essential. Much space is rightly de- 
voted to the correlation coefficient * Vernon 
also very properly points out that adherence to 
the correlation coefficient commonly leads to 
neglect of more appropriate methods of dealing 
with particular problems, such methods, for 
example, as the analysis of variance. It might 
have been better to have given a fuller descrip- 
tion of the analysis of variance, not only be- 
cause it is a method which should very often 
be used, but because it is the expression of a 
way of thinking which is essential to the proper 
understanding of the use of correlation coeffi- 
cients. Much of the blind error in the use of 
correlation coefficients in the past (such as 
Winfield’s determination of the correlation be- 
tween the intelligence coefficients of pairs of 
twins from a single orphanage) would have 
been avoided if investigators had realized that, 
in calculating a correlation coefficient, one is 
comparing variances, There is an interesting 
and useful chapter on factor analysis. In deal- 
ing with the difficult problem of multiple factor 
analysis, the author has adopted the wise course 
of explaining what multiple factor analysis is 
getting at without giving instructions in the 
actual methods of carrying it out. I do not think 
that there is anywhere else where this is done 
for the student. If he wishes to learn the actual 
techniques, he will be in a much better position 
for doing so after he has understood Dr. Ver- 
non’s chapter. Dr. Vernon argues again the 
case against the traditional examination as an 
adequate measuring instrument, It might have 
been better to omit much of the anecdotal evi- 
dence, such as the story of the examiner who 
wrote for his own guidance a model set of an- 
swers which were marked variably and some- 
times severely by his colleagues. It is difficult to 
know how far such selected evidence is truly 
representative. Even the more respectable evi- 
dence of correlation coefficients may often be 
symptomatic of bad examining rather than of 
a necessary defect in the examination. Inde- 
pendent marking of university examinations by 
competent examiners is found to give consid- 
erably higher correlations than those ordinarily 
published. On the other hand, it may also be 
pointed out that the matter may be worse than 
correlation coefficients indicate since these indi- 
cate only the reliability of independent examin- 


ers' marks, not their validity. Unless papers are 
independently marked from typed copies, we 
cannot know how much of such agreement as 
there is may be due to both examiners being 
influenced by handwriting and neatness rather 
than by the candidates' knowledge of the sub- 
ject. The main practical problem for the ex- 
aminer is to use such techniques as will reduce 
the unreliability of his marking as much as 
possible. On this point, Dr. Vernon gives prac- 
tical guidance which would much improve mark- 
ing if it were followed. Towards the end of the 
book, there is a new-type examination on its 
subject-matter which the reader will find it 
amusing to carry out himself. There is a key 
to right answers at the end. Perhaps in a future 
edition the author will add norms. This is a thor- 
oughly useful book which fills a gap of which 
teachers of educational psychology have been 
long conscious. 

Eug R 32:93-4 O '40. Evelyn Lawrence. * 
written primarily for teachers, psychologists 
and doctors, and has the very great merit of 
being the work of a teacher who really under- 
stands the difficulties of his pupils—one of the 
rarest attributes among teachers of mathe- 
matics * Vernon makes each stage perfectly 
clear as he goes along, and gives worked exam- 
ples of the problems he is discussing, without 
however falling into the error of insulting the 
intelligence of his readers. * Teachers will read 
with great interest, and probably some dismay, 
what the author has to say about school marks. 
* It is to be hoped that this book will be widely 
read. Much injustice is done to examinees 
through the sheer inefficiency of examiners, and 
a thorough study of the arguments presented 
should at least form a starting point for im- 
provement both in obtaining and interpreting 
measurements of abilities. 

J Ed (London) 72:304 Ag '40. For some 
time there has been a demand for a book of this 
kind for use in Training Colleges; Dr. Vernon's 
book admirably fills the gap. In a simple and 
direct style, with telling illustration, and with 
the authority of experience * To all teachers 
who propose to construct and to use mental 
tests, or wish to make the most intelligent use 
of school examinations, this book cannot be too 
highly recommended as a first text-book. A 
valuable list of available tests and standardized 
scales, with particulars of age-limits, time, pub- 
lisher and price, is included. 

Occupations 20:475-6 Mr '42. Forrest H. 
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Kirkpatrick. * begins with useful chapters on 
the essentials of statistical procedure in psychol- 
ogy, but its main object is to outline the prin- 
ciples of test construction and administration, 
and to consider questions which arise in the 
interpretation of test results. * Of particular 
interest to counselors are the chapters entitled 
“Analysis of Abilities,” “Mental Tests,” and 
*Hints to Testers.” Some of the problems dis- 
cussed in these—e.g., concerning the nature of 
factors—are important ones which have been 
neglected by other authors; probably it would 
not be too much to say that a number of them 
have been recognized only by psychologists who 
spend a very considerable proportion of their 
time dealing with group and individual tests. It 
is doubtful whether Dr. Vernon’s rather scat- 
tered references to the place of incentives in 
testing can be regarded as adequate. It is ap- 
parent that he recognizes the importance of the 
matter, but it would be possible for the casual 
student of psychology to get the impression that 
Dr. Vernon subscribes to the common view 
that group testing is easy. His remark (p. 187) 
that “group tests are easier (than ordinary tests) 
to procure, to apply, and to score, and they 
demand much less training and experience on 
the part of the teacher” is, despite the caution- 
ary comment which follows it, an unfortunate 
one. Group testing is often fraught with even 
greater dangers than individual testing. It is 
also somewhat disappointing to note that he 
does not deal in an entirely satisfactory way 
with the problem of defining the term “ability.” 
Much of the present-day confusion about the 
use of psychological tests could be dispelled by 
a resolute attack on it by competent people. The 
bibliography in the book seems good, but it 
should not have omitted mention of Bingham’s 
Aptitudes and Aptitude Testing. And it is sur- 
prising to find that, while Oakley’s and Mac- 
rae's Handbook of Vocational Guidance is in- 
cluded in the list, Macrae’s Talents and Tem- 
peraments is not, The two books seem to be 
closely associated when one thinks of vocational 
guidance in Great Britain. 

Personnel Adm 4:16 Mr '42. Forrest H. 
Kirkpatrick. [Practically the same as his re- 
view in Occupations excerpted above.] 

Personnel J 20:228 D "41. Forrest H. Kirk- 
Patrick. [Practically the same as his review in 
Occupations excerpted above.] 

Psychol B 37:748-9 N ’40. Quinn McNemar. 
It can be said that only about one-third of this 


volume is devoted specifically to its title. The 
other two-thirds is about equally divided be- 
tween statistical methods and a discussion of 
school examinations. The attempt to include so 
much within one book has, despite concise writ- 
ing, led to a somewhat superficial treatment of 
some of the topics. This has, without doubt, 
diminished the usefulness of the book to those 
for whom it is primarily intended: teachers and 
examiners, psychologists as scientists, and doc- 
tors and psychologists as clinicians. The re- 
viewer has no quarrel with any part of the book 
which is devoted to statistical techniques except 
the sketchiness. Any hundred-page exposition 
which begins with the most elemental concepts 
and progresses through sampling, goodness of 
fit, linear and curvilinear correlation, to, and 
including, partial and multiple correlation, is 
apt to be so sketchy as to be practically worth- 
less. The beginner in statistics will find the dis- 
cussion inadequate, and the more sophisticated 
will not find it useful. The chapters dealing with 
ordinary mental tests and problems related 
thereto are somewhat more adequate. The author 
rightfully stresses the necessity for a more cau- 
tious and critical use of tests, and presents some 
of the criticisms which have been raised against 
the 1937 Revised Stanford—Binet. In these chap- 
ters one readily notices the influence of Spear- 
man. It would seem that the author has more 
faith in the symbol g than in the general intel- 
ligence measured by the Binet type of test. This 
leaning towards Spearman and g and an as- 
sumption, upon which a criticism is based, that 
the 1937 Stanford Revision “measures an un- 
analyzed hotch-potch of abilities" (p. 193) 

seems amusing when it is recalled that Spear- 

man is on record as saying that g is measurable 
by any hotch-potch collection of tests (Psy- 
chologies of 1930, pp. 343-344). Then on page 
195 the reader is told that the author "would 
agree with Spearman, Cattell [R. B.], and Kent 
that the New Stanford Revision is a distress- 
ingly unscientific instrument owing to the hetero- 
geneity of its content." One wonders why Kent, 
whose main thesis seems to be that the 1937 
Revision is not flexible enough for clinical 
work, should be cited in this connection. As to 
the supposed heterogeneity of the Stanford- 
.Binet, it might be remarked that the recent 
Psychometrika paper of Wright and the far 
more extensive unpublished analyses of the re- 
viewer agree in revealing a surprisingly (to 
some) large common factor in the Binet. The 
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last part of Vernon's book is an excellent, con- 
cise discussion of various types of school ex- 
aminations exclusive of standardized achieve- 
ment tests. The reviewer is of the opinion that 
any instructor, whether at the secondary or 
college level, who is having examination worries 
will find the last four chapters very worth 
while. 

Scottish Ed J 23:594 Ag 30 '40. This singu- 
larly able book...is a welcome addition to the 
growing volume of treatises relating to statistics 
and mental measurement. So far as the re- 
viewer is aware, it is the only British text-book 
which surveys the whole field. Dr. Vernon’s 
book is almost unique, and must long remain a 
competent guide. * The first and longer section 
(Chapters I.-VII.) is concerned with statisti- 
cal methods. The field is covered ably and 
lucidly. No one need be deterred by the treat- 
ment herein given * Terms are clearly ex- 
plained and formulae are in every case illus- 
trated by example. The reviewer would specially 
commend Chapter IV. (Interpretation of School 
Marks and Test Scores) and Chapter V. (Reli- 
ability and the Theory of Sampling). Chapter 
VIII. (Analysis of Abilities) forms a section 
by itself. It deals briefly but clearly with the 
rather difficult subject of factorial analysis, * 
includes a particularly helpful list of tests * 
The final section, comprising four excellent 
chapters, discusses the question of examinations 
—their function, their reliability and unrelia- 
bility, their validity or lack of it, the relative 
merits of the new-type and essay-type, the con- 
struction of the new-type, improvements in ex- 
amining. Altogether a stimulating discussion, in 
which Dr. Vernon is at his very best. A valu- 
able bibliography (120 items) of books or ar- 
ticles recommended for further reading is given. 

For additional reviews, see 40:B1115. 


[ 1220 ] 
XVIRENE, Encar Paur. A Factor Analysis of the 
Scholastic Aptitude Test. Philadelphia, Pa.: Uni- 
versity of Pennsylvania; 1940. Pp. vii, 36. Paper. * 


[ 1221 ] 


Wane, CHARLES K. A. An Annotated Bibliography 
of Mental Tests and Scales, Volume I. Publica- 
tions of the College of Education, Catholic University 
of Peking, No. 1. Peiping, China: Catholic University 
Press, 1939. Pp. vi, 725. * (PA 14:3789) 


Am J Psychiatry 97:990-1 Ja '41. C. R. 
Myers. Although many mental test bibliogra- 
phies of various sorts are now available, there 
is certainly none which is at once so ambitious 


in its scope and so meticulous in its documenta- 
tion as this descriptive catalogue * To those 
who are not familiar with the ever-increasing 
literature on mental tests, this lengthy bibliog- 
raphy may prove both surprising and dismay- 
ing. To those who are familiar with this field, 
the book should prove a useful index for quick 
reference to half-forgotten test devices as well 
as a source of concise, pertinent information on 
test material not previously noted. It is, of 
course, of special interest that this product of 
long and painstaking research should come from 
what we have grown accustomed to think of as 
“war-torn China.” 

Am J Psychol 55:150-1 Ja '42. John G. 
Darley. * A glaring omission is the absence of 
reference to the very large body of standardized 
academic subject-matter achievement tests avail- 
able in the literature. Furthermore, the treat- 
ment of all tests is non-selective, with too little 
information on the utility, reliability, and valid- 
ity of the tests. Almost 600 tests are mentioned 
under the heading of mental capacity; only 
about 60 of these have appeared since 1930. 
The author has unintentionally provided an in- 
teresting historical summary of published tests, 
but he has been inadequately critical of this 
history in his notations. There is also an adher- 
ence, in his classificatory system, to a “faculty” 
approach to mental life that is difficult to main- 
tain in present-day theoretical discussions. The 
volume cannot compare with Buros’ mental 
measurement yearbook series, which is a far 
more authoritative, critical, and selective treat- 
ment of up-to-date testing devices. Dr. Wang 
has, however, documented a period in the his- 
tory of the testing movement which should be 
interesting and nostalgic reading for today’s 
psychologists. 

Brit J Psychol 31:184 O '40. More than 
1770 references are given and their contents 
briefly described. * Whether the preparation of 
this kind of bibliography is genuinely worth the 
labour that must be lavished upon it is perhaps 
doubtful, but it is certain that Professor Wang 
and his collaborators have accomplished the job 
itself in an exceedingly satisfactory manner. 
The annotations are brief but clear, and where 
they cannot be firsthand this is indicated. Any- 
body turning over the pages of this book will 
probably get a very strong impression that re- 
search workers in this field have been more 
anxious to invent tests than to do anything else, 
and also that the number of tests is altogether 
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out of proportion to their originality. It is prob- 
able that the bibliography will be most useful 
to the student who wishes to get a quick spe- 
cialized survey and then to follow this up by 
study of original papers in a more limited field. 

Ed 61:253 D '40. Guy M. Wilson. A total of 
1776 testing and measuring instruments are 
listed and classified. As the author recognizes, 
the classification categories are not always clear 
cut; they do overlap and conflict, but cross 
references aid the reader who might reasonably 
expect a test to be placed under a different 
category. The 1776 mental tests and scales are 
classified broadly into (1) measures of mental 
capacity, (2) measures of personality and char- 
acter, and (3) measures of vocational aptitudes 
and abilities. * an invaluable reference for li- 
braries, professors of education, teachers and 
administrators. There is an attempt at evalua- 
tion of each test through annotations following 
biographical listing; in many cases these anno- 
tations are uncritical quotations or summaries 
from the descriptive circulars of the test mak- 
ers, in some cases they represent keen personal 
judgment. 

J Ed Res 35:51-2 S ’41. T. L. Torgerson. 
* consists of an annotated bibliography of 1776 
titles classified as: tests of Mental Capacity, 
tests of Personality and Character, tests of Vo- 
cational Aptitudes and Abilities. The annota- 
tions include author, date of publication, norms, 
forms, purpose, content, reliability, validity, 
price, publisher and selected references. The 
information is unfortunately incomplete for most 
of the tests especially with respect to reliability 
and validity. Many of the tests are described 
very briefly by statements two or three lines in 
length. Other tests have one or two page anno- 
tations. The inclusion of a Directory of Pub- 
lishers is a valuable addition to the volume. * 
Achievement tests are not included in the vol- 
ume, The author does not attempt to evaluate 
the tests listed. The annotations do however in 
most instances provide valuable descriptive 
material which is a decided improvement upon 
a bibliography which contains a mere list of 
tests. The addition in many instances of peri- 
odical references makes it possible for the con- 
sumer to secure additional information about 
the tests from the standpoint of critical evalu- 
ation, uses, construction and standardization. 
The author states, “The list is especially com- 
plete for tests published in America or appear- 
ing in American Journals.” This is undoubt- 


edly an overstatement. The reviewer, however, 
accepts the fact that the volume does present a 
very useful and comprehensive annotated bibli- 
ography of tests and scales in the fields sam- 
pled. Many of the tests listed are obsolete or 
out of print. The author justifies their inclusion 
as a matter of “scientific record.” Some of the 
omissions and inconsistencies noted can be at- 
tributed to the fact that more information about 
a test is not available. In spite of the shortcom- 
ings listed the reviewer feels that the volume 
makes a distinct contribution as a useful refer- 
ence in the field of psychological testing. 
Sociol & Social Res 25:97 S-O 40. This is 
a remarkable list of 1,776 tests and scales. A 
brief description ranging from 100 words to 
175 words accompanies each test or scale. * 
This volume will prove to be exceedingly useful. 
For an additional review, see 1223. 


[ 1222 ] 


*Wanc, CuarLes K. A. An Annotated Bibliography 
of Mental Tests and Scales, Volume II. Publica- 
tions of the College of Education, Catholic University 
of Peking, No. 1. Peiping, China: Catholic University 
Press, 1940. Pp. viii, 698. * (PA 15:4426) 


Am J Psychiatry 98:785 Mr ’42. A. Jean 
Brown. This second volume....is a bibliography 
including “all tests and other devices which 
would ordinarily be regarded as belonging or 
related to the field of educational measurement.” 
Volume I covers a broader field in mental tests, 
having included means of measuring: I. Mental 
Capacity; 2. Personality and Character; 3. Vo- 
cational Aptitudes and Abilities. * an exhaustive 
and yet condensed guide to tests in educational 
measurement. This bibliography is well classi- 
fied and includes chapters on the following: (1) 
Composite Tests and Test Batteries ; (2) Arith- 
metic; (3) English; (4) Fine Arts; (5) For- 
eign Languages; (6) Handwriting; (7) Mental 
Hygiene; (8) Home Economics; (9) Mathe- 
matics; (10) Science; (11) Psychology and 
Education; (12) Reading; (13) Social Science; 
(14) Spelling. There is also an index of authors 
and subjects. Many of these chapters are sub- 
divided with the result that it is a ready refer- 
ence. As in the first volume each test is anno- 
tated giving name of test, name of author, date 
of publication, price, type of subject for which 
test is intended, including age and grade. Tests 
range from the kindergarten to the college grad- 
uate level. In an attempt to do justice to the 
work one might comment on one topic, “Read- 
ing,” chapter 12, pages 416-482. This chapter 
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includes a descriptive practical note on 219 
reading tests. Since reading disabilities are 
among the problems occurring in children who 
are referred to child guidance clinics, this re- 
view is invaluable to clinical psychologists, in 
the selection of practical tests. These include 
diagnostic, remedial word-recognition and 
word-pronunciation tests which are applicable 
in this field. On the other hand, innumerable 
resources are to be found in silent and oral 
reading and achievement tests, of assistance to 
teachers and educators for school placement. 
These range from elementary to advanced 
grades. This volume is invaluable as a constant 
reference to educators and clinical psychologists 
interested in the field of educational measure- 
ment. 

Am J Psychol 55:611 O 42. John G. Darley. 
* Using the first 94 pages as a sample, it ap- 
pears that 61% of the references are to tests 
published in the decade 1925-35, 27% in the 
years 1900-25, and only 12% in the years 
1935-40. Thus the greater majority of the tests 
cited are outmoded in current testing programs. 
Furthermore, incomplete and inadequate data 
are presented on the reliability, validity, and 
norms for the various tests. This is a partial 
reflection of the actual absence of such data in 
the earlier literature of test-development, but it 
still precludes the use of the bibliography in 
selecting tests. In the light of these two weak- 
nesses—inadequate annotations and lack of re- 
cent materials—the volume is far less useful 
than such sources as Buros’ mental measure- 
ment series, the periodic Reviews of Educa- 
tional Research, or the recent Encyclopedia of 
Educational Research. As was true of Volume 
I, however, Wang has sketched an interesting 
history of the testing movement in his exhaus- 
tive listing of test names. 

For an additional review, see 1223. 


[ 1223 ] 


Wano, CnanLES K, A. An Annotated Bibliograph: 

of Mental Tests and Scales, Volumes I and II. 
Publications of the College of Education, Catholic 
University of Peking, No. 1. Peiping, China: Catholic 
University Press, 1939, 1040. Pp. vi, 725; viii, 698. * 
$11.25 per set of two volumes. (PA 14:3789, 1 5:4426) 


Brit J Ed Psychol 12:136 Je ’42. This sub- 
stantial work...is a product of ten years’ accu- 
mulation of material, and the author is to be 
congratulated on his patient labours and schol- 
arly achievement, even if, as he himself mod- 
estly says, he has failed in his ideal of present- 


ing exhaustive lists of “all instruments of 
mental measurements available in English up to 
1939.” * The volumes reflect a much more com- 
plete familiarity with American writers than 
British, but in spite of that they constitute a 
very useful work of reference. 

For additional reviews, see 1221 and 1222. 


[ 1224 ] 
*Wan Department. The Army Individual Test 
(AIT-1). War Department Pamphlet, No. 12-12. 
Washington, D. C.: Government Printing Office, April 
1945. Pp. ro. Paper. $0.35. * (PA 21:629) 

. [ 1225 ] 

*War Department. Army Trade Screening Tests. 
War Department Pamphlet, No. 12-13. Washington, 
D. C.: Government Printing Office, July 1945. Pp. 11. 
Paper. $0.35. * (PA 21:589) 

[ 1226 ] 
X*WaR DEPARTMENT. Personnel Classification 
Tests. War Department Technical Manual, TM 12- 
260. Supersedes TM 12-260, 31 December, 1942. Wash- 
ington, D. C.: Government Printing Office, April 1946. 
Paper. $0.35. * (PA 17 :2469) 

[ 1227 ] 
*Wan Department. Personnel Efficiency Reports. 
Army Regulations, AR 600-185, 27 May 1947. Super- 
sedes AR 600-185, 15 November 1944, including C1, 31 
October 1946 and Section 1, WD Circular 28, 1947. 
Washington, D. C.: Government Printing Office, May 
27, 1947. Pp. 24. Paper. $0.10. * 

[ 1228 ] 
*Watxins, Joun GoopucH. Objective Measure- 
ment of Instrumental Performance. Columbia Uni- 
versity, Teachers College, Contributions to Education, 
No. 860. James L. Mursell, faculty sponsor. New York: 
Bureau of Publications, the College, 1942. Pp. x, 98. 
$1.60. * (PA 17:334) 

Ed Res B 22:110-1 Ap 14 '43. M. Emett 
Wilson. The title of this dissertation is mislead- 
ing since the only instrument dealt with is the 
trumpet (or cornet). Furthermore, it fails to 
provide a criterion, as it purports to do, against 
which prognostic tests may be evaluated: there 
is no justification for assuming that the specific 
difficulties selected by cornet teachers as indi- 
cations of a student’s progress are more valid 
than the gross evaluations which have been 
used as criteria heretofore. Even the interpre- 
tation that there is negative acceleration in sight 
reading and acquisition of technique-is not jus- 
tified, for there is no assurance that the early 
exercises in the tests differ in difficulty to the 
same degree as the later ones. Moreover, al- 
though the reliability of the test is high in the 
hands of one investigator, the evidence is that 
it would be low when administered by different 
experimenters. The difficult field of testing in 
the ephemeral art of music is a graveyard of 


969 ] BOOKS & REVIEWS: WANG 


1232 


misdirected enthusiasms. Most studies have 
been valuable in showing what not to do. Such 
reports should be circulated among the inves- 
tigators but should not be published or allowed 
to come into the hands of uncritical music edu- 
cators. 

J Appl Psychol 27:369-72 Ag ^43. Abraham 
Pepinsky. * It would have been desirable to in- 
dicate in the title of the book the fact that the 
test is limited to the playing on the cornet. * 
The reasons for the selection of the musical 
instrument to be used in the investigation were 
rather naive. Instruments with extreme range 
were excluded as well as the Violin and Trom- 
bone, which would introduce another element 
of subjective judgment involving correct pitch. 
Yet, the musician would be very apt to consider 
fine pitch discrimination a very important fac- 
tor, even in an instrument with valves, like the 
cornet. It must be admitted that the adjudi- 
cator, as well as the candidate being tested, 
might find this element of pitch a trying one. 
The “psychological intonation” resulting from 
“tendency tones” in a melodic line are admit- 
tedly highly subjective, but are truly meaning- 
ful. It is regrettable that the pitch factor was 
eliminated. The construction of the perform- 
ance test for the cornet was handled very care- 
fully and purposefully. * there were found to be 
a number of controversial points in the scoring 
system. One rule especially provokes the musi- 
cian’s good-natured smile, namely, that “Pauses 
between measures, no matter how long, do not 
constitute errors. However, do not inform the 
student of this fact.” Another bit of instruction : 
“Tones badly out of tune were not counted as 
errors if the correct partial and not an entirely 
different tone was sounded.” * K. L. Bean is 
quoted....that a Gestalt is involved in the read- 
ing of music and that an organismic interpreta- 
tion is desirable when experimenting with melo- 
dies.” It seems almost unbelievable that this 
same thought is not applied to the measurement 
of performance by the adjudicator. No provision 
is made for it whatsoever. * The overview and 
summary offer some pertinent suggestions: (1) 
Standardized grading of music would be valu- 
able. (2) Teachers spend too little time devel- 
oping sight-reading ability. * The bibliography 
«iS a valuable addition. 

J Ed Psychol 33:307-8 My '42. Louis Ches- 
lock. There has been a definite need for testing 
instrumental performance in music. Most per- 
formance tests (rarely using complete melo- 


dies) have been confined heretofore chiefly to 
measuring sight singing. Watkins's very excel- 
lent approach to his subject resulted in a series 
of tests based on a survey and analysis of many 
widely known cornet methods. * The scoring 
unit was each measure instead of each note. 
One defect with this method (although other- 
wise preferable) is that if the subject is un- 
nerved by an error at the end of a measure the 
play-over into the next measure might result in 
two errors counted, whereas the same errors 
embodied within the bar lines would result in 
only one error counted. A list of typical errors 
should have been included and analyzed. Hav- 
ing had the advantage of Wheelwright's fine 
study of Perceptibility and Spacing of Music 
Symbols it seems a pity that the manuscripts 
used (they are reproduced in the Appendix) 
did not more clearly differentiate the “naturals” 
from the “sharps.” Also, the bars for 8th notes 
and 16th notes might have been thicker, and 
some of the notes better spaced and combined. 
In the main, however, the work is very thor- 
ough, and is a genuine contribution to educa- 
tion in the field of music research. 

J Ed Res 37:375 Ja '44. Alvin W. Ahrens. 
* a most valuable study * The author has 
shown that objective measurement of musical 
performance is possible and his methods may be 
found valuable in the building of other research 
instruments in this field. The tests themselves 
should prove useful measures of achievement 
for the research worker in music education, for 
standardizing the grading of music, and for the 
cornet teacher. 


[ 1229 ] 
XWarsox, K. Brantley. The Nature and Measure- 
ment of Musical Meanings. American Psychological 
Association, Psychological "Monographs, Vol. 54, No. 
2, Whole No. 244. Washington, D. C.: the Associa- 
tion, Inc., 1943. Pp. iii, 43. Paper. $1.00. * (PA 17 :248) 


[ 1230 ] 
xWzssrER, Enwanp C. Put Yourself To the Test: 
A Manual of Vocational Self-Guidance. New York: 
Harper & Brothers, 1941. Pp. xi, 129. Out of print. * 


[ 1231 ] 
Wecuster, Davi. The Measurement of Adult In- 
telligence. Baltimore, Md.: Williams & Wilkins Co., 
1939. Pp. ix, 229. (London: Bailliére, Tindall, and 
Cox.) * (PA 13:5389) For latest edition, see 1233. For 
reviews, see 200, 40:1420, and 40:B1121. 


[ 1232 ] 
x*Wrcnusteg, Davi. The Measurement of Adult - 
Intelligence, Second Edition. Baltimore, Md.: Wil- 
liams & Wilkins Co., 1941. Pp. xi, 248. (London : Bail- 
lière, Tindall, and Cox, 1942.) * For latest edition, see 
1233. (PA 16:1239) For reviews, see 300. 
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[ 1233 ] 
*Wzcnsrzn, Davrp. The Measurement of Adult In- 
telligence, Third Edition. Baltimore, Md.: Williams 
& Wilkins Co., 1944. Pp. vii, 258. $2.50. * (PA 19:815) 
For reviews, see 298 and 301. 

[ 1234 ] 
WELLMAN, Betu L. The Intelligence of Preschool 
Children as Measured by the Merrill-Palmer 
Scale of Performance Tests. University of Iowa 
Studies, Studies in Child Welfare, Vol. 15, No. 3, New 
Series, No. 361. Iowa City, Iowa: University of Iowa 
Press, October 1938. Pp. rso. Cloth, $1.35; paper, 
$1.00. (PA 13:1730) 


J Abn & Social Psychol 35:457-62 Jl '40. 
Barbara S. Burks. The material in this study is 
not all directed to the problem of environmen- 
tal influences on test scores. There are sections 
on practice effects, on correlations between origi- 
nal Merrill-Palmer scores and retest scores, 
and between Merrill-Palmer and Binet scores. 
Much of the discussion is made highly cumber- 
some through the separate treatment of scores 
stated first in sigmas, then in IQ’s, then in per- 
centiles, with results so disparate that the direc- 
tion of average change on retest is in some in- 
stances positive by one scoring technique and 
negative by another (Table ro). This fact indi- 
cates, of course, that the norms for the test are 
in need of thorough overhauling, but it does 
not preclude the comparison of groups with 
respect to changes in scores providing proper 
controls and precautions are exercised. The 
question of major interest is whether, and how 
much, attendance in the Iowa Preschools influ- 
ences performance on the Merrill-Palmer. In 
the original monograph Wellman reports a 
gain from fall to spring testing (corresponding 
to the period of Preschool attendance) in ex- 
cess of the gain from spring to fall, of roughly 
5 IQ points on the Merrill-Palmer, and 8 IQ 
points on the Binet (Table 11). Reworking the 
Merrill-Palmer data according to certain rea- 
sonable assumptions, McNemar finds that the 
difference for the one clear-cut subgroup is only 
1.56 times its standard error (corresponding to 
P of about .06). Wellman’s subsequent rework- 
ing of the same data by means of the t test 
emerges with a value of P between .1 and .2. 
(She does not give enough material to enable 
one to judge as to the correctness of her use 
of the ¢ test.) The reviewer believes that if the 
vaguely defined and overlapping subgroups 
could be further unravelled, the evidence would 
be better than either the McNemar or Wellman 
reworking of data indicate, since Wellman’s 
Table 11 contains data that seem too consistent 


to attribute reasonably to chance. Although 
McNemar’s comments on statistical formulae 
and the fundamental conditions of sampling are 
incontrovertible, and Wellman’s use of overlap- 
ping subgroups means that the differences, with 
their calculated sampling errors, are not com- 
pletely independent, the fact remains that a 
large number of tests were given. It seems 
hardly warranted to ignore them as evidence 
simply for want of a valid critical-ratio formula. 
The temptation is, actually, to accept the results 
on their face value, because it would be so sur- 
prising if a test of the Merrill-Palmer perform- 
ance type were not significantly influenced by 
Preschool attendance. Yet we cannot overlook 
the fact that the more verbal Binet scores seem 
to show even more change between fall and 
spring testing than do the Merrill-Palmer—and 
that other studies have indicated the possibility 
of a similar fall-spring, spring-fall mental 
growth pattern in children not attending school. 
The apparently positive relation between Mer- 
rill-Palmer scores and attendance prior to first 
test (p. 67) adds no decisive data because the 
ages of the children are not controlled in this 
comparison. The question under discussion, 
then, seems not to be answered with any finality 
by the data in this monograph. 
For additional reviews, see 40:B1123. 


[ 1235 ] 
X*WrLLMAN, BETH L., AND McCanotess, Bovp R. 
Factors Associated With Binet IQ Changes of 
Preschool Children. American Psychological Asso- 
ciation, Psychological Monographs, Vol. 60, No. 2, 
Whole No. 278. Washington, D. C.: the Association, 
Inc., 1946. Pp. iii, 29. Paper. $075. * (PA 22:1103) 

[ 1236 ] 
*WELLs, F. L., AND RUESCH, JUnGEN. Mental Exam- 
iners' Handbook. New York: Psychological Corpora- 
tion, 1942. Pp. v, 122, Paper, spiral bound. * For 
revised edition, see 1237. (PA 16:4206) 


Am J Psychiatry 100:155 Jl ’43. Elizabeth 
Gifford Bull. * A good deal of the material has 
not beer standardized, and Successful use pre- 
Supposes intensive personal experience, * The 
book is an attempt to bring to the psychiatrist 
an understanding of the psychometrician's tech- 
nique in evaluating the intellectual resources of 
his patients. In doing so, the authors have sac- 
rificed most of the exactness to which intelli- 
gence tests have now attained. The book would 
have been of greater value if well established 
norms had been included with each set of tests. 
The range of difficulty of the material is defi- 
cient at the upper levels. It is sometimes useful 
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to be able to demonstrate superior intelligence. 

J Nerv & Mental Dis 97:725-6 Je ^43. * very 
useful handbook for mental examiners * After 
extensive personal experience the examiner 
usually comes to the acceptance of resultant 
“subjective norms.” The main demands remain 
for him proper organization of the content of 
the tests and convenient methods of presenta- 
tion. Attention to these factors has been the 
authors’ first concern. In this respect their work 
offers advances over many of the presently 
available handbooks. Their material has been 
arranged, from the standpoint of clinical psy- 
chometrics, so as to help toward better stand- 
ardization of technique for the given observa- 
tional range. * A useful feature is the inclusion 
of reduced facsimiles of letter-size cue-sheets 
which may serve as individual record forms for 
any and all the given test procedures. * 


[ 1237 ] 


*Wetts, F. L., AND Ruescu, Jurcen. Mental Exam- 
iners' Handbook, Revised Edition. New York: 
Psychological Corporation, 1945. Pp. vii, 211. Paper, 
spiral bound. $5.00, postpaid. * 


J Appl Psychol 30:293 Je '46. Paul E. Meehl. 
* intended primarily for aid in the examination 
of "psychiatric" patients, that is to say, patients 
whose behavior deviates from the normal much 
more than is found in the practice of most 
readers of the Journal of Applied Psychology. * 
The present edition is an improvement...., espe- 
cially with respect to the presence of norms. 
It is particularly useful in the training of medi- 
cal students just beginning the study of neuro- 
psychiatry, furnishing a more or less objective 
set of tasks by which they may assess the pa- 
tient's functioning without actually requiring a 
thorough study of psychometrics. In some re- 
spects the work might be criticized for lending 
an impression that certain determinations are 
relatively simple and easy, such as the listing of 
Murray's "needs" in the same way as one lists 
the varieties of orientation. Some psychologists 
might object to the giving of “mental age" values 
on certain of the tests for similar reasons. On 
the whole, this Handbook fills an important 
place between rigorous psychometrics and all 
that implies, and the almost wholly unstandard- 
ized and subjective “mental status” examination 
-of clinical psychiatry. 


[ 1238 ] 
*WELTMAN, Naomi, AND Remmers, H. H. Pupils’, 
Parents’ and Teachers’ Attitudes—Similarities 
and Differences. Purdue University, Division of Ed- 


ucational Reference, Studies in Higher Education, 
[No.] 56; Further Studies in Attitudes, Series 9. La- 
fayette, Ind.: the Division, September 1946. Pp. 50. 
Paper. $0.75. * (P4 21:555) 

[ 1239 ] 
*Wickert, FREDERIC, EDITOR. Psychological Re- 
search on Problems of Redistribution. Army Air 
Forces Aviation Psychology Program Research Re- 
ports, Report No. 14. Washington, D. C.: Government 
Printing Office, 1947. Pp. vii, 208. $1.25. * 

[ 1240 ] 
*WicHtwick, M. IRENE. Vocational Interest Pat- 
terns: A Developmental Study of a Group of Col- 
lege Women, Columbia University, Teachers College, 
Contributions to Education, No. 900. Ruth Strang, 
faculty sponsor. New York: Bureau of Publications, 
the College, 1945. Pp. vii, 231. Out of print. * (PA 19: 
3190) 

J Higher Ed 18:53-4 Ja’47. Isabel K. Wal- 
lace. Teachers of personnel work will welcome 
this excellent description of vocational counsel- 
ing in an eastern women’s college of 750 stu- 
dents. * the most significant aspects of the data 
did not lend themselves to elaborate statistical 
treatment, states the author. Consequently, the 
book is pleasantly readable but meager in statis- 
tics. Students will find here a valuable resumé 
of the literature published in the United States 
from 1924 to 1943 concerning the measurement 
of interest as an aid to vocational guidance. * 
Students of personnel work will also find mate- 
rial for building a foundation for vocational 
counseling in the forty case histories by which 
patterns of vocational interest are depicted. * 
the group studied is not a well-selected sample 
of graduates of women’s colleges, because all 
but 6 of the 115 were living in their own homes, 
a fact which undoubtedly had an influence on 
the kinds of jobs they secured. In other words, 
they did not necessarily seek the jobs which they 
most desired but took jobs which would enable 
them to live at home. * Two significant conclu- 
sions stand out from the comparison of interests 
in college and jobs after college. First, there was 
greater agreement between choices of occupa- 
tions in college and jobs obtained after grad- 
uation than between scores on the Strong 
Vocational Interest Blank and jobs. This out- 
come is undoubtedly due to the fact that the 
Strong test is scored for only 17 vocations and 
and thus its usefulness is limited. The second 
significant conclusion related to job satisfaction. 
After four years out of college, there were four 
times as many satisfied as dissatisfied young 
women, when the job was related to expressed 
vocational interest, and four times as many 
dissatisfied as satisfied, when the job was not 
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related to expressed vocational interest (page 
94). Six well-stated implications for counseling 
include the counselor's need for comprehensive 
knowledge of her college and faculty and of the 
multiple phases of the student's personality, 
with emphasis on emotional factors of that per- 
sonality. The counselor should encourage and 
help the student to incorporate her strong long- 
standing interest into her vocational plans, for 
it appears from this study that individuals with 
such persistent strong interests show the great- 
est satisfaction in their chosen work. From the 
study of forty individual patterns, all reported 
in this volume, Miss Wightwick formulates 
four hypotheses, the two most important of 
which may be stated as follows: First, a genu- 
ine vocational interest is indicated in early 
initiation without environmental pressure. Ac- 
cording to a second hypothesis, this interest 
may be deflected by family pressure, financial 
pressure, financial need, a single personality 
lack, or limited job opportunity. There are 
many more conclusions regarding factors affect- 
ing vocational choices and interest patterns, 
which the reader is urged to find for himself. 
In so doing, he will be stimulated to utilize his 
own material for the benefit of other counselors 
of college women. 

Occupations 25:133-7 N ' 46. Barbara Mayer 
Kirchheimer. * of interest and importance both 
to investigators in the field of vocational inter- 
est, and to practicing counselors * A wealth of 
carefully gathered data is presented, including 
material from college cumulative records, apti- 
tude, achievement and interest tests, ratings, 
and interviews. * With so thorough and de- 
tailed a study of such a small group as this, it 
is regrettable that it necessarily was a some- 
what special group. An adequate description of 
the college is not given, and without knowledge 
of the curriculum, etc., it is difficult to evaluate 
this particular sampling. * a review of studies 
on the measurement of interest and on factors 
related to vocational choice...offers, in simple 
and condensed form, a survey which should be 
useful to the counselor unacquainted with the 
psychological literature in this field, Of use, 
also, to the counselor is the Interview Schedule 
for Vocational and Avocational Interest His- 
tories which provides a thorough coverage of 
pertinent information, and might well serve as 
an aid in directing such an inquiry or in elicit- 
ing material upon which to base a judgment of 
interest. Of greatest value to counselors, how- 


ever, is the latter half of the book, a series of 
forty-five individual case studies, grouped ac- 
cording to the subject’s college major. Dr. 
Wightwick knows well and is warmly inter- 
ested in, each of her girls, and does a remark- 
able job of telling each story from the single 
approach of the developing vocational interest, 
Although one misses a part of the picture—the 
inner emotional attitudes and experiences which 
are interwoven with the espousal or denial of 
either a genuine or an artificial interest, never- 
theless, Dr. Wightwick has made a real con- 
tribution in delineating all objective ascertain- 
able facts involved in the determination of 
vocational choice and vocational activity. 
Counselors of girls, whether they be in high 
schools, colleges, or in public or private agen- 
cies may well add to their store of knowledge 
by substituting Dr. Wightwick’s studies for 
the seldom achieved opportunity of first-hand 
controlled observation over so long a period. 
Dr. Wightwick correctly concludes that her 
most significant data on the vocational interests 
of women were secured from the examination 
of individual interest patterns. The interest pat- 
tern emerges as the ball upon which the coun- 
selor’s eye should be focused. The entire study 
becomes a tribute to the effectiveness of the 
Strong Interest Blank for Women. Even 80, 
we might question that all possible diagnostic 
interpretation of the Strong had, in this study, 
been derived. The extent to which general clin- 
cial experience in the use of the Strong has 
been substantiated by this study will reinforce 
counselors’ acceptance of their own impressions 
in using the Strong. Heartening also is the 
degree of stability of measured vocational in- 
terest, and the demonstrated existence in 
women of a continued core of interest, which 
may be at least partially measured. An impor- 
tant conclusion is that neither expressed nor 
measured interest should be used exclusively in 
vocational guidance, for a multitude of other 
factors within and without the person affect 
her vocational choice and adjustment, and must 
be considered in guiding her. This study’s 
primary value lies in its highlighting of the 
role that basic natural interest plays in voca- 
tional choice, and in the occupational and avo- 
cational adjustment of women. 
[ 1241 ] 
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Worr, Rareu ROBINSON, JR. Differential Fore- 
casts of Achievement and Their Use in Educa- 
tional Counseling. American Psychological Associa- 
tion, Psychological Monographs, Vol. 51, No. 1, Whole 
No. 227. Washington, D. C. : the Association, Inc., 1939. 
Pp. v, 53. Paper. $075. * (PA 13:6506) 


Occupations 19:628 My "41. Walter W. 
Cook. In this study an attempt is made to 
determine the differential aptitudes of students 
for verbal and quantitative curricula at Yale 
University. The procedure involves, first, the 
determination that a differential exists between 
verbal and quantitative aptitude, and second, 
the determination of the validity of differential 
prediction as revealed by achievement in the 
freshman year and in the major curricula of 
the junior and senior years. * One may con- 
clude from this study that (1) differential fore- 
casts have some value at the freshman level but 
practically none for predicting success in major 
fields, (2) current measures of achievement at 
Yale appear relatively unreliable, and (3) the 
present admission policy of Yale discriminates 
against students with quantitative ability in 
excess of verbal ability. 

For an additional review, see 40:B1132. 


[ 1243 ] 
X*Worrr, Cuartorte, The Human Hand. Preface by 
William Stevenson. New York: Alfred A. Knopf, Inc., 
1942. Pp. xvii, 198, vi, 26 pl. $3.00. * (PA 17 :3467) 


Am J Psychiatry 100:850 My "44. T(heo- 
phile) R(aphael). * Approaching her task in 
a very direct, systematic manner, the author re- 
views the literature, describes the form, struc- 
ture, and characteristics of the hand in adequate 
and specific detail and relates data from this 
level to findings respecting actual personality 
traits and makeup. These correlations are both 
striking and extensive, enabling the differentia- 
tion of six main hand-personality types with 
fourteen possible two-type combinations. The 
book is well illustrated and a clear, comprehen- 
sive depiction is given of the procedure worked 
out by the author for hand examination and 
interpretation. Also, there is included a helpful 
series of five case examples and a carefully 
selected bibliography. Dr. Wolff is to be con- 
gratulated for the well organized and stimulat- 
ing presentation of a subject definitely meriting 
further and intensive study, with possibility of 
really significant results. This book is decidedly 
worth reading and is strongly recommended to 
all those, psychiatrists and others, seriously and 


scientifically interested in human beings and 
problems of personality. 

Ini J Psycho-Analysis 24:85 pis 1 & 2 743. 
Marjory Brierley. Dr. Charlotte Wolff believes 
that hand-reading can provide a reliable method 
of testing temperament and character. This be- 
lief is the result of intensive study of the human 
hand, normal and abnormal, and of the hands 
and feet of anthropoid apes and monkeys. As a 
result of these researches, she has been led to 
infer correlations between hand-characters, 
physiological conditions and personality trends. 
In this book she supplies a psychophysiological 
theory to account for these correlations and a 
description of the practical methods of hand- 
interpretation she has worked out. The whole 
field covered requires much fuller investigation 
by independent workers before Dr. Wolff's con- 
clusions can be regarded as valid. 

J Am Med Assn 122:1051 Ag 7 '43. This 
purports to be a scientific answer to the age 
long question Is there anything in palmistry? 
So far as the text goes the reviewer does not 
feel that an answer has been given. Some evi- 
dence is given to indicate that certain types of 
hands are found more commonly in the case of 
the insane than in the case of the sane. Perhaps 
the most satisfying evidence is to be found in 
the statement made by Dr. William Stevenson, 
assistant director at the Institute of Experi- 
mental Psychology at Oxford, to the effect that 
Dr. Wolff was able to give good readings of 
character when she was able to see only hands, 
thrust by the owner through holes in a heavy 
curtain. Naturally it is only evidence of this 
type that can have any value. Better yet per- 
haps would be estimates of the personalities 
made solely from prints of hands submitted 
through umpires of such standing in the scien- 
tific world that their conclusions would be re- 
spected. About all this book has done for the 
reviewer is to supply him with information 
about types of hands. There isn't much on 
actual correlations between types of hands and 
types of men and women. It would seem that 
years of research should have enabled Dr. 
Wolff to bring forth more evidence than she 
presents in this book. Actually, if scientifically 
trained men and women are ever to be con- 
vinced that there is something in palmistry the 
proof will have to come from disinterested 
scientists who will see how many hits and, more 
important, how many misses are registered by 
a good palmist who has to base reports on an 
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analysis of scores of hand prints identified only 
by numbers. A one page report on such a study 
would have immensely more value than this 
book has. 

J Nerv & Mental Dis 101:192 F "45. K. 
Goldstein. In the first place, this book has noth- 
ing in common with the usual procedures in 
hand-reading of the palmists. There is no mys- 
tic interpretation. It is a careful study of a 
great number of general and individual peculi- 
arities which the human hands show as to form, 
the relation of its parts to each other, the physi- 
cal qualities, the nails, the lines, the differences 
of the right from the left hand, etc. From these 
different points of view the hands of normal 
persons, of children and adults and neurotics, 
Schizophrenics and idiots are compared. The 
characteristics are related to findings of the 
respective actual personality traits. From this 
body of facts and immense material, rules for 
the interpretation of individual hands are de- 
rived. Detailed description of the procedure in 
cases is presented in a way which may allow 
those not acquainted with this matter to make 
a record or at least to learn the method of 
recording. The conclusions to which the author 
comes, in spite of using careful scientific pro- 
cedures, do not simply evaluate the facts in a 
purely quantitative way. It is a kind of holistic 
approach which does not preclude intuition. I 
think the author is right in saying: *Only those 
who do possess intuition are the real artists, 
the good doctors, and the most profound psy- 
chologists." Use of intuition is not dangerous 
so long as it is controlled by an extraordinary 
amount of knowledge and critique. The pro- 
cedure of Dr. Wolff is, of course, original but 
she had predecessors, and she uses the literature 
in a careful way, particularly the work of the 
little known Carl G. Carus. This book is excel- 
lent so long as it gives facts and interpretation 
based on them. The theoretic explanation shows 
sometimes that the author speaks here not so 
much from first-hand experience. I think that 
Dr. Wolff is to be congratulated on her pre- 
sentation of a subject which is little known and 
until now was obscured by mystical specula- 
tion. This book can be highly recommended to 
psychologists and psychiatrists. Use of this 
method in psychiatry may be very helpful for 
clarification of such an unsolved problem. 

Psychoanalytic R 31:115-6 Ja "44. A. N. 
Foxe. * represents a definite increase in knowl- 
edge of the hand in relation to the body. That 


constitutional formations should manifest them- 
selves in every organ of the body is not surpris- — 
ing. Thus the presence of Herberden’s nodes is 
well known to the profession. To the medical 
profession the study of the hand will be impor- 
tant for the most part only in that it gives — 
clues to disease disturbing the whole body. 
Modern medicine relies more upon mechanical 
instruments and less on the powers of direct 
observation. Clinical observation of a part or 
use of a machine is resorted to only if it gives 
information otherwise unobtainable. Perhaps it ^ 
would be asking too much to demand such hard 
practicality of one interested in the hand, In- 
terpretations of personality from anatomic | 
studies are even more hazardous inasmuch as ^ 
the gap from birth to adulthood is usually un- 
known and often defies revelation; too much 
therefore is apt to be attributed to heredity and 
again one is up against the unknown when it | 
becomes possible to speculate at will. The re- 


viewer would say that although the author 


makes a contribution of some new factual | 
knowledge, that she has not yet completely 
emancipated herself from the superstitious or 
perhaps compulsively determined interpretations 
of the past. It is hoped that she will continue 
with her studies. 


[ 1244 ] 
XWorrr, Werner. The Expression of Personality: 
Experimental Depth Psychology. New York: Har- 
per & Brothers, 1943. Pp. xiv, 334. $3.50. (PA 17 :3467) 


Am J Mental Def 48:191-2 O '43. Heinz 
Werner. * Wolff describes..a number of ex- 
pressive methods ingeniously devised, and 
presents many significant results obtained 
through the use of these methods * [an] inter- 
esting method of analysis of dynamic personality. 
traits is a verbal presentation of a conflicting 
life situation; the subject is requested to solve 
the conflict in one of two opposing directions * 
In another chapter, cleverly devised association 
techniques are described; they appear particu- 
larly useful for analysis of the emotional aspects 
of personality. There is also a discussion of the 
Possibilities of a multi-dimensional diagnosis of 
personality through the composite ratings of 
forms of expressions of an individual by a 
group of observers. It seems to the reviewer 
that some of the methods—with necessary alter- 
ations—could be advantageously employed for 
the study of personality traits of the mentally 
deficient. The chapters are rather loosely con- 
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nected and unequal in scope and in scientific 
rigor. On the whole, the author’s work presents 
a novel and significant approach to the psychol- 
ogy of personality ; it appears particularly valu- 
able in view of the dearth of methods suitable 
for objective study of the fundamental problems 
of individuality. 

Am J Psychiatry 100:576-7 Ja ’44. Arnold 
Gesell. * self-confrontation is the central theme 
of the volume. Here the experimenter attempts 
to come to grips with what he calls “the inner 
personality.” Here are the ultimate depths of 
depth psychology; for the self-judge does not 
always recognize his self and his unconscious 
dynamisms reveal themselves in the manner in 
which he reacts to the self-confrontation. Some- 
times, of course, the self-judge recognizes him- 
self; but Wolff places more weight on the 
“unconscious self-judgments,” on the unrecog- 
nized auto-evaluations which give evidence of 
internal tensions, and the dualities of person- 
ality. He is interested in “latent” traits rather 
than the “manifest” traits which are elicited by 
ordinary clinical methods. Toward this end, he 
constructed what he calls “the crossroads test.” 
* the methods are at present in a cumbersome, 
exploratory stage of development. But the 
author wishes to sink a very deep plummet. 
By the experimental technique of self-confron- 
tation, he hopes to determine the general rela- 
tionships of the tendencies in the depths of 
personality. This is his Gestalt approach to the 
ancient problem of psychic constitution. The 
study of the actual organization and life history 
of personality will require supplementary con- 
cepts and additional labors, which are promised 
in forthcoming reports. Nevertheless, the clos- 
ing section of the present volume deals with 
the psychiatric task of personality diagnosis, 
and maintains that experimental development 
of depth psychology will afford a firmer basis 
for characterological skill. 

Am J Psychol 58:147-9 Ja’45. Max Meenes. 
Believing that psychoanalysis is not taken seri- 
ously by experimental psychologists and that 
tests of personality are superficial, Wolff at- 
tempts to explore the dynamics of personality 
as a whole by means of an ‘experimental’ ap- 
proach to depth-psychology. He argues that 
photographs of faces, profiles and hands, re- 
cordings of the voice, handwriting, style of 
retelling a story, and the like, are consistent 
expressions of personality, and he proposes to 
show that acceptable descriptions can be de- 


rived from them. His proof of consistency is 
the ability of ‘judges’ to pair beyond chance- 
expectancy three voices with three handwritings 
of the same persons, three profiles with three 
pairs of hands, and so on. * The general pro- 
cedure is laborious. It first calls for photographs 
and other records of a variety of things assumed 
to be ‘expressive.’ The results are then pre- 
sented to ‘judges’ (of doubtful training and 
competence), summarized, and interpreted by 
the investigator to bring out correlations. The 
correlations are translated into ‘traits,’ ‘degrees 
of importance,’ ‘contradictions,’ and so on, in 
accord with a doctrine of personality and its 
modes of ‘expression.’ It can hardly be said that 
the validity of the method has been established 
or its conclusions made persuasive, save where 
the critic starts with the author’s assumptions 
and follows his ‘logic.’ It is doubtful whether— 
on the practical side—the procedure would 
yield facts of sufficient value to warrant its use 
even in the clinic. But of greatest importance to 
the descriptive psychologist is the fact that the 
book well illustrates a frequent but illicit use 
of the terms ‘experiment’ and ‘experimental’ to 
describe a method which combines loosely con- 
trolled exercises of the testing sort with free 
interpretation under speculative and doubtful 
assumptions. 

J Consult Psychol 8:125 Mr-Ap’44. Percival 
M. Symonds. * the first book in English by a 
young and gifted psychologist who has recently 
come to this country as a refugee from Europe. 
* He has been a pioneer since 1925 in research 
in “experimental depth psychology,” better 
known in this country as studies in expressive 
movement. * The present volume is a review 
of Wolff’s eighteen years of experimentation 
with judgments of personality as expressed in 
voice, handwriting, gait, the profile, hands, and 
facial expression. A considerable part of the 
book is concerned with experiments on an in- 
dividual’s ability to recognize and judge his 
own personality from various products and ex- 
pressive movements. * Although any separate 
experiment may lack statistical conclusiveness, 
there is a consistency in the results of several 
experiments which is impressive. He found 
that there is an inhibition in a subject toward 
recognizing his own form of expression, but 
that his judgment of his own unrecognized 
forms of expression are emotionally stimulated 
and he characterizes them either favorably or 
unfavorably, but not neutrally. * To Wolff goes 
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the credit for being the first to employ the 
“matching” method of validating psychological 
experiments. However, the statistician might 
shudder to read (p. 23), “the number of correct 
matchings was one and one half times larger 
than would be expected according to the co- 
efficient of chance expectancy.” “Coefficient of 
chance expectancy” is a percentage to be ex- 
pected by chance. The book is a splendid intro- 
duction of Dr. Wolff to psychologists in this 
country and we may expect that he will be able 
to make further important contributions to “ex- 
perimental depth psychology.” 

J Nerv & Mental Dis 99:109 Ja '44. 
A(rthur) N. F(oxe). * The author has done 
a tremendous amount of work and the volume 
is filled with tables, charts and photographs. 
Numerous data are novel and interesting. * To 
be deplored is the use of the dictaphone without 
telling the subjects that a dictaphone might 
someday be used upon them. It will not be pos- 
sible to decide on a paranoid trend in the future 
if a patient hunts around the office for a dicta- 
phone; he may have read this book. Psychologi- 
cally it is a difficult task to examine the validity 
of methods until the results are more complete 
than those the author has presented in this vol- 
ume; the author's tendency is commendably 

. restrained and there is nothing to do but follow 
suit. The reviewer will patiently await further 
developments in this work. 

Mental Hyg 29:314-6 Ap '45. Frank K. 
Shuttleworth. It is a pleasure to call the atten- 
tion of readers....to this book. It deserves to be 
rated as a classic for the skill of its experimental 
attack on the dynamics of personality. Wolff's 
initial work, beginning in 1925, pioneered in 
the application of the Gestalt point of view. 
While American psychologists were fumbling 
with ratings and aimlessly computing correla- 
tions, Wolff had a theory and a method for the 
study of personality that yielded clear answers 
to many problems. His volume is a "must" 
book for all psychologists who are seriously 
interested in experimental studies of personality 
and its underlying dynamics. Instead of center- 
ing attention on single elements or traits of 
personality, Gestalt psychology is interested in 
the structure of personality, the interplay of 
forces exhibited by the organism as a whole in 
specific situations. Thus, instead of rating domi- 
nance on a ten-point scale, Gestalt psychology 
would point out that dominance may represent 
a compensatory mechanism; that it may appear 


in an individual’s relations to his subordinates, 
but not in his relations to his peers; that an 
individual may dominate others, but be unable 
to make decisions for himself; that ratings of 
dominance by different judges may have differ- 
ent meanings. Instead of studying a single per- 
sonality trait and investigating its distribution 
and correlates in a population, Gestalt theory 
would prefer to study a single personality and 
to understand its dynamics in a series of situa- 
tions. * Among personality studies, Wolff's 
earlier work has been accorded the unique dis- 
tinction by Huntley at Harvard of systematic 
repetition and successful verification. Wolff's 
point of attack is the external manifestation of 
the personality in facial expression, voice, gait, 
handwriting, gestures, rhythmic behavior, and 
so on. Photographs, motion pictures, and sound 
recordings were made. **In his experiments 
Wolff discovered that observers rarely recog- 
nized their own voices even when their atten- 
tion was called to the fact that their own voices 
were included in a series, a finding later verified 
by Huntley. Now comes a critical question: Do 
self-characterizations of a voice sample given 
in the absence of recognition of one’s own voice 
agree with the characterizations of others? 
They do. Moreover—and this is one of the 
most exciting of Wolff's findings—these self- 
characterizations made in the absence of recog- 
nition tend to be more emotional, more pene- 
trating, and more complete. * One must read 
the volume to appreciate its meticulous care for 
detail and scientific control, the great variety, 
flexibility, and appropriateness of its experi- 
mental method, and the richness of the implica- 
tions of its findings for a deeper understanding 
of personality. 

Psychiatric Q 18:527-8 Jl 44. * Dr. Wolff 
is to be complimented upon his scholarly work 
and his industry and it is to be hoped that his 
promise of further contributions along this line 
will not be long delayed. 

Psychoanalytic Q 14:116-8 Ja "45. Carl M. 
Herold. * The author, who appears to be an 
inspired and ingenious experimenter, analyzes 
the following forms of expressions: face, pro- 
file, hands, voice, gait, handwriting, manner of 
retelling narratives, forms of associations, and 
rhythms of various forms of expression. These 
expressions were recorded without the subject's 
knowledge. Later the subjects were confronted 
with several such recordings, amongst them 
their own, and asked to characterize them. 
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These characterizations are, I believe, the 
weakest point in these interesting and stimulat- 
ing studies: they are couched in the vague 
terms of everyday psychological descriptions 
and lack any real precision in marked contrast 
to the experimental approach and methods 
which are employed with utmost precision. 
Thus the whole procedure is comparable to the 
attempts of a master photographer who must 

. work with an uncorrected, distorting camera 
lens. Still, the procedure, described with numer- 
ous illustrations and records, is fascinating 
enough to warrant study. * the author’s experi- 
ments of self-confrontation are a means of ob- 
taining a subject’s complex words which he 
considers the highway into the depth of the 
personality. He does not aim at a definition of 
a person’s individual personality, but tries ‘to 
determine the general relationships of the tend- 
encies in the depth of personality’. The re- 
viewer thinks that this goal is reached much 
more easily and directly by psychoanalytic 
methods. The experiments themselves yield 
very interesting data about a person’s image of 
himself—not only his body image but also his 
psychological image. * Since the soundness of 
our personality depends on the degree of our 
orientation in the world of external objects, 
any false orientation is likely to be reflected in 
a similarly false orientation towards ourselves 
and ‘vice versa. * A study of such false self- 
orientation in comparison with external orien- 
tation might be of diagnostic help similar to 
that of the Rorschach test. In any case, the 
studies are certainly worth while getting ac- 
quainted with, in spite of the doubtfulness of 
their theoretical evaluation. 

Q R Biol 19:74-5 Mr '44. * The astonishing 
and important thing is the difference between 
the objective account of a given subject from 
the subject’s own account of himself, both 
when he was aware that he was speaking of 
himself and when he wasn’t aware of it. The 
outstanding thing was that when the subject 
was describing himself, even when he was un- 
aware of it, he used a different kind of language 
than when he was describing someone else, a 
language rich in emotional terms. This held 
true either when giving himself a more positive 
or a more negative description than other ob- 
servers gave. According to the author this taps 
the hidden dynamic factors in the personality. 
The author has attempted to assess the intra- 
personal tendencies by the construction of what 


he calls a crossroad test, divided into a cross- 
road memory test, crossroad perception test, 
crossroad decision test, crossroad color match- 
ing test. These constitute only different ways of 
getting at the vital elements in the intraper- 
sonal tendencies, and furnish in actual practice 
a certain focussing of these tendencies so that 
personality description may be read off with 
reasonable validity. The book is full of ingen- 
ious test methods according to the Gestalt sys- 
tem. The author’s conclusion that he has 
developed the beginning of an experimental 
depth psychology rests on the analogy between 
the inhibitory effects upon the memory of a 
subject when he perceives and recognizes his 
own functioning and the inhibitory effects on 
the verbal expression as noted in the Jung test, 
for example. This estimate of the author's con- 
cerning his own work would appear to have 
considerable merit and to furnish one of the 
few bridges existing between the subjective 
psychoanalytic method and traditional methods 
of experimental psychology. As such the au- 
thor's work is a very important one, and one 
can subscribe fully to the enthusiasm expressed 
by the editor, Gardner Murphy. 


[ 1245 ] 


X*Worrr, Werner. The Personality of the Pre- 
school Child: The Child's Search for His Self. 
New York: Grune & Stratton, Inc., 1946. iE xvi, 341. 
$5.00. * (London: William Heinemann, Ltd. 1947. 
25s.) (PA 21:1706) 


J Consult Psychol 11:224 Jl-Ag '47. In this 
book, Wolff applies his methods of the analysis 
of expression to young children, with results 
that alternate between the inspiring and the 
exasperating. There can be no quarrel with 
many of his fundamental premises, which have 
often been overlooked by other workers: that 
the child's world is not that of the adult; that ~ 
children’s drawings are often expressions of 
how they see themselves and their world, and 
are not merely “crude,” “uncoordinated,” or 
“immature” drawings, as some adults have seen 
them. These, and many other ideas, are charm- 
ingly illustrated by illustrations and by anec- 
dotes throughout the book. However, as the 
report of research that it purports to be, this 
volume is woefully lacking. Hypotheses are 
made, and techniques are proposed, most freely, 
but none of them are demonstrated to be valid 
by any conceivably acceptable experimental de- 
sign. For example, the validity of personality 
diagnosis from children’s scribblings and draw- 
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ings is established by the sentence, “The author 
himself was very surprised that the blind diag- 
nosis coincided in most cases with the records 
and observations made by the workers for the 
individual children” (p. 207). Validity having 
been established by this statement, the author 
then gives twenty pages of detailed instructions 
for the diagnosis of personality from graphic 
expressions. Workers in child psychology 
should read this book; it contains more hy- 
potheses than they and their graduate students 
can investigate experimentally in a decade. 
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J Am Med Assn r17:2110 D 13 '41. This 
small monograph was undertaken to check the 
findings of another author regarding a sup- 
posed relationship between patients' perform- 
ances before and after insulin and metrazol 
shock therapy. The sorting tests were purely 
psychologic with a statistical analysis. The tests 
used were the Weigl Color Form, the Vigotsky 
and the BRL sorting. One hundred and four- 
teen patients were studied. The conclusions 
were that the sorting tests showed the existence 
of a definite relationship between patients’ per- 
formances before and after Shock therapy. 
There is a good bibliography. This publication 
is recommended to all psychologists. 
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J Bus Ed—The Journal of Business Education. xo is- 
sues (omitting Jl, Ag) ; vol, 23 started S '47; $2 per 
year ; 25¢ per issue; Herbert A. Tonne, editor, New 
York University, N. Y.; Robert C. Trethaway, 
managing editor, 512 Brooks Bldg., Wilkes-Barre, 
Pa.: test, 302, 506; bk, 697, 825, 933, 1008, 1083, 
1201-2, 1206 

T Clin Psychol—Journal of Clinical Psychology. 4 is- 
sues; vol. 4 started Ja '48; $4 per year; $125 per 
issue; Frederick C. Thorne, editor, University of 
Vermont, Burlington, Vt.: bk, 77, 949, 956, 1104, 
1118, 1200 

J Consult Psychol—Journal of Consulting Psychology. 
Published by the American Psychological Associa- 
tion, Inc. 6 issues; vol. 12 started Ja-F '48; $5 per 
wei $1 per issue; Laurance F. Shaffer, editor, 525 

est 120th St., New York 27, N. Y.: test, 19, 50, 107, 
219, 223, 228, 232, 245, 253, 201, 271, 446, 448, 404, 466, 
472, 502, 620, 624, 636, 641, 663, 674; bk, 72, 77-8, 8o, 
82-4, 86, 90, 104, 104a, 277-8, 287-8, 293, 207, 300, 652, 
788, 793, 796, 835-7, 841, er 962, 983-4 999, 1070, 
1091, 1104, 1112, 1159, 1166, 1168, 1187, 1190, 1200, 
1244-5 

Ji Grin Law & Criminology—The Journal of Criminal 
Law and Criminology: Including the American 
Journal of Police Science. 6 issues; vol. 38 started 
My-Je 473 $3.75 per year; $1 per issue; Robert H. 
Gault, editor of the Journal of Criminal Law and 
Criminology, Northwestern University, 357 East 
Chicago Ave., Chicago 11, Ill.; Ordway Hilton, edi- 
tor of the American Journal of Police Science, 15 
Park Row, New York 7, N. Y.: bk, 1018, HSI. 

J Ed (London)—The Journal of Education: A 
Monthly Record and Review With Which Is Incor- 
porated The School World. 12 issues; vol. 80 started 
Ja '48 (no. 942) ; 15s. per year; Is. 8d. per issue; 

Salter Davies, editor, Amen House, Warwick 
Square, London E.C.4, England: test, 235; bk, 270, 
747, 776, 778, 781, 788, 804, 837, 870, 875, 891, 928, 
952-3, 961, 964, 1000, 1002, 1026, 1030, 1082, 1085, 
1126, 1141, 1185, 1219 

J Ed Psychol—The Journal of Educational Psychol- 
ogy: Devoted Primarily to the Scientific Problems 
of Learning and Teaching. 8 issues (omitting Je, JI, 
Ag, S) ; vol. 39 started Ja '48; $6 per year; $1.10 
per issue; H. E. Buchholz, managing editor, 10 East 
Centre St., Baltimore 2, Md.: test, 402; bk, 8o, 648, 
652, 758, 815, 871, 800, 912, 944, 956, 962, 065, 1063, 
1067, 1082, 1112, 1143, 1168, 1185, 1200, 1228 

J Ed Res—Journal of Educational Research. 9 issues 
(omitting Je, Jl, Ag) ; vol. 41 started S '47; $4 per 
year; 6oé per issue; A. S. Barr, chairman of the 


editorial board, University of Wisconsin, Madison 6, 
Wis.: bk, 277, 766, 788, 871, 893, 914, 916, 981, 1030, 
1177, 1221, 1228 3 x 

J Excep Child—Journal of Exceptional Children. 
Official publication of the International Council for 
Exceptional Children, a Department of the Na- 
tional Education Association. 8 issues (omitting Je, 
Ji, Ag, S); vol. 14 started O '47; sms 30¢ 
per issue; Francis E. Lord, editor, Michigan State 
Normal College, Ypsilanti, Mich. : test, 110; bk, 1030. 

J General Psychol—The Journal of General Psychol- 
ogy: Experimental, Theoretical, Clinical and His- 
torical Psychology. 4 issues in 2 volumes; vol. 37 
started Jl '47; vol. 38 started Ja '48; $14 per year; 
$4 per issue; Carl Murchison, editor, 2 Commercial 
St., Provincetown, Mass. : bk, 84, 716 

J Genetic Psychol—The Pedagogical Seminary and 
Journal of Genetic Psychology: Child Behavior, Ani- 
mal Behavior, and Comparative Psychology. 4 issues 
in 2 volumes; vol. 70 started Mr '47 ; vol. 71 started 
S '47; $14 per year; $4 per issue; Carl Murchison, 
editor, 2 Commercial St., Provincetown, Mass.: bk, 
278, 1102, 1104 

J Health & Phys Ed—The Journal of Health and 
Physical Education, Official journal of the American 
Association for Health, Physical Education and 
Recreation. 10 issues (omitting Jl, Ag); vol. 19 
started Ja '48; $3.50 per year; 35¢ per issue; Mary 
Wibel, editor, 1201 16th St, N.W., Washington 6, 
D. C.: bk, 789 

J Higher Ed—The Journal of Higher Education. 
Published by the Bureau of Educational Research, 
Ohio State Universitv. 9 issues (omitting Jl, Ag, S) ; 
vol. 19 started Ja '48; $4 per year; so¢ per issue; 
R. H. Eckelberry, editor, Ohio State Universitv, 
Columbus 10. Ohio: test, 538; bh, 299-300, 778, 798, 
817, 1008, 1168, 1240 

J Med Soc N J—The Journal of the Medical Society 
of New Jersey. Published by the Medical Societv 
of New Jersey. 12 issues; vol. 45 started Ja '48; $3 
per year; 30 per issue; Henry A. Davidson, editor, 
315 West State St, Trenton 8, N. J.: bk, 75 

J Mental Sci—The Journal of Mental Science, Pub- 
lished by authority of the Royal Medico-Psycho- 
logical Association. 4 issues; vol. 93 started Ja '47 
(no. 390) ; 425. per year; ros. 6d. per issue; G. W. 
T. Fleming, editor-in-chief, Barnwood House, 
Gloucester, England: test, 29; bk, 788 
Nerv & Mental Dis—The Journal of Nervous and 
Mental Disease: An Educational Journal of Neuro- 
psychiatry. 12 issues in 2 volumes; vol. 106 started J1 
447; vol. 107 started Je '48; $12.50 per year; $1 per 
issue; Nolan D. C. Lewis, managing editor, 70 Pine 
St, New York s, N. Y.: fest, 110; bb, 28, 78, 81-2, 
277, 788, 800, 851, 875, 986, 1042, 1106, 1123, 1236, 
1243- 

J Personality—Journal of Personality. Title was Char- 
acter and Personality (q. v.) through vol. 13, no. 4, 
Je "45. 4 issues; vol. 16 started S '47; $2 per year; 
5o£ per issue; Karl Zener, editor, Drawer D, College 
Station, Durham, N. C. 

J Social Psychol—The Journal of Social Psychology: 
Political, Racial, and Differential Psychology. 4 is- 
sues in 2 volumes; vol. 25 started F "47; vol. 26 
started Ag '47; $14 per year; $4 per issue; Carl 
Murchison, editor, 2 Commercial St., Provincetown, 
Mass. : bk, 299, 716 

J Speech Disorders—The Journal of Speech Dis- 
orders. Published by the American Speech Correc- 
tion Association. 4 issues; vol, 12 ended D "47; title 
changed to The Journal of Speech and Hearing Dis- 
orders beginning with vol. 1, no. 1, Mr '48; $3.50 
per year; $1 per issue; Wendell Johnson, editor, 
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Psychological and Speech Clinic, State University of 
Iowa, Iowa City, Iowa; Ernest H. Henrikson, review 
editor, University of Minnesota, Minneapolis 14, 
Minn.: bk, 277, 1006, 1094 

Jun Col J—The Junior College Journal. Official Organ 
of the American Association of Junior Colleges. 9 
issues (omitting Je, Jl, Ag) ; vol. 18 started 47; 
$3 per year; 50¢ per issue; Leonard V. Koos, editor, 
University of Chicago, Chicago 37, Ill: bk, 788, 955 


LIB Q—The Library Quarterly: A Journal of Inves- 
tigation and Discussion in the Field of Library 
Science. Published by the University of Chicago. 
4 issues; vol. 18 started Ja "48; $5 per year; $1.50 
per issue; Leon Carnovsky, managing editor, Gradu- 
ate Library School, University of Chicago, Chicago 
37, Ill.: bk, 1158 

Librarian—The Librarian and Book World: The In- 
dependent Professional Journal. 12 issues; vol. 37 
started Ja "48; 12s. per year; Is. per issue; A. hi 
Millard, editor, s Wardrobe Place, London E.C.4, 
England: bk, 788, 1011 

Loyola Ed Digest—Loyola Educational Digest. Dis- 
continued with vol. 19, D '43: bk, 42, 768, 788, 837, 
845, 871, 012, 914, 933, 940, 952, 954, 905-6, 970, 976, 
1064, 1114, 1189, 1199 


MATH GAZETTE—The Mathematical Gazette. Pub- 
lished by the Mathematical Association. Usually 5 
issues ; vol. 32 started F '48 (no. 298) ; 155. per year ; 
4s. per issue; T. A. A. Broadbent, editor, Royal 
Naval College, Greenwich, London S.E.10, England: 
bk, 788, 804 

Math Mag—Mathematics Magazine. Title was Na- 
tional Mathematics Magazine. (q. v.) through vol. 
20, no. 2, N '45. 5 issues (omitting Jl-Ag) ; vol. 21 
started S-O '47; $3 per year; 65¢ per issue; Glenn 
James, managing editor, University of California, 
Los Angeles 24, Calif. 

Mech Eng—M echanical Engineering. Published by the 
American Society of Mechanical Engineers. 12 is- 
sues; vol. 70 started Ja '48; $7 per year; 756 per is- 
sue; George A. Stetson, editor, 29 West 39th St, 
New York 18, N. Y.: test, 639 

Med Digest—M edical Digest. 12 issues ; $1.25 per year; 
U. B. Narvanrao, editor, r Damodar Mansions, Gir- 
gaum, Bombay 4, India: bk, 788 

Med Times—M edical Times: The Journal of the 
American Medical Profession. 12 issues; vol. 76 
started Ja '48; subscription rates on request; Arthur 
C. Jacobson, editor-in-chief, 67 Wall St, New York 
5, N. Y.; Andrew M. Babey, review editor, 1313 Bed- 
ford Ave., Brooklyn 16, N. Y.: bk, 788 

Mental Hyg—Mental Hygiene: Quarterly Magazine 
of the National Committee for Mental Hygiene, Inc. 
4 issues; vol, 32 started Ja '48; $5 per year; $1.25 
per issue; George S. Stevenson, editor, 1790 Broad- 
way, New York 19, N. Y.: bk, 28, 37, 962, 1244 

Mil Surg—The Military Surgeon. Official publication 
of the Association of Military Surgeons of the 


United States. 12 issues in 2 volumes; vol. 101 started ` 


Jl '47; vol. 102 started Ja '48; $6 per year; 50¢ per 
issue; James M. Phalen, editor, Army Medical Mu- 
seum, Washington 25, D. C.: bk, 78 

Mod Lang J—The Modern Language Journal. Pub- 
lished by the National Federation of Modern Lan- 
guage Teachers Association. 8 issues (omitting Je, 
Jl, Ag, S) ; vol. 32 started Ja '48; $2.50 per year; 
4o¢ per issue; William S. Hendrix, mE edi- 
tor, Ohio State University, Columbus 10, Ohio: test, 


211 

Munic Ref Lib Notes—Municipal Reference Library 
Notes. Published by the New York Public Library 
for circulation among the officials and employees of 


the City of New York. 10 issues (omitting Jl, Ag); 
vol. 30 (no. 21) started Ja '48; $2 per year; 20€ 
per issue; Rebecca B. Rankin, editor, Municipal Ref- 
erence Library, Room 2230, Municipal Building, 
New York 7, N. Y.: bk, 788 

Music Ed J—Music Educators Journal: Formerly 
Music Supervisors Journal. Published by the Mu- 
sic Educators National Conference. 6 issues; vol. 
34 started S-O '47; $1.50 per year; 35¢ per issue; 
Clifford V. Buttelman, managing editor, 64 East 
Jackson Blvd., Chicago 4, Ill.: bk, 1043 


N Y TIMES BOOK R—The New York Times Book 
Review. 52 issues; vol. 43 started Ja 4 '48; $3 per 
ear; John K. Hutchens, editor, New York Times, 

imes Square, New York 18, N. Y.: bk, 1042, 1089 

Nat Math Mag—National Mathematics Magazine. Ti- 
tle changed to Mathematics Magazine (q. v.) begin- 
ning with vol. 21, no. 1, S-O '47: bk, 788 

Nation’s Sch—The Nation's Schools: The Magazine 
of Better School Administration, 12 issues in 2 vol- 
umes; vol. 40 started Jl '47; vol. 41 started Ja "48; 
$3 per year; 35¢ per issue; Arthur H. Rice, manag- 
ing editor, 919 North Michigan Ave., Chicago II, 
Ill.: bk, 781, 1072, 1166 

Nature—Nature, 52 issues in 2 volumes; vol. 160 
started Jl 5 '47 (no. 4053); vol. 161 started Ja 3 
'48 (no. 4079) i 90s. year; Is. 6d. per issue; 
A. J. V. e and L. J. F. Brimble, joint editors, 
Macmillan & Co, Ltd, St. Martin's St., London 
W.C2, England: test, 29; bk, 278, 1086 

Nerv Child—The Nervous Child: Quarterly Journal 
of Psychopathology, Psychotherapy, Mental Hy- 
giene, and Guidance of the Child. 4 issues; vol. 7 
started Ja '48; $5 per year; $1.50 per issue; Ernest 
Harms, editor-in-chief, 30 West 58th St., New York 
19, N. Y.: bk, 82, 87, 817 

New Statesman & Nation—The New Statesman and 
Nation: The Week-end Review. 52 issues in 2 yol- 
umes; vol. 34 (no. 852) started JI 5 '47; vol. 35 (no. 
878) started Ja 3 '48; 32s. 6d. per year; 6d. per is- 
sue; the Literary Editor, 10 Great Turnstile, Lon- 
don W.C.r, England: bk, 953 


OCCUPATIONAL MEDO cubalah Medicine. 
Title was War Medicine (q. v.) t rough vol. 8, N-D 
"4s. Published by the American Medical Association. 
12 issues in 2 volumes; vol. 3 started Ja '47; vol. 4 
started J1 '47; $6 per Heec 75¢ per issue; William 
A. Sawyer, editor-in-chief, 343 State St., Rochester 


4 No 
Occupational Psychol—Occupational Psychology. Pub- 
* lished by the National Institute of Industrial Psy- 
chology. 4 issues; vol. 22 started Ja '48; 20s. per 
year; 5s. per issue; Alec Rodger, editor, Aldwych 
House, London W.C.2, England: bk, 78, 293, 300, 
652, 756, 758, 796, 870, 873, 875, 952, 1002, 1082, 1168, 


1200 

Occupations—Occupations: The Vocational Guidance 
Journal. Published by the National Vocational Guid- 
ance Association, Inc. 8 issues (omitting Je, Ji, Ag, 
S); vol. 26 started O '47; $4.50 per year; Oo¢ per 
issue; Harry D. Kitson, editor, 82 Beaver St., New 
York s, N. Y.: test, 633, 635, 640, 655, 686; bk, 82a, 
293, 648, 652, 715, 724, 756, 758, 766, 788, 817, 827-8, 
831, 833, 841, 912, 918, 981-2, 1000, 1008, 1014, 1032, 
1038, 1082, 1166, 1177, 1200, 1219, 1240, 1242 


PERSONNEL—Personnel. Published by the Ameri- 
can Management Association. 6 issues; vol 24 
started Jl 47; $5 per year; $1 per issue; available 
only to members and employees of member com- 

nies; James O. Rice, editor, 330 West 42nd St., 
ew York 18, N. Y.: bk, 841 


Personnel Adm 
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Personnel Adm—Personnel Administration. Official 
publication of the Society for Personnel Administra- 
tion. 6 issues; vol. 10 started S '47; $3.50 per year; 
75€ per issue; O. Glenn Stahl, editor, Room 5654, 
Federal Security Agency, Fourth and Independence 
Ave, S. W., Washington 25, D. C.: bk, 301, 788, 
817, 842, 1219 

Personnel J—Personnel Journal: The Magazine of 
Labor Relations and Personnel Practices. 10 issues 
(omitting Jl, Ag); vol. 27 started Wy '48; $5 per 
year ; 75€ per issue; Edward N. Hay, editor, Swarth- 
more, Pa.: bk, 766, 1082, 1190, 1219 

Prison World—The Prison World: For the Pro- 

ressive Administration of Jails, Prisons and Re- 
ormatories, Official publication of the American 
Prison Association and National Jail Association. 6 
issues; vol. 10 started Ja-F '48; $2 per year; soé 
S issue; Roberts J. Wright, managing editor, 135 

ast 15th St, New York 3, N. Y.: bk, 82, 652, 788, 
949, 1006 ; 

Progressive Ed—Progressive Education, Published by 
the American Education Fellowship. 7 issues (omit- 
ting Je, Jl, Ag, S, D) ; vol. 25 started O '47; $3.75 
per year; 5o¢ per issue; B. Othanel Smith, editor, 
Box 33, University Station, Urbana, Ill: bk, 845, 
955 

Psychiatric Q—The Psychiatric Quarterly. Official sci- 
entific organ of the New York State Department of 
Mental Hygiene. 4 issues; vol. 21 started Ja '47; 


M per year; $1 per issue; Newton J. T. Bigelow, 
editor, Utica State Hospital, Utica, N. Y.: bk, 72, 
75, 78, 80, 84, 90-1, 299-300, 986, 090, 1244 


Psychiatry —Psychiatry: Journal for the Operational 
Statement of Interpersonal Relations. Published by 
the William Alanson White Psychiatric Foundation, 
Inc. 4 issues; vol rr started F '48; $8 per year; 
$2 per issue; Harry Stack Sullivan, editor, 1711 
Rhode Island Ave., Washington 6. D. C.: bk, 28, 
78-0, 82, 84, 87-8, 300, 788, 056, 1168 

Psychoanalytic Q—The Psychoanalytic Quarterly. 4 
issues; vol. 17 started Ja “48; $8 ner year; $2.50 per 
issue; Raymond Gosselin, editor, Room 912, 57 West 
s7th St, New York 19. N. Y.: bk, 28. 75-6, 78, 81, 
84, 87, 270, 200, 760, 788, 956, 962, 986, 1093, 1104, 
1126, 1158, 1164, 1244 

Psychoanalytic R—The Psychoanalytic Review: An 
Educational American Journal of Psychoanalysis 
Devoted to an Understanding and Education of Hu- 
man Behavior. 4 issues; vol. 35 started Ja '48; $7 
per year; $2 per issue; Nolan D. C. Lewis, manag- 
ing editor, 70 Pine St, New York 5, N. Y.: bk, 28, 
769, 986, 1003, 1158, 1164, 1243 

Psychol B—Psychological Bulletin. Published by the 
American Psychological Association, Inc. 6 issues; 
vol. 45 started Ja '48; $7 per year; $1.25 per issue; 
Lyle H. Lanier, editor, Vassar College, Poughkeep- 
sie, N. Y.: bk, 72, 75, 78, 80, 82, 82a, 84, 282, 203, 
301, 652, 788, 807, 887, 910, 912, 942, 049, 956, 962, 
986, 1038, 1091, 1102-3, 1112, 1147, 1177, 1200, 1206, 
1219 

Psychometrika—Psychometrika: A Journal Devoted 
to the Development of Psychology as a Quantita- 
tive Science. Official journal of the Psychometric 
Society. 4 issues ; vol. 13 started Mr ’48; $10 per year 
for current subscriptions (2 copies of each issue) ; 
5 per back volume; $1.25 per back issue (current 
issues not sold) ; Harold Gulliksen, managing editor, 
Box 502, Princeton, N. J.; J. P. Guilford, review 
editor, University of Southern California, Los An- 
geles 7, Calif. : bk, 652, 828, 870 

Psychosom Med—Psychosomatic Medicine: Experi- 
mental and Clinical Studies. Published under edi- 
torial supervision of the American Society for 


Research and Psychosomatic Problems, Inc. 6 is- 
sues; vol. 10 started Ja-F '48; $6.50 per year; $1.50 
per issue; Carl Binger, editor-in-chief, 714 Madison 
Ave., New York 21, N. Y.: bk, 28, 75-6, 78-9, 81, 
277, 1093, 1123 

Pub Personnel R—Public Personnel Review: The 
Quarterly Journal of the Civil Service Assembly 
of the United States and Canada. 4 issues; vol. 9 
started Ja '48; $5 per year; $1.50 per issue; James 
M. Mitchell, editor, Room 456, 1313 East 6oth St., 
Chicago 37, Ill.: bk, 650, 652, 686, 732, 788, 918, 955, 
997, 1008, 1082-3, 1135 


Q R BIOL—The Quarterly Review of Biology. 4 is- 
sues; vol 23 started Mr '48; $5 per year; $1.50 
per issue; B. H. Willier, editor, Department of Bi- 
ology, Johns Hopkins University, Baltimore 18, Md.: 
test, 462, 469; bk, 75, 277, 279, 300-1, 716, 760, 788, 
799-800, 851, 944, 962, 976, 986, 1006, 1042, 1123, 1143, 
1156, 1189, 1244 


RORSCHACH RES EXCH—Rorschach Research 
Exchange. Title changed to Rorschach Research Ex- 
change and Journal of Projective Techniques (q. v.) 
beginning with vol. 11, no. I, '47: bk, 76, 89a. 

Rorschach Res Exch & J Proj Tech—Rorschach Re- 
search Exchange and Journal of Projective Tech- 
niques. Title was Rorschach Research Exchange 
(q. v.) through vol. ro D '46. Published by the 
Rorschach Institute, Inc. 4 issues; vol. 11 started 
S '47; $5 per year; Bruno Klopfer, editor; Camila 
Kemple, executive editor, 35 West 58th St, New 
York 19, N. Y. 

Rural Sociol—Rural Sociology. Official organ of the 
Rural Sociological Society. 4 issues; vol. 32 started 
Mr '47; $3.50 per year; gog per issue; Howard W. 
Beers, editor, University of Kentucky, Lexington, 
Ky.; Selz C. Mayo, managing editor, North Carolina 
State College, Raleigh, N. C.; Otis D. Duncan, re- 
view editor, Oklahoma Agricultural and Mechanical 
College, Stillwater, Okla.: bk, 797 


SCH—The School, Elementary Edition; and The 
School, Secondary Edition: A Magazine for Teach- 
ers and Inspectors. The book review sections are 
identical in content and pagination in both editions. 
Published by the Ontario College of Education, Uni- 
versity of Toronto. 9 issues (omitting Jl, Ag. S) ; 
vol. 36 started O '47; $1.00 per year; 25¢ per issue; 
Adrian Macdonald, editor, 206 Huron St., Toronto 
5, Ontario, Canada: test, 272; bk, 788 

Sch Executive—The School Executive: Including 
School Equipment News. 12 issues; vol. 67 started 
S '47; $3 per year; 35¢ per issue; Walter D. Cock- 
ing, chairman of the board of editors, 470 Fourth 
Ave., New York 16, N. Y.: bk, 788 

Sch R—The School Review: A Journal of Secondary 
Education. Published by the Department of Educa- 
tion, University of Chicago. 10 issues (omitting Jl, 
Ag); vol. 56 started Ja "48; $3 per year; 4o¢ per 
issue; Ralph W. Tyler, chairman of the editorial 
committee, University of Chicago, Chicago 37, Ill.: 
test, 639; bk, 715, 721, 782, 788, 851, 962, 1100, 1123, 
1166, 1168, 1177, 1200 

Sch Sci & Math—School Science and Mathematics: 
A Journal for All Science and Mathematics Teach- 
ers. Published by the Central Association of Science 
and Mathematics Teachers, Inc. 9 issues (omitting 
Ji, Ag, S); vol. 48 started Ja ’48; $3.50 per year; 
40€ per issue ; Glen W, Warner, editor, 7633 Calumet 
Ave., Chicago 19, Ill.: bk, 915, 1008 

Sch & Soc—School and Society. Published by the 
Society for the Advancement of Education, Inc. 52 
issues in 2 volumes; vol. 66 started Jl 5 '47; vol. 
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67 started Ja 3 '48; $7 per year; 20¢ per issue; I. L. 
Kandel, editor, 15 A erdin Ave., New York 23, 
N. Y.: test, 281; bk, 721, 817, 1168 

Schoolmaster—The Schoolmaster and Woman Teach- 
ers Chronicle. The organ of the National Union 
of Teachers. 2d. per issue; vol. 153 (no. 2010) started 
Ja '48; Hamilton House, Hastings St., London 
W.C.1, England: test, 270 

Sci Ed—Science Education. The official organ of the 
National Association for Research in Science Teach- 
ing, the National Council on Elementary Science, 
the Association of Science Teachers of the Middle 
States, 5 issues (F, Mr, Ap, O, D) ; vol. 32 started 
F '48; $2.50 per year; 75¢ per issue; Clarence M. 
Pruitt, editor, Oklahoma Agricultural and Mechani- 
cal College, Stillwater, Okla.: bk, 37, 943, 1062 

Scientific Mo—The Scientific Monthly. Published by 
the American Association for the Advancement of 
Science. 12 issues in 2 volumes; vol. 65 started Jl 
'47; vol. 66 started Ja '48; $7.50 per year; 75¢ per 
issue; F. L. Campbell, editor, 1515 Massachusetts 
Ave., N. W., Washington 5, D. C.: bk, 807 

Scottish Ed J—The Scottish Educational Journal. The 
Official Organ of the Educational Institute of Scot- 
land. s2 issues; vol 31 started Ja 2 23 2d. per 
issue; The Editors, 47 Moray Place, Edinburgh, 
Scotland : bk, 964, 1219 

Social Ed—Social Education. Official journal of the 
National Council for the Social Studies. Published 
by the Council in collaboration with the American 

istorical Association. 8 issues (omitting Je, Jl, 

Ag, S); vol. 12 started Ja '48; $3 per year; 40¢ 
per issue; Lewis Paul Todd, editor, 1201 Sixteenth 
St., Washington 6, D. C.: bk, 890, 1159, 1206 

Social Sci—Social Science. Published for Pi Gamma 
Mu and the Academy of World Economics. 4 issues ; 
vol. 23 started Ja '48; $2 per year; 50¢ per issue; 
Edward W. Carter, assistant editor, University of 
Pennsylvania, Philadelphia, Pa.; Charles W. Shull, 
IW editor, Wayne University, Detroit, Mich.: 

, I151 

Social Service R—The Social Service Review: A 
Quarterly Devoted to the Scientific and Professional 
Interests of Social Work. Published by the Univer- 
sity of Chicago. 4 issues; vol. 22 started Mr '48; 
$6 per year; 5175 per issue; Edith Abbott, editor, 
University of Chicago, Chicago 37, Ill.: bk, 716, 


766, 893 

Social Studies—The Social Studies for Teachers and 
Administrators: Continuing The Historical Outlook. 
8 issues (omitting Je, Jl, Ag, S); vol. 39 started 
Ja '485 $2 per year; 30€ per issue; Arthur C. Bining, 
editor, Ira Kreider, review editor, 809-811 North 
Toth St., Philadelphia 30, Pa. : bk, 890 

Sociol & Social Res—Sociology and Social Research: 
An International Journal. 6 issues; vol. 32 started 
S-O '47; $3.50 per year; 7o¢ per issue; Emory S. 
Bogardus, editor, 3551 University Ave., Los Angeles 
7, Calif. : bk, 37, 716, 942, 1152, 1177, 1221 

Sociometry—Sociometry: A Journal of Inter-Personal 
Relations. 4 issues; vol. IO started F '47; $6 per 
yan $2.50 per issue; J. L. Moreno, editor, P. O. 

ox 311, Beacon, N. Y.: bk, 956, 1074 
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Wilson Lib B 


TEACH COL J—The Teachers College Journal. Pub- 
lished by the Indiana State Teachers College. 6 
issues (omitting F, Ap, Je, Jl, Ag, S); vol 19 
started O '47; gratis; J. Erle Grinnell, editor, the 
Collse; Terre Haute, Ind.: test, 35, 489, 612; bk, 


7 

Teach Col J (Manila)—The Teachers College J ournal. 
4 issues; vol. 3 started JI-S '41; Flora A. Ylagen, 
editor-in-chief, 629 Tanduay, Manila, Philippine 
Islands : bk, 788 3 

Teaching—Teaching: A Quarterly Technical Journal 
for Teachers. 4 issues; vol. 20 started S '47; Rs 3 or 
5s. per year; Re 1 or Is. 6d. per issue; The Editor, 
c/o Oxford University Press, Post Box 31, Bom- 
bay, India: bk, 788, E 

Times Ed Sup—The Times Educational Supplement. 
52 issues; 38th year started S 6 '47 (no. 1688); 
17s. 4d. per year; 3d. per issue; The Editor, The 
Times Publishing Co., Ltd., Printing House Square, 
Parish of St. Ann, Blackfriars, London E.C.4, Eng- 
land: test, 270; bk, 288 

Times Lit Sup—The Times Literary Supplement. 52 
issues; 47th year started Ja 3 '48 (no. 2396) ; 175. 
6d. per year; 3d. per issue; Printing House Square, 
London E.C.4, England: bk, 1 

Training Sch B—The Training School Bulletin, 10 
issues (omitting Jl, Ag); vol. 44 started Mr '47; 
$1 per year; 15¢ per issue; Training School, Vine- 
land, bk, 1093 


U S NAVAL MED B—United States Naval Medical 
Bulletin. Published by the Bureau of Medicine and 
Surgery, Navy Department. 6 issues; vol. 48 started 
Ja-F '48; $1.75 per year; 35¢ per issue; Louis H. 
Roddis, editor, Division of Publications, Bureau of 
Medicine and Surgery, United States Navy, Wash- 
ington, D. C.: bk, 79, 82, 90 

Understanding the Child—Understanding the Child: 
A Magazine for Teachers. Published by the Na- 
tional Committee for Mental Hygiene, Inc. 4 issues 
(Ja, Ap, Je, O) ; vol. 17 started Ja '48; $1 per year; 
25€ per issue; W. Carson Ryan, editor, Box 810, 
Chapel Hill, N. C.: test, 272 

Univ Mich Sch Ed B—The University of Michigan 
School of Education Bulletin. 8 issues (omitting Je, 
Jl, Ag, S) ; vol. 19 started O '47; gratis; J. B. Ed- 
monson, chairman of the editorial board, University 
of Michigan, Ann Arbor, Mich.: bk, 279 

Univ Wash Col Ed Rec—The University of Wash- 
ington College of Education Record. 6 issues (omit- 
ting My, Je, Jl, Ag, S, O) ; vol. 14 started N '47; 
gratis; Francis F. Powers, chairman of the editorial 
board, Bureau of Educational Research and School 
Service of the College of Education, University of 
Washington, Seattle 5, Wash.: bk, rooo 


WAR MED—War Medicine. Title changed to Occu- 
pational Medicine (q. v.) beginning with vol. 1, no. 
1, Ja '46: bk, 76, 78, 82, 1190 

Wilson Lib B—Wilson Library Bulletin. 10 issues 

(omitting Jl, Ag); vol. 22 started S '47; $2 per 

year; 25¢ per issue; Marie D. Loizeaux, editor, 

H. W. Wilson Co., 950 University Ave., New York 

52, N. Y.: bk, 788 
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entirely or in large part to tests; asterisks indicate other publishers of one or more tests listed 

in this volume. Branch office addresses are given for selected test publishers. Numbers preceded 
by "test" refer to tests; numbers preceded by "bk" refer to books. 


*ACORN Publishing Co., Inc., Rockville Centre, 
N. Y.: test, 19, 126, 168, 214-5, 310, 333, 421, 488, 
495, 408, 514, 525, 590, 504, 600, 623, 638, 609 

Agricultural Experiment Station, Oklahoma Agricul- 
tural and Mechanical College, Stillwater, Okla.: bk, 
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George Allen & Unwin Ltd., Ruskin House, 40 Mu- 
seum St., London W.C.1, England: bk, 953, 1126 
American Academy of Ophthalmology and Otolaryn- 
gology, 100 First Ave. Bldg., Rochester, Minn.: bk, 
I 


*American Association of Universit Women, 1634 
First St, N.W., Washington 6, D. Č.: test, 654 

American Book Co., 88 Lexington Ave, New York 
16, N. Y.: bk 934, 1091, 1166 

American Council on Education, 744 Jackson Place, 
Washington 6, D. C.: bk, 718-24, 835, 1206, 1218. 
See also Cooperative Test Service, and National 
Committee on Teacher Examinations. 

American Foundation for the Blind, Inc., 15 West 16th 
St, New York 11, N. Y.: bk, 931 

American Hearing Society [formerly American So- 
ciety for the Hard of Hearing], 817 14th St., N.W., 
Washington 5, D. C. : bk, 892 

American. Management Association, 330 West 42nd 
St., New York 18, N. Y.: bk, 725-32, 842 

*American Optical Co., Southbridge, Mass.: test, 460, 
470, 473 

American Optometric Association, s18 Wilmac Bldg., 
Minneapolis, Minn.: bk, 733 

*American Orthopsychiatric Association, 1790 Broad- 
way, New York ro, N. Y.: test, 108; bk, 109, 754 

American ee Association, Inc., 1515 Massa- 
chusetts Ave., N.W., Washington 5, D. C.: bk, 42, 
738, 772, 784, 786-7, 802, 868-9, 908, 920-1, 985, 995, 
908, 1007, 1013, 1016, 1024, IIII, 1144, 1178, 1184, 
1195-6, 1200, 1220, 1235, 1242, 1252 

American Society for the Hard of Hearing. See Amer- 
ican Hearing Society. 

*American Transit Association, 292 Madison Ave., 
New York 17, N. Y.: test, 606; bk, 978 

Anatolia Press, 200 West 54th St, New York 19, 
N. Y.: bk, rors 

Antioch Press, Yellow Springs, Ohio: bk, 807 


D. Appleton-Century Co., Inc. See D. Appleton-Cen- 
tury-Crofts, Inc. 

D. Appleton-Century-Crofts, Inc. [formerly D. Apple- 
ton-Century Co., Inc. and F. S. Crofts & Co.], 35 
West 32nd St, New York 1, N. Y.: bk, 769, 782, 
962, 983, 1003-4, 1208 - 

Archives of Psychology. See American Psychological 
Association, Ínc. 

Arco Publishing Co., 480 Lexington Ave, New York 
17, N. Y.: bk, 709 

Association for Childhood Education, 1201 16th St., 
N.W., Washington 6, D. C.: bk, 741 


BAILLIERE, Tindall and Cox, 7-8 Henrietta St., 
Covent Garden, London W.C.2, England: bk, 299- 
300, 1231-2 

*Andrew H. Baird, Instrument Maker, 33-39 Lothian 
St., Edinburgh 1, Scotland: test, 290 

*Banks Upshaw & Co., 707 Browder St., Dallas 1, 
Tex.: test, 213 

Barbados, B. W. I. See Department of Education, 
Barbados. 

A. S. Barnes & Co., Inc., 67 West 44th St., New York 
18, N. Y.: bk, 1149 

Barnes & Noble, Inc., 105 Fifth Ave. New York 3, 
N. Y.: bk, 940 

*Bausch & Lomb Optical Co., Rochester 2, N. Y.: 
test, 471 

Beacon House, 101 Park Ave., New York 17, N. Y.: 
bk, 780, 957, 1033-7, 1075 

Birmingham, England. See City of Birmingham Edu- 
cation Group. 

^ Bliss, Jr., P. O. Box 267, Brooklyn, N. Y.: 

Bureau of Business Research, Northeastern Univer- 
sity, Boston 15, Mass.: bk, 977 

*Bureau of Cooperative Research and Field Service, 
School of Education, Indiana University, Blooming- 
ton, Ind.: test, 424 

*Bureau of Educational Measurements, Kansas State 
Teachers College of Emporia, Emporia, Kan.: test, 
7, 40, 92, 112, 139, 149, 153, 164 174, a, m =~ 
315, 326, 351, 380-2, 385, 387, ] ] ; 576, 599, 
601, 610-1, 618-9 5 se 


986 


987 | 


Educational Research, College of Educa- 
aie Oey of Alabama, University, Ala.: bk, 


II ^ 
Bur of Educational Research and Service, M. edical 
College of Virginia, Richmond, Va.: bk, 948 
*Bureau of Educational Research and Service, State 
University of Iowa, Iowa City, Iowa: test, II, 115, 
131, 146, 172, 178, 189, 212, 308-9, 327, 360, 547, 566-7, 
8; 


IRR of Guidance and Records, University of Wis- 
consin, Madison, Wis.: bk, 798, 888 

Bureau of Personnel Research. See Surveys, Inc. 

Bureau of Publications, Extension Division, Univer- 
sity of Colorado, Boulder, Colo.: bk, 1098 3 

*Bureau of Publications, Teachers College, Columbia 
University, New York 27, N. Y.: test, 53, 82a, 98, 
102, 165, 242-3, 256, 419, 484-7, 506, 509-10, 516, 637 ; 
bk, 715, 780, 815, 845, 887, 890, 907, 919, 927, 938, 
9 963, 965, 970, 980-1, 997, 1006, 1030, 1083, 1170, 
1228, 1240 T 

Bureau of Reference, Research and Statistics, Division 
of Tests and Measurements, New York Public 
Schools, 110 Livingston St, Brooklyn 2, N. Y.: 
bk, 1066-8 

Bureau of School Service, College of Education, Uni- 
versity of Kentucky, Lexington, Ky.: bk, 777 


CALIFORNIA Institute of Technology. See Indus- 
trial Relations Section, California Institute of Tech- 
nology. 

California Personnel. Management Association, 442 
Flood Bldg., San Francisco 2, Calif.: bk, 1133 

*California Test Bureau, 5916 Hollywood Blvd., Los 
Angeles 28, Calif.; 110 South Dickinson St., Madi- 
son 3, Wis.; 206 Bridge St., New Cumberland, Pa. : 
test, 15-6, 26, 57, 59, 157, 222-3, 345, 347, a 479, 
490, 517, 536, 631, 643, 661, 674, 680-2, 690; bk, 1134 

Cambridge University Press, Bentley House, 200 
Euston Road, London N.W.1, England: bk, 776, 
778, 785, 804, 1169, 1188-9 

Carnegie Foundation for the Advancement of Teach- 
ing, 552 Fifth Aye., New York 18, N. Y.: bk, 708 

*Catholic University of America Press, 620 Michigan 
Ave, N.E., Washington 17, D. C.: test, 285; bk, 
775, 848, 881, 880, 946, 968, 979, roo, 1009, 1040, 
1073, 1084, 1125, 1153 

Catholic University Press, Peiping, China: bk, 1221-3 

XCenter for Psychological Service, George Washing- 
ton University, Washington 6, D. C.: test, 52, 96, 
101, 170, 238, 400, 405, 699, 705 

*Center for Safety Education, Division of General 
XR TUE S Hades pui , 8 Fifth Ave., 

ew York 11, N. Y.: test, ; bk, 712, 1155 

*E. D. Chapple Co., Inc., 230 Coats St., Boston 10, 
Mass.: test, 688 

City of Birmingham Education Committee, Birming- 
ham, England: bk, 714 

Civil Service Assembly of the United States and Can- 
ada, 1313 East 6oth St., Chicago 37. Ill.: bk, Sor 

Clarke, Irwin & Co., Lid., 480-486 University Ave., 

* Toronto 2, Canada: bk, 870, 878. 1146 

College Book Co., 1836 North High St., Columbus, 
Ohio: test, 460, 474 

College Entrance Examination Board, Box 592, 
Princeton, N. J.: bk, 808-14, 1076 
olumbia University. See Bureau of Publications. 
Teachers College; Columbia University Press; and 

ans o Press. 
olumbia University Press, 29060 Broadway, New York 
27, N. Y.: bk, 1158 i Á 
ommittee on the Teaching of Botany in American 
Colleges and Universities, c/o Ernest L. Stover. 
Em Illinois State College, Charleston, Ill.: bk, 
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Evans Brothers 


Commonwealth Fund, 41 East 57th St, New York 22, 


. Y.: bk, 740 ae rie 
Connecticut. ‘See Division of Administration. 
*Cooperative Book Store, Corvallis, Ore. : test, 1 
Cooperative Test Division. See Cooperative Test Serv- 


ice. 4 

Cooperative Test Service. Now the Cooperative Test 
Division, Educational Testing Service, 15 Amster- 
dam Ave., New York 23, N. Y.: test, 2-5, 20, 120-2, 
129-30, 141-2, 144-5, 160, 180-5, 190-3, 199-205, 208-10, 
303-5, 307, 316, 321-5, 343, 354-5, 357-9, 303-4, 371, 
373-8, 401, 407-16, 439-40, 497, 526-9, 535, 548, 550-4, 
550-9, 561-5, 570-1, 573-4, 578-86, 591-2, 596-7, 604-9, 
614, 616-7; bk, 819-20, 1137 

*Cowan Research Project, Salina, Kan.: test, 30 

F. S. Crofts & Co. See D. Appleton-Century-Crofts, 


Inc. 
Current Book Co. Now A. A. Wyn, Inc, 23 West 
47th St., New York 19, N. Y.: bk, 706 


DARTNELL Corporation, 4660 Ravenswood Ave., 
Chicago 40, Ill. : bk, 829-31 
XH por Press, 105 East 24th St, New York 10, 


eren 1025 

J. M. Dent & Sons, Ltd., Aldine House, 10-13 Bedford 
St., Strand, London W.C.2, England: bk, 954 

Dre of Education, Barbados, British West In- 

ies: bk, 930 
Department of Educational Research, Ontario College 
of Education, University of Toronto, Toronto 5, 
tario, Canada: bk, 952, 1142. See also Vocational 
Guidance Centre. 
Department of Public Instruction, State of Indiana, 
ndianapolis, Ind. : bk, 849-50, 884 

Division of Administration, Connecticut State Depart- 
ment of Education, Hartford, Conn.: bk, 824 

*Division of Applied Psychology, Purdue University, 
Lafayette, Ind.: test, 314, 

*Division of Educational Re erence, Purdue Univer- 
sity, Lafayette, Ind.: test, 46, 134, 137, 143; bk, 791, 
832, 920, 966, 1207, 1238 

Doubleday & Co., Inc., 14 West 49th St, New York 
20, N. Y.: bk, 779 

y M. Drake, Kent State University, Kent, Ohio: 


k, 843 
Dryden Press, Inc., 386 Fourth Ave, New York 16, 
N. Y.: bk, 1064, 1194 
*Israel Dvorine, 2328 Eutaw Place, Baltimore 17, Md.: 
test, 461a, 462 


*EDUCATIONAL Records Bureau, 437 West soth 
St., New York 19, N. Y.: test, 154-5, 233, 336, 481, 
541, 622; bk, 852-67, 1197, 1199, 1201-2 

*Educational Test Bureau: Educational Test Publish- 
ers, Inc, 720 Washington Ave., S.E., Minneapolis 
14, Minn.; 3433 Walnut St, Philadelphia 4, Pa.; 
2106 Pierce Ave., Nashville, Tenn. : test, 6, 107, 128, 
133, 140, 232, 236, 251, 286, 320, 328, 403. 448, 472, 
$0, 520, 522, 530-2, 538, 624, 641, 655, 663-5, 684; 

Educational Test Publishers, Inc. See Educational 
Test Bureau. 

*Educational Testing Service, 2% Chambers St., 
Princeton, N. J.: test, 217-8. See also College En- 
trance Examination Board, Cooperative Test Serv- 
ice, Graduate Record Office, and National Commit- 
tee on Teacher Examinations. 

Edwards Brothers, Inc., 300 John St, Ann Arbor, 
Mich.: bk, 1151 

*Enaineers Northwest, 100 Metropolitan Life Bldg., 
Minneapolis I, Minn. : fest, 651 

Essex Education Committee, Chelmsford, Essex, Eng- 

end. » 872 
vans Brothers Lid. Montague House, Russell 
Square, London W.C.1, England: test, 235 


Family 
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FAMILY Service Association of America. See Fam- 
ily Welfare Association of America. 2 
Family Welfare Association of America. Now Family 
Service Association of America, 122 East 22nd St., 
New York 1o, N. Y.: bk, 893 

*Carl Fischer, Inc., 56 Cooper Square, New York 3, 
N. Y.: test, 176 4 

*Foster & Stewart Publishing Corporation, 210 Elli- 
cott St., Buffalo 3, N. Y.: test, 477 

Wilfred Funk, Inc., 227 East 44th St., New York 17, 
N. Y.: bk, 1210 


GEORGIA University Press, Athens, Ga.: bk, 752 

Sterling K. Gernet, Pacific Union College, Angwen, 
Calif.: bk, 897 

*Robert Gibson & Sons (Glasgow), Ltd., 45 Queen 
St., Glasgow C.1, Scotland: test, 123, 227, 257, 340 

*Ginn & Co., Statler Bldg., Boston 17, Mass.: test, 
163 : 

Government Printing Office, Washington 25, D. C.: 
bk, 765, 793, 816, 818, 827, 836, 839, 847, 877, 882, 905, 
917, 939, 959, 967, 982, 984, 1029, 1135, 1148, 1150, 
1187, 1211-3, 1224-7, 1239 

Graduate Record Office, 15 Amsterdam Ave, New 
York 23, N. Y.: bk, 1215-7 CHA 

Graduate School of Education, Harvard University, 
Cambridge, Mass.: bk, 834 

*Graybar Electric Co., 420 Lexington Ave., New York 
17, N. Y.: test, 475 

XC. A. Gregory Co., 345 Calhoun St., Cincinnati, Ohio : 
test, 349, 483 

*Grune & Stratton, Inc., 381 Fourth Ave., New York 
16, N. Y.: test, 73, 100; bk, 75-9, 81, 90-1, 104a, 
750-1, 770-1, 924, 1127, 1192, 1245, 1254 

Gry, oe e 220 Montgomery St., Highland Park, 

+1 bk, 7| 


XE. M. HALE & Co., 320 South Borstow St., Eau 

Claire, Wis.: test, 22, 34, 138, 330, 337-8, 362, 478, 
55, 569, 577, 580 

*Hamish Hamilton Ltd., 90 Great Russell St., London 
W.C.1, England: fest, 278; bk, 790, 900. See also 
Hamish Hamilton Medical Books. 

Hamish Hamilton Medical Books, 90 Great Russell 
St, London W.C.1, England: bk, 1183, 1186. See 
also Hamish Hamilton Ltd. 

*Harper & Brothers, 49 East 33rd St., New York 16, 
N. Y.; 90 Great Russell St., London W.C.r, Eng- 
land: test, 276; bk, 270, 766, 898, 901-3, 918, 999. 
1112-3, 1159. 1200, 1230, 1244 

*George G. Harrap & Co. Ltd., 182 High Holborn, 
London W.C.1, England: test, 93, 220, 203, 352; 
bk, 84:5; 796, 973-4, 1012 

Harvard University. See Graduate School of Educa- 
tion; and Harvard University Press. 

*Harvard University Press, 38 Quincy St., Cambridge 
38, Mass.: test, 103 

William Heinemann Ltd., o9 Great Russell St, Lon- 
don W.C.r, England: bk, 1245 

Hellman, Williams & Co., 5 Beekman St., New York 
7, N. Y.: bk, 750 M 

*Robert W. Henderson, University Personnel Service, 
University of Kentucky, Lexington, Ky.: fest, 113, 


252 
His Majesty's Stationery Office, Adastial House, 


Kingsway, London W.C.2, England: bk, ort 
Paul B. Hoeber, Inc., 49 East 33rd St, New York 
16, N. Y.: bk, 277-8, , 1093-4 


Henry Holt and Co., 257 Fourth Ave., New York 10, 
N. Y.: bk, 871 `% 

*Houghton Mifflin Co., 2 Park St., Boston 7, Mass. ; 
432 Fourth Ave., New York 16, N. Y.; 2500 Prairie 
Ave., Chicago 16, Ill.; 500 Howard St, San Fran- 


cisco 5, Calif.; 715 Browder St, Dallas, Tex.; 39 
Harris St, Atlanta 3, Ga.: test, 10, 23, 99, 116-7, 
, 224, 292-3, 334, 339, 492, 494, 501, 519, 545; 

bk, 1012, 1126, 1241 

Hans Huber, Verlag, Berne, Switzerland: bk, 90-1, 
1127, 12 

Human Engineering Laboratory, Inc., 347 Beacon 
St. Boston 16, Mass.: bk, 737, 739, 744-5, 753, 760, 
767, 846, 876, 879, 886, 896, 922, 945, 958, 971-2, 
987-93, 1044, 1065, 1077-81, 1096-7, 1099, 1115-7, 1120, 
1145, 1203 

*Humm Personnel Service, 1219 West 12th St., Los 
Angeles 15, Calif.: test, 48 

Bruce Humphries, Inc., 30 Winchester St., Boston 16, 
Mass.: bk, 1017 


INDIANA, See Department of Public Instruction. 

Indiana University Bookstore, Box 46, Bloomington, 
Ind.: bk, 773-4, 883, 950-1, 1160-2. See also Bureau 
of Cooperative Research and Service. , 

Industrial Relations Section, California Institute of 
Technology, Pasadena, Calif. : bk, 1250 

Industrial Relations Section, Department of Econom- 
ics and Social Institutions, Princeton University, 
Princeton, N. J.: bk, 756-7 

*Institute of Applied Art, Ltd., Institute Bldg., Ed- 
monton, Alberta, Canada: test, 162, 268 

*Institute of Living, 200 Retreat Ave., Hartford 2, 
Conn.: test, 95 

*International Business Machines Corporation, 590 
Madison Ave., New York 22, N. Y.: test, 307 

*International Textbook Co., Scranton 9, Pa.: test, 
173; bk, 1072 


*JOSEPH JASTAK, Delaware State Hospital, Farn- 
hurst, Del.: test, 21 

*Jewish Education Committee of New York, Inc., 
1776 Broadway, New York 19, N. Y.: test, 105-8, 
452a, 453-4 

Journal Press, 2 Commercial St., Provincetown, Mass. : 
bk, 104, 797, 803, 935, 1045, 1087 

*Clifford E. Jurgensen, Minneapolis Gas Light Co., 
Minneapolis, Minn.: test, 58 


KANSAS State Teachers College of Emporia, Em- 
poria, Kan.: bk, 783, 1019. See also Bureau of Edu- 
cational Measurements. 

Kentucky. See Bureau of School Service; and Ken- 
tticky Cooperative Testing Service. 

*Kentucky Cooperative Testing Service, University of 
Kentucky, Lexington 29, Ky.: test, 132, 234, 311 

GHEE View Co., Meadville, Pa.: test, 467; bk, 
701-3 

Re Frederick Kilander, 25 North Edison St., 
Arl ington, Va. : test, 425 

King’s Crown Press, 2960 Broadway, New York 27, 
N. Y.: bk, 986, 1147 

Alfred A. K: opt Inc., 501 Madison Ave., New York 
22, N. Y.: bk, 1243 


*LAFAYETTE Printing Co., 511-517 Ferry St., La- 


fayette, Ind.: test, 496 
*Lakue Printing Co., 906 Baltimore Ave, Kansas 


SAT Mo.: test, 65 

*H. K. Lewis & Co. Ltd., 136 Gower St, London 
.W.C.r, England: fest, 29, 239, 258; bk, 1107 

Life Office Management Association, 110 East 42nd 
St, New York 17, N. Y.: bk, 994 

Longmans, Green and Co., Inc., $5 Fifth Ave, New 
York 3, N. Y.: bk, 913-5, 956, 1043 

*Loyola University Press, 3441 North Ashland Ave., 
Chicago 13, Ill.: test, 455 

Lynchburg State Colony. See Steward's Office. 
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Prentice-Hall 


*McGRAW-HILL Book Co., Inc, 333 West 42nd 
St., New York 18, N. Y.: test, 24; bk, 743, 781, 841, 
1008, 1032, 1038, 1118, 1128, 1163 

McGraw-Hill Publishing Co., Ltd., Aldwych House, 
Aldwych, London W.C.2, England: bk, 781, 841, 
1032, 1038, 1118 

*McKnight & McKnight, 109-111 West Market St, 
Bloomington, Ill.: test, 593, 599, 635, 695 

Macmillan & Co. Ltd., 10 St. Martin's St., Leicester 
Square, London W.C.2, England: bk, 33, 904, 928, 


1000, 1089 

*Macmillan Co., 60 Fifth Ave., New York 11, N. Y.: 
test, 32, 36, 276; bk, 33, 37, 280, 746, 778, 785, 895, 
904, 926, 928, 1000, io, Hi 

Josiah Macy, Jr. Foundation, 563 Park Ave, New 
York 21, Ñ. Y.: bk, 1105-6 

*Management Service Co., 3136 North 24th St., Phila- 
delphia, Pa.: fest, 51, 221, 389, 625, 644, 656-7, 673, 
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* Marietta A ei Co., Marietta, Ohio: fest, 276 

*Walter W. Marseille, 2145 California St., San Fran- 
cisco r5, Calif.: test, 73 

Paul Maslow, 16 Court St, Brooklyn 2, N. Y.: bk, 
86-8, 1020-2 

Medical College of Virginia. See Bureau of Educa- 
tional Research and Service, Medical College of 
Virginia. 

Medical Research Department, United States Subma- 
rine Base, New London, Conn.: bk, 1174-6 

Melbourne University Press, Swanson St., Carlton, 
Melbourne N.3, Victoria, Australia: bk, 894, 1085 

BPEL. ui ellenbruch, 253 Rodes Ave., Lexington, Ky.: 
test, 662 

*Methuen & Co. Ltd., 36 Essex St., Strand, London 

W.C2, England: test, 283; bk, 279, 891, 901, 1082, 
1214 

Metropolitan Life Insurance Co., 1 Madison Ave., 
New York 10, N. Y.: bk, 1028 

C. V. Mosby Co., 3207 Washington Blvd., St. Louis 3. 
Mo.: bk, 821-2 

*Munsell Color Co. Inc., 10 East Franklin St., Balti- 
more, Md.: test, 465 

John Murray, 50a Albemarle St., London W.1, Eng- 
land: bk, 1179 


NATIONAL Association of Teachers of Educational 
Measurements, Fairmont State College, Fairmont, 
W. Va.: bk, 1046. See also National Council on 
Measurements Used in Education. 

National Committee on Teacher Examinations, 15 
RE Ave, New York 23, N. Y.: bk, 1136, 
1138-9 

National Council for the Social Studies. National 
Education Association, 1201 16th St., N.W., Wash- 
ington 6, D. C.: bk, 734-6, 1039 

National Council on Measurements Used in Educa- 
tion, Fairmont State College, Fairmont, W. Va.: 
bk, 1047-9. See also National Association of Teach- 
ers of Educational Measurements. 

National Education Association, 1201 16th St.. N.W.. 
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National Institute of Vocational Research. See Sur- 
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379, 384, 391, 394, 396 ‘ 

National Opinion Research Center, Interview Depart- 
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16, N. Y.: bk, 912 £ 
*Offce of Educational Research, Rochester Institute 
of Technology, Rochester 8, N. Y.: test, 171 
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Education, Columbus 15, Ohio: test, 14, 63, 125, 127, 
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bk, 1026-7. Oxford University Press, 114 Fifth Ave., 
New York 11, N. Y.: bk, 1042, 1158 
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School of Education, University of Michigan, Ann 
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School of Education, University of South Dakota, 
Vermillion, S. D.: bk, 748 
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Cincinnati 2, Ohio: bk, Bas, 933 

*George Spache, Greeley School, Chappaqua, N. Y.: 
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441; bk, 1114 

State University of Iowa. See Bureau of Educational 
Research and Service; and University of Iowa 


ress. 

*Steck Co., Austin, Tex.: test, 9, 50, 

H. C. Steinmetz, 3518 Union St., San 
bk, 1167 

*Verne Steward & Associates, P.O. Box 226, South 
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*Syracuse University Press, 920 Irving Ave., Syracuse 
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796, 973-5, 1164 
A. A. Wyn, Inc. See Current Book Co. 
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Application of Measurement to Physical Education 
Programs in Secondary Schools, 806 

Application of Psychological Tests to_the Selection, 
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Appraisal of Anecdotal Records, 919 

Appraisal of Growth in Reading, 1066 

Appraisal of the Social Development of Children at a 
Summer Camp, 1074 2 

Appraising and Recording Student Progress, 1159 

Aptitude Test for Nursing: George Washington Uni- 
versity Series of Nursing Tests, 

Aptitudes and the Languages, 1077 

Arithmetic Reasoning, 331 

Arithmetic Test for Prospective Nurses: George 
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Basic Skills, Test D, New Edition, 10, 334 

Basic Employment Test, 221 

Basic Interest Questionnaire: 
Vocation or Avocation, 633 

Basic Language Skills: Iowa Every-Pupil Tests of 
Basic Skills, Test C, New Edition, 10, 116 

Basic Skills in Arithmetic Test, 335 

Basis of Pertinence: A Study of the Test Performance 
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Psychological Aids in the Selection of Workers, 728 

UU Approaches to the Biography of Genius, 
IL 


Psychological Corporation General Clerical Test, 630 

Esychological Program in AAF Convalescent Hospi- 
tals, 765 

Psychological Rescarch in the Theaters of War, 984 

Psychological Research on Bombardier Training, 967 

Psychological Research on Equipment Design, 877 

Psychological Research on Flexible Gunnery Training, 


939 

Psychological Research on Navigator Training, 793 

Psychological Research on Operational Training in 
the Continental Air Forces, 818 

Psychological Research on Pilot Training, 1029 

Psychological Research on Problems of Redistribu- 
tion, 1239 

UNIO Research on Radar Observer Training. 
H 

Psychological Testing, 1043 

Psychological Tests and Their Uses, 1056-8 

Psychology for Business and Industry, Second Edi- 
tion, 1032 


1003 ] INDEX OF TITLES 


Selected. Items 


Psychology of Early Growth: Including Norms of 
Infant Behavior and a Method of Genetic Analy- 
sis, 280, 904 2 

Psychology of Human Differences, 1208 

Psychology of the Interview, 1082 

Public School Attainment Tests for High School En- 
trance: Examinations of Abilities in Reading, Eng- 
lish, and Mathematics, 17 

pupil Personnel, Guidance, and Counseling, 1059 
upils; Parents’ and Teachers’ Attitudes—Similari- 
ties and Differences, 1238 

Purdue Industrial Mathematics Test: Purdue Person- 
nel Tests, 314 

Purdue Industrial Training Classification Test, 675 

Purdue Mechanical Adaptability Test, 676 

Purdue Pegboard, 666 

Purdue Personnel Tests, 314 

Purdue Reading Test, 496 

Purdue Teacher’s Examination: How I Teach, 403 

Purdue Test for Electricians, 701 

Purdue Test for Machinists and Machine Operators, 


702 
Put Yourself To The Test: A Manual of Vocational 
,Self-Guidance, 1230 


QUALITATIVE Performance in the Maze Test, 
1090 

Questionnaire on Emotional Adjustments: Informa- 
tional Blank EA, Provisional Form, 50 


RASMUSSEN General Mathematics Test, 315 

Rating and Training Executives and Employees, 729 

Rating Form for Use of Interviewers and Oral Ex- 
aminers, 691 

Rating of Supervisors, 1250 

Reading and Interpretation of Literature and Literary 
Acquaintance: Examination in English : High School 
Level, Book I, 130 

Reading and Literary Acquaintance; Examination in 
English: College Level, Book I, 129 

Reading Aptitude Tests, 51 

Reading Comprehension: E English Test: 
Lower and Higher Levels, Tests Cr and C2, 120, 


497 

Reading Comprehension Test for Prospective Nurses: 
George Washington University Series of Nursing 
Tests, 699 

Reading Comprehension Test: National Achievement 
Tests, 498 

Reading Readiness Test, 520 

Reading Scales in History, 530 

Reading Scales in Literature, 531 

Reading Scales in Science, 532 

Reading Speed and Comprehension : Ohio Senior Sur- 
vey Tests, 524 r 

Reading Speed Test: National Achievement Test, 525 

Record for Reading Diagnosis, 512 

Records, Analysis, and Test Procedures, 839 

Records and Reports: Trends in Making and Using 
Them, 741 

Recreation Inquiry, 70 

Relation of Pupil Achievement Gain to Certain Per- 
sonal and Environmental Elements, 

Relation of Wiggly Block to Black Cube, 993 

Relationship Between Certain Psychological Tests and 
Shorthand Achievement, 1083 

Relationship Between Content of an Adult Intelligence 
Test and Intelligence Test Score as a Function of 


ge, 980 
Relationship Between Tonal Memory and Pitch Dis- 
crimination, oor 
Relationship Between Variations in Silent Reading 
Ability and Mental Ability, 711 


Relationship of Vocabulary and Wiggly Block Scores 
of Seven Test Administrators, 992 

Reliability of Mental Tests, 875 

Religion Essentials Test, 455 x k . 

Remedial Reading: The Diagnosis and Correction of 
Reading Difficulties at the College Level, 1204 

Renfrow Survey Tests of Mathematical Skills and 
Concepts, 349 = - 

Report of a Subcommittee on the Selection of Chil- ^ 

ren for Secondary Education, 872 

Report on Defective Colour Vision in Industry, 1086 

Report on the First Six Tests in English Composition: 
With Sample Answers From the Tests of April 
and June 1944, 1076 

Report on the University of Alabama Freshman Test- 
ing Program 1927-1941, 1157 

Reporting to Parents, 1170 

Research Problems and Techniques, 1187 

Revised Alpha Examination, Forms 5 and 7, 220. 

Revised Alpha Examination Form 6, Short Form, 220 

Revised Beta Examination, 259 > 

Revised Minnesota Paper Formboard Test, 677 

Revised Stanford-Binet Scale, 292 

Revision of Number Reasoning Wks. 436 B to C, 


1097 j 
Revision of the Babcock Examination for Measuring 
Efficiency of Mental Functioning, 71 
Revision of the Seashore Measures of Musical Talent, 


1140 P 

Revision of the Stanford-Binet Scale: An Analysis 
of the Standardization Data, 293, 1012 2 

Revision of 294V to 294V A New Order of Difficulty, 
IIIS 

Revision of Verbal Inductive Test (273 B to 273 BA) 
With Indices jf all Items, 1044 

Rigg Poetry Judgment Test, 146 

Rorschach, 73 3 

Rorschach Method of Personality Diagnosis, 73 

Rorschach Psychology, 87, 1021 

Rorschach Responses of Successful and Unsuccess- 
ful Students, 1016 f 

Rorschach Scoring Samples: Compiled From Various 
Sources for Private Circulation, [Revised Edition], 

1110 

Rorschach Technique: A Manual for a Projective 
Method of Personality Diagnosis, 84-5, 973-4 

Rorschach Technique: 1946 Supplement, 83, 975 

Rorschach Theory, 88, 1022 

Rorschach's Test: I, Basic Processes, 75, 77, 750 

Rorschach's Test: II, A Variety of Personality Pic- 
tures, 76, 77, 751 


SALES Questionnaire, 703 
Scale of Nonverbal Mental Ability, 260 
Scenario on Word Straggled 95 A D Word Number 


83, 87 

Schonell Diagnostic Arithmetic Tests, 350 

Schonell Diagnostic English Tests, 135 

Schonell Reading Tests, 499 

Schrammel-Gorbutt Personality Adjustment Scale, 92 

Schrammel-Gray High School and College Reading 
Test, 500 

Schrammel-Otterstrom Arithmetic Test, 351 

Scientific Selection of Salesmen, 1128 

Scovill Classification Test, 220 

Seashore-Bennett Stenographic Proficiency Tests: A 
Standard Recorded Stenographic Work-sample, 386 

Seashore Measures of Musical Talents, Revised Edi- 
tion, 177 

Second Study of Grip, Worksample 185 Conversion 
Factors and Norms, 1116 

Second Yearbook: National Council on Education, 
Proceedings of San Francisco Meeting. 1042, 1047 

Selected Items for the Testing of Study Skills, 1039 


Selected Test 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 1004 


Selected Test Items in American History, Revised 
Edition, 734 

Selected Test Items in American History, Second Re- 
vised Edition, 735 y 4 

Selected Test Items in World History, Revised Edi- 
tion, 736 

Selection and Development of Foremen and Workers, 


730 
Selection and Employment of Operating Personnel, 


978 
Selection for Secondary Education, 1002 , 
Selection for Secondary Education Without a Written 
Examination, 1179 S 
Selection of Students for Vocational Training, 882 
Selective Art Aptitude Test, Second Edition, 173 
Selective Vocabulary Test, 93 E 
Self-Knowledge, Through Handwriting, 954 
Self-Rating Scale for Leadership Qualifications, 692 
Ss Scale for School Custodians, 1019 
Self-Rating Scale for the Elementary Teacher, 783 
Sense of Humor Test, Second Edition, 94 
Seven Quichiy Admimstered Tests of Physical Capac- 
ity: And Their Use in Detecting Physical Incapacity 
for Motor Activity in Men and Boys, 789 
Shemwell-Whitcraft Bookkeeping Test, 387 r 
FEES English Test: Clapp-Young Self-Marking 
ests, I3! 
Shipley-Hartford Retreat Scale for Measuring Intel- 
lectual Impairment, 95 
Shipley-Institute of Living Scale for Measuring Intel- 
lectual Impairment, 95 : 
OW) Test: State High School Tests for Indiana, 
3 


Siebrecht Attitude Scale, 458 

Silent Reading Comprehension: Iowa Every-Pupil 
Tests of Basic Skills, Test A, 10, 501 

Small Parts Dexterity Test, 667 | 

Social Intelligence Test: George Washington Univer- 
sity Series, 96 

Social Personality Inventory for College Women, 97 

Social Studies Test: National Achievement Tests, 594 

Sociality in Preadolescent Boys, 927 

Sociometric Control Studies of Grouping and Re- 
grouting: With Reference to Authoritative and 

emocratic Methods of Grouping, 1035 

Sociometric Measurement of Social Configurations 
Based on Deviation From Chance, 1036 

Sociometry and the Cultural Order, 1034 

Sociometry of Leadership: Based on the Differentia- 
tion of Psychegroup and Sociogroup, 957 

Some Principles of Examining, With Aids for Con- 
sulting Examiners, 752 

Sophomore and Freshman Testing Program in the 
Accredited High Schools of Michigan, 1939, 1248 

Sophomore and Freshman Testing Program in the 
Accredited High Schools of Michigan, 1942, 1249 

Sorting Tests in Relation to Drug Therapy in Schizo- 
phrenia, 1253 

Southend Attainment Test in Mechanical Arithmetic, 


352 

Space Relations : Differential Aptitude Tests, 620 

Spanish Training: Iowa Placement Examinations, 212 

Spatial Orientation: Guilford-Zimmerman Aptitude 
Survey, Part 5, 621 

Spatial Visualization: Guilford-Zimmerman Aptitude 
Survey, Part 6, 621 

Special Place Examinations: The Report of an In- 
vestigation by a Joint Committee of the West Rid- 
ing Teachers’ Association and the West Yorkshire 
County Association of the National Union of Teach- 
ers, of the Merits and Defects of Some Forms of 
Special Place Examination, 961 

Special Reading Test: Ohio Senior Survey Tests, 539 

Specific Interest Inventory, 634 


Specimen Questions from U. S. Civil Service Exami- 
nations, 1211 

Speech Attitude Scale, 151 

Speech Experience Inventory, 152 

Square Pegs in Square Holes, 779 

SRA Clerical Aptitudes, 388a 

SRA Dictation Skills, 388b 

SRA Lan; e Skills, 388c 

SRA Mechanical Aptitudes, 677a 

SRA Non-Verbal Classification Form, 261 G 

SRA Primary Mental Abilities, 225 

SRA Reading Record, 502 LN 

SRA Tests of Primary Mental Abilities for Ages 5 


and 6, 264 

SRA Typing Skills, 388d 

SRA Verbal Classification Form, 262 

Standard Error on Worksample No. Three, 767 

Standard Examination for Transit Employees: 
{American Transit Association Tests], 696 

Standard Profile Chart, 448 

Standardization of Two Intelligence Tests, 1110 

Standardized Tests and Educational Practice, 768 

Stanford Achievement Test, 18, 503, 595 

Stanford Achievement Test [Reading], 503 

Stanford Achievement Test: Social Studies Tests, 505 

Stanford-Binet Scale, Revised, 202 

Stanford Educational Aptitude Test, 404 

State High School Tests for Indiana, 134, 137, 143, 
365-7, 370, 388, 395, 420, 427-37, 441 . g 

Statistical Study of Certain Diagnostic Personality 
Traits of College Men, 889 

Statistical Study of Leadership. Among College 
Women, 848 

Stencil Design Test, 295 

Stenogauge, 389 

Stenographic Aptitude Test, 390 

Stenographic Test: United-NOMA Business Entrance 
"Tests, 391 

Stenquist Mechanical Aptitude Tests, 678 

Stenquist Mechanical Assembling Test, 679 

Stevens Reading Readiness Test, 521 

Steward Selection System, 704 

Stone-Webster Test in Beginning Reading, 504 

[Strong] Vocational Interest Blank for Men, Revised, 


647 

[Strong] Vocational Interest Blank for Women, Re- 
vised, 649 

Structural Visualization, 1079 

Student Exercises in Measurement and Evaluation for 
Education and Guidance, 1113 

Student Personalities as Instructors See Them, 791 

Student Questionnaire, 98 

Studies From the Center for Research in Child Health 
and Development, School of Public Health. Harvard 
University: III, The Growth of Bone, Muscle and 
Overlying Tissues as Revealed by Studies of Roent- 
genograms of the Leg Area: Description of a 
Method of Evaluating These Tissues, Standards 
Derived From a Series of Normal Children Exam- 
ined Periodically From Birth to Seven Years and 
Examples of the Growth of Individual Children, 


1172 

Studies on the Reliability of Tests, 952 

Study Guide on Evaluation: Suggestions for Faculty 
Committees and Other Groups Studying Evaluation 
in Nursing, 1010 

Study-Habits Inventory, Revised Edition, 1941. 540 

Study in Factor Analysis: The Reliability of Bi-Fac- 
tors and Their Relation to Other Measures, 1180 

Study in Judging Opinions of Groups, 1195 

Study of Abstract Imagination Worksample 329 and 
Revision from Form A to Form B, 753 

Study of Critical Reading Comprehension in the In- 
termediate Grades, 890 


100] INDEX OF TITLES Š 


Trusler-Arnett 


Study of Factors Influencing the General Development 
of the Child During Pre-School Years by Means.of 
Record Forms, 776 

Study of Preadolescents by Means of a Personality 
Inventory, 1153 

Study of Proportion Appraisal Worksample 235 
Form B, 1099 

Study of the English Usage, Spelling, and Vocabulary 
of ee Graduate Students at Indiana University, 

* 1162 

Study of the Intercorrelations of Personality Traits 
Among a Group of Novices in Religious Communi- 
ties, 1084 

Study of Values: A Scale for Measuring the Domi- 
nant Interests of Personality, 99 

Study of Visual Imagination Worksample 307, Forms 
AA and AB Leading to Form AC, 745 

Study of Worksample 268 as a Preliminary Step To- 
ward the Development of New Age Norms, 739 

Studying Children: Diagnostic and Remedial Proce- 
dures in Teaching, 1194 

Studying Children in School, Second Edition, 743 

Studying the Individual School Child: A Manual of 
Guidance, 871 

Siogettionr for the Use of New-Type Tests in India, 
102 

Suggestions for the Use of New-Type Tests in India, 
Second Edition, 1027 

Summary of Manual and Mechanical Ability Tests, 
Preliminary Form, 758 

Supplementary Guide for Scoring the Revised Stan- 
ford-Binet Intelligence Scale, Form L, 1068 

Supplementary Guide for the Revised Stanford- 
Binet Scale (Form L), 294, 1088 

Survey of Abilities of High School Seniors in 
Twenty-One Counties of Indiana, 849 

Survey of Company Morale: Job Satisfaction Blank 
No. 12, 693 

Survey of Mechanical Insight, 680 

Survey of Object Visualization, 681 

Survey of Space Relations Ability, 682 

Survey of Study Habits, Experimental Edition, 541 

Survey of the Language Arts Achievement of Sixth 
Grade Children in 18 Counties and Six Cities in In- 
diana, 850 

Survey of Working Speed and Accuracy, 631 

Survey Test in American Literature, 147 

Survey Test in Elementary Algebra, 328 

Survey Test in English Literature, 148 

Survey Test in United States History, 613 

Survey Test of Vocabulary, 167 

Syllabus of Audiometric Procedures in the Adminis- 
tration of a Program for the Conservation of Hear- 
ing of School Children, 1069 

Systematic Approach to the Construction and Evalu- 
ation of Tests of Ability, 995 

Szondi Test, 100 


TATE Economic Geography Test, 601 

Teaching and Testing English, 747 

Teaching Aptitude Test: George Washington Uni- 
versity Series, 405 

Techniques in Reading Comprehension for Junior- 
Senior High School: Every Pupil Test, 505 

Techniques of Guidance: Tests, Records, and Counsel- 
ing in a Guidance Program, 1200 

Telling What I Do, Alpha Form: Detroit Adjustment 
Inventory, 31 

Tenth Educational Conference, New York City, Octo- 
ber 30and 31,1941: Under the Auspices of the Educa- 
tional Records Bureau, the Cooperative Test Service, 
the Committee on Measurement and Guidance of 
the American Council on Education, and the Com- 


mission on the Relation of School and College of the 
Progressive Education Association, 719 
Terman-McNemar Test of Mental Ability, an 
Terman-Merrill Intelligence Scale in Scotland, 969 
Terman Test of Mental Ability, 263 3 
Test for Ability to Sell: George Washington Univer- 
sity Series, 705 
Test of Color Blindness, 474 1 d f 
Test of Etiquette: George Washington University 
Series, 101 3 
Test of General Proficiency in the Field of Mathe- 
matics: Cooperative General Achievement Tests, 
Revised Series, Test 3, 3, 316 
Test of General Proficiency in the Feld of Natural 
Sciences: Cooperative General Achievement Tests, 
Revised Series, Test 2, 3, 548 3 a 
Test of General Proficiency in the Field of Social 
Studies: Cooperative General Achievement Tests, 
Revised Series, Test 1, 3, 596 - 
Test of Knowledge of Social Usage, 1942 Edition, 102 
Test of Mathematical Fundamentals for Grades 7-12, 


317 : 
Test of Mechanical Comprehension, 683 
Test of Practical Judgment, 694 
Test of Reading Ability for , 533 
Test of Study Skills, 542 
Test on Arithmetic Meanings and Vocabulary, 353 
Test Yourself! A Handbook of Self-Analysis Based 

on Modern Psychology, Second Edition, 759 
Test Yourself for a War Job, 1241 á 
Testing and Counseling in the High-School Guidance 

Program, 828 
Testing Results in the Infant School, 891 2 
Tests and Measurements in Business Education, ih 
Tests and Measurements in Health and Physical - 

cation, Second Edition, 1003 
Tests for High School Entrants, 19 
Tests for Primary Mental Abilities, 225 
Tests in Selecting Salesmen, 831 
Tests of General Educational Development, 20, 121-2, 

144-5, 307, 526-0 
"Tests of Human Growth and Development, 406 
Tests of Primary Mental Abilities for Ages 5 and 6, 

264 
Thematic Aperception Test, 103 
Thematic Apperception Test: The Theory and Tech- 

nique of Interpretation, 104a, 1192 
Third Study of Grip Worksample 185, 1117 
Third Yearbook of the National Council on Measure- 

ments Used in Education, 1048 
Thorndike-Lorge Reading Test for Grades 7 to 9, 506 
Three Specialized Projects in Measuring Accounting 

Aptitude, 846 
Thurstone Emplovment Tests, 632 
Thurstone Examination in Clerical Work: Thurstone 

Employment Tests, 632 
Thurstone Interest Schedule, 646 
Thurstone Scoring Board, 398 
Thurstone Test of Mental Alertness, 265 
[Thurstone] Vocational Interest Schedule, 653 
Time and the Mind: Personal Tempo—The Key to 

Normal and Pathological Mental Conditions, 72, 742 
Time Appreciation Test, 266 
Time as a Measure of Inter-Personal Relations, 1037 
Too Many Aptitude Woman, 1080 
Tools of Written English: State High School Tests 

for Indiana, 137 
[Torgerson's Inventories and Record Forms], 105 
Trait-Consistency in Personality: A Differential In- 

vestigation, 1196 
Trend of National Intelligence: The Galton Lecture, 

1946, 1186 
Trusler-Arnett Health Knowledge Test, 426 


- Turse-Durost 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 100 


Turse-Durost Shorthand Achievement Test (Gregg), 


392 

Turse Shorthand Aptitude Test, 393 K 

Twenty-Eighth Annual Conference on Educational 
Measurements: Held at Indiana University, April 
29, 1041, 950 : 

Twenty-Ninth Annual Conference on Educational 
Measurements: Held at Indiana University, April 
28, 1942; and A Study of the Achievement of Fresh- 
man Students Holding County Scholarships, 951 

Twenty Years of Merit Rating, 1926-1946: Selected, 
Annotated and Classified References From Indus- 
try, Business, Government, and Education, Experi- 
ience and Research, 1014 cM 

Two Thousand Test Items in American History, 1114 

Typewriting Test: State High School Tests for In- 
diana, 395 

Typing Test: United-NOMA Business Entrance Tests, 
394 


UNEVEN Distribution of the Effects of Specific Fac- 


tors, 941 

aie NOMS SUED Entrance Tests, 368-9, 
384, 301, 304, 39 

Unsolved Business Problems, 1081 č 

Use of Aptitude Tests in the Selection of Radio Tube 
Mounters, 1178 

Use of Library and Study Materials, 543 

Use of Personal Documents in History, Anthropology, 
and Sociology, 910 

Use of Personal Documents in Psychological Science, 


379, 


716 

Use of Test Results in Diagnosis in the Tool Subjects, 
1201 

Use of Tests and Rating Devices in the Appraisal of 
Personality, Revised Edition, 1202 

Use of the National Teacher Examinations in Colleges 
and Universities, 1139 

Utilizing Human Talent: Armed Services Selection 
and Classification Procedures, 835 


[VALENTINE] Intelligence Tests for Young Chil- 
ren, 283 

Validation Study of Worksample 307, Form AC, 945 

Value of Psychology in Selecting Salesmen, 731 

Value of Vocational Tests as Aids to Choice of Em- 
ployment, 714 

Values of Psychology in Industrial Management, 732 

Van Alstyne Picture Vocabulary Test for Preschool 
Children, 206 

Variability as a Function of Ability and Its Relation 
to Personality and Interests, 784 

Verbal Comprehension: Guilford-Zimmerman Apti- 
tude Survey, Part 1, 621 

‘ay Reasoning: Differential Aptitude Tests, 620 


V.G.C. Anecdotal Record Form, 449 
V.G.C. Cumulative Record Folder, 450 
V. Intelligence Indicator, 267 


V.G.C. Interview Record Form, 451 

V.G.C. SIRE Adjustment Indicator or Inven- 
tory, 1 

V.G.C, Student Information Form, Revised, 452 

Vineland Social Maturity Scale, 107 

Visual Acuity Measurements With Three Commercial 
Screening Devices, 1176 

Visual Motor Gestalt Test, 108 

Visual Motor Gestalt Test and Its Clinical Use, 109, 


754 
Visual Retention Test for Clinical Use, 207 
Vocabulary Test: National Achievement Tests, 168 


Vocational Aptitude Examination, Type E-A, Revised 
1946, 695 N 

Vocational Aptitude Tests for the Blind, 932 

Vocational Interest Blank for Men, Revised, 647 

Vocational Interest Blank for Women, Revised, 649 

Vocational Interest Patterns: A Developmental Study 
of a Group of College Women, 1240 

Vocational Interest Schedule, 653 

Vocational Interest Test for College Women, 654 

Vocational Interests and Job Orientation: A Tene 
Year Review, 650, - 

Vocational Interests 
toried by the Strong and Manson Blanks, 981 

Vocational Interests of Men and Women, 652, 1171 

Vocational Inventory, 655 

Vocational Personalities: An Application of the Ror- 
schach Group Method, 82a, 963 

Votaw Algebra Test, 329 


WAR-TIME Role of the College Sophomore: A Re- 


port on the 1942 Sophomore Testing Program, 819 
Washburne Social-Adjustment Inventory, Thaspic 
Edition, 110 * 
Washington, Indiana, Testing Program: An Experi- 

ment in Educational Measurement in an Average- 
Sized City School System, 884 
Watson-Glaser Tests of Critical Thinking, 544 
Wechsler-Bellevue Intelligence Scale, 298 
Wechsler Memory Scale, 302 
Weigl-Goldstein-Scheerer Color Form Sorting Test, 
I 


4 

Weitzman's Inventory of Social Behavior, 111 

Wellesley Spelling Scale, 157 

[Wesman] Personnel Classification Test, 253 

Western Electric Audiometer, 475 

What the Testing Program in the Schools of Nursing 
Has Taught Us, 948 

Whipple’s High-School and College Reading Test, 507 

Who's Who in My Group?, 63 

Wide Range Achievement Test: Reading, Spelling, 
Arithmetic From Kindergarten to College, 1946 Edi- 
tion, 2r 

Wide Range Vocabulary Test, 169 

Wiedefeld-Walther Geography Test, 602 

Williams Primary Reading Test, 508 

Willis-Smith Advanced Mental Test, 268 

Wilson Scales of Stability and Instability, 112 

Wilson Tests of Religious Aptitude, 456 ' 

Wisconsin Achievement Tests: Chemistry, 569 

Wisconsin Algebra Test, 330 

Wisconsin Biology Test, 555 

Wisconsin Composite Achievement Test, 22 

Wisconsin General Science Test, 577 

Wisconsin Geometry Test, 362 

Wisconsin Language Usage Test, 138 

Wisconsin Physics Test, 589 

Wonderlic Personnel Test, 269 

Word Dexterity Test, 170 

Work-Book in Educational Measurements and Evalu- 
ation, 913 


792 j 
of High School Girls: As Inven- | 


Work Book in Measurements of Human Behavior, E 


843 

Work Preference Inventorv, 113 

Work-Study Skills: Towa Every-Pupil Tests of Basic 
Skills Test B, New Edition, ro, sas 

World Geography Test: Dominion Tests, 603 


YALE Educational Aptitude Tests, 622 
Your Vocational Guide to the Ideal Job: Self Tests 
That Reveal Your Special Abilities, 1210 


Index of Names 


IEEE E 


References are to test and book entries. The name of each person who contributed original re- 
views to this volume appears in large and small capitals and is followed by his position and 

* address in italic type. The abbreviations and numbers following the names may be interpreted 
thus: "rev, 68" indicates an original review of test 68; "test, 73" indicates authorship of test 73; 

P “bk, 896" indicates authorship of book 896; "exc, 112" indicates an excerpted review of test (or 
book) 112; "ref, 126(34)" indicates authorship of reference 34 for test 126; "cross ref, 347” indi- 

cates that a cross reference to the name is mentioned under test (or book) 347; “other, 466" indi- 

cates that the name is mentioned in the entry for test (or book) 466 in a manner not classifiable 


under any of the above categories. 


A., P. M.: exc, 714 

Aamodt, Geneva P.: ref, 250(22) 
Abbott, E. Carlton: bk, 711 
Abbott, Walter D.: ref, 73(294), 


95(6) 

Abel, Theodora M.: ref, 73(359, 
410-2), 103(46), 271 (28) 

Abell, E. L.: exc, 35, 788; cross ref, 
175 

AnRELSON, Harop H., Associate 
Professor of Education and Di- 
rector, Educational Clinic, The 
City College of New York, New 
York, New York: rev, 30, 105 

Abercrombie, Stanley: bk, 712 

Abraham, Herbert J.: ref, 617(1) 

Abramson, Harold A.: bk, 713, ref, 
60(14-5,33) 

Abt, Lawrence Edwin: ref, 73(551) 

Achard, Francis H.: ref, 647 (119) 

Achauer, M. P.: ref, 649(21) 

Achilles, P. S.: test, 446; ref, 23 
(24), 220(49) 

Ackland, Neil William: ref, 177 


75 

y Austin J.: ref, 73(478, 
94. 

hobs Clifford R.: test, 64; ref, 


~~ 44(5), 64(2, 7) 


Adams, Eris L.: ref, 64(5a) 

Adams, F. J.: test, 50; ref, 217(75), 
292(187) 

Adams, Georgia Sachs: test, 16 

Adams, R. H.: test, 230 

Adams, William Michael: ref, 11 
(6-7), 131(14-5), 308(13, 16), 
489(17) 

Addison, Eleanor M.: ref, 217 (80) 

Adjutant General's Office: ref, 
219(5, 14) 


ApKINS, Dorotny C. Chief, Test 
Development Unit, U. S. Civil 
Service Commission, Washing- 
ton, D. C.: rev, 609, 612, 690; ref, 
99(56), 217(134), 225(17-8), 
236 (18-9), 535(13), 640(3-4) 

Adkins, Margaret M.: bh, 1143; 
ref, xo3 (35), 292 (279) 
dkins, S. D.: cross ref, 204 

Adler, Daniel Leslie: ref, 202 (166) 

Acarp, FmEDERICK B. Associate 
Professor of Modern Languages, 
Cornell University, Ithaca, New 
York: rev, 208, 211, 213 

Ahrens, Alvin W.: exc, 1228 

Aita, John A. : ref, 49(10), 73(552), 
95(21), 298(107) 

Albrecht, Ruth: ref, 302(2) 

Alden, Elizabeth : ref, 26(9) 

Aldrich, C. Knight: ref, 27(15) 

Aldrich, Cecelia G.: ref, 276(6) 

Alexander, Frances S.: ref, 298 
(108) 

Alexander, W. P.: test, 270 

Alexandra, M. : ref, 26(15) 

Allan, Mary Elizabeth: ref, 292 
(280) 

Allen, E. Patricia: bk, 714 

Allen, Frederick H.: exc, 1151 

Allen, James E.: ref, 613(4) 

Allen, Mark K.: ref, 292(240) 

Allen, Mildred M.: ref, 236(22, 


33-4, 37) 
Allen, Richard D.: test, 13, 345a, 


Alien, Robert M.: ref, ao& 109) 

en, Robert M.: ref, 208(1 

Allen, Wendell C.: bk, 715 S 

Allport, Floyd H. : test, 23 

Allport, Gordon W.: test, 23, 99; 
bk, 716; ref, 99(2-4) 


1007 


Alper, Karen: ref, 73(289) 
Alper, Thelma G.: fest, 157; ref, 


157(1) 

Alschuler, Rose H.: bk, 717 

Altable, Jose Peinado : ref, 73(553) 

Alteneder, Louise E.: ref, 647 (89, 
92), 649 (11, 16), 677 (12, 16) 

Altus, William D.: ref, 60(16), 
292 (336), 298 (59, 80) 
lusow, Frank Telespore: ref, 
150(4) À 

Amatruda, Catherine Strunk; test, 
276; bk, 276-80, 898-901, 904; ref, 
276(11, 13, 17, 20, 22) 

Amen, Elisabeth W.: ref, 103(11) 

American Council on Éducation: 
bk, 718-24 

American Management Associa- 
tion: bk, 725-32 

American Nurses' Association, Re- 
search Department: ref, 640(36) 

American Optical Company: ref, 
470(5) s s 

A un Optometric Association : 

» 733 

Amerson, Vera M. Manager, 
Walsh School of Business Sci- 
ence, Miami Beach, Florida: rev, 
369, 380 

Ames, Louise Bates: test, 276; bk, 
279, 901, 903; ref, 276(20, 23-7) 

Ames, Viola: ref, 250(57) 

Amin, D. L.: ref, 270(4) 

Amoss, Harry: test, 272 

ANASTASI, ANNE, Associate Pro- 
fessor of Psychology, Fordham 
University, New York, New 
York: rev, 216, T 265; exc, 
300, 1219; cross ref, 217, 263, 275 

Anderson, Amos C.: exc, 912, 1131 


Anderson 


THE THIRD MENTAL MEASUREMENTS YEARBOOK 


[ 1008 


Anderson, Edward E.: ref, 217 
(98), 292 (241), 208(20) 
Anderson, Elizabeth : test, 428, 434 
Anderson, Hedwin C.: ref, 647 
(138) D 
ANDERSON, Howard R., Specialist 
for Social Sciences, U. S. Office 
of Education, Washington, D. C.: 
rev, 590, 597, 607; bk, 734-6; exc, 
1159, 1206 
ANDERSON, Irvine H., Professor of 
Education, University of Michi- 
gan, Ann Arbor, Michigan: rev, 
514, 518-9, 521; ref, 163(3), 470 
(12-3, 17), 489(10), 507(3); 
cross ref, 467 
ANDERSON, JAMES M., Instructor in 
Clinical Psychology, University 
of Rochester, Rochester, New 
ork: rev, 295 
Anderson, John E.: ref, 220(8) 
Anderson, L. Dewey : test, 664, 671; 
ref, 270 (16), 285(6), 292 (167), 
6711, 3), 678(5), 679(3) 
Anderson, Phyllis M.: bk, 737 
Anderson, Rose G.: test, 236; exc, 
871, 947 ; ref, 60(33), 99(23), 236 
(40), 661 (40), 677 (56), 683 (18 
Anderson, Roy N.: exc, 831; ref, 
626(1), 627(30), 629(2) 
Anderson, Sarah Ferrall: ref, 217 
(98), 293 (241), 298 (20) 
Anderson, W. E.: ref, 26(20) 
Andrew, Dorothy M. : test, 627 
ae Margaret E.: ref, 649 
22 


Anpruss, Harvey A, President, 
State Teachers College, Blooms- 
burg, Pennsylvania: rev, 368, 373 

. Angell, Robert: bk, 910 

Annen, Ida: ref, 161(4) 
Apfeldorf, Max: ref, 73(360, 403) 
Arango, Carlos Martinez. See Mar- 

tinez-Arango, Carlos. 

Arensberg, Conrad M.: bk, 797 

Arkola, A.: ref, 49(4) 

Arluck, Edward W.: ref, 73(164, 
186, 234) 

Armitage, Stewart G.: bk, 738; 
ref, 41(28), 4 g 10), 73(481), 
95(17, 21), 2 


G 10 
Arthur, Grace: test, 271, 295; bk, 
G3, 24a, 25), 
205 (1-2) 


Asher, E. J.: test, 132, 234, 311 

ASHFORD, THEODORE A., Assistant 
Professor of Chemistry in the 
College, The University of Chi- 
cago, Chicago, Illinois; and 
Chairman, Committee on Exami- 
nations and Tests, American 


Chemical Society: rev, 566-8; 
test, 557, 559; ref, 557 (7), 565 (1) 

Association for Childhood Educa- 
tion: bk, 741 

Atwell, Charles R.: test, 169, 220; 
ref, 169(1), 220(44, 54) 

Austin, Ágnes Sutton: ref, 467(22, 


29) 
Avakian, Sonia A.: ref, 49(5) 
Averill, Lawrence A.: ref, 120(22) 
Ayer, Freperic L., Graduate Stu- 
dent, Teachers College, Columbia 


University, New York, New 
York: rev, 10; cross ref, 116, 
334, 501, 545 


Ayers, Arthur W.: bk, 761-3; ref, 
467 (23, 23a, 24, 29a, 34) 


BABCOCK, Harriet: test, 71; bk, 
72, 742; ref, 71(17-8), 661(17) 

Babcock, Virginia F.: test, 550 

Bacon, E. K.: test, 563 

Baggett, Miriam P.; ref, 49(11), 
298(111) 

Bahner, C. T.: test, 558 

Bailey, Albert Ernest: exc, 1000; 
ref, 223(8), 236(29), 249(1), 
250(50), 255 (13), 263(46) 

Bailey, Edna W.: bk, 743 

Baird, Sallyann: bk, 744-5 

Baker, A. B.: ref, 60(9) 

Baker, Clyde William: ref, 220(55) 

Baker, Elizabeth : ref, 73(165, 187, 
207, 324) 

Baker, Harry J.: tes!, 31-2, 274-5, 
626, 668, 697; bk, 33, 746; ref, 
32 (1), 668(1) 

Baker, W. M.: test, 559 

Baldwin, Alfred L.: exc, 299 

Baldwin, Ellsworth F.: ref, 677 


(41 
Baldwin, G. Bernard : test, 232, 624, 


I 

Balinsky, Benjamin: ref, 73(361, 
391, 413), 250(35), 208(3, 6, 18) 

Balken, Eva Ruth: ref, 103(3, 6-7, 
IS, 25-6 $ 47 

Ball, Fred J.: re ; 225(19) 

Ball, Rachel Stutsman: exc, 282; 
cross ref, 286 

x Philip Boswood: bk, 747, 


Banker, Howard J.: ef 247(1) 
Barber, Dixie: ref, 9(2), 18(12) 
Barden, Harold E.: ref, 678(9) 
Barker, Clarence, Jr.: ref, 40(1) 
Barnabas, Bentley: ref, 250(63), 
647 (159), 683(12) 
Barnard, Bernice: ref, 176(4) 
Barnes, Melvin W.: ref,217 (114-5) 
Barnette, W. Leslie, Jr.: ref, 24 
(14), 131 (11), 647(141) 
Baron, Samuel: ref, 73(355) 
Barr, A. S., Professor of Educa- 
tion, The University of Wiscon- 
sin, Madison, Wisconsin: rev, 


400, goes 
Barr, F. E.: ref, 689(1) a 
Barre, Marguerite F.: ref, 663(0) 
Barrera, S. Eugene: ref, 73(166, 
211-2, 214) 
Barrett, Dorothy M.: ref, oo(50), 
172(19), 390(1), 393(3), 627 
(38), 649 (39), 661 (36), 677 (48) 


Barrett, E. R.: test, 139 

Barry, Cora Miner : ref, 640(19) 

Bartelme, Phyllis: ref, 270(8) 

Barthelmess, Harriet M.: 
242(2) ; cross ref, 10 

Bartlett, J. F.: test, 558 


ref, 


Bartlett, Neil R.: bk, 1174-6; ref, — 


460(4-5, 7), 467(40-1, 43), 471 — 
(10, 20-1, 28) 
Bartlett, Russell S.: test, 581 
Bass, Hyman: ref, 250(8) 
Bassett, Dorothy : ref, 107(23) 
Bates, Justine: ref, 664(24), 677 


(37) 

Bathhurst, G. E.: exc, 1166 

Batson, William H.: bk, 748 

Bauer, W. W.: exc, 419 

Bauman, Mary K.: bk, 1122; ref, 
663 (15, 21), 665(1-2) 

Baurer, Herbert T.: ref, 678(12) 

Baxter, Brent, Director of Per- 
sonnel Research, Chesapeake and 
Ohio Railway Company, Cleve- 
iene a rev, 221, 252; ref, 
250(45, 51 

Bayles, tiai E.: cross ref, 581 

BavLEY, Nancy, Lecturer in Psy- 
chology and Research Associate, 
Institute of Child Welfare, Uni- 
versity of California, Berkeley, 
California: rev, 276, 285; exc, 
287; cross ref, 286 

Beach, Frank A.: test, 174 

Beales, Ben B.: ref, 48(42) 

Beaman, Gale C.: ref, 217(81) 

Bean, Kenneth L.: ref, 177(70), 
292 (200) 

Bear, Ropert Murray, Professor 
of Psychology and Director o; 
the Reading Clinic, Dartmout 
College, Hanover, New Hamp- 
shire: rev, 497, 509; test, 481} 
ref, 587(2) ; cross ref, 120, 470 

Beasley, Ronald: test, 3, 596 

Beattie, H. R.: test, 317 

Bechtel, Jessie: test, 127 

Bechtoldt, Harold: test, 598 

Beck, Maurice P.: ref, 24(9) 

Beck, RotANpD L., Professor of 
Education and Director of the 
Demonstration School, Central 
State College, Edmond, Okla- 
homa: rev, 6, 9; test, 17, 495, 5143 
ref, 244(16) 

Beck, Samuel J.: bk, 74-7, 749-51; 
exc, 78, 84; ref, Bush 167, 
237-8, 205-7, 362-4, 380, 400, 414-5, 
532) ; cross ref, 106 

Becker, Helen R.: exc, 721 

Beckham, Albert Sidney: ref, 176 
(16), 177(94) 

Beper, RatPH C. Professor of 
Educational Psychology and 
Measurements, The University of 
Nebraska, Lincoln, Nebraska: 
rev, 67; ref, 649(17, 19) 

Beer, Ethel S.: exc, 279 

Beers, F. S.: bk, 752; exc, 919, 
1200; cross ref, 4 

Beers, Lester D.: test, 581 

Beeson, M. F.: ref, 202 (137) 

Beier, Delton Clifford: ref, 4(10) 

Bell, Clarissa C.: exc, 92! 

Bell, Eleanor S.: ref, 99(51) 


& 


1009 ] 


INDEX OF NAMES 


Booth 


Bell, Hugh M.: test, 106; ref, 60 


(16) 

Bell, Reginald C.: bk, 983 

Bellak, Leopold: ref, 103(16, 39, 
79), 292(201) 

Bellinger, Lois: test, 206 

uw ium M.: e 
788, 965; ref, 225(32), 647 (103, 

115), 661(22, 25), 663(6), 664 
(14), 677 (17) ; cross ref, 255 

Bemis, Howard Albert: ref, 460(6) 

Bender, Evelyn: bk, 752 

Bender, ipe a na bk, 
109, 754; ref, 108 (1-2, 4-5 

Bender, Ralph E.: test, 68; ref, 68 
(1-2) 

Benge, Eugene J.: test, 51, 221, 389, 
625, 644, 656-7, 673, 692, 703; bk, 
755; ref, 703(1) 

Benjamin, Hazel C.: bk, 756-7 

Benkart, Joseph M.: ref, 419(2) 

Bennett, ALBERT A., Professor of 
Mathematics, Brown University, 
Providence, Rhode Island: rev, 
304, 323 

Bennett, C. W.: test, 556, 563 

Bennett, Grorce K., President, 
The Psychological Corporation, 
New York, New York: rev, 645, 
wo test, 386, 390, 446, 620, 659, 

3; bk, 758; ref, 61(7), 159(2), 
220(68), 259 (5, 9), 627 39), 658 
(3) 859 (12), S60 (12), 1 (26), 

3(8), 664(19), 668(2), 671 

(1o), 672(3), 677 20), 678017), 

79(10), 683(2-3, 5, 18) 

Bennett, Georgia: ref, 103(12, 17) 

Bennett, Margaret E.: bk, 1050 

Bennett, Mary Woods: ref, 263 
(44, 49) 

Benson, C. E.: ref, 220(19) 

Benton, ArtHur L., Associate 
Professor of Psychology, Uni- 
versity of Louisville School of 
Medicine, Louisville, Kentucky: 
rev, 29, 60, 103; test, 207; exc, 
72; ref, 41(8), 60(17, 34), 73 

188, 416), 95(3), 284(6), 202 

202), 297 (1), 298(7-8) 
ERDIE RarPH F., Director, Stu- 
dent Counseling Bureau and As- 
sociate Professor of Psychology, 
University of Minnesota, Minne- 
apolis, Minnesota: rev, 639-40; 
ref, 60(35), 99(36), 120(12), 
217(109), 647(120, 142-3, 150-1) 

Berens, Conrad: ref, 470(3-4) 

Bera, Harry D., Assistant Profes- 
sor, Board of Examinasions, 
ees College, ea 

ing, Michigan: rev, 593, 506; 
ue 592, 604; bk, 734-5; cross 
rej,3 

Berg, Irwin August: ref, 120(23), 
217 (139), 640(47), 699(6) 

a Arthur: ref, 292(176, 

42-3. 

Bergman, Paul: exc, 1104 

Bergmann, Martin S.: ref, 73 (417, 
429, 491, 554), 103 (64) 


5 
Berman, Abraham B.: 
(210) raham B.: ref, 292 


Berman, Isabel R.: ref, 642(4) 


EC ee w ref, 163(5), 

244(17), 507 (2 

Bernard, William : bk, 759 

Bernreuter, RoBERT G., Professor 
of Psychology, The Pennsyl- 
vania State College, State Col- 
lege, Pennsylvania; and Techni- 
cal Director, The Klein Institute 
for Aptitude Testing, Inc., New 
York, New York: rev, 703; ref, 
225(11, 26) 

Bernstein, Esther: bk, 760 

Berrien, F. K.: ref, 470(9) 

Berson, P. M.: ref, 236(20) 

Bettelheim, Bruno: ref, 103(80) 

Betts, Emmett A.: bk, 761-3; ref, 
467(1, 3, 11, 22, 23, 23a, 28, 29, 


29a) 

Betz, WiLLIAM, Specialist in Math- 
ematics, Public Schools, Roches- 
ter, New York: rev, 306, 310, 344 

Beynon, Lisabeth : test, 556 

Biber, Barbara: ref, 73(258) 

Bickel, Charles L.: test, 561; cross 
ref, 561 

Bidlake, Lawrence A. : ref, 647 (78) 

Bienstock, Sylvia F.: ref, r 
(24-5), 177(95) 

Biesheuvel, S.: bk, 

Bigelow, R. Barry: ref, 73(168) 

Bijou, Sidney W. : test, 21; bk, 765; 
ref, 271(23), 292 

Billeter, Paul Edward: ref, 219(4 

Billhartz, William H., Jr.: ref, 1 


(13) 
Billig, Otto: ref, 73(239, 298-9) 
Brits, Marton A., Assistant Sec- 
retary, Aetna Life Insurance 
Company, Hartford, Connecti- 
cut: rev, 215-6, 221, 246, 623, 638; 
ref, 48(21a), 220(20, 48), 647 


(104) 
Bilsky, H. B.: ref, 24(17) 
Bingham, C. W.: ref, 177(58) 
Bingham, Walter Van Dyke: test, 
691; bk, 766; ref, 220(32), 637 
(5), 642 (5), 661 (13, 15), 671(6), 
o 678(10), 689(4), 691 
1-2 
Bininger, Mary Lee: ref, 249(4), 
1(1 


Binion, Harriet Seay: fest, 514; 
ref, 514 (1) 

Binnewies, E. R.: test, 556 

Birchenough, C.: exc, 776, 778, 870, 


952, 1002 

Bird, Kenneth L.: ref, 566(10) 

Birnberg, Vita K.: ref, 60(18) 

Bishop, Elizabeth L.: bk, 743 

Bittel, Jane: bk, 

Bittner, Reicn H., Director of 
Personnel Research, Owens-Illi- 
nois Glass Company, Toledo, 
Ohio: rev, 669, 680 

Brxier, HAROLD H. Director of 
Research and Guidance, Public 
Schools, Atlanta, Georgia: rev, 
158, 162, 164, 268; test, 13, 147-8, 
156, 345a, 490a 

Blackburn, J. M.: cross ref, 270 

Blacklock, Clara: exc, 272 

BracksroNE, E. G., Professor of 
Business Education, The Univer- 
sity of Southern California, Los 


Angeles, California: rev, 378, 
382-3, 394 
Blackwell, A. C.: test, 558 
Blair, Glenn Myers: ref, 26(16), 
73(482-3), 250(52) 
Blair, Norman J.: test, 4 
Blakemore, Arline Mance: ref, 220 
(66), 250(53-4), 627(27, 32) 
Blanchard, Phyllis: exc, 1091 
Blankstein, Samuel S. : ref, 467 (12) 
ORIS Jean: ref, 292(202), 298 


C 

Biber M.: ref, 73(149) 

Bliss, Elisha F., Jr.: bk, 768 

Block, Irma S.: ref, 73(258 

Block, Virginia Lee: test, 

BLommers, PauL, Acting Dean of 
the College of Education and As- 
sistant Professor of Education, 
The State University of Iowa, 
Iowa City, Iowa: rev, 326, 530; 
test, 12; ref, 489(18) 

Broom, BENJAMIN S., College Ex- 
aminer, Board of Examinations, 
The University of Chicago, Chi- 
cago, Illinois: rev, 4, 17,19; exc, 
1008; ref, 397 (20) 

Blos, Peter: bk, 769; exc, 962 

Bluett, Charles G.: ref, 48(42), 
647 (156) ! 

BruM, Mitton L., Assistant Pro- 
fessor of Psychology, The City 
College of New York, New York, 
New York: rev, 657, 660, 664; 
He aha 627(21), 661(30), 


3 (7 

Boatright, J. A.: bk, 752 

Bochner, Ruth : bk, 78-9, 770-1; ref, 
73( 240, 418) 

Bodman, Frank: ref, 107(55) 

Boehm, Alice E. : ref, 208 (110) 

Boertman, C. Stewart: fest, 610 

Bogardus, Emory S.: exc, 37, 716, 
1177 

Bolanovich, D. J.: ref, 269(5), 640 
(20-1) 

Bolles, Mary Marjorie: bk, 772; 
ref, 27 (4), 41(1, 3-4) 

Bolt, Richard A.: exc, 279 

Bolton, Euri Belle: ref, 217(70) 

Bolton, Floyd B.: ref, 223(14), 249 
(9), 263(50) 

Bond, Elden A. : ref, 292 (161) 

Bond, Eva: ref, 467(16), 510(2), 
515(2), 520(3) 

Bond, : ref, 467(6, 16), 510 
(2), 515(2), 520(3) 

Bone, Paul Lee: ref, 10(6) 

M come Alfred M.: ref, 284 


) 
Bonney, Merl E.: ref, 26(6, 8) 
Book, William F. : ref, 5oz (1) 
Booxer, Ivan A., Assistant Direc- 
tor, Research Division, National 
Education Association, Washing- 
ton, D. C.: rev, 505, 532, 534; 
test, 481; cross ref, 13, 15, 489, 


490a 
Bookwalter, Carolyn W.: test, 424; 
bk, 774; ref, 424(4) 
Bookwalter, e W. j test, 424; bk, 
773-4; exc, 789; ref, 424(1-4) 
Bond Gotthard: exc, 8t; ref, 73 
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Borcher, Raymond, test, 556 

Bornin, Epwaro S., Director, Stu- 
dent Counseling Center, and As- 
sociate Professor of Psychology, 
The State College of Washing- 
ton, Pullman, Washington: rev, 
638, 643, 655; ref, 640(48), 647 


es Hery: test, 698; ref, 698 


Borow, 

a) 
Bortin, Aaron W. : ref, 60(19) 
Bosselman, Beulah : ref, 24(10) 
Botsford, Frances R. : test, 367, 388, 


395 
Boulger, Catherine : ref, 295 (2) 
Bowden, A. O.: ref, 99(12) 
Bower, Libbie B.: bk, 775; ref, 176 
(28), 177 (98) 
Bowley, Agatha H.: bk, 776 
Bowlus, D. E. : ref, 73(555) 
qanm um M.: bk, 78-9, 770-1; 
ref, 73(418) 
Boyd, George Robert: bk, 777 
Boyd, Paul P.: ref, 613(5) 
Boynton, Paul L.: ref, 73(300), 
78(13) 
Braddock, A. P.: exc. 1030 
BraprorD, E. J. G., Senior Lecturer 
in Education, The University of 
Sheffield, Sheffield, England: rev, 
95, 266 ; ref, 95 (14), 258 (11) 
Bradford, Jeanne M.: test, 3-4, 596 
He Mary Edith: ref, 20(4), 


528(1) —— 

Bradley, William Arthur, Jr.: ref, 
217(116) 

Bradway, Katherine Preston: ref, 
73 (485), 107 (9-12, 19), 292(172, 
282-3, 299, 317-8) 

Brainard, Paul P.: test, 634 

Brainard, Ralph T.: test, 634 

Brandenburg, G. C.: ref, 220(6) 

Brannan, Christine V.: test, 220 

Branom, M. E.: test, 6 

Braverman, Xt ref, 103(79) 

Brayfield, Arthur H. : exc, 640, 655; 
ref, 647 (121) 

Brecher, Sylvia: exc, 851, 1106, 
1123; ref, 208(82) 

Breese, Fay H.: ref, 225(575) 

Bregman, Elsie O. : test, 220 

Brenke, W. C., Professor of 
Mathematics, Emeritus, The Uni- 
versity of Nebraska, Lincoln, 
Nebraska: rev, 321 

Brenman, Margaret: ref, 73(301), 
103 (18) 

Brereton, J. L.: bk, 778 

Brewer, John M.: ref, 635(11), 
640(14), 647 (145) 1 

BREWINGTON, ANN, Associate Pro- 
fessor of Business Education, 
School of Business, The Univer- 
sity of Chicago, Chicago, Illinois: 
rev, 386, 301 

Brickman, William W.: exc, 761 

Bridges, J. W. : ref, 220(9) 

Brierley, Marjory: exc, 1243 

Briggs, Leslie : ref, 653(10) 

Bright, Charles J.: ref, 60(36) 

Brill, Isidore: ref, 60(19) 

Brillinger, E. Virginia : ref, 470(8) 
Brinegar, Vaughn: ref, 244(4), 566 
(13), 567 (11) 


Brinkman, Albert R.: bk, 825 
Bristol, Radcliffe W.: test, 303 
Britt, Steuart H. : exc, 953 
Broadley, Margaret E.: bk, 779 
Brodman, Keeve : bk, 713 

Brody, M. B.: exc, 29; ref, 71(22- 
3), 292 (203, 244-5) 

BnocpeN, H. E., Research Psychol- 
ogist, Personnel Research Sec- 
tion, The Adjutant General's Of- 
fice, M D. C.: rev, 
178-9, 

Bronfenbrenner, Urie: bk, 780 

Brooks, Bernice R. : ref, 292(246) 

Brooks, Esther: ref, 217(56), 640 


(10) 
Brooks, Fowler D.: bk, 1052; ref, 


250(3) 

Brooks, James J.:ref,107(42) . 

Brooxs, Netson, Instructor in 
French, Westover School, Mid- 
dlebury, Connecticut: rev, 183, 
185; cross ref, 181 

Broom, M. Eustace: bk, 781, 933; 
exc, 788, 912; ref, 250(12), 470 

5 (10); Iu ref, EA ) 
rosin, Henry W.: ref, 73(241 

BROTEMARKLE, R. A., P ca of 
Psychology and Director of The 
Psychological Laboratory and 
Clinic, The University of Penn- 
sylvania, Philadelphia, Pennsyl- 
vania: rev, 45; cross ref, 43-4, 


55 

Broudy, H. S.: ref, 613(6) 

Brower, Daniel: ref, 60(54), 73 
(556), 107 (56 

Brower, Judith F.: ref, 107 (56) 

Brown, A. Jean: exc, 1222 

Brown, Atrrep S., Associate Pro- 
fessor of Chemistry, Colgate 
University, Hamilton, New 
York: rev, 559 

Brown, Andrew W. : exc, 912, 1102; 
ref, 256(2), 202(143) ; cross ref, 


271 
Brown, Clara M.: test, 438-9; bk, 


782 

Brown, Clarence W.: bk, 818; ref, 
276(8) 

Brown, Edwin J.: bk, 783 

Brown, F. T.: ref, 73(349) 

Brown, Fred: ref, 202(178, 247-8, 
284, 300) 

Brown, J. E.: ref, 647(93) 

Brown, J. F.: ref, 27(9), 41(10), 


298(9) 

Brown, John: bk, 822 

Brown, Marion A.: test, 102 

Brown, Max Duane: bk, 784 

Brown, Paul A.: ref, 38(5) 

Brown, Ralph R.: exc, 78; ref, 73 
(242, 302), 292(147), 208 (21) 

Brown, W. Gordon: test, 561 

Brown, William : bk, 785 

Brownell, C. P.: ref, 471(24c) 

Brownell, Clifford L.: test, 421 

BnowNELL, WirLIAM A., Chairman, 
Department of Education, Duke 
University, Durham, North Car- 
olina: rev, 334, 338, 351 ; bk, 1063; 
cross ref, 10, 15, 334, 501, 545 

Brozek, Josef: ref, 225(57a) 

Bruce, H. W.: test,.360 


Bruce, Myrtle: bk, 786; ref, 236 
(23), 271 (20), 292(177) 

BRUECKNER, Leo J., Professor of 
Education, University of Minne- 
sota, Minneapolis, Minnesota: 
rev, 341, 348; test, 6 

Bruner, Jerome S.: exc, 1164; ref, 
73(561), 99(58) 

Brush, Edward N.: ref, 664(15), 
671(9), 677 (19) à 

Brussel, James À.: ref, 73(243-4) 

Brussell, Eleanor S.: test, 

Bryan, Alice I.: exc, 1158; ref, 217 
(60, 99), 677 (27) 

Bryan, Miriam M.: test, 4, 120 

Buck, John N.: test, 47, 56, 66, 266 ; 
ref, 47 (1-3), 56(1), 66(1), 266 


1) 

Buckle, D. F.: ref, 73(303) 

Budd, Nathan: test, 153 

Buhler, Charlotte: ref, 60(29), 73 
(304, 444, 557), 103(53) 

Bull, Elizabeth Gifford : exc, 1236 

Bullis, Glenna E.: bk, 903 

Bunch, C. C.: ref, 475(7) 

Bundas, Lourene E.: bk, 1129 

Bunting, Vidamae: test, 555 

Burchard, Edward M. L.: ref, 73 
(486), 107 (38), 271 (24), 284 (4) 

Burgemeister, Bessie B.: bk, 787; 
ref, 73(419), 99(32), 649(17a) 

Burger, F. E.: ref, 24(15) 

Burgess, Ernest W.: exc, 716 

Burke, Agnes: ref, 515(1), 518 
(12), 520(1 

Burke, Paul J.: test, 3-4, 316, 548, 
561, 581 

Burkhart, Russell: test, 367, 388, 


395 

Burks, Barbara S.: exc, 1234 

Burlingame, C. Charles : ref, 95(4) 

BunNHAM, PauL S., Associate Di- 
rector, Student Appointment Bu- 
reau, Yale University, New 
Haven, Connecticut: rev, 161, 
169, 636; bk, 817; ref, 20(2), 217 
id 225(55), 622(2), 647 (82, 


Buros, Oscar Krisen : bk, 788 
Burpee, Royal Huddleston: bk, 780 
Burr, Emuty T. Director, Guid- 
ance Bureau, Inc., New York, 
New York: rev, 222, 265 
Burr, James B.: exc, 914 
Burr, Mary: ref, 661(11) 
Burt, Cyr, Professor of Psychol- 
ogy, University College, London, 
ngland: rev, 225; bk, 790, 1186; 
exc, 283, 203, 837, 1002, 1082, 
1168; ref, 202(249); cross ref, 
2 
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Burton, Arthur: exc, 801; ref, 60 
(36, 55), 544(3) 

Burton, Theodore M.: test, 558 

Burton, W. H.: exc, 955 

Burtt, Harold E.: exc, 807, 1083, 
1166, 1190; ref, 220(10) 

Buswell, Guy T.: test, 502; exc, 
1168, 1190; cross ref, 470 

Butler, Richard: test, 569 

Buttenwieser, Paul: ref, 647 (85) 

Buynitzky, K. C.: test, 179 

Buzzard, G. A.: test, 601 

Byers, Kenneth : exc, 686 
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Byrd, Oliver Erasmus: fest, 418; 
ref, 418(1) 
Byrns, Ruth K. : ref, 244(3) 


C., B. C.: exc, 788 

Cahill, Gertrude B.: ref, 35(5) | . 

Canoon, G. P., Professor of Sci- 
ence Education, The Ohio State 
University, Columbus, Ohio: rev, 

* 581, 585 

Cahow, Arthur C.: d 244(18) 

Cain, Leo Francis: ref, 225(12) 

Calandra, Alexander: test, 3, 548, 
561-2; ref, 557 (8), 562 (1) 

Caldwell, C. W. : test, 701 

Calkins, E. J.: test, 598 

Callis, Robert : ref, 20(7) 

Cameron, Dale C.: ref, 73(245) 

Cameron, M. A.: bk, 1142 

Cameron, Norman: ref, 27(6) 

Campbell, A. A.: exc, 1062 

Campbell, C. Macfie: bk, 74, 749 

Campbell, Helen S. : ref, 181 (6) 

Campbell, Ronald Kenneth : ref, 647 
X94 

Candee, Beatrice: ref, 73(391), 250 
(46), 292 (148), 627 (21), 663(7) 

Canham, J. W.: bk, 1179 

Cannell, Charles F.: ref, 292(214) 

Canning, Leslie B.: ref, 647 (105-6) 

Cantril, H. : ref, 99(3-4) 

Capwell, Dora F.: ref, 24(19), 60 
(20-1), 107(51-2), 110(9), 271 


29 

Cardall, Alfred J., Jr.: test, 645, 
694; ref, 645 (1-2), 694(1) 

Carl, George P. : test, 273 

Carlson, J. Spenser : test, 689 

Carlson, Jessie J. : ref, 2902(337) 

Carlson, Walter A.: ref, 269(3) 


Carlton, Theodore: ref, 236(30), 
292 (250-2) 

Carmichael, Vernal H.: test, 367, 
388, 305 


Carp, Abraham: ref, 60(61) 
Carpenter, Aileen: bk, 1004 
Carpenter, Helen M.: exc, 890 
Carpenter, June: ref, 97(5) 
Carpenter, M. F.: fest, 115, 131 
Carrison, Doris : ref, 73(477) 
Carroll, Herbert A. : test, 140; exc, 
817; ref, 140(1-4) 

Carroll, John B.: ref, 2(12) 

Carr-Saunders, Alexander: 
1186 

Carter, Dorothae: ref, 471(27) 

Carter Gerald Clayton: bk, 791 

Carter, H. E.: test, 556 

Carter, Hanorp D., Associate Pro- 
fessor of Education, University 
of California, Berkeley, Califor- 
nia: rev, 52, 638, 655; bk, 650, 
792; exc, 652, 986; ref, 647(76, 
105-6, 152), 649(30, 33); cross 
ref, 647, 653, 605 

Carter, Launor F. : bk, 793 

Carter, Thomas M.: exc, 721 

Pe W. R.: bk, 794; ref, 244 

2! 

Cartwricut, W. H., Assistant Pro- 
fessor of Education, Boston Uni- 
versity, Boston, Massachusetts: 
rev, 591, 610, 618 


bk, 


Case, Harry W.: test, 682; ref, 250 


70) 
on William A., Jr.: ref, 60(61) 
Cassingham, C. C.: ref, 292(135) 
Castner, Burton M.: test, 276; bk, 
278-9, 900, 901; ref, 276(17, 20) 
Caswell, Hollis L.: bk, 845, 1170 
CarrELL, Psycue, Clinical Psychol- 
ogist, Lancaster, Pennsylvania: 
rev, 256, 274; test, 281; bk, 282, 
795; ref, 250 (13-4, 17) 281(1-2) ; 
cross ref, 236, 240 
Cattell, Raymond B.: test, 228; bk, 
796; ref, 43(7), 55(10), 228(1, 
4); cross ref, 26, 106, 223, 202 
Cavan, Ruth Shonle: exc, 962 
Cavanaugh, Jean Ogden: ref, 70 


I 

Cavell, Angelo Giavarella : ref, 236 
(21) 

Cawley, Anne Mary: ref, 649(45) 

Center, Stella S.: cross ref, 470 

Chalke, F. C. R.: ref, 73(456, 487) 

Challman, Alan: ref, 292(178) 

Challman, Robert C.: exc, 82; ref, 
73(420) 

Chalmers, J. W.: test, 162 

Chamberlain, Florence: ref, 99(21) 

Chamberlain, H. E.: exc, 1164 

CHAMBERS, E. G., Assistant Direc- 
lor of Research in Industrial 
Psychology, University of Cam- 
bridge, Cambridge, England: rev, 
640, 682 

Chapanis, Alphonse: ref, 236(38), 
250(31) i 

Chapman, Albert E.: exc, 747, 1089 

Chapman, G. H.: test, 559 

Chapman, J. Crosby : test, 522; ref, 
522(1) 

Chapman, Kenneth W.: ref, 73 
(274-5) — 

Chapple, Eliot D.: test, 688; bk, 
797; ref, 688 (1-4) 

Chassell, Marian A.: ref, 292(204) 

Cuauncey, Henry, President, Ed- 
ucational Testing Service, Prince- 
ton, New Jersey: rev, 12 

Chesire, Leone E. : test, 321-2 

Cheslock, Louis : exc, 1228 

Chester, H. M.: bk, 752 

Cheydleur, Frederic D.: bk, 798 

Child, Irvin L.: test, 291; ref, 23 
(28), 24 (11) 

Chipman, Catherine E.: ref, 73 
(488), 103(59), 292 (141) 

Chittenden, E. W.: test, 308-9, 547 

Chittenden, Gertrude E.: bk, 799 

Christensen, Thomas E. : exc, 1200; 
ref, 219(8), 640(37) 
i weg James A.: ref, 103 
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Christiansen, Carole: ref, 60(42), 
73(499) 

Christofferson, H. C.: test, 312 

Church, H. V.: ref, 220(2) 

CnuncHiLL, Eomunp P., Assistant 
Professor of Mathematics, An- 
tioch College, Yellow S, prings, 
Ohio: rev, 308-9, 323 

Cuurcum, Ruta D., Administra- 
tive Assistant, Office of Testing 
and Evaluation, Antioch College, 
Yellow Springs, Ohio: rev, 136 
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Chute, Eloise: ref, 73(358) 

Ciocco, a bk, 800; ref, 475 
(2-3, 5, 

Cs). aD O 
471 (12), 649(34), 2 
ivi ice Assembly of The 
United States and Canada, Com- 
mittee on Oral Tests in Public 
Personnel Selection: bk, 801 

Clapp, Frank L.: test, 117, 339 

Clardy, E. R.: ref, 73(189) 

Clark, Genevieve Y.: ref, 244(2), 
250(1) 

que Hone 2 test, 257 

r! ohn R.: test, 

Clark, Mamie Phipps Án. 8o2; ref, 
225(50) 

Clark, P E.: test, 563 

Clark, Ronald W.: ref, 26(16), 73 


. (483) s 

Clark, Ruth Millburn: bk, 104, 803; 
ref, 103(40, 59a) 

Clark, Stanley: ref, 292(259) 

Clark, Vernon: exc, 79 

Clark, Willis W.: test, 15, 26, 59, 
222-3, 345, 490, 517, 690; ref, 15 
(3), 26(3), 517 (1) 

rke, Daniel F.: ref, 292(205) 

Clarke, E. R.: bk, 804 

Clarke, H. Harrison: bk, 805-6 

Clarke, Helen Jane: ref, 73(480), 
208 (83) 
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Clarke, W. P.: test, 589 

Clausen, John A. : bk, 

CrEETON, GLeN U., Director, Divi- 
sion of Humanistic and Social 
Studies, Carnegie Institute of 
Technology, Pittsburgh, Pennsyl- 
vania: rev, 96, 645, 694; test, 224, 
635, 695-6; bk, 807, 978; ref, 11 
(1), 695(4) 

Clements, Dollie: ref, 217(55) 

Cleveland, Sidney E.: ref, 41(20), 
298(47) 

Clinton, R. J.: test, 158 

Close, Ruth L.: ref, 292 (301) 

Cobb, Elizabeth A.: bk, 1143; ref, 
103 (35), 292 (279) 

Coblentz, Irving: ref, 647 (123) 

Cochran, Grace : test, 178 

Cochrane, Roy : test, 347 

Corer, Cnanrzs N., Associate Pro- 
fessor of Psychology, University 
of Maryland, College Park, 
Maryland: rev, 266, 284 

CorFMAN, WiLLIAM E. Assistant 
in Psychological Foundations and 
Educational Research, Teachers 
College, Columbia University, 
New York, New York: rev, s9 

Coggin, H. T.: test, 556, 558 

Cohen, Betram : ref, 298 (60) 

pe Edith R.: ref, 250(36), 263 

40) 

Couen, Jonn, Lecturer, University 
of Leeds, Leeds, England: rev, 
135, 270, 342, 346; exc, 29 , 

Cohen, Jozef B.: ref, 99(39) 

are. ala, M.: ref, 49(11), 298 

III 

Cole, Gerald: ref, 467 (30) 

Cole, Lawrence W.: test, 226 

Cole, Robert D.: ref, 250(10) 

Coleman, J. H.: ref, 471(6, 8) 


Coleman 


Coleman, William : ref, 103(81) 


206) 

Collins, James H.: ref, 48(19-20, 
21, 31) 

Collins, Mary: test, 290; bk, 844; 
ref, 290(2) 

Collins, Patricia J.: ref, 649(23) 

Combs, Arthur W.: ref, 67(1), 103 
(60-3, 82) Á 

Commins, W. D., Associate Pro- 
fessor of Psychology, The Cath- 
olic University of America, 
Washington, D. C.: rev, 217, 224; 
ref, 242(3), 635(12) ; cross ref, 
220, 223, 627 

Conant, Margaret M. : bk, 815 

Congdon, Nora A.: ref, 67(7), 217 
(101), 635(1-2).; cross ref, 635 

Conger, J. A.: ref, 285(3) 

Connor, William L.: test, 13, 345a, 


490a 
Conover, David M.: ref, 219(6) 
Conrad, Clinton C.: cross ref, 611 
Conran, HrmsERT S. Technical 
Consultant, College Entrance 
Examination Board, Princeton, 
New Jersey: rev, 20; bk, 1058; 
exc, 1168; ref, 220(45) ; cross 
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bk, 878; exc, 781, 788, 837, 964, 
1141 
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Fleming, Edith Endsley: ref, 103 
(6: 

Fleming G: W. T. H.: exc, 788; 
ref, 95(7) 

Flomine bate Van ae bk, 
1123; ref, 292(253, 302 

SEES Cecile White: ref, 96 
(29), 110(14), 250(65), 647 (167) 

Flemming, Edwin G.: ref, 96(29), 

* 110(14), 250(65), 647 (167) 

Flecher, Stevenson W.: test, 305 


Florence Louise, Sister M:: ref, 
217 (140) l; z 
Frory, Caartes D., Rohrer, Hib- 


ler and Replogle, New Y ork, New 
Fork: rev, 245, 251, 269; exc, 
851; ref, 26(9) 

Flottman, Edward Albert: ref, 18 

= (14), 250(34) 

Flynn, Elizabeth: ref, 292(319) 

Foley, Jeanne: bk, 879. 

Font, Marion McKenzie: ref, 95 
(8), 292(181) 

Foran, T. G.: ref, 18(6) at, m 

Ford, C. A.: ref, 99(5) - 

Ford, Mary E. N.: bk, 8o, 880; ref, 
73(368, $00) on G 

Fordyce, Wellington G.: test, 63; 
ref, oM 

Forkner, H. L.: bk, 1083 

Forlano, George: ref, 24(18), 97 
eg 99(28-9, 48), 110(10), 250 


Forrest, Helen de Sales ; bk, 881 
Forster, M. C.: ref, 647(75) 
Forster, Max H. : ref, 73(570) 
FonsvrH, ELAINE, Assistant Pro- 
fessor of Social Studies, New 
+ York State College for Teachers, 
Albany, New York: rev, 599-600; 


test, m 
Fosa, Elizabeth Joan: test, 472; 
ref, 472(1-4) — ^ 
Fosberg, Irving A.: ref, 73(171, 
193, 306) 
Fosselius, Ebba E.: ref, 640(22) 
Foster, Harriet, ref, 473(7) 
Foster, Josephine C.: bk, 934 
Foster, Laurence S.: ref, 217(86), 
557(6), 567(12-3) 
Foust, Judson W.: test, 306; ref, 
“306 (1) ; cross ref, 305 
pelr, hice Mi bk, 882 
owler, Mary Jane: ref, 467 (12 
Fowler, Victor M.: ref, 612 AS ? 
Fox, Charles: cross ref, 124 
Fox, James Harold: test, 400 
Fox, William H.: bk, 883-4; ref, 
18(33), 120(17), 217 (122) 
Foxe, Arthur N.: exc, 78, 277, 769, 
1164, 1243-4 
Euch dr G.: um 885 
randsen, Arden N.: ref, 640(50), 
= 671 (3) eie 
y : ref, 244(4), 566(13), 
irte 566(13) 


Frank, Lawrence K.: bk, 916; ref. 
73(307) 
Frankel, Esther B.: bk, 1075 
Franklin, Joseph Charles: 
298(62) 
ioa QUEM bk, 886 
er, D. 1 2 
[pgs ennedy. See Kennedy. 


ref, 


Fraser, Hugh Russell: ref, 613(9- 
11) 
Frazer, Jane: ref, 467 (24) 
Ec pus A.: ref, 271(34), 
292 (321 
Frease, Forrest : test, 598 , 
FREDERIKSEN, NORMAN, Associate 
Professor of Psychology, Prince- 
ton University; and Research 
Associate, College Entrance Ex- 
amination Board; Princeton, 
New Jersey: rev, 446, 641, 608 
Freed, Herbert: ref, 103(66) 
Freeman, Ellis: ref, 460(1) 
Freeman, F. N.: cross ref, 292 
Freeman, H.: ref, 73(369) 
Freeman, Walter: ref, 73(267) 
French, Esther: bk, 1149 
Frencu, Joun W., Research Asso- 
ciate, College Entrance Examina- 
tion Board, Princeton, New Jer- 
sey: rev, 61-2, 701 
Frencu, Srpney J., Dean of the 
Faculty and Professor of Chem- 
istry, Colgate University, Hamil- 
ton, New York: rev, 557 
French, Thomas M.: exc, 28 
French, Vera V.: exc, 778; ref, 
66(2), 73(560), 99(59), 103(94) 
Fridiana, M.: ref, 486(3) 
Fried, Edrita G.: ref, 73(582) 
Friedline, Cora L.: ref, 292(210) 
Friedman, Bertha B.: bk, 887 
Fritz, Ralph A.: ref, 217 (52) 
Froehlich, Clifford: ref, 397(35), 
655(3) : 
FroenLicH, Gustav J. Assistant 
Director and Assistant Profes- 
sor, Bureau of Institutional Re- 
search, University of Illinois, 
Urbana, Illinois: rev, 10, 12; 
test, 22, 138, 330, 362, 555, 577, 
589; bk, 888; exc, 287, 715, 
914, 1112, 1156; ref, 22(1-2), 
217(82) ; cross ref, 116, 334, Sot, 


545 
Frogner, Ellen: test, 6 
Fromm, E.: ref, 73(241) 
Fruchter, Benjamin : ref, 225(56) 
F. P.: test, 313 


Fry, Glenn A.: 

Frye, Ellis K.: ref, 679(9) 

Fryer, DoucLas H., Associate Pro- 
fessor of Psychology, New York 


University, New York, New 

Hie rev, 685-6, 691; ref, 642 
I 

Fuller, Elizabeth Mechem: ref, 
_4(15), 120(28), 217(133) 

Fulton, James L.: ref, 48(36) 

Funderburg, Joe: ref, 73(575), 

= 103(97) j B:ref,73( 

Funkhouser, James B. : ref, 73(2. 

Furbay, John H.: test, 40 E. 

Furfey, Paul Hanley: ref, 285(1, 
5) ; cross ref, 107 

Furneaux, W. D.: bk, 873; ref, 
43(6), 44(6), ss(9), 73(5s0), 
239(3), 258(19) 

Furness, Edna Lue : fest, 213 

Fuszard, Melvin: test, 555 


GABLER, Earl R.: exc, 26, 294, 
306, 344, 489, 788, 796, 1131, 1200 

Gage, N. L.: test, 137; bk, 1112-4 

Gair, Mollie: ref, 73(340, 370) 

Gallagher, Constance D.: ref, 
473(2) 

Gallagher, J. Roswell: ref, 468(4- 
5), 473(2) 

Gallien, Shelby: fest, 420 

Gambaro, Providence ref, 
107 (48) 

Gannon, Joseph Timothy: bk, 889 

Gans, Roma: bk, 890 

Gansl, Irene: test, 1595 ref, 159(1) 

Gardner, D. E. M.: bk, 891 

Gardner, Warren H.: bk, 892; exe, 


2 

Garheld, Sol L.: ref, 73(495). 
95(22), 103(67), 298(114) 

Garman, Helen R.: test, 

Garretson, Oliver Kelleam: fest, 
637 ; ref, 637 (1, ü 

Garrett, Annette: bk, 893 

Garrett, Henry E., Professor of 
Psychology, Columbia Univer- 
sity, New York, New York: rev, 
59, 223; test, 159; exc, 293, 962, 
1104, 1168; ref, 292(285) ; cross 
ref, 225, 236 

crau iley S.: ref, 224(3), 


Garters Marie. See Evans, Maric 
Garrison. 

Garrison, N. L.: ref, 242(2) 

Garth, Thomas R.: ref, 177(57), 


ae} 

Gaskill, Harold V.: ref, 274(1) 

Gasking, D. A. T. : bk, 894 

Gates, Arthur L: test, 419, 484-7, 
510, 516; bk, 895; ref, 


255(11), 
oid Qe a, sat 
Bt sis(2), see 8). 

Gatien: kaoul: bk, 1249; ref, 218 


mon Frank: test, 1 
Gaudet, Frederick J.: ref, 163(7), 


250(49 
Gearan, John S.: ref, 217(71) ~ 
Gehlmann, Frederick: ref, 647 


129) 
TM end H. E.: ref, 217(77) 
uror A.: ref, 73(426). 
Gelder, Bosch van: bk, 947 
Geleerd, Elisabeth R.: erc, 962 
Gell, Kenneth E.: cross ref, 606 
Gelman, Babette: bk, 896 
Gentry, Curtis G.: test, 655; ref, 
GE) R 837, 
, J. R.: exc, 837, 907, 1091 
Gersericn, J. R., Decor, Bureau 
of Educational Research and 
ervice, and Professor of Edu- 
cation, University of Connecti- 
cut, Storrs, Connecticut: rev, 127, 
sor; bk, 914-5; ref, 11(2), 
It e 131 (7), 178(5), 217(50), 


Gerken, Clayton d'A : test, 689 

Gerlach, Marjorie: ref, 292(169) 
Gernes, Elizabeth: ref, 649(18) 
Gernet, Sterling K.: bk, 


Gerson 
Gerson, Martin J.: ref, 73 (336, 
383) 


Gerstein, Samuel : ref, a 
Gerver, |, M.: ref, 103 (68 
Arnold : test, Da bk, 277- 
^s, 1, 903-4; exc, 1244; ref, 
276 (1-5, 10-1, 13, 17, 20, 22, 26-7) 
s, Louise E.: exc, 756, 7: 
Garsen, Epwin E, Associate 
Professor of Psy sychology, Uni- 
versity of California, Pria 
California: rev, 663, 66 
nm ref, 627 (22, 25-6), a ino). 
661(24, 28), 663(10, 12), 
16).8 O77 21, 30. 
Gibb, : exe, 
ehe Crate C.: exc, 918, 1190 


iie 
m Q H: laha atis) 
gies, OFF ref, US 


Gifford, iie ee 
Gifford, Elizabeth y exc, 788; 
ref, 292( 
Gilbert, G. M. : ref, vn 21, 26) 
Gilbert Jeanne G.: ref, 71(19, 
Gilger, ope eol AS Jr.: ref, 635(5 
Gilkinson, : test, 150; ref, 
24(4, 8, 12), E 3), 151 (2), 
iala, + 177097), 647 (90, 110), 


Gill, Moin: bk, 1102-6; ref, 

27 (13, 17-9), 41 (24, 267), 71(25, 
27, 31-2), 73 (332, 5 x 100(3), 
103(29, 71-3), 292(325), 298(34. 


Gillespie i PH : test, 46 

Gillespie, wh H: exc, 28 

Gillett, Albert N.: bk, 906 

b Annette L.: ‘ref, 292(182, 


12) 
Gill Cliff oo 
Si M 


Gilman Gooddloe: ref, 
ga erich, C. N.: ref, Hp CUN 
ner, John: ref, 
[OE Millar: P test, 444 
S^ dore Vn test, 53, 544; 
ref, 53(1-2, 4), I 
eid es E 647(79) 
Glick, cs N.: ref, 202 (319) 
Glick, W. Per ef, 647 (97) 
exc, 
Goddard, H. H.: exc, 1168 
gorabera, Milton: p 
lfred L. 


Ec William: bk, 907; ri 
73(250, 309-10, 371-2, -8), 
a 48). 292 (30); 208 (10, 48, 


Goldman, George S. 
Goldman, Mi ania pin) 
Goldman, Rosaline: ref, 208 (86-8) 


Goldstein, H. H. H.: ref, 
Goldstein, Kart: Ped 5 D 


908; exc, 28, 1243; ref, 41 (253. 
6, 9, 13, 16-7, 21), 73( , 
ally 429a) 


Goldstein, Nathan N.: ref, II 
Good, Carter V.: PAL o 
871, 803, 916 

ENOUGH, Frorence L., Re- 
search Professor, Institute of 


’ 
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Child Welfare, University of 
Minnesota, Minneapolis, Minne- 
sota: rev, 264; test, 286; bk, 287, 
909; exc, 1168; ref, 286(4), 
292(254) ; cross ref, 292 


os ‘Alice. Whiteman : ref, 
107 (29) 
Goodman, Charles M. : ref, 225(15, 


26, 29, 48-9, 51), 635(7), 640(21), 
661(37, 41-2) 
CUPIT H. R.: test, 701 
rbutt, Dorothy Gale ; test, 92 
Gordon, H. Phoebe: ref, 61(7), 


540(3) 

TEASA Hans C., Special Assistant 
to the Director, Division o, 
Educational Research, Public 
Schools, "p img Pennsyl- 


vania: rev, 571, 573, 576; ref, 
: B), '635(6, 8), 
d E ie ar AA 


(8, 11), 655(2, 4) 
ols? Wem test, 153 
Gorseline, Donald E.: exc, 639 


Gotham, R, E.: rej A (os) 
hber iura ref, 10; 
Gottschalk, Louis: bk, 910 a 


Gottschalk, dn M.: test, Xe 
e TN G.: ref, 60(40, 
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: test, S89 
Gould, RR ref, 2(12), 110(15) 
Grage, D o B.: ref, 217(75), 
Graham, Alra Whitcomb: ref, 
26(21) 
Graham, Frederick B.: /est, 13, 


345a, 490a 
Graham. He Herbert: ref, 73(554) 
Graham, Jessie: exc, 
Grant, Albert: ref, 255(9-10), 
492(1) ; cross ref, 492 
Grant, Harry: ref, 60(41) 
Grassi, Jorsh ub ref, fous 251, 


sky CE 5 p 3, 450) 
Gray, : ref, 177 (58) 
GN : test, 9 


Gray Jean: ref, 217 (98), 292 (241), 


Gray, W. H.: test, 500 

Gray, WILLIAM S, Professor of 
Education, The University of 

hicago, Chicago, Illinois: rev, 


, 403 
Grayson, Harry M.: ref, 60(46), 


uise: exc, 933 
gren Lou Jacob: fey 181 
Greenblatt, Milton : d (86-8) 
Greene, EpwARD B., Chief, Person- 
nel Training, poro Unem- 
ployment Compensation Com- 
mission, Detroit, Michigan; and 
ecturer in Psycholo , Univer- 
sity of Michigan, Ann Arbor, 
Michigan: rev, st, 635; test, 166, 
523; bk, 912; exc, 650, ref, 166 
(3-4), 217(72) ; other, 843 
GREENE, Harry A. Director of the 
Bureau of Educational Research 
and Service amd Professor of 


Education, The State Univer: 
of lowa, lowa City, Iowa: fi 
br ci d test, 10-1, 116, 327, 
Sot, 545; bk, 913-5 
d. cross ref, 15 
Greene, J. E. : ref, 244(14), 399 (3). 
405(6), 540(2) 
Grecne, Paul C.: 

_ (84) , 
Greene, W., Ainsworth : fest, 3, 
Greenly, R. J.: ref, 250(39), 

(a1) 
Greenwald, Isidor : test, 556 
Greenwood, Edward D.: ref, 2847 

_ (18), 298(49) : 
Gregory, R. W.: test, 365 
Gregory, Wilbur S.: test, 636; ref, 

630(1) 
Greider, n: ref, 48(25) 
Greulich, William Walter: bk, 916 
Griffin, Charles . test, 698; ref, 


ref, 177 (88), 217 


698(1) 
Griffiths, Nellie L.: test, 518 
Griffitts, C. H.: ref, 661(9) 
Grime, Herschel E.: ref, 327(3)* 
Grinker, Roy R.: bk, 76, 751; fefe 
73(415, 501) 
Grossman, David : ref, 397 (30) 
GnossxickLE, Foster E., Professor 
s Mathematics and Head of the 
epartment, State Teachers Cole ~ 
lege, Jersey City, New Jersey: 
rev, 337-8, 353; cross ref, 1% 


3458. 
Grotjahn, Martin: exc, 78 
Grove, Victor: bk, 116 


Grove, Wuttam R., Psychologist 
and Director of Child Study 
Service, Phoenix Elementary — 


Schools, Phoenix, Arizona; for- 
merly Professor of Psychol ogy 
and Director of the Division DE 
Psychological Services, The Uni- 
versity of. Pittsburgh: rev, 688, 
662, 671, 679; ref, 660(10- 1) 
Grover, aor C.: test, S15 
Growdon, C. H.: ref, 292 (183, 
213) x 
pu a exc, 1089 EY 
Gruney, test, 482 
Guetzkow, Harold: ref, 225(57a) 


Guller, y S.: ref, 244(1), 250(5), 

2 

GUTER TP J. P., Professor of Psy- 
chology, The University of 


Southern California, Los Ange- 
les, California: rev, 217, 238, 2441 
test, 43-5, SS, 220, 621; bk, 9175 
exc, 912; ref, 43(1-3), §5(1-2, 4), 
177( 9)» 220(61), bat (i); cross 
ref, 106 

Guilford, Ruth B.: ref, 43(1-3), 
§5(1-2) 

Gulde, Carl J.: ref, 60(59) 

Gulliksen, Harold : exc, 1030; cross 
ref, 17 

Gummere, John Flagg: cross ref, 
20. 

GN Ira Curtis : ref, 244(5) 

Gurrey, P.: bk, 92! 

Gumi Miton $.: 
259(8) 5) 

Gustav, Alice: ref 73(496) 

Guttman, Louis : k, 042 


test, 259; ref, 
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HACKBUSCH; Florentine: ref, 
, 103(69) 
Hae? Nellie P.: ref, 18(1) 
Hadfield, A. E.: test, 493 
Hadley, Ernest E.: exc, 788 
Hadley, Loren S.: ref, 70(2) 
` Haeíner, John: test, 614 
Haenisch, Edward L.: test, 559, 
563 

Haggerty, Melvin E.: fest, 491 

Haun, Miton E., Dean of Stu- 
dents and Professor of Psychol- 
ogy, University of California, 
Los Angeles, California: rev, 
697, 704; test, 689; exc, 1128; 
ref, 62(4), 640 (m3 

Haines, Eleanor M.: ref, 217(143) 

Hales, Nora M.: ref, 220(41) 

Hales, W. M.: ref, 632(6) 

Hall, Prudence: fest, 533; ref 533 


(2) 
Hart, W. E., Associate Professor 
of Educational she aad o The 
ES 


University of Nebraska, coln, 

* Nebraska: 480, 528; bk, 

1124; cross ref, 20 

Hallowell, A. Irving: ref, 73(194-5, 
252, 430, 431) 

Halpern, Florence: bk, 78-9, 770- 
HEC E Rar ma dee 19m. 
103(88), 292 (255), 208(23 
alpin, Andrew: ref, 467 (16), 510 
(2), 515(2), 520(3) 

Halsey, George D.: bk, 918 

Halstead, H.: bk, 873; ref, 43(6), 
44(6), 55(9), 73(559), 95(9, 15), 
290 258(13, 18-9), 273(5), 

(66), 664(26) 

Halverson, Henry M.: test, 276; 
bk, 279, 901; ref, 276(20) 

Hamalainen, Arthur E.: bk, 919 

Hamilton, B. A.: test, 611 

Hamilton, E. Louise: ref, 73(312) 

Hamilton, Mildred Eckhardt: bk, 


920 
Hamley, H. R.: bk, 928 
Hamlin, Roy M.: bk, 921 
Hamlin, Talbot F. : bk, 922 
Hampton, Peter : exc, 716 
Hampton, Ruth: bk, 1015 
HANAWALT, Netson G., Associate 
Professor of Psychology, New 
Jersey College for Women, Rut- 
(Ge University, New Brunswick, 
ew Jersey: rev, 92, 97 
Handschin, C. H. : cross ref, 190 
Hanfmann, Eugenia: test, 27; bk, 
28, 923; ref, 27(3, 5, 5a, 7-8, 12), 
41(7), 73196, 561) 
anford, Jeannette: exc, 893 
Hankes, E. J.: test, 651 
E Bert: ref, 672(4), 677 
31 
Hanna, Joseph V.: exc, 766 
ANNA, LAVONE A., Associate 
Professor of Education, San 
Francisco State College, San 
Francisco, California: rev, 6. 12, 
Lan 617; bk, 983; cross ref, 4, 


5 

Hansen, Alice Viola: 

na, CF hi zd dad 
ansen, Frank W.: ref, 

Hanske, Carl F.: EO 


rev, 
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Hardaway, Mathilde: bk, 933 
Harding, D. W.: ref, 258(14) 
Hardy, LeGrand H.: ref, 473(5, 


8-9) 

Harmon, Janet B.: ref, 664(17) 

Harmon, Lindsey R.: test, 251; ref, 
60(23), 664 (17) 

Harper, Bertha Peterson : exc, 648; 
ref, 647 (153), 649(24) 

Harrell, Margaret S.: ref, 219(7) 

Harrett, THoMAS W., Associate 
Professor of Psychology, Uni- 
versity of Illinois, Urbana, Illi- 
nois: rev, 665-6; ref, 96(27), 219 
(7, 9), 225(20, 30), 250(42), 647 
et 661 (16, 19-21), 663(5), 

13), 671(8), 678(11, 15) 

Harriman, Philip Lawrence: erc, 
76, 84, 89; ref, 73(500, 502, 517) 

Harris, ALBERT J., Assistant Pro- 
fessor of Education, The City 
College of New York, New York, 
New ed PM e 511, 

; test, ; ref, 250(33 

HA nester W., Assistant 
Professor of Education, The 
University of Chicago, Chicago, 
Illinois: rev, 139-40, 143 

Harris, Dale B.: exc, 1070 

Harris, Daniel : ref, 99(6, 13) 

Harris, Harold J.: bk, 713 

Harris, John D.: ref, 284(23) 


E pe [p e 84; ref, 
24, 42), 73(499. 
Harris, Robert .: ref, 473(4) 


Harris, Thomas M.: exc, 1177 

Harrison, Mary R.: ref, 217(100) 

Harrison, Ross: ref, 73(313), 103 
(8-9, 28, e 

d xd rge J.: ref, 73(253- 


5 

Harrower [Harrower-Erickson], 
M. R.: test, 73; bk, 81-2, 924-5; 
exc, 75, 78, 84; ref, 73(197-200, 
256-7, 314-9, 374-5, 432-3, 500-1) 

Harrower-Erickson, M. R. See 
Harrower, M. R. 

Hamsm, CHamLES M. Associate 
Professor of Psychology, The 
University of Nebraska, Lincoln, 
Nebraska: rev, 673, 681, 683 

Hart, Ella B.: ref, 36(2) 

Hart, Hornell: fest, 36; bk, 37, 
926; ref, 36(1-2) 

Hartley, Ruth Edith: bk, 927 

Hartman, A. A.: ref, 220(63) 

Temani, s Wi e R 
1190; ref, 99(7, 14), 647 (77 

Hartoch, Anna. See Schachtel, 
Anna Hartoch. 

Hartog, Anna. See Schachtel, Anna 
Hartoch. 


rtoch. 
Hartog, Philip: bk, 928; exc, 961, 
1082 


Hartson, Louis D.: ref, 236(24), 
244(6, 21, 25), 263(45); cross 
rey, 244 

Hartung, Arthur: test, 610 

Hartune, M. L., Associate Profes- 
sor of the Teaching of Mathe- 
matics, The University of Chi- 
eee eri poral rev, 305- 

; test, ; ref, 53s(11 

Hartzell, Mi b. 


ildred D.: ref, 635(9) 


Harvey, Lucy Jean: bk, 929 
Harvey, O. L.: ref, 671(4) 
Hasenbank, Alvin L.: test, 611 
Hass, Gren, Assistant in the Hor- 
ace Mann-Lincoln Institute of 
School Experimentation; Teach- 
ers College, Columbia University, 
New York, New York: rev, 1,6 
Hassler, J. O.: cross ref, 363 
Hastings, Margaret: test, 3, 596 
Hatcher, W. T.: test, 558 
HATHAWAY, STARKE R. Professor 
and Clinical Psychologist, De- 
partment of Psychiatry and Nen- 
rology, University of Minnesota, 
Minneapolis, Minnesota; rev, 24, 
93; ue e ref, 60(1-5, 8-9, 12, 
43, 45, 59a 
Hattwick, LaBerta : bk, 717 
Hauptman, Leo M. : exc, 1109 
Haury, Gertrude: ref, 292 (154) 
Hause, Enid: ref, 242(5) 
Hausheer, Herman: exc, 1151 
Havighurst, Robert J.: exc, 652, 
ga: ref, 161 (13), 225(57b), 271 


(26) 

Hawkins, Charles C. : bk, 712 

Hawkins, G. E.: cross ref, 363 

Hay, Epwarp N., Personnel Of- 
ficer, The Pennsylvania Company 
for Banking and Trusts; and 
President, Edward N. Hay and 
Associates, Inc.; Philadelphia, 
Pennsylvania: rev, 624, 630-1; 
exc, 841, 994; ref, 220(66, 69), 
250(53-4, 59), 627(27, 31-2); 
cross ref, 292 

Hayden, Howard: bk, 930 

Spe Samuel P.: bk, 931-2; ref, 
18(21), 292 (286), 298 (24) 

Haymaker, Webb: exc, 78 

Hayman, Max : ref, 71 (24), 05 (10) 

Haynes, Benjamin R.: bk, 033 

Hayter, Walter Henry: ref, 480(1) - 

Hazard, Lucile: ref, 10(8), 18(22) 

Headley, Neith E.: bk, 934 

Healey, Mary I.: test, 430-1 

Heath, Fred A.: test, 562 

Heaton, Kenneth L.: exc, 788 . - 

Heidbreder, Edna: test, 671; 
ref, 671(3), 678(5), 679(3) 

Heil, Louis M.: bk, 983; exc, 817; 
cross ref, ior Jp N 

Heilman, J. D.: ref, 217 (101) 

Heim, A. W.: exc, 817 

Heller, Eugene: ref, 73(512) 

Heller, Harry : cross ref, 181 

Hellfritzsch, Alvin G.: fest, 330, 


362 
Henderson, Charlotte : test, 370 
Henderson, Ethel M.: ref, 276(12) 
Henderson, Mack T.: ref, 664(24), 
677 (37) 
Henderson, Robert W.: test, 113, 


252 
Hendrick, Ives: exc, 279 
Hendricks, B. Clifford: test, 557 
Hendrickson, Gordon: ref, 220 (40) 
Henmon. V. A. C.: cross ref, 218 
Henys [^ A 1000 
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Hook, J. N.: ao 

Hooper, iun E, Assistant 
Professor of Chemistry, The 
e e MEATUS University, 

ashington, D. C.: rev, 557, 555 

Horse Robert: var 693; ref, 
693(1) 

Horan, Kathryn: ref, 467(16), 510 
(2), 515(2), 520(3) 

Horn, Alice McAnulty: bk, 941 

Horn, Charles C.: test, 171; ref, 
171(1) 

Horn, Daniel: exc, 942 

Horn, Ernest: test, 10, 116, 334, 501, 


545 

Horne, Betty N.: ref, 61 (6) 
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Jorvas, RicHamD HL, Associate 
Professor of Geology, Florida 
State University, Tallahassee, 
Florida: rev, 553, 574 

Jordan, William Leroy: ref, 18(15) 

Jorgensen, A. N.: fest, 489; bk, 
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Laleger, Grace Elizabeth: bk, 981; 
ref, 642(6), 640(25) 

E x Jeannette Carl: ref, 177 

3 

Land, George A.: test, 204 

Lander, Joseph: exc, 716 

Landis, Carney: ref, 27(4), 41 (4 


13) 

Landisberg, Selma: ref, 47(4-5). 
66(3). 71(34), 95(23a), tes (95). 
107 (58), 266(2) 

Ex Herbert A.: cross ref, 166, 


LI 
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Lane, George Gorham: ref, 683 


1 
Tee wanes O'Kelley: ref, 298 
112) Y 
ue Irene Dunn: ref, 250(6) 
Langlie, Theos A. : ref, 115(2-3, 5), 
131 (2-3, 5), 308 (2-3, 5), 309 (2-3, 
5), 566(4-5, 7-8), 567(4-5, 7); 
cross ref, 244 . 
Lanomun, Cnanues R., Director, 
New York Office, Educational 
Testing Service, New York, New 
York: rev, 232, H5: test, 245 
Langsam, Rosalind Streep: ref, 163 
(e m 489 (11), 491(6), 
Yun Gerap V., Assistant 
Director, The Graduate Record 
Examination, New York, New 
York: rev, 117, 128; exc, 914 
Lanpher, Henry Coe: exc, 893 
Lapp, C. J.: test, 547, 582, 587, 608; 
he aoe 308(12), 587(4), 
7 (24 
‘Larson, Arthur Hoff: ref, 67 (12, 
15) 
a P.: ref, 120(23), 
178(7 
Larson, WiLLIAM S., Chairman of 
the Music Education Tapani- 
ment, Eastman School of Music, 
The University of Rochester, 
ME ay York: rev, 175- 
7; ref, 177(77 
Lasaga YF eso, Jose I.: ref, 


roa R.: ref, 107(34), 217 
(sr), 489 (7), 66 (19), 678 (6). 


9(5 

Latham, Darrell E.: bk, 823 

Laton, Anita D.: bk 745 

A ig ilbur: ref, 60 

44), I 

n F Calvin: ref, 242(5), 
255 

DN Austin des : ref, 73(560), 
103 

Lavos, George : ref, 18(31) 

Lawrence, Evelyn: exc, 1219 

Lawsue, C. H., JR., je real of 
Psychology, Purdue University, 
Lafayette, Indiana: rev, 656, 659, 
661; test, 216, ie 675-6; ref, 73 
(570), 216(1-2), 246(3), 672(5), 
675(1-2), 676(1) 

Lawson, Melvyn: exc, 890 

Laycock, Samuel R.: fest, 237; exc, 
1030; ref, 292 (250, 324) 
yman. James W.: ref, 302(260) 

r, Mae: ref, 242(2) 

Leaf, Curtis T. : ref, 11(4), 115(9), 
217 (74), 

Leahy, William: test, 569 

Learned, W. S.: test, 444 

Leavell, U. W.: test, 6; ref, 470(6) 

Leavitt, H. J.: ref, 05(25) 

Leavitt, Harry C.: ref. 73(554) 

Le Bas, Margaret: ref, 73(524) 

Lebeaux, Thelma: ref, 73(360, 
467), 103(57) 

Ledgerwood, Richard: cross ref. 
13, 166, 234, 345a, 490a, 523 

Ledwell, Ruth: test, 555 

Lee, Dorris May: ref, 517(1) 
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Lee, Edwin A.: test, 643 
Lee, Harold L.: ref, 321(1) 
Lee, Hector H.: test, 118-9 
Lee, J. Murray: test, 17, 345, 490, 
517; ref, S17 (1) ; cross ref, 10, 


IS D Werry, Professor of 


Education, The University of 
Southern California, Los 
les, California: rev, 675, 687, 695; 
ay 828, 981 ; ref, 73(557) ; cross 
ref, 15 

Lehman, Ruth T.: ref, 640(24) 

Leitch, Mary : ref, 103(89) 

Leland, Bernice: test, 275 

Lemkau, Paul: bk, 90, 1127; ref, 
73(285) 

Lemmon, Martha L.: exc, 956 

Lemon, Harvey B.: test, 582 

Lennard, Henry Loeblowitz. See 
Loeblowitz-Lennard, Henry. 

Lentz, THeopore F., Director, At- 
titude Research Laboratory, and 
Associate Professor of Educa- 
tion, Washington University, St 
Louis, Missouri: rev, 67, 70; ref, 


653(5 
Mesi dist A.: bk, 
Leonarp, J. Paut, President, San 
Francisco State College, San 
Francisco, California: rev, 120, 
125; bk, 983; cross ref, 15, 497 
et Jules: bk, 759 


Leslie, Louis A.: ref, 136(2) 
Lester, David W.: ref, 284(25) 
Lester, Helene: ref, 647 (130) 
Leuba, Clarence: exc, 956 
Leuenberger, Harold W.: exc, 788 
Leverenz, Carleton W.: ref, 60(7) 
u^ Joseph: ref, 292(296), 298 
Levi, Marjorie B.: ref, 3460 
Levine, Kate Natalie. See Kogan, 
LT CM 
vinson, Harry: fest, 149 
Levy, David M.: exc, 716; ref, 73 
(475. 597) 
Levy. Lydia: test, 71 
Lewinski, Robert J.: ref, 95(19), 
284(13, 16, 21), 208(35-6, 50, 60- 


70) 
* Lewinson, Thea Stein: bk, 986 ; ref, 


73(388) 

Lewis, Aubrey : bk, 873; ref, 43(6), 
44(6), 55(7), 73(550), 230(3), 
258(19) 


I yon bk, 1140; ref, 177(84, 
Tewicrfoln A.: ref, 60(64), 640 


(53) 
Lewis, Nolan D. C.: bk, 086; ref, 
60(86a). 73(440, 578) 
Licht. Marie H.: bÈ, 987-93 
Liefeld. Theodore S. : ref, 177 (68) 
Life Office Management Associa- 
tion: bk, 994 
Likert, Rensis : test, 677 
Lillian, Katz K. : ref, 73(213) 
Limbert, Paul M.: cross ref, 2 
Lincoln, A. L.: test, 154-5, 233 
Lincoln, Edward A. : ref, 273(3) 


Lindemann, Erte : 
Lindner, Robert , 259; exc, 

75, 912, 1103; ref, 73(274-5, 337- 
-6), 259(8) 


ref, 285(2) 


Link, H 
(30), 9700 + 107( 

Linn, Louis E d 73(527 

Lion, Ernest G.: ref, 73(485) 

Lippitt, Ronald : exc, 

Lipsett, Laurence: ref, 640(54) 

Litterick, William S. : test, 303 

Lrrruz, OrreL E., Associate Pro- 
fessor of English, Manchester 
College, North Manchester, In- 
diana: rev, yss 

: ref, 73(465), 103 


Ma: 

(s6), ean 
Livesa T T ul 218(7) 
Loeb, Eleanor : exc, 1166 
Loeblowitz-Lennard, Henry: ref, 
Lone J.: bk f, 627 (19) 

vinger, J.: bk, 995; ref, 
Loken, Robert D. "u 996 
Loman, William ; test, 705; ref, 405 


(3) 

Lomax, Paut S., Professor of Ed- 
ucation and irman, Depart- 
ment of Business Education, 
York University, New York, 
New York: rev, 306 

Lombardi, Maryellan Maher: bk, 
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Long, Elizabeth: ref, 242(2) 

Long, Howard H.: ref, 292(140) 

Lone, Joux A., Director of the De- 
partment of Educational Re- 
search, University of Toronto, 
Toronto, Canada: rev, 324, 328; 
test, 322, 357, 363; bk, 1142; exc, 
788; ref, 242(2) ; cross ref, 249 

Lone, Lovis, Director of Division 
of Testing and Guidance and As- 
sistant Professor, Department of 
Student Life, The City College 
of New York, New York, New 

ork: rev, 70, 111; ref, 647 (157), 

666(1) 

Long, W. F.: ref, 471(22) 

Longacre, Andrew: cross ref, 581 

Longstaft, Howard P.: test, 627 

Longwell S. Geraldine: ref, 107 
(16, 21, 23) 

Lonigan, Mary A.: ref, 217 (141) 

Loomis, H. H. : test, 561 

Loper, James F.: ref, 397(18) 

Lord, Elizabeth E.: exc, 278 

Lorenz, Jennie: ref, 475(4) 

Lorce, Inviwc, Professor of Ed- 
SES 7 [v pene ficer, 
nstitute of Psychological Re- 
search, Teachers College, Colum- 


C.: test, St ref, 23 


University, New York, New 
Vork: rev, 245, 269; test, 165, 
243, 506; bk, 907, 919, 944, 


1055; erc, 72, 300; ref, 220(53), 
250090, db). 308 (10); artes), 
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506(1), 679(8) ; cross ref, 626, McCandless, Boyd R.: bk, 1235; bia University, New York, New 
668 ref, 292 (345) York: rev, 1, 6 

Lorge, Sarah Wolfson: test, 180, Los arthy, T amas J.: bk, 1001; McKenzie, John: bk, 964 
182 » 99(42) McKim, MARGARET 

Lorimer, Margaret W.: ref, 647 Mec AUL, Pain Instructor and Professor of Education, Univer- 


(147) 
Lorr, Maurice: ref, 292(219, 261) 
Loth, N. N.: ref, 55(6), 60(26) 
Lott, Hiram V.: ref, 217(65) 
Lough, Orpha Maust: ref, 60(45, 


65) 

Lourtit, C. M.: Professor of Psy- 
chology and Dean of the Under- 
graduate Division, University of 
Illinois, Galesburg, Illinois: rev, 
59, 107; bk, 998; exc, 80, 282, 
1091 ; ref, 107 (30) ; cross ref, 106 

Love, L. L.: ref, 20(11) 

Loveland, Ruth : exc, 956 

Lovell, Constance: bk, 999; ref, 43 
(4), 44 (3), 5507) 

Lovell, George D.: ref, 161(6, 11) 

Lowell, Frances E.: ref, 292 (220) 

Lowss, Rura, Associate Professor 

Éducation, The West Texas 
State Teachers College, Canyon, 
Texas: rev, 483, 490, 494, 504 

Lowrey, Lawson G.: exc, 75, 78, 
788, 916, 1005, 1094, 1154 

Lowrie, Kathleen Harriet: ref, 244 
(24), 640(11), 647 (131), 689(5) 
owry, Edna: ref, 177( 5) 

Loyes, M. Edmund: ref, 18(6) 

Lozoff, Milton: ref, 27(13), 71 
B 73(332), 100(3), 103 (29), 


208 (34 
Lubin, Ardie: ref, 397 (20) 
Luborsky, L. B.: ref, 43(7), 55 
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Luchins, Abraham S.: ref, 73 
(571), 298(92) 

Lueck, Irving B.: ref, 471 (16) 

Lueck, William R.: test, 319 

Lufkin, H. M.: ref, 220 (24) 

Luke, Brother: ref, 73(276, 339) 

Lumry, Kelly: bk, 1129 

Lundeberg, Olav K.: fest, 194, 211 

Lundin, Robert W.: ref, 175(2), 


177 (100). 

‘Lurie, Louis A.: exc, 82; ref, 107 
(31, 43) 

Lurie, Walter A.: ref, 99(22) 

Lutz, Rose E.: test, 305 


Lynaugh, Mathias: exc, 766 
Lynch, James M.: exc, 1147 


MAAS, Henry S., Research Asso- 
ciate in Group Psychotherapy, 
William Alanson White Psychia- 
tric Foundation, Washington, 
D. C.: rev, 417 

McAnulty, E. Alice: ref, 24(3) 

US Y test, 10, 116, 
wee 
McCall, W. E ref, 61(6), 640(9) ; 
cross "ref, 491 

McCall, William A.: test, 242-3; 
bk, 997, 1000; ref, 242(1-2); 
cross ref, 540 

McCatuister, JAMES M., Director 
of Personnel Service, Herzl 
Branch, Chicago City Junior Col- 
lege, Chicago, Illinois: rev, 491, 
500,.505 


Adviser in the College, The Uni- 
versity of Chicago, Chicago, Illi- 
nois: rev, 488, 500, 506; cross ref, 


478 
McClelland, William: bk, 1002 
McCloskey, Ethel P.: ref, 2092 (223) 
McCloy, Charles Harold: bk, 1003- 


4 
McComb, H. G.: test, 701-2 
McConnell, T. R. : test, 62; bk, 724; 
ref, 62(2) 


McCord, Clinton P.: exc, 299 
McCord, Fletcher: ref, 462(4) 
McCormick, Thomas C.: exc, 912 
McCrory, Walter : test, 569 
McCrum, Muriel: test, 320 
McCuen, Theron L.: ref, 18(4), 


263(34) 
EA T. L.: exc, 282; ref, 292 
311) 

McCurLouca, Constance M., As- 
sistant Professor of ‘Education, 
San Francisco State College, San 
Francisco, California: rev, 492, 
501; test, 128, 481; ref, 600(5) ; s 
cross ref, 10 

McCune, George H. : bk, 1039 

McCutchen, S. P.: cross ref, 605 

McDaniel, H. B.: exc, 724 

MeDanich J. , J. W.: ref, 661 (32), 672 

McDonald, B David: ref, 15(2), 223 


( 
McDougal, Clyde R.: test. EM 
McElwee, Agnes R.: ref, 225(43) 
McElwee, Edna Willis : Pm e) 
McFarland, Henry J.: exc, 733 
McFarland, Muriel G.: test, 427-37 
McFarland, Ross A.: ref, 470(3- 4) 
MacFarlane, Jean Walker : bk, 1005 
McGarrett, Vincent: test, 590 
McGehee, William: exc, 1177; ref, 
217(117), 250(55), 308(15), 567 
(14), 627(28), 677(34), 683(4) 
McGeoch, John A.: ref, 23(22), 
220(30), 250(1) 
as Kathleen: test, 428, 


Me innies, Elliott: ref, 8 

McHale, Kathryn: test, in i i1 
654 (1-2) 

Machover, Karen: ref, 73(572) 

Machover, Solomon : bk, 1006; exc, 
1104; ref, 208(37-8) 

McHugh, Gelolo: bk, 1007; ref, 

Mclntre, J, Thomas: ref, 284(3) 
cIntire, omas: ref, 2! 

Men Keith: ref, 290(1) 3 

McKay, B. Elizabeth: ref, 107(8, 


17) 

Mckee, J. H.: test, 134 

Mackenzie, Gorvon N., Executive 
Officer of the Horace Mann- 
Lincoln Institute of School Ex- 
perimentation, Head of the De- 
partment of Curriculum and 
Teaching, and Professor of Edu- 
cation; Teachers College, Colum- 


sity of Cincinnati, Cincinnati, 


Ohio: rev, 476, 503; test, 509; 
cross ref, 18 
McKinlay, M.: bk, 873; ref, 43(6), 


a: 55(9), 73(559), 239(3), 


Mckinley, 1 Tanani: test, 60; 
s 6o (1-3, 8, 12) 

McKinstry, Eleanore: ref, 217 (98), 
292 (241), 298(20) 

McKown, Harry C.: bk, 1008 

McMahan, Frederick Joseph: 
470(11) 

se ty Maurice: bk, 1009 


ref, 


McManus, R. Louise: bk, 1010 
Macmecken, A. M.: bk, 1011 
Macmillan, jJ. W.: exc, 37 


McMullen, Harold G.: 
ref, sas (11) 


text, 5555 


MacMullen, Marjorie R.: rcf, 49 
(11), 298(111) 

McMullen, T. E.: test, 132 

McMurry, Robert N. : fest, 261; ref, 
rus 269(7), 663(18), 683 
IO 

pe reste vee : test, 61; ref, 
61(4-5), 397(32) 

MacNaughton Dorothy: ref, 73 

McNaughton, Francis L.: ref, 73 
(398, 457), 95(12) 
cNemar, Quinn: fest, 263; bk, 


203, 1012; exc, 301, 1102, 1219; 

ref, 24(3), 292(164, 264) 
Macomber, Lois: ref, 202(219) 
MacPhail, Andrew H.: ref, 

(86, 105), 567 (12-3) 
MacPhee, H. M. : ref, 208(84, 117) 


MacQuarrie, T. W.: test, 661, 690; 
ref, 661(2) 


urrrv, Jonn V. University 

Examiner, University of Florida, 
Gainesville, Florida: rev, 3; 
cross ref, 316, 548, 596 

McRae, Hugh : ref, 263(47) 

dg puris Gilbert S.: ref, 220(62, 
4 

Madigan, Marian E. : ref, 217 (108), 
489(14) 

Madison, Thurber H.: bk, 1013 

Madonick, M. J.: ref, 298(118) 

Madow, Leo W.: ref, 73(287) 

Madsen, T. N.: ref, 220(5, 12-3) 

Magaret, Ann: exc, 301; ref, 202 
(351), 205 4 25, 39) 

Maher, Howard: ref, 265(1), 627 
(40), 661(43), 665(3) 

Mahler, Walter R.: bk, 1014 
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Mahler-Schoenberger, Margaret : 
exc, 769 

Maiti, H.: exc, 1026 

Maizlish, I. Leon: ref, 292(221, 
262), 208 (26) 


Malamud, Daniel : ref, 73(442, 50). 

Malamud, Rachel F.: ref, 49(8) 
73(442, 528) 

Malamud, William : ref, 292(163) 

Malcove, Lillian: exc, 1126 

Mallary, Benjamin E.: erc, 1008 
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ERE Julius B.: test, 53; ref, 
M Edith B.: test, 157 

Malloy, Helga Tait: ref 73(471) 

Mandell, Milton M.: ref, 99(56), 
217 (134), 535 (13) 

Mangan, Martha R.: ref, 176(12) 

Mann, Edna Brand: exe, 78, 84 

Manning, Imogene: test, 556 

Mn iE test, 642; ref, 
220(34), 042(2 

Manson, Morse P.: ref, 60(46), 
95(24) 

MauwuzL, HerscHeL T., Professor 
of Educational Psychology and 
Director of Testing and Guidance 
Bureau, The University of 
Texas, "Austin, Texas: rev, 634; 
test, 50, 542; bk, 1218; ref, 172 


Cama 217 (75), 292 (187); cross 
re 

Manzer, Tiie W.: ref, 176 (10) 
Marcus, J. J.: test, 563 


Marcuse, Irene: bk, 1015 
“Margulies, Helen: bk, 1016; ref, 
73(177, 214, 216, 234, 277, 326) 
Maile LeClaire Robert: ref 647 


91) 
Mie, Oscar B.: ref, 73(529) 
Markt, Anna R.: ref, 405(2) 
Marowitz, Samuel: ref, 176(10) 
Marseille, Walter W.: test, 73; exc, 


986 
Marsh, Charles J.: ref, 67(3), 110 
Marshall, Frances Botkin; ref, 276 


(12) 
Marshall, Helen: ref, 292(263) 
Marshall, M. V.: bk, 1017; ref, 136 
(1), 2120 123) 
Marshall, Margaret: exc, 997 
Marshall, Thomas O. : exc, 402 
Marston, William M. : bk, 1018 


Martin, A. H.: ref, 220(41) ; cross 


ref, 202 
Martin, C. S.: test, 563 
Martin, F. D.: test, 557, 559 


Martin, Howard G.: test, 43-5; ref, 
43 (5), 44(2), 55(4) 
artin, L. C.: exc, 1 
Martin, Lureata Ross: ref, 176(27) 
Martin’ M. Frances: ref, 26(2) 
Martin, Margaret E.: test, 509 
Martin, Margaret P.: test, 3, 303, 
316, 321-2, 357 
Medio Carlos: ref, 103 
70) 
Martinson, Betty : ref, 292(222) 
Martinson, Melvin P.: ref, 217 (66) 
Marum, O.: bk, 873; ref, 43(6), 
HOT 55(9), 73(559), 239(3), 
19) 


ManzoLr, Stantey S. Professor 
of Psychology, Illinois State 
Normal University, Normal, Illi- 
A rev, 236, 255; ref, 67 (13, 


ae Wayne E.: bk, 1019 
Maskin, Meyer: test, 28 
Maslow, A. EL: test, 97; ref 97(3- 


bk, 86-8, 1020-2; 
443, 573) 
ref, 120(7), 217(87) 
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Mason, Charles W.: test, 695; bk, 
807; ref, 695(4) 

Masserman, Jules H.: 
ref, 1o3(3, 6, 26) 

Mateer, Florence: a 292(173) 

Mather, M. E.: ref, 647(132) 

Mauker, J. William: test, 12 

Maurer, Katharine M.: test, 286; 
bk, 287-8, 909, 1023; ref, 286(4, 
5) 

Maxfield, Francis N.: exc, 203-4, 
799. 851, 997, IOII, 1091, 1123, 
1168; ref, 292(132) ; cross ref, 
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Maxfield, Kathryn Erroll: ref, 107 


exc, 1123; 


MAxwELL, James, Principal Lec- 
turer in Psychology, Moray 
House, Edinburgh, Scotland: 
rev, 8, 59; cross ref, 114, 318, 
356, 546 

MS Miriam: test, 120 

Mayer, Barbara A.: ref, 292(153) 

Mayman, Martin: ref, 27(19), 41 
(26), 71(31), 73(574), 103(71, 
90), 202(325, 339), 208(71, 93) 

Mead, Leonard C. : ref, 467 (35) 

Mead, Margaret: ref, 276(28) 

Meadow, Arnold: ref, 97(8) 

Meeut, PauL E., Assistant Profes- 
sor of Psychology, University of 
Minnesota, Minneapolis, Minne- 
sota: rev, 53, 99, 112; bk, 1024; 
exc, 80, 1237; ref, 48(38), 49 
(9), 60(27-8, 47-8) 

Meenes, Max: exc, 1244 

Meier, Norman Charles: test, 172 

Meister, Ralph K.: ref, 292(191, 
219, 230, 261) 

MsrbpRUM, WiLLIAM B., Professor 
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562-3; cross ref, S61, 570 
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662, 670; ref, 662(1) 

Mellone, Margaret A.: 
ref, 240(1-2) 
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cal Service Center, St. Louis, 
Missouri: rev, d 57; exc, 956; 
ref, 73(150, 386) 

Mendel, Altred O.: bk, 1025 

Meininger Karl; oe, 1105; ref 27 

Ae PK OE a 54 

Menzel, Emil W.: bk, 1026-; 
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103(97), G49 (26) 

Merrill, Í, Maud test, 292; ref, 24 


test, 240; 


3) 

Merrill, Ruth ron ba) ref, 131 
(10), 570(1), 699(2, 5 

Metcalfe, Zaida F.: AR sro (8) 

an Life Insurance Com- 

: bk, 1028 

Meuh lenbein, J. M.: ref, 285(4) 
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Miale, F. R.: ref, 73( 
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Miles, Eunice C.: ref, 13(8) 
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Miller, Dorothy A.: ref, 125(1) 
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131, 566; ref, 115(8), 131(8), 178 
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(6), 566(11), 567 (10) 

Miller, Leo R.: ref, 202(291) 

Miller, Neal E.: bk, 1029 

proe 5 29a (270) bk, 1143; ref, 
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44), AUT 508(15), erar 
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ton University, Princeton, New 
Jersey: rev, 313, 315; test, 303 
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cisco, California: rev, 522-3, 525; 
test, 481 
Monroe, Marion: 
ref, 467, 470 
Monroe, Paul: bk, 1030 
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test, 519; cross 
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Montagn, Ashley M. F.: exc, 82, 


IL 

Montalto, Fannie D.: ref, 73(530) 

Mooney, Elizabeth: test, 52; ref, 
s2(1 

Mooney, Ross L.: test, 67-9; exc, 
1166; ref, 67 (4, 6, 8-10) 

Moore, Bruce Victor : test, 331, 608; 
bk, 766; exc, 1038, 1082; ref, 647 
(112), 677 (22), 683(1), 691 (2) 
oore, Clarence Lee: ref, 10(4), 
18(16) 

Moore, Herbert : test, 628; bk, 1031- 
2; ref, 220(51). 95625) 

Moore, Josepu E. Head, Depart- 
ment of Psychology, Georgia 
School of Technology, Atlanta, 
Georgia: rev, 656, 667; test, 538; 
ref, 538(1) ; cross ref, 9 

Moore, Marjorie E. : ref, 566(15) 

Moore, P. J.: fest, 588 

Morales, Noemi: ref, 107 (45) 

Mordy, Francis E.: test, 618-9 

More, Grace VanDyke: ref, 176 


(6) 
Moreno, J. L.: bk, 1033-7; exc, 37 
Morgan, C. L.: ref, Kos 

Morgan, Christiana D.: ref, 103(1, 


4) 
Morgan, Frank: ref, 263(43) 
Morgan, Howard K.: test, 63; bk, 


103 

Morgan, W. J.: ref, 220(71), 661 
(33), 677 (42-3) 

Morgenthaler, W.: bk, 90, 1127; 
ref, 73(285) 

Morison, Luella J.: test, 69; ref, 


69(1) 

Morris, Woodrow W.: ref, 60(49, 
67), 73(341) ~ 

Morrison, NATRAN, Principal Ac- 
tuary, New York State Division 
of Placement and Unemployment 
Insurance, New York, New 
York: rev, 304, 329 

Morrison, Nellie C.: ref, 242(2) 

Morrison, Thomas F.: fest, 550 

Morrow, Robert S.: ref, ye). 
177(91), 217(88), 627(23), 
(18), 677 (23) 
orse, H. T., Dean, The General 
College, University of Minnesota, 
Minneapolis, Minnesota: rev, 2, 
4; bk, 1039 

Morton, N. W.: test, 259; cross ref, 
635, 647, 653 y 

Moster, Cartes L, Director of 
Research, Personnel | Researci 
Section, The Adjutant General's 
Office, Washington, D. C.: rev, 
675, 687; ref, 99(31) 
oe F. p eG » 405, 705; ref, 

21-2), 405(3 

Moss, W. G.: bk, 822 

Motter, Mary Elizabeth: ref, 217 
(98), 202 (241), 298(20) 

Moutoux, A. C.: test, 675 

Moutoux, Ruth Davis: test, 431, 


436 
Mowat, Alexander S.: test, 241, 346 
Moynihan, James F. : bk, 1040 
Muehlenbein, Josephine: ref, 285 


See 
Mueller, Wilbert J.; test, 30 


Muench, George A.: ref, 73(576) 

Mullen, Frances A.: ref, 236(32), 
284(14, 17), 292(266) 

Mulligan, J. Kenneth: exc, 1135 

Muncie, Wendell: exc, 28, 75 

Munroe, Ruth Learned: bk, 89, 
dm ref, UM 342, 387- 
, 445-9, 530a, 531 

Munson, Grace: ref, 292(174, 226) 

Munson, Helen W.: exc, 1010 

Murphy, Florence E.: ref, 635(9) 

Murphy, Gardner: bk, 89, 956, 
1041 ; ref, 73(449) 

Murphy, Lois Barclay : ref, 73(216, 
258, 389) 

Murphy, Matthew J.: bk, 1163 

Murphy, Paul: exc, 1098 

Murphy, William F.: exc, 81, 87 

Murra, Wilbur F.: cross ref, 610 

Murray, C. L.: exc, 1112 

Murray, Elsie: exc, 462; ref, 473 

I 


(1) 

Murray, Henry A.: test, 103; bk, 
1042; ref, 103(1-2, 4, 32 

MunszLL, James L., Professor of 
Education, Teachers College, Co- 
lumbia_ University, New York, 
New York: rev, 174, 177; bk, 
1043, 1228; ref, 177(78) ; cross 


ref, 175 
Musselman, John W.: ref, 110(6), 


540(7) 
Myer, Lester N.: ref, 467(39) 
iat C ur exc, T. Gens) 
yers, C. Roger: ref, 292 (29; 
Myers, Charles S.: exc, 652, 870 
Myers, M. Claire: exc, 84; ref, 73 


Mila d dj ud t e 
, 271 (24), 664 (28), 671(11), 
677 (54), $868)" 


NADEL, Aaron B.: ref, 41(5), 
73(155, 158) 

Nagle, Joyce: bk, 1044 

Napoli, Peter J.: bk, 1045 

Nardi, Noah : test, 195-8, 452a, 453- 
4; ref, 196(1) 

National Ássociation of Teachers 
of Educational Measurements : 
bk, 10459 

National Council of Geography 
Teachers, Testing Committee: 
ref,Soo(1) 

National Council on Measurements 
Used in Education: bk, 1046-9 

National Education Association, 
American Educational Research 
Association, Committee on Coun- 
seling, Guidance, and Personnel 
Work: bk, 1050 

National Education Association, 
American Educational Research 
Association, Committee on 
Growth and Development: bk, 
1051 : 

National Education Association, 
American Educational Research 
Association, Committee on Men- 
tal and Physical Development: 
bk, 1052-3 

National Education Association, 
American Educational Research 
Association, Committee on Meth- 


ods of Research and Appraisal 
in Education: bk, 1054-5 — 
National Education Association, 
American Educational Research 
Association, Committee on Psy- 
chological Tests: bk, 1056-8 7 
National Education Association, 
Ameri Educational Research 
Association, Committee on Pupil 
Personnel, Guidance and Coun- “ 
seling: bk, 1059 
National _ Industrial 
Board, Inc.: bk, 1060 
National Opinion Research Center, 
Interview Department: bk, 1061 
National Society for the Study of 
Education, Committee on "Intel- 
ligence: Its Nature and Nur- 
ture”: bk, 1062 
National Society for the Study of 
Education, Committee on the 
Measurement of Understanding : 
bk, 1063 
Neeb, Marie Marguerite: ref, 518 


Conference 


I 
Needham, Norma R.: ref, 41(23), 
202(307) . 
Neher, Gerwin Charles: test, 422; 
ref, 422 (1) 
Nelson, A. Gordon: exc, 635; ref. 


223(9) 

Nelson, Charles W.: ref, 217(106) 

Netson, Crarence H., Examiner 
in Biological Science, Board of 
Examiners, Michigan State Col- 
lege, East Lansing, Michigan: 
rev, 551. 554 

Nelson, M. J.: test, 492; bk, 1064 

Nelson, Virginia Lafayette: ref, 
276(14-5. 18-9, 21) 

Nemzek, Claude L.: ref, 18(23), 
24(15), 163(2), 220(47), 236 
(17), 250(21) 

Nepove, Fay: bk, 1065 

Neuhof, Mark: ref, s35(11) 

Neumann, H: exc, 37 

Nevill, E. M.: cross ref, 292 

Nevins, F. A.: ref, 2902(267) 

Newburger, Maurice: ref, 107 (31) 

Newburn, Harry K.: ref, 48(25) 

Newhart, Horace: bk, 1069 

Newman, Frances Burks: bk, 1070 

Nene Joseph, bk, 1071; ref, 660 

Ig 

Newton, R. H.: test, 559 

New York Public Schools, Bureau 
of Reference, Research and Sta- 
tistics, Division of Tests and 
Measurements: bk, 1066-8; ref, 
292 (227) 

Nichols, Helen: exc, 721 

Nielson, J. R.: test, 178 

Niemela, Weston A. : ref, 217 (6) 

Niemi, George M. : ref, 250(61) 

Noble, Ellis L.: ref, 220(14) 

Noble, M. C. S., Jr.: bk, 1072 

Noble, Mary Alfred: bk, 1073 

Nott, Victor H., Professor of Ed- 
ucation, Michigan State College, 
East Lansing, Michigan: rev, 564, 
569, 573; test, 561 

Norman, Ralph D.: ref, 630(2) 

Northrop, Paul A.: cross ref, 582 
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Northway, Mary L.: bk, 1074-5; 
ref, 73(577) 

Norton, B. M.: test, 559 

Norton, J. K.: ref, 292(135) 

Norwood, Cyril : b&, 911 

Nosik, William A.: ref, 73(294), 
95(6) T 

Noyes, Epwanp S., Associate Pro- 
fessor of English, Yale Univer- 

© sity, New Haven, Connecticut: 
rev, 120; bk, 1076; cross ref, 141, 


497 
Nutting, Mary L.: ref, 308(8) 
OAK, Lura: ref, 467(19-20), 468 


2) 

OAKLEY, C. A., Lecturer in Indus- 
trial. Psychology, Can eee Uni- 
versity, Glasgow, Scotland: rev, 
635, 670 

Oakey, C. O., Professor of Math- 
ematics, Haverford College, Hav- 
erford, Pennsylvania: rev, 357-8 

Oberheim, Grace M.: ref, 217 

* (107), 628(3-4, 6) 

Oberholzer, Emil: bk, 90, 1127; 
ref, 73(179, 248, 285, 390) 

Oblinger, Barbara: ref, 292(350) 

O'Brien, Cyril C.: ref, 176(29) 

O'Brien, F. P.: ref, 2603(29) 

O'Brien, F. W.: exc, 75 

O'Brien, James C.: exc, 1082 

O'Connor, Johnson: bk, 1077-81 

Oddy, Harold G.: test, 556 

Oneri, C. W., Professor of Educa- 
tion, University of Illinois, Ur- 
bana, Illinois: rev, 205, 207; ref, 
250(7) ; cross ref, 10, 13, 15, 334, 
345a, 490a, 501, $45 

Odoroff, M. E.: ref, 220(58), 236 
(31), 292(258) 

Oehler, Chester A. : ref, 9(4) 

Oeschler, Hazel: ref, 242(2) 

Oexle, Helen M.: ref, 202(228) 

Ogden, Jean: ref, 467(24) 

O'Gorman, William D.: ref, 
(68) 

Older, Harry J.: ref, 73(409), 250 
(56), 284(19) 

Oldfield, Richard C.: bk, 1082 

Oliver, Richard A. C.: ref, 177 (61) 

Olson, Willard C.: exc, 287 

Omwake, Katharine T.: test, 96; 
ref, 177(65) 

Omwake, L.: test, 238 

Onarheim, James : ref, 223(15), 640 


(55) 
Oppenheim, Sadi: ref, 298(68) 
Oppermann, Alice F.: ref, 242(2) 
Orata, Pedro T.: exc, 788 
Orbach, Charles E.: test, 446 
Ordahl, George: ref, 107 (50) 
Ordahl, Vida Ellison: ref, 223(13), 
250(66) 
Ordway, Samuel H., Jr.: bk, 801 
O'Reilly, Virginia M.: bk, 783 
Orlansky, Jesse: ref, 73(377) 
Orteans, Jacos S., Professor of 
Education, The City ollege of 


do York, New York, New 
ork: rev, 331, ; tesi, 207, 
361, 399; ref, ao à 

Orleans, Joseph B.: test, 361 


Ronke, L. J.: test, 167, 246, 629, 

2 

On Clarence: test, 593 

Ortleb, Ruth V.: ref, 177(67) 

Osborn, Dortha Williams : exc, 37 

OsBoRNE, AGNES E., Assistant Pro- 
fessor, The City College of New 
York, uel d n York: 
rev, 377, 381; OR, 1053 

Osborne Herbert F. : bk, 1166; ref, 
661(22), 663(6), 664(14), 677 


(17) 
Osburn, Penh J.: cross ref, 478, 
y 5I 

Oseretsky, N.: test, 472 

Oswalt, Jay H.: ref, 18(18) 

Otis, us S.: test, 247-50; ref, 
292(135) 

Oris, JAY L., Director, Personnel 
Research Institute, and Profes- 
sor of Psychology, Western 
Reserve University, Cleveland, 
Ohio: rev, 672, 676, 680; bk, 
1166; ref, 661(22), 663(6), 664 
(14), 677 (17) 

Otness, H. Robert: exc, 1093; ref, 
107 (32) 

O'Toole, Charles E.: test, 674 

Otterstrom, Ruth E.: test, 351 

Outcalt, Louisa C. : ref, 107 (31, 43) 

Outland, Richard W.: ref, 67(5) 

Overn, Orlando E.: test, 330, 362 

Owen, H. F.: test, 702 

Owen, Reba: ref, 242(2) 

Owens, William À.: ref, 220(35) 

Oxlade, M. N.: ref, 663(23) 


PACE, C. Rosert, Associate Di- 
rector, Evaluation Service Cen- 
ter, Syracuse. University, Syra- 
cuse, New York: rev 528, 619; 
bk, 983, 1206; cross ref, 20 

Pacella, Bernard L.: ref, 60(68a), 
73(440, 578) : 

PACKARD, ALBERT GIBSON, Assistant 
Director in Charge of Aptitude 
Testing, Public Schools, Balti- 
more, Maryland: rev, 665, 666, 

Page, James D.: ref, 24(5), 292 


I 
Page, Marjorie Lou: ref, (11) 
Palm, Rose: exc, 86 dii 
Palmer, Carroll E.: bk, 800; ref, 
475(8) 
Palmer, Eleanor Margaret : ref, 292 
(157, 163) 
almer, Frederic, Jr. : test, 582 
Palmer, Maurice: test, 559 
Pankaskie, Margaret : exc, 489; ref, 
m. 225(31), 308(17), 489 
12 


Park, Robert E.: exc, 716 
Parke, L. A.: test, 385 
Parkhurst, Nelson M.: test, 366 
Parks, Elizabeth Lou: ref, 18(10) 
Parks, J. Allen: ref, 18(29) 
Parkyn, G. W.: ref, 292(326) 
Parmenter, M. D.: test, 106, 267, 
Party, Do las F.: ref, 647 
arry, Douglas F.: ref, 134- 
Parsek, Anna: cross ref, S a 
Partington, J. Edwin: test, 597; 
ref, 298(21) 


Phillips 
Parvis, Jeannette Orem: test, 430- 


I, 436-7 

Paso, Arlin A.: ref, 3365) 

Paster, Samuel: ref, 73(450) 

Paterson, Donan G, Professor 
of Psychology, University of 
Minnesota, Minneapolis, Minne- 
gig Eon P dro, 627, 

71, 689; ref, 29), 

(40), 642(4), 647 (168), iQ. 
678(5), 679(3) ; cross ref, 

Patrick, James R.: ref, ) 

Patry, Frederick L.: exc, 7 

Patterson, Dale O.: test, 328 

Patterson [Peterson], C. H.: exc, 
1104; ref, 107 (47), 298 (04-5 

Patterson, R. Melcher : ref, 271(32) 

Paul, G.: test, 556 

Paulsen, Alma: ref, 73(221) 

Peak, Philip: test, 370 

Peake, Nellie L.: ref, 18(19) 

Pearce, Jane: ref, 73(563) 

ee US A.: test, 3, 548, 550-1, 

I, 571, 551 

Peatman, John Gray: exc, 837 

Pechstein, L. A.: ref, 176(18) 

Pedersen, Ruth A.: test, , 388c, 
9884; bk, 819; ref, 2(10), 4(6), 
120(9. 

Pegram, Edna Lee: ref, 292(316) 

Peiser, Walter Gilbert: ref, 217 


(57) 
Peixotto, Helen E.: ref, 497 (13) 
dye William H.: ref, 73 
213, 

PED S., Galton Professor 
of Eugenics, University College, 
London, England: rev, 60, : 
bk, 1186 

Pepinsky, Abraham: exc, 1228 

Perkins, Elizabeth : test, 555 

Perkins, Keith J. : test, 403; bk, 832, 
966; ref, 403(1-2) 

Perl, Ruth E.: ref, 217(60) 

Perry, W. D.: exc, 1199 

Pescor, M. J.: ref, 73(222) 

Peters, Edwin F.: ref, 640(12), 
647 (136) 

Peters, Richarda: bk, 1084; ref, 99 


(43) 
Peterson, B. H.: ref, 217(89) 
Peterson, Bertha M. : ref, 647 (114), 
649(20a) 
RD C. H. See Patterson, C. 


Peterson, George C.: test, 569, 577 
po H. A.: test, 238; ref, 220 
25 

Peterson, Harvey : test, 577 

Peterson, Julia: test, 12 

PETERSON, SHAILER, Director of Ed- 
ucational Measurements, Coun- 
cil on Dental Education, Ameri- 
can Dental Association; and 
Research Associate, The Univer- 
sity of Chicago, Chicago, Illinois: 
rev, 558, 680-1; fest, 170; ref, 
170(1-2), sg) A 
Petrie, A.: bk, 873; ref, nk 
44(6), 55(9), 73(559), 239 (3), 

258(19) 


5 9 
Phillips, Alexander J.: ref, 231 (1), 
242 (8 
Phillips, E. L.: ref, 60(64) 


Phillips 


~ Phillips, George H.: test, 149 
Phillips, Gilbert E.: bk, ro85 
Phillips, Leslie: exc, 79; ref, 73 


(579) 

Phillips, M. J. W.: test, 589 

Philpott, S. J. F.: exc, 875 

Phoenix, Joseph G.: exc, 915 

Physical Society Colour Group 

mmittee: bk, 1086 
Piıcxerr, Hare C., Chairman, De- 
riment di Mathematics, State 
eachers College, West Chester, 
Pennsylvania: rev, 359 

Pickford, A. S.: exc, 28 

Pickford, R. W.: exc, 28 

Piel, Gerard: ref, E 

Pierce, Helen Oexle: ref, 292(338) 

Pignatelli, Myrtle Luneau: bk, 
1087; ref, 202(268, 294) 

Pike, H.: ref, 176(11) 

Pintner, Rudolf: test, 254-6; bk, 
294, 907, 1088; ref, 24(18), 97 
(10), 99(9, 28-9, 48), 250(48), 

2356), 3), 256(1, 3, 4), 26 » 

290(1), 202(260, 308), 298(10), 

664 (13, 671(7), 077 (13) ; cross 
ref, 223, 244, 249 

Piotrowski, Zvgmunt,A:: bk, 713; 
mG 77, 356 th ; ef cota), 
73 (154, 180-2, 223, 279-81, 344-6, 
359, 391, 451-2, 580, 578), 640 


41 

Piper, A, H.: test, 327 

Plomer, William : exc, 953 

PLUMLEE, L.B., Head of Mathemat- 
ics Section, Test Construction 
Department, College Entrance 
Examination Board, Princeton, 
New Jersey: rev, 322, 325 

Poland, Roberta G.: test, 581 

Pollack, Benjamin: ref, 95(5) 

Pond, Frederick L. test, 511; ref, 
511 Ga 

Pond, Millicent: ref, 220(48) 

Poole, F. E.: ref, 48(27) 

Poorey, Rosert C., Professor of 
English, The University of Wis- 
consin, Madison, Wisconsin: rev, 
115, 120, 131 ; cross ref, 497 

Portenier, Lillian G.: ref, 668(4) 

Porter, Florence Swan: ref, 103 


42 
Porter, JAMES M., Jg, Associate 
Professor of Psychology, Rens- 
selacr Polytechnic institute, 
Un New York: rev, 669, 678, 


Porter, James P.: ref, 244(8) 

Porter, M. Powell: ref, 255(6) 

Porteus, Stanley D.: bk, 1089-91; 
ref, 73 (224), 107 (33), 202 (229) ; 
cross ref, 259 

e E UE e res ) 
'ostman, : ref, 99 E 

Poston, Freda L.: ref, 486(4, 6), 


494(1 
Potashin, Reva: bk, 1075 
Potter, Elmer H. : ref, 73(581, 586) 
Potter, Mary A.: test, 344 
Potthoff, Edward F.: bk, 1092 
Powell, Lee: ref, 107(34) 
Powell, Norman J.: exc, 1008 
Powell, Virginia M.: ref, 423(2) 
Power, Lloyd: ref, 18(17) 
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Powers, S. Ralph: test, 550, 561, 
570; bk, 970 

Prados, Miguel: ref, 73(392, 582) 

Prager, Daniel: ref, 292(190), 298 


(5) 
PH Karl C.: ref, 217 (62) 
Pratt, Norman T., Jr.: cross ref, 


204 

Pratt, Willis E.: test, 1, 513; ref, 
513(1) 

ce Ae ref, 249(2), 

19), 668(3 

Basset Daniel A.: bk, 721, 871; 
cross gE 48, 110 

Pressey, Sidney L.: test, 70, 125, 
2. 

cer Rarrn C., Associate Pro- 
fessor of Education, University 
of Pennsylvania, Philadelphia, 
ennsylvania: rev, 10, 18; cross 
ref, 116, 334, 501, 503, 545, 595 

Preu, Paul W.: bk, 1093-4 

Price, Dennis H.: test, 314 

Price, Helen Gertrude: ref, 298 


72) 
Price, Ray G., Associate Professor 
of Education, University of Min- 
nesota, Minneapolis, Minnesota: 


rev, 368, 385 
Price, Roy A., Professor of Social 
Science and of Education, Syra- 


cuse University, Syracuse, New 
York: rev, 616; test, 501; ref, 
591 (1) 

Price, Vernon: test, 305 

Prieur, Marjorie: ref, 67(11, 14) 

Pritham, Gordon H. : test, 556, 558, 
563; ref, 556(1) 

Probst, John B.: fest, 700; bk, 1095; 
ref, 700(1-2) 

Probst, Kathryn A.: ref, 60(34) 

Proshansky, Harold M.: ref, 103 


33 
Prudden, George H.: ref, 48(20a) 
Pruitt, Clarence M. : test, 37 
Pryor, Patrica M.: bk, 1096-7 
Pugh, Gladys S.: ref, 249(8), 489 


(24) 
Pullias, Earl V. : ref, 18(11) ; cross 
ref, 13, 345a, 490a 
Purnell, Russell T.: bk, 1098 
Pusey, Harriet Clough: ref, 26(4) 
Puskin, Ruth: ref, 25(1) 


QUAID, Thomas Dell Doma: ref, 
217 (58, 63), 244(7, 11) 

Quasha, Willam H.: test, 677 

Quesnel, Lillian J.: bk, 1099 

Quimby, Neal F. : ref, 640(25) 


RABIN, Albert I. : exc, 76, 79; ref, 
71(35), 73(453, 532, 583), 220 
(73), 292(309), 208(13-4, 28-9, 
40, 52-3, 73-4, 96) 

Rabinovitz, A. : ref, 49(10), 95(21), 
298(107) 

Rados, William: bk, 1100 

Raimey, Victor: bk, 871 

Rainier, Ruth Neely : ref, 217 (108), 


498 (14) 
Ralph, H. Thoburn: ref, 34(1) 
Ramzy, I.: ref, 73(533) 
Rand, Gertrude: ref, 473(5, 8:9) 


Rand, Henry A.: bk, 1101; ref, 99 


Ralik, Jane Marie: ref, 96(25) 

RawporPH, Jonn F., Professor of 
Mathematics, Oberlin College, 
Oberlin, Ohio: rev, 316, 355; 
cross ref, 3 

Ranson, Frances C. : test, 495 

Rapaport, David: bk, 713, 1102-6; 
ref, 27(9, 12a, 13-4, 17-9), 41(10,* 
15, 18-9, 24-7), 71(25, 27, 31-2), 
73(182, 282, 332, 454, 534-5), 100 
(1-3), 103(19-20, 29, 34, 71-3), 
292 (325), 208 (9, 34, 41, 54, 56, 71, 


75 

n Theophile: exc, 1243 

Rarick, Harold J.: ref, 217(76) 

Raschig, Frances A.: ref, 467(9) 

Rashkis, Harold A.: ref, 60(50), 
73(584), 298 (97) 

Rasmussen, Otho M. : test, 153, 164, 
315, 326; ref, 315(1) 

Ratcliffe, Gleela : test, 436 

Raths, Louis : test, 63; ref, 63(2-3, 
5-7), 402 (1) L 

Raubenheimer, A. S.: ref, 661(6), 
672 (1), 678(6), 670 (4) 

Rautman, Arthur L.: ref, 71(20), 
284(22), 292 (270) 

Raven, John C.: test, 20, 239, 258; 
bk, 1107; ref, 239(1), 258(9-10) 

Read, James Morgan: ref, 613(13) 

Ream, M. J.: ref, 220(23) 

Reavis, William C.: bk, 1108-9 

Recktenwald, Lester Nicholas: ref, 
635(14-5, 18) 

Redfearn, Lena: ref, 678(13) 

Redlich, Frederich C.: test, 291; 


exc, 75 
Redmond, Mary : bk, 1110; ref, 250 


(44) 

Reed, Homer B.: exc, 1168; ref, 
166 (6), 653 (6, 0), 678(16) 

Reed, Paul H.: ref, 48(28) 

Reed, Robert: bk, 942 

Reed, Rufus D.: test, 562; ref, 
562 (1) 

Reeder, C. W.: ref, 540(1) 

Reener, Epwin H., Professor of 
Education, University of Illinois, 
Urbana, Illinois: rev, 598, 603 

Reem, Linda E.: ref, 103(54) 

Rees, W. Linford : bk, 873; ref, 43 
(6), 44(6), 55(9), 73(550), 239 
(3), 258(19) 

Reese, Ellsworth: ref, 467 (18) 

Reese, Thomas Whelan: bk, 1111 

Reger, Scott N.: bk, 1069 

Rehfeld, Florence Washburn: ref, 
217 (108). 489(14) 

Reichard, Suzanne: ref, 27 (14), 41 
(18-9, 25), 73(301), 159(3), 298 
(41-2), 677 (44) 

Reid, E. B.: test, 558 

Reiff, Fayrene: test, 153 

Reilley, Albert G.: test, 404 

Reining, Henry, Jr.: ref, 225(45) 

Reitan, Ralph M.: ref, 49(10), 73 
(552), o5(21). 208(107) M 
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Walther, E. Curt: test, 602 

Wang, Charles K. A.: bk, 1221-3; 
ref, 23(21), 06(23) 

Ward, Lewis B.: bk, 818 

Ward, Raymond S.: bk, 1166; ref, 
Eae 663(6), 664(14), 677 
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War Department: bk, 1224-7; ref, 
219(5, 14) 

Warkentin, John: ref, 24(5) 

War Manpower Commission, Divi- 

sion of Occupational Analysis, 
Staff: ref, 167 (1), 663(19), 664 
(27), 677(53) 
ARREN, NEIL D., Associate Pro- 
fessor of Psychology and Dean 
of Men, The University of 
Southern California, Los Ange- 
les, California: rev, 659, 665 
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Washburn, Alfred A. : exc, 277 

Washburn, M. R.: ref, 459(5) 

WasnusunN, Rurs W., Consultant 
in Child Development, Shady 
Hill School, Cambridge, Massa- 
chusctts: rev, 226, 274, 

Washburne, John N.: test, 110; ref, 
110(5) 

Wasson, Margaret M. : ref, 23(27) 

e Waterman, Alan T.: test, 581; cross 

ref, 582 

Waterman, John: ref, 73(229) 

Watkins, John Goodrich : bk, 1228; 
ref, 397 (28-9) 

Watkins, Ralph K.: cross ref, 581 

Watson, Bernard B.: ref, 582(5) 

Watson, GoopwiN, Professor of 
Education, Teachers College, Co- 
lumbia University, New York, 
New York: rev, 111, 406, 456; 
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Watson, H. Boyes: ref, 352(1) 

Watson, K. Brantley: bk, 1229 
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fessor of Medical Psychology, 
Department of Neuropsychiatry, 
Washington University School of 
Medicine, St. Louis, Missouri: 
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Watson, Ruth: ref, 107(30) 
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Weaver, Wilbur E: bk, dus 
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Webster, Barbara Simmons: ref, 
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Webster, Edward C.: bk, 1230 

Wecuster, Davin, Chief Psycholo- 
gist, Psychiatric Division, Belle- 
vue Hospital, New York, New 
York: rev, 228, 239, 258; test, 
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3; ref, 220(42), 298(16-8, 58, 
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Weidemann, Charles C.: cross ref, 
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Weider, Arthur: bk, 713; ref, 292 
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Weidner, Bruce V.: test, 563 
Weigh Egon: ref. 41(13 
Weil, Andre A :. ref, 73(232) 
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Weisman, Leah: ref, 535(11) 
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Weiss, Leo: ref, 23(25) 
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EITZ, HENRY, Director, Psycho- 
logical Services Center, Univer- 
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Weitzman, Ellis : test, 111; ref, 111 
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Welborn, E. L.: ref, 217 (120) 
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Acting Supervisor, The Reading 
Clinic, Baltimore, Maryland: rev, 


509, 512 _ 
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WELLMAN, BETH L., Professor of 
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Welfare Research Station, The 
State University of Iowa, Iowa 
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1234-5; ref, 202 (316, 333, 345) 

Wes, F. L., Psychologist, De- 
partment of Hygiene, Harvard 
University, Cambridge, Massa- 
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p Visas bk, 1239; ref, 
, 7 
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This index serves as an expanded table of contents for the section “Tests and Reviews" oy 
. presenting a complete list of tests, books devoted exclusively to single tests, and reviewers of 


these tests and books. Numbers preceding titles indicate the ent 


numbers to be consulted for 


e been 


full bibliographic entries, test de fide and reviews. Stars indicate new tests not previously 


listed in this series; asterisks ù 


icate tests which have been revised in some way or 


issued in new form since they were last listed in this series. 


ACHIEVEMENT BATTERIES 


1. kAmerican School Achievement Tests. Reviews 
by Walter W. Cook, Glen Hass, and Gordon N. Mac- 

enzie. 

2. *Cooperative Contemporary Affairs Test for Col- 
lege Students. Review by H. T. Morse. 

3. *Cooperative General Achievement Tests, Re- 
vised Series. Review by John V. McQuitty. See also 
316 for review by John F. Randolph of Test 3, 548 
for review by Palmer O. Johnson of Test 2, and 596 
for review by Harry D. Berg of Test 1. 

.4. *Cooperative General Culture Test, Revised Se- 
ries, Reviews by Benjamin S. Bloom and H. T. Morse. 

5. XCooperative Test on Recent Social and Scientific 
Developments. Reviews by Ernest W. Tiegs, Ralph W. 
Tyler, and Edgar B. Wesley. 

6. *Coordinated Scales of Attainment. Reviews by 
Roland L. Beck, Lavone A. Hanna, Glen Hass, Gordon 
N. Mackenzie, and C. C. Ross. 

7. Every Pupil Primary Achievement Test. Review 

y C. C. Ross. 

3 XFife Tests of Ability. Review by James Max- 
well. 

.9. *Gray-Votaw General Achievement Tests. Re- 
view by Roland L. Beck. 

10. *lowa Every-Pupil Tests of Basic Skills, New 
Edition. Reviews by Frederic L. Ayer, Gustav J. Froeh- 
lich, and Ralph C. Preston. See also 334 for reviews by 
William A. Brownell and Leroy H. Schnell of Test D, 
m for reviews by James R. Hobson and Constance 

. McCullough of Test A, and 545 for review by 
J. Wayne Wrightstone of Test B. 

Ir. *Iowa High School Content Examination, 1943 
Edition. Review by Maurice E. Troyer. 

.12. klowa Tests of Educational Development. Re- 
views by Henry Chauncey, Gustav J. Froehlich, and 
Lavone A. Hanna. 

k n *Metropolitan Achievement Tests [1947 Edi- 
on]. 

14. *Ohio General Scholarship Test for High School 
Seniors. Review by C. C. Ross. 


15. *Progressive Achievement Tests, 1943 Edition. 
Review by Paul A. Witty. " 

16. Progressive Tests in Social and Related Sci- 
ences. 

17. *Public School Attainment Tests for High 
School Entrance: Examinations of Abilities in Read- 
ing, English, and Mathematics. Review by Benjamin S. 

loom. 

18. *Stanford Achievement Test. Reviews by Walter 
W. Cook and Ralph C. Preston. 

19. *Tests for High School Entrants. Review by 
Benjamin S. Bloom; and an excerpt from a review. 

20. kTests of General Educational Development. 
Reviews by Herbert S. Conrad and Warren G. Find- 
ley. See also 122 for review by Charlotte W. Croon of 
Test 1, and 528 for review by W. E. Hall and C. Robert 
Pace of Test 2. 

21. XWide Range Achievement Test: Reading, 
Spelling, Arithmetic From Kindergarten to College, 
1046 Edition. Reviews by Paul Douglas Courtney, 
Verner M. Sims, and Louis P. Thorpe. 

22. kWisconsin Composite Achievement Test. Re- 
views by Ernest W. Tiegs and Maurice E. Troyer. 


CHARACTER & PERSONALITY 


23. A-S Reaction Study: A Scale for Measuring 
Ascendance-Submission in Personality. Review by 
William U. Snyder. 

24. Attitude-Interest Analysis Test. Review by 
Starke R. Hathaway. 

25. Behavior Cards: A Test-Interview for Delin- 
quent Children. Reviews by W. C. Kvaraceus and 
Simon: Tulchin. 

26. *California Test of Personality: A Profile of 
Personal and Social Adjustment. Reviews by Laurence 
F. Shaffer and Douglas Spencer ; and an excerpt from 
a review by Earl R. Gabler. 

af Concept Formation Test. Review by O. L. Zang- 


wt 
28. k[Re Concept Formation Test] Conceptual 
Thinking in Schizophrenia by Eugenia Hanfa and 
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E Kasanin. Excerpts from reviews by Oskar Diet- 
elm, Thomas M. French, W. H. Gillespie, K. Gold- 
stein, George W. Hartmann, Meyer Maskin, Wendell 
Muncie, A. S. Pickford, R. W. Pickford, Clara Thomp- 
son, David G. Wright, and two others. 

29. *Controlled rojection (1944): A Standard Ex- 
perimental Procedure. Reviews by Arthur L. Benton 
and Percival M. Symonds; and excerpts from reviews 
by M. B. Brody, John Cohen, and three others. 

30. *Cowan Adolescent Adjustment Analyzer: An 
Instrument of Clinical Psychology. Reviews by Harold 
H. Abelson and William U. Snyder. 

31. XDetroit Adjustment VU RD RAS Form of 
CES What I Do." Review by All Ellis. 

32. Detroit Scale of Behavior Factors. r 

33. [Re Detroit Scale of Behavior Factors.] Diag- 
nosis and Treatment of Behavior-Problem Children 
by Harry J. Baker and Virginia Traphagen. Excerpt 
from a review by Edw. H. Stullken. . 

34. Diagnosis and Treatment of Pupil Maladjust- 
ment. Review by Laurence F. Shaffer. 

35. *Dunlap Academic Preference Blank. Reviews 
by Lee J. Cronbach, W. C. Kvaraceus, and Edith I. M. 
Thomson; and an excerpt from a review by E. L. 


36. *Euphorimeter. i 

37. *[Re Euphorimeter.] Chart for Happiness by 
Hornell Hart. Excerpts from reviews by Emory S. 
Bogardus, J. W. Macmillan, J. L. Moreno, H. Neu- 
mann, Dortha Williams Osborn, Clarence M. Pruitt, 
and Howard Davis Spoerl. 

38. kEvery-Day Life: A Scale for the Measurement 
Lue Varieties of Self-Reliance. Review by Albert 

is. 

39. XExamining for Aphasia and Related Disturb- 
ances, Review by C. R. Strother; and an excerpt from 
a review, 

40. *Furbay-Schrammel Social Comprehension Test. 

41. *Goldstein-Scheerer Tests of Abstract and Con- 
crete Thinking. Reviews by Ludwig Immergluck, Kate 
Levine Kogan, C. R. Strother, and O. L. Zangwill. 

42.k[Re Goldstein-Scheerer Tests of Abstract and 
Concrete Thinking.] Abstract and Concrete Behavior: 
An Experimental Study With Special Tests by Kurt 
Goldstein and Martin Scheerer. Excerpts from a re- 
view by Austin G. Schmidt. f 

43. kGuilford-Martin Inventory of Factors 
GAMIN, Abridged Edition. Review by H. J. Eysenck. 

44. kGuilford-Martin Personnel Inventory I. Re- 
view m Benjamin Shimberg. 

45. kGwuilford-Martin Temperament Profile Chart. 
Review by R. A. Brotemarkle. 

46. High School Attitude Scale. Review by Lee J. 
Cronbach. 

A7. *H-T-P p House, Tree, and Person: A Measure 
of Adult Intelligence and a Projective Device: Pre- 
liminary Edition. Reviews by Morris Krugman and 
Katherine W. Wilcox. 

48. *Humm-Wadsworth Temperament Scale, 1940 
Edition. Reviews by H. J. Eysenck, H. Meltzer, and 
Lorenz Misbach. 

49. *Hunt-Minnesota Test for Organic Brain Dam- 
age. Reviews by Margaret Ives and O. L. Zangwill. 

50. Information Blank EA: A Questionnaire on 
Emotional Adjustment, Provisional Form. Review by 
Percival M. Symonds. 

51. XInterest. Analysis, 1942 Revision. Review by 
Edward B. Greene. 

52. Interest. Inventory for Elementary Grades: 
George Washington University Series. Reviews by 
Harold D. Carter and Lee J. Cronbach. 

53. Interest-Values Inventory. Reviews by E. Lowell 
Kelly and Paul E. Meehl. 

54. Inventory of Affective Tolerance. Reviews by 


i R. Farnsworth, E. Lowell Kelly, and William U. 
nyder, 
55. Inventory of Factors STDCR. Review by -H. J. 


ysenck. 

56. *JNB Psychograph. - 

57. XJohnson Temperament Analysis. Review by H. 
Meltzer. 

58. *Jurgensen Classification Inventory. 

59. *Mental Health Analysis, Reviews by William 
E. Coffman, Henry E. Garrett, C. M. Louttit, Jamese — 
Maxwell, and Douglas Spencer; and an excerpt from _ 
a review. 

60. *Minnesota Multiphasic Personality Inventory, 
Revised Edition. Reviews by Arthur L. Benton, H. J. 
Eysenck, L. S. Penrose, and Julian B. Rotter; and an 
excerpt from a review. 

61, *Minnesota Personality Scale, Reviews by Philip 
Eisenberg and John W. French, 

62. kMinnesota T-S-E Inventory. Reviews by Philip 
Eisenberg and John W. French. 

63. *Ohio Guidance Tests for Elementary Grades. 
Reviews by M. H. Elliott and John W. M. Rothney. 

64. *Personal Audit. Review by Percival M. Sy- 
monds. 

65. *Personality Index. Review by Benjamin Shim: 


rg. 

66. *Philo-Phobe. Review by Parker Davis, Jr. 

67. kProblem Check List. Reviews by Ralph C. 
Bedell and Theodore F. Lentz. 

68. *Problem Check List: Form for Rural Young 
People. 

69. kProblem Check List: Form for Schools of 
PEE 5 " dori 

70. *Recreation Inquiry. Reviews by Theodore F. 
Lentz and Louis Long. 

71. Revision of the Babcock Examination for Meas- 
uring Efficiency of Mental Functioning. 

72. *[Re Revision of the Babcock Examination for 
Measuring Efficiency of Mental Functioning.] Time 
and the Mind: Personal Tempo—The Key to Normal 
and Pathological Mental Conditions by Harriet Bab- 
cock. Excerpts from reviews by Arthur L. Benton, 
Irving Lorge, G. K. Yacorzynski, and one other. 

73. *Rorschach. Reviews by Morris Krugman and 
J. R. Wittenborn. 

74. [Re Rorschach.] Personality Structure in Schiz- 
ophrenia: A Rorschach Investigation in 81 Patients 
and 64 Controls by Samuel J. Beck, Excerpt from a 
review by D. Shakow. 

75:77. *[Re Rorschach.] Rorschach’s Test, Volumes 
I and II, by Samuel J. Beck. Excerpts from reviews 
by Philip Lawrence Harriman, M. R. Harrower- 
Erickson, Robert M. Lindner, Lawson G. Lowrey, 
Wendell Muncie, F. W. O'Brien, Zygmunt A. Pio- 
trowski, Albert I. Rabin, F. C. Redlich, Saul Rosen- 
zweig, Donald E. Super, W. D. Wall, Frederic S. 
Weil, and six others. 

78. k[Re Rorschach.] The Clinical Application of 
the Rorschach Test by Ruth Bochner and Florence 
Halpern. Excerpts from reviews by S. J. Reck, Ralph 
R. Brown, C. J. Earl, R. M. Finney, Arthur N. Foxe, 
Martin Grotjahn, M. R. Harrower-Erickson, Webb 
Haymaker, Morris Krugman, Lawson G. Lowrey, 
Edna Mann, Ernest G. Schachtel, W. D. Wall, and 
four others. 

79. k[Re Rorschach.] The Clinical Application of 
the Rorschach Test, Second Edition by Ruth Bochner 
and Florence Halpern. Excerpts from reviews by Ver- 
non Clark, Isabelle V. Kendig, Leslie Phillips, Albert 
L Rabin, and two others. 

8o. *[Re Rorschach.] The Application of the Ror- 
schach Test to Young Children by Mary Ford. Ex- 
cerpts from reviews by Lee J. Cronbach, C. M. Lout- 


1037 ] CLASSIFIED INDEX OF TESTS 


tit, L. Joseph Stone, Adolph G. Woltmann, and one 
other. : " 

8r. *[Re Rorschach.] Manual for Psychodiagnostic 
Inkblots (A Series Parallel to the Rorschach Blots) 
by M. R. Harrower and M. E. Steiner. Excerpts from 
reviews by Gotthard Booth, William F. Murphy, Zyg- 
munt A. Piotrowski, and Adolf G. Woltmann. 

82. *[Re Rorschach.] Large Scale Rorschach Tech- 
niques: A Manual for the Group Rorschach and Mul- 
«iple Choice Test by M. R. Harrower-Erickson and 
M. E. Steiner. Excerpts from reviews by Robert C. 
Challman, Bruno Klopfer, Louis A. Lurie, Ashley 
Montagu, Z. A. Piotrowski, Fred V. Rockwell, Don- 
ald E. Super, and six others. A 

82a. *[Re Rorschach.] Vocational Personalities: An 
Application of the Rorschach Group Method by Ruth 
Goldie Kaback. Excerpts from reviews by Max Hertz- 
man and Joseph Zubin. = 

83. *[Re Rorschach.) The Rorschach Technique: 
1946 Supplement by Bruno Klopfer and Helen H. 
Davidson. Excerpt from a review. 

84-5. *[Re Rorschach.] The Rorschach Technique: 
A Manual for a Projective Method of Personality 
Diagnosis by Bruno Klopfer. Excerpts from reviews 
by S. J. Beck, R. Milton Finney, Philip Lawrence Har- 
riman, Robert E. Harris, M. R. Harrower-Erickson, 
Morris Krugman, Edna Brand Mann, Claire Meyers, 
Z. A. Piotrowski, Ernest G. Schachtel, Ira S. Wile, and 
two others. 

86. *[Re Rorschach.] The Analysis and Control o; 
Human Experiences: The Individual Seen Through 
Rorschach, Volumes t and 2 by Paul Maslow, Ex- 
cerpts from reviews by Rose Palm and one other. 

87. *(Re Rorschach.] Rorschach Psychology ci 
Paul Maslow. Excerpts from reviews by Milton H. 
aud D. R. Miller, William F. Murphy, and one 
other. 

88. *[Re Rorschach.] Rorschach Theory by Paul 
Miuow. Excerpt from a review by Milton H. Erick- 


. *[Re Rorschach.] Prediction of the Adjustment 
and Academic Performance of College Students 
a Modification of the Rorschach Method Rut! 
Learned Munroe. Excerpts from reviews by Marion 
Cowin, Paul E. Meehl, and Donald E. Super. 
89a. *[Re Rorschach.] Rorschach Scoring Samples: 
Compiled From Various Sources for Private Circu- 
lation [Revised Edition] by Maria Rickers-Ovsiankina. 
EET from a review is unt A. Piotrowski 
90. X[Re Rorschach.] Psychodiagnostics: A Diag- 
hostie Bi ao » erception, Edition by 
ermann Rorschach. Ex from reviews by Bruno 
Klopfer and thrce Qu d y 
pt *[Re Rorschach.] Einführung in den Behn-Ror- 
schach Test by Hans Zulliger. Excerpt from a review. 
92. *Schrammel-Gorbutt Personality Adjustment 
Scale. Reviews by Raleigh M. Drake and Nelson G. 
Ex 
3. *Selective Vocabulary Test. Reviews ack 
w Dunlap and Starke R. Hathaway. E 
94. XS, ense of Humor Test, Second Edition. 
. 95. XShipley-Institute of Living Scale for Measur- 
ing Intellectual Impairment. Reviews by E. J. G. Brad- 
ford, William A. Hunt, and Margaret Ives. 


96. Social Intelli Test: ashington 
University Series, Eoen cx Glen U. Hide and 
oward R. Taylor. 


.kSocial Personality Inventory 
Maus. Review by Nelson G. n “ta 
rue Questionnaire. Review by Simon H. 
99. Study of Values: A i 
uet Interests ei Sedi qs pubs d 


English 


100. *Szondi Test. Review by Susan K. Deri. 

101. Test of Etiquette: George Washington Uni- 
versity Series. L 

102. *Test of Knowledge of Social Usage, 1942 
Edition. i P 

103. *Thematic Apperception Test. Reviews by 
Arthur L. Benton, Julian B. Rotter, and J. R. Wit- 
tenborn; and an excerpt from a review. 

104. *[Re Thematic Apperception Test.] 4 Method 
of Administering and Evaluating the Thematic Ap- 
perception Test m Group Situations by Ruth Millburn 
Clark. Excerpt from a review by Percival M. S: monds. 

104a. *[Re Thematic Toe ion Test.] The The- 
matic Apperception Test: hi and Technique 
of Interpretation by Silvan S. Tomkins with the col- 
laboration of Elizabeth J. Tomkins. Excerpt from a 
revicw. 

105. *[Torgerson's Inventories and Record Forms.) 
Review by Harold H. Abelson. ^ 

106. *V.. G. C. Personality Adjustment Indicator or 


Inventory, A 
107. *]'ineland Social esi Scale, Reviews by 
C. M. Louttit and John W. M. Rothney; and an 


excerpt from a review, 

108. *Virual Motor Gestalt Test. è 

109. [Re Visual Motor Gestalt Test] A Visual 
Motor Gestalt Test and Its Clinical Use by Lauretta 
Bender. Excerpts from reviews by W. Line and M. A. 
Rickers-Ovsiankina. 

110. Washburne Social-Adjustment SACRA] 
Thaspic Edition. Excerpts from reviews by 
Postel.and one other. ; 

111, ®Weitzman’s Inventory of Social Behavior. 
Reviews by Louis Long, and win Watson. 

112, kWilson Scales o Sneed and Instability. 


Reviews by Paul E. Meehl and Kai ne W. Wilcox. 
*Work Preference Inventory. Reviews by Ed- 
win W. Davis, John C. Flanagan, and Gilbert J. Rich. 


ENGLISH 


114. XAbility for English (Lan €): Fife Tests 
of Ability, Test 1. See 8 for review ty James Max- 
well of total battery. 

115. lowa Placement Examinations: English Apti- 
—— oe gue m eld obert C. 

ley. 
116. *Basic Language Skills: lowa Every-Pupil 
Tests of Basic Skills, Test C, New Edition. See 10 
for reviews by Frederic L. Ayer, Gustav Froehlich, 
and ean Young English Tests Clapp: Young Self 

117. Clapp-Young Eng est: ‘oun: - 
Marking ests. Review by Gerald V. holm. 

118. *College Placement Test in English. 

119. XCollege Preparatory Test in English. 

120. *Cooperative English Test: Lower and Higher 
Levels. Reviews by J. Paul Leonard, Ed 
Noyes, and C. Pooley. See also 497 for re- 
views by Robert Murray Bear and J. B. ud of 
Tests Ct and C2. 

121. *Correctness and Effectiveness of Expression: 
Tests of General Educational Development: High 
School Level, Test 1. See 20 for reviews by Herbert 
S. Conrad and Warren G. Findley of the total bat- 


122. XCorrectness and E ffectiv Ex ion: 
Tess of General Rámcations] Developsant" College 
Fre aedi hen etel rime E Croon. See 20 

ews by Her 
Findley of Ba f ; onrad and Warren G. 

123. *Cotswold Measurement of Ability 11: English. 

pe Hoe Test in English Usage. Review by 


125. *English Survey Test: Ohio Senior Survey 


Fine Arts 


Tests. Reviews by Charlotte W. Croon and J. Paul 
Leonard. 

126. English Test: National Achievement Tests. Re- 
view by Harry A. Greene. 

127. *English Usage: Every Pupil Test. Review by 
J. R. Gerberich. : 

128. *Essentials of English Tests. Reviews by Char- 
lotte W. Croon and Gerald V. Lannholm; and an 
excerpt from a review by William J. Jones. 

129. *Examination in English: College Level. Re- 
view by John S. Diekhoff. 

130. *Examination in English: High School Level. 
Reviews by Holland Roberts and Louis C. Zahner. 

131. *Iowa Placement Examinations: English Train- 
ing. Review by Robert C. Pooley. . 

132. *Kentucky English Test, [Revised]. 

133. kLanguage Essentials Tests. Review by Harry 

Greene. y 
134. kMechanics of Written English: State High 
School Tests for Indiana. 

135. Schonell Diagnostic English Tests. Review by 
John Cohen and Robert H. Thouless. 

136. Shepherd English Test: Clapp-Young Self- 
Marking Tests. Review by Ruth D. Churchill. 

137. XTools of Written English: State High School 
Tests for Indiana. 

138. kWisconsin Language Usage Test. 


LITERATURE 

139. Barrett-Ryan Literature Test. Review by Ches- 
ter W. Harris. 

140. Carroll Prose Appreciation Test. Review by 
Chester W. Harris. 

141. *Cooperative Literary Acquaintance Test. 

142. XCooperative Literary Comprehension and Ap- 
breciation Test. Review by Holland Roberts. 

143. *English: Understanding and Appreciation of 
Poetry: State High School Tests for Indiana. Review 
by Chester W. Harris. 

144. *Interpretation of Literary Materials: Tests of 
General Educational Development : College Level, Test 
4. See 20 for reviews by Herbert S. Conrad and War- 
ren G. Findley of the total battery. 

145. XInterpretation of Literary Materials: Tests of 
General Educational Development : High School Level, 
Test 4. See 20 for reviews by Herbert S. Conrad and 
Warren G. Findley of the total battery. 

146. *Rigg Poetry Judgment Test. Reviews by John 
S. Diekhoff and Louis C. Zahner. 

147. Survey Test in American Literature. 

148. *Survey Test In English Literature. 


MISCELLANEOUS 
149. XP-L-S Journalism Test. 


SPEECH 
150. kGuidance Questionnaire for Students of 
Speech. 
ISI. *Speech Attitude Scale. 


152. *Speech Experience Inventory. 


SPELLING 

153. Kansas Spelling Test, Reviews by Henry D. 
Rinsland and Guy M. Wilson. 

154. *Lincoln Diagnostic S, pelling Test. 

155. *Lincoln Intermediate Spelling Test. 

156. New Standard High School Spelling Scale. 

157. *Wellesley Spelling Scale. Reviews by Henry 
D. Rinsland and Guy M. Wilson. 


VOCABULARY 


158. Clinton General Vocabulary Test for High 
Schools and Colleges. Review by Harold H. Bixler. 
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159. Columbia Vocabulary Test. Reviews by Verner 
M. Sims and Clifford Woody. : 

160. kCooperative Vocabulary Test. Reviews by 
Edgar Dale and Henry D. Rinsland. s 

161. English Recognition Vocabulary Tesi. Reviews 
by Paul S. Burnham and Edgar Dale. 

162. kHigh School Vocabulary Test. Review by 
Harold H. Bixler. ; 

163. Inglis Tests of English Vocabulary. Review by 
Henry D. Rinsland. 


2 
164. kKansas Vocabulary Test. Review by Harold 


H. Bixler. < 

165. *Kennon Test of Literary Vocabulary, 1941 
Edition. Review by H. H. Remmers, x 

166. Michigan Vocabulary Profile. Review by Jo- 
seph E. King. y 

167. Survey Test of Vocabulary. Reviews by Verner 
M. Sims and Clifford Woody. | 

168. kVocabulary Test: National 
Tests. Review by Clifford Woody. " 

169. *Wide Range Vocabulary Test. Review by Paul 
S. Burnham. ^ 

170. kWord Dexterity Test. 


FINE ARTS > 
ART 
171. kHorn Art Aptitude Inventory: Preliminary 
Form, 1944 Revision, Review by Edwin Ziegfeld, 
172. kMeier Art Tests: I, Art Judgment. Review 
by Edwin Ziegfeld. a 
173. Selective Art Aptitude Test, Second Edition. 
Review by Edwin Ziegfeld. 


MUSIC 
^ *Beach Music Test. Review by James L. Mur- 
ell. 


Achievement 


s 

175. Drake Musical Memory Test: A Test of Mu- 
sical Talent. Review by William S. Larson. 

176. Kwalwasser-Dykema Music Tests. Review by 
William S. Larson. 

177. Seashore Measures of Musical Talents. Re- 
views by Paul R. Farnsworth, William S. Larson, and 
James L. Mursell. 


FOREIGN LANGUAGES 


178. *Iowa Placement Examinations: Foreign Lan- 
guage Aptitude. Review by H. E. Brogden. 

179. Language Aptitude Test: George Washington 
University Series. Review by H. E. Brogden. 


FRENCH 

180. *[Cooperative French Comprehension Test.) 
Reviews by Joseph F. Jackson and Clarence E. Turner. 
See also 182 for reviews by John H. Meyer and Roland 
Vinette of the total test. 

181. *Cooperative French Test: Elementary and Ad- 
vanced Forms. 

182. XCooperative French Test: Lower and Higher 
Levels. Reviews by John H. Mayer and Roland Vi- 
nette. See also 180 for reviews by Joseph F. Jackson 
and Clarence E. Turner of Part 1. 

183. *Examination in French Grammar. Review by 
Nelson Brooks. 

„184. kEzamination in French Reading C. omprehen- 
sion. Reviews by Joseph F. Jackson and Clarence E. 
Turner. 

185. *Examination in French Vocabulary. Review 
by Nelson Brooks. 

. 186. kPrench Grammar Test: Dominion Tests. Re- 
view by John H. Meyer. 

187. *French Reading: Dominion Tests. Review by 
Geraldine Spaulding. 


| 
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188. XFrench Vocabulary Test: Dominion Tests. 
Review by Roland Vinette. | 

189. Iowa Placement Examinations: French Train- 
ing: Series FT1, Revised. Review by Geraldine 
Spaulding. 

GERMAN 

190. Cooperative German Tesi: Advanced Form. 
Review by Harold B. Dunkel. 
e. 191. kExamination in German Grammar: Lower 
Level. Review by Herbert Schueler. 

192. kExamination in German Reading Compre- 
hension: Lower Level. Review by Herbert Schueler. 

193. *Examination in German Vocabulary: Lower 
Level. Review by Herbert Schueler. 

194. Lundeberg-Tharp Audition Test in German. 
Review by Harold B. Dunkel. 


HEBREW 


195. *Group Test in Siddur Reading. 
196. *Hebrew Aptitude Test. 

197. *Hebrew Intermediate Test. 
198. *Hebrew Primary Test. 


° ITALIAN 

199. XCooperative Italian Test. Review by Elton 
Hocking. 

200. Examination in Italian Grammar: Lower 
Level. 

201. XExamination in Italian Reading Comprehen- 
sion: Lower Level. 

202, kExamination in Italian Vocabulary: Lower 
Level. 

LATIN 

203. *[Cooperative Latin Comprehension Test.] Re- 
view by Hazel M. Toliver. See also 205 for review 
by C. D. Odell of the total test. 

204. *Cooperative Latin Test: Elementary and Ad- 
vanced Forms. 

205. kCooperative Latin Test: Lower and Higher 
Levels. Review by C. W. Odell. See also 203 for re- 
view by Hazel M. Toliver of Part 1. 

206. Kansas First Year Latin Test. Review by 
Hazel M. Toliver. 

207. Orleans-Solomon Latin Prognosis Test, Review 
by C. W. Odell, 


SPANISH 

208. kExamination in Spanish Grammar: Lower 
Level. Review by Frederick B. Agard. —. 

209. *Examination in Spanish Reading Compre- 
hension: Lower Level, Review by Harry J. Russell. 
E 21 *Examination in Spanish Vocabulary: Lower 
Level. 

211. Lundeberg-Tharp Audition Test in Spanish. 
Reviews by Frederick B. Agard and Walter V. Kaul- 
i and an excerpt from a review by Winthrop H. 

ice. 

212, Iowa Placement Examinations: Spanish Train- 
ings Series STr, Revised. Review by Harry J. Rus- 
sell. 

213. kFurness Test of Aural Comprehension in 
Spanish. Reviews by Frederick B. Agard and Walter 
V. Kaulfers. 


INTELLIGENCE 


GROUP 
F 214. kAcademic Aptitude Test: Non-Verbal Intel- 
igence. 
215. Academic Aptitude Test: Verbal Intelligence. 
Review by Marion A. Bills. d 
216. xAdaptability Test, Reviews by Anne Anatasi 
and Marion A. Bills. 


217. *American Council on Education Psychological 
Examination for College Freshmen. Reviews by W. D. 
Commins and J. P. Guilford, 

218. *American Council on Education Psychological 
Examination for High School Students. Review by 
Carl I. Hovland. 

219. kArmy General Classification Test: First Ci- 
vilian Edition. Excerpt from a review. 

„220. *Army Group Examination Alpha and revi- 
sions. 

221. k[Benge Employment Tests] Reviews by 
Brent Baxter and Marion A. Bills. 

222. kCalifornia Capacity Questionnaire. Reviews 
by Anne Anastasi and Emily T. Burr. 

223. *California Test of Mental Maturity. Review 
by Henry E. Garrett; and an excerpt from a review 


‘by W. Line and one other. 


224. Carnegie Mental Ability Tests: Clapp-Young 
Self-Marking Tests. Reviews by W. D. Commins and 
Robert L. Thorndike. 

225. kChicago Tests of Primary Mental Abilities. 
Bre by Cyril Burt, James R. Hobson, and F. L. 

ells. 

226. Cole-Vincent Group Intelligence Tests for 
School Entrants. Review by Ruth W. Washburn. 

227. kCotswold Mental Ability Test. 

228. *kCulture-Free Test. Reviews by L. S. Penrose, 
Walter C. Shipley, and David Wechsler. 

229. kDuplex Series of Ability Tests. 

230. *Essential Intelligence Test. 

231. *Group Test of Learning Capacity: The Do- 
minion Tests. Review by F. T. Tyler. 

232. *Inductive Reasoning Test. Review by Charles 
R. Langmuir. 

233. XJunior Scholastic Aptitude Test. 

234. *Kentucky General Ability Test. 

235. Kingsway Intelligence Tests. Excerpt from a 
review. k 

236. *Kuhlmann-Anderson Intelligence Tests, Fifth 
Edition. Reviews by W. G. Emmett and Stanley S. 
Marzolf. 

237. Laycock Mental Ability Test. Reviews by 
George A. Ferguson and F. T. Tyler. 

238. Mental Alertness Test: George Washington 
Pu Series, Revised Form. Review by J. P. Guil- 
ord. 

239. XMill Hill Vocabulary Scale. Review by David 
Wechsler. 

240. kMoray House Picture Intelligence Test r. 

241. [Moray House Tests of Intelligence.] Review 
by Patrick Slater. 

242. Multi-Mental Scale, Review by D. A. Worces- 
ter. 

243. *Non-Language Multi-Mental Test. Review by 
Carroll A. Whitmer. 

244. *Ohio State University Psychological Test, Re- 
view by J. P. Guilford. 

245. kOral Directions Test: An Intelligence Test 
on Phonograph Records. Reviews by Charles D. Flory, 
Irving Lorge, and William W. Turnbull. 

246. *O'Rourke General Classification Test, Senior 
Grade, 1942 Edition. Review by Marion A. Bills. 

247. *Otis Classification Test, Revised. Excerpt 
from a review by William J. Jones. i 

248. Otis General Intelligence Examination: De- 
signed Especially for Business Institutions. Review 
by Frederic Kuder, 

249. Otis Quick-Scoring Mental Ability Tests. Re- 
view by Frederic Kuder. 

250. Otis Self-Administering Tests of Mental Abil- 
ity. Review by Frederic Kuder. 

251. kPerception of Relations Scales. Review by 
Charles D. Flory. 


Intelligence 
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252. kPersonnel Classification Test [Henderson]. 
Reviews by Brent Baxter and John C. Flanagan. 
253. Personnel Classification Test [Wesman]. Ex- 
cerpt from a review. 
254. *Pininer General Ability Test: Non-Language 
Series. Review by Carroll A. Whitmer. 
255. XPintner General Ability Tests: Verbal Series. 
Reviews by Stanley S. Marzolf and D. A. Worcester. 
256. Pintner Non-Language Primary Mental Test. 
Reviews by Psyche Cattell and Carroll A. Whitmer. 
257. kProfion Dealltwriaeth Cyfaddasiad Cymbraeg. 
258. *Progressive Matrices: A Perceptual Test of 
Intelligence, 1938. Reviews by Walter C. Shipley and 
David Wechsler. 
259. *Revised Beta Examination. Reviews by Ra- 
leigh M. Drake and Walter C. Shipley. 
260. *Scale of Nonverbal Mental Ability. Excerpt 
from a review. 
261. XSRA Non-Verbal Classification Form. 
262. KSRA Verbal Classification Form. 
263. *Terman-McNemar Test of Mental Ability. 
Reviews by Carl I. Hovland and Robert L. Thorndike. 
264. kTests of Primary Mental Abilities for Ages 
5 and 6. Review by Florence L. Goodenough. 
265. kThurstone Test of Mental Alertness. Reviews 
by Anne Anastasi and Emily T. Burr, 
266. *Time Appreciation Test. Reviews by E. J. G. 
Bradford and Charles N. Cofer. 
267. kV.G.C. Intelligence Indicator. 
268. XWillis-Smith Advanced Mental Test. Reviews 
by Harold H. Bixler and F. T. Tyler. 
269. *Wonderlic Personnel Test. Reviews by H. E. 
Brogden and Charles D. Flory, and Irving Lorge. 


INDIVIDUAL 
270. *Alexander Performance Scale: A Perform- 
ance Scale for the Measurement of Practical Ability. 
Review by John Cohen; and excerpts from two re- 


views. 

271. *[Arthur] Point Scale of Performance Tests. 
Excerpt from a review. 

272. kCanadian Intelligence Examination. Excerpt 
from a review by Clara Blacklock and one other. 

273. *Carl Hollow Square Scale. Review by Grace 
H. Kent. 

274. Detroit Kindergarten Test. Reviews by Psyche 
Cattell and Ruth W. Washburn. 

275. *Detroit Tests of Learning Aptitude. Review 
by F. L. Wells. 

276. *Gesell Li Capena Schedules. Reviews by 
Nancy Bayley and Florence M. Teagarden. 

277. k[Re Gesell Developmental Schedules.] Devel- 
opmental Diagnosis: Normal and Abnormal Child De- 
velopment: Clinical Methods and Practical Applica- 
tions by Arnold Gesell and Catherine S. Amatruda. 
Excerpts from reviews by Edgar A. Doll, Milton 
H. Firestone, Arthur N. Foxe, Warren H. Gardner, 
Carter V. Good, Helen L. Koch, Alfred A. Washburn, 
Margaret T. Wilson, and three others. 

278. [Re Gesell Developmental Schedules.] Biog- 
raphies of Child Development: The Mental Growth 
Careers of Eighty-Four Infants and Children: by 
Arnold Gesell, Catherine S. Amatruda, Burton M. 
Castner, and Helen Thompson. Excerpts from reviews 
by W. Line, Elizabeth E. Lord, Livingston Welch, and 
one other. 

279. k[Re Gesell Developmental Schedules.] The 
First Five Years of Life: A Guide to the Study of 
the Preschool Child by Arnold Gesell, Henry M. Hal- 
verson, Helen Thompson, Frances L. Ilg, Burton M. 
Castner, Louise Bates Ames, and Catherine S. Ama- 
truda. P from reviews by Richard A. Bolt, 
Ives Hendrick, Aleš Hrdlička, J. W. Jenkins, C. W. 
Valentine, and three others. 


280. [Re Gesell Developmental Schedules.] The Psy- 
chology of Early Growth: Including Norms of In- 
fant Behavior and a Method of Genetic Analysis by 
Arnold Gesell and Helen Thompson assisted by Cath- 
erine Strunk Amatruda. Excerpt from a review. 

281. XInfant Intelligence Scale, Reviews by Flor- 
ence M. Teagarden and Beth L. Wellman; and an 
excerpt from a review by Norma V. Scheidemann. 

282. *[Re Infant Intelligence Scale.] The Measure- 
ment of Intelligence of Infants and Young Children + 
by Psyche Cattell Excerpts from reviews by Rachel 
Stutsman Ball, C. M. Louttit, T. L. McCulloch, and 
Helen Speyer. 

283. Intelligence Tests for Young Children. Ex- 
cerpt from a review by C. Burt and one other. 

284. *Kent Series of Emergency Scales. Review by 
Charles N. Cofer. 

285. Linfert-Hierholzer Scale for Measuring the 
Mental Development of Infants During the First Year 
of Life. Review by Nancy Bayley. 

286. *Minnesota Preschool Scale. Review by Beth 
L. Wellman. 

287. *[Re Minnesota Preschool Scale.] The Mental 
Growth of Children From Two to Fourteen Years: 
A Study of the Predictive Value of the Minnesota" 
Preschool Scales by Florence L. Goodenough and 
Katharine M. Maurer. Excerpts from reviews by 
Nancy Bayley, Gustav J. Froehlich, Willard C. Olson, 
C. W. Valentine, and one other. 

288. *[Re Minnesota Preschool Scale.] Intellectual 
Status at Maturity as a Criterion for Selecting Items 
in Preschool Tests by Katharine M. Maurer. Excerpts 
from two reviews. 

289. kNebraska Test of Learning Aptitude for 
Young Deaf Children: A Non-Verbal Individual Scale 
Especially Devised for Deaf and Hard of Hearing 
Children From Four to Ten Years of Age. 

290. *Performance Tests of Intelligence: A Series 
of Nonlinguistic Tests for Deaf and Normal Children, 
Third Edition. 

291. XPreliminary Test of Intelligence: A Brief 
Test of Adult Intelligence Designed for Psychiatric 
Examiners. 

292. Revised Stanford-Binet Scale. 

.293. *[Re Revised Stanford-Binet Scale.] The Re- 
vision of the Stanford-Binet Scale: An Analysis of 
the Standardization Data by Quinn McNemar. Ex- 
cerpts from reviews by Cyril Burt, Ethel L. Cornell, 
Henry E. Garrett, Forrest H. Kirkpatrick, Francis N. 
Maxfield, and one other. 

294. k[Re Revised Stanford-Binet Scale.] Supple- 
mentary Guide for the Revised Stanford-Binet Scale 
(Form L) by Rudolf Pintner, Anna Dragositz, and 
Rose Kushner. Excerpts from reviews by Earl R. 
Gabler and Francis N. Maxfield. 

295. *Stencil Design Test. Review by James M. 
Anderson. 

296. Van Alstyne Picture Vocabulary Test for Pre- 
school Children. Review by Ruth W. Washburn. 

297. kVisual Retention Test for Clinical Use. Ex- 
cerpt from a review. 

208. *Wechsler-Bellevue Intelligence Scale. Review 
by Robert I. Watson, 

299. [Re Wechsler-Bellevue Intelligence Scale.] The 
Measurement of Adult Intelligence by David Wechs- 
ler. Excerpts from reviews by Alfred L. Baldwin, 
Edward E. Cureton, Clinton P. McCord, Emily L. 
Stogdill, Charles L. Vaughn, and one other. 

300. k[Re Wechsler-Bellevue Intelligence Scale.] 
The Measurement of Adult Intelligence, Second Edi- 
tion by David Wechsler. Excerpts from reviews by 
Anne Anastasi, Irving Lorge, B. D. Misselbrook, Win- 
ifred Richmond, Henriette Woolf, J. Wayne Wright- 
stone, and one other. 
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301. *[Re Wechsler-Bellevue Intelligence Scale.] 
The Measurement of Adult Intelligence, Third Edi- 
tion by David Wechsler. Excerpts from reviews by 
Quinn McNemar, Ann Magaret, Irma Stark, C. Hart 
Westbrook, and one other. E 

302. kWechsler Memory Scale. Review by Kate 
Levine Kogan. 


MATHEMATICS 


° 

303. KCooperative Test in Secondary School Mathe- 
matics: Higher Level. Review by E. H. C. Hilde- 
brandt. ^ 

304. XCooperative College Mathematics Test for 
First-Year Courses. Reviews by Albert A. Bennett and 
Nathan Morrison. 

305. *Cooperative Mathematics Test for Grades 7, 
8, and 9. Review by M. L. Hartung. e 

306. kFoust-Schorling Test of Functional Thinking 
in Mathematics. Reviews by William Betz and M. L. 
Hartung; and excerpts from reviews by Harold P. 
Fawcett and Earl R. Gabler. 

307. General Mathematical Ability: Tests of Gen- 
eral Educational Development: High School Level, 
Test 5. See 20 for reviews by Herbert S. Conrad and 
Warren G. Findley of the total battery. 

308. *Iowa Placement Examinations: Mathematics 
Aptitude, Reviews by Edmund P. Churchill and Paul 
L. Dressel. 

309. Iowa Placement Examinations: Mathematics 
Training: Series MTr, Revised. Reviews by Edmund 
P. Churchill and Paul L. Dressel. 

310. *Junior High School Mathematics Test: Acorn 
Achievement Tests, Review by William Betz. 

311. *Kentucky Mathematics Test. 

312. Mathematics for High School Seniors: A 
Screening Test in Basic Skills. 

313. Mathematics Test: Ohio Senior Survey Tests. 
Review by William G. Mollenkopf. 

314. *Purdue Industrial Mathematics Test: Purdue 
Personnel Tests. Reviews by Clyde H. Coombs and 
C. C. Upshall. 

.315. *Rasmussen General Mathematics Test. Re- 
view by William G. Mollenkopf. 

316. *Test of General Pro EY in the Field of 
Mathematics: Cooperative General Achievement Tests, 
Revised Series, Test 3. Review by John F. Randolph. 
See 3 for review by John V. McQuitty of the total 
battery. 

317. kTest of Mathematical Fundamentals for 
Grades 7-12. 


ALGEBRA 

318. *Ability for Algebra: Fife Tesis of Ability, 
Test 3. See 8 for review by James Maxwell of the 
total battery. 

319. kAlgebra Readiness Test. 

320. XCalifornia Algebra Aptitude Test. Review by 
David Segel. 

321. *Cooperative Algebra Test: Elementary Alge- 
bra Through Quadratics, Revised Series. Review by 
W. C. Brenke. 

322. *Cooperative Intermediate Algebra Test: Quad- 
ratics and Beyond, Review by L. B. Plumlee. 

323. kExamination in College Algebra. Reviews by 
Albert A. Bennett and Edmund P. Churchill. 

324. kExamination in Elementary Algebra—High- 
School Level. Reviews by Richard M. Drake and John 
A. Long. 

325. Examination in Second-Y ear Algebra—High- 
S Level. Reviews by L. B. Plumlee and Daniel W. 

nader. 

326. *Garman-Schrammel Algebra Test. Reviews 
by Paul Blommers and E. H. C. Hildebrandt. 


327. *lowa Algebra Aptitude Test, Revised Edition. 
Review by David Segel. d 

328. *Survey Test in Elementary Algebra. Reviews 
by John A. Long and Daniel W. Snader. — 

329. Votaw Algebra Test. Reviews by Richard M. 
Drake and Nathan Morrison. 

330. Wisconsin Algebra Test. 


ARITHMETIC 

331. XArithmetic Reasoning. Review by Jacob S. 
Orleans. . 

332. *Arithmetic Test: Fundamental Operations: 
Dominion Tests. Review by C. L. Thiele. 

333. Arithmetic Test (Fundamentals and Reason- 
ing). 

E *Basic Arithmetic Skills: Iowa Every-Pupil 
Test of Basic Skills, Test D, New Edition. Reviews 
by William A. Brownell and Leroy H. Schnell. See 
10 for reviews by Frederic L. Ayer, Gustav Froehlich, 
and Ralph C. Preston of the total battery. 

335. XBasic Skills in Arithmetic Test. Reviews by 
Jacob S. Orleans and F. Lynwood Wren. 

336. XBrief Survey of Arithmetic Skills. 

337. Chicago Arithmetic Readiness Test. Review by 
Foster E. Grossnickle. 

338. *Chicago Arithmetic Survey Tests. Reviews 
by William A. Brownell and Foster E. Grossnickle. 

339. Clapp-Young Arithmetic Test: Clapp-Young 
Self-Marking Tests. Review by Leroy H. Schnell. 

340. *Cotswold Measurement of Ability I: Arith- 
metic. ` 

341. kDiagnostic Tests in Arithmetic Fundamentals: 
Dominion Tests. Review by Leo J. Brueckner. 

342. k[Essential Arithmetic Tests.) Reviews by 
John Cohen and Stephen Wiseman. 

343. kExamination in Advanced Arithmetic—High- 
School Level. Reviews by Monica M. Hoye and F. 
Lynwood Wren. 

344. kHundred-Problem Arithmetic Test. Review 
by William Betz; and an excerpt from a review by 
Earl R. Gabler. 

345. *Lee-Clark Arithmetic Fundamentals Reais d 
Test: High School Edition. Reviews by Monica M. 
Hoye and F. Lynwood Wren. 

2e *Metropolitan Achievement Tests [Arithme- 
tic]. 
346. Moray House Arithmetic Test. Review by John 
Cohen. 

347. Number Fact Check. 

348. *Oral Diagnostic Test in Addition: Analysis 
of Errors in Addition: Dominion Tests. Review by 
Leo J. Brueckner. 

340. kRenfrow Survey Tests of Mathematical Skills 
and Concepts. Review by C. L. Thiele. 

350. *Schonell Diaanostic Arithmetic Tests. 

351. XSchrammel-Otterstrom. Arithmetic Test. Re- 
view bv William A. Brownell. 

352. Southend Attainment Test in Mechanical Arith- 
metic. Review by Stephen Wiseman. 

353. Test on Arithmetic Meanings and Vocabulary. 
Review by Foster E. Grossnickle. 


CALCULUS 
354. kExamination in Differential Calculus. 
355. *Examination in Calculus II: Integral Cal- = 
culus „(Following Initiai Course Containing No In- 
tegration). Review by John F. Randolph. 


GEOMETRY 
nom *Ability for Geometry: Fife Tests of Ability, 
est 4. 
357. *Cooperative Plane Geometry Test. Reviews 
by Harold P. Fawcett and C. O. Oakley. 


Miscellaneous 
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358. kExamination in Analytic Geometry. Review 
by C. O. Oakley. ¢ 

359. kExamination in Plane Geometry—High- 
School Level. Review by Hale C. Pickett. 

360. *Iowa Plane Geometry Aptitude Test, Revised 
Edition. Review by Philip H. DuBois. 

361. Orleans Plane Geometry Achievement Test. Re- 
view by Harold P. Fawcett. 

362. Wisconsin. Geometry Test. 


TRIGONOMETRY 
363. *Cooperative Triognometry Test. 
364. XExamination in Plane Trigonometry. 


MISCELLANEOUS 


AGRICULTURE 
365. Animal Husbandry Test: State High School 
Tests for Indiana. 
366. kFarm Shop Tools: Recognition and Use: 
State High School Tests for Indiana. 


b BUSINESS EDUCATION 

367. kBookkeeping Test: State High School Tests 
for Indiana. 1 ; 

368. *Bookkeeping Test: United-NOMA Business 
Entrance Tests. Reviews by Harvey A. Andruss and 
Ray G. Price. 

369. *Business Fundamentals and General Infor- 
mation Test; United-NOMA Business Entrance Tests. 
Reviews by Vera M. Amerson and C. C. Upshall. 

370. kCommercial Arithmetic Test: State High 
School Tests for Indiana. j 

371. kCooperative Commercial Arithmetic Test. 

372. kE.R.C. Stenographic Aptitude Test. Reviews 
by Philip H. DuBois and Edward A. Rundquist. 

373. kExamination in Bookkeeping and Accounting. 
Review by Harvey A. Andruss. 

374. kExamination in Business Arithmetic. 

375. kExamination in Business English—High- 
School Level. Review by Orrel E. Little. 

376. kExamination in Commercial Correspondence— 
College Level. Reviews by Orrel E. Little and Her- 
bert A. Tonne. 

377. kExamination in Gregg Shorthand. Review 
by Agnes E. Osborne. 

378. *Examination in Typewriting. Review by E. G. 
Blackstone. 

379. *Filing Test: United-NOMA Business En- 
trance Tests. Reviews by Arnold E. Schneider and 
C. C. Upshall, 

„380. *General Test of Business Information. Re- 
views by Vera M. Amerson and Herbert A. Tonne. 

381. Hiett Stenography Test (Gregg). Review by 
Agnes E. Osborne. 

382. Kauzer Typewriting Test. Review by E. G. 
Blackstone. 

.383. XKimberly-Clark Typing Ability Analysis. Re- 
view by E. G. Blackstone. 

384. *Machine Calculation Test: United-NOMA 
Business Entrance Tests. Review by Elizabeth Fehrer. 

385. Parke Commercial Law Test. Review by Ray 
G. Price. 

386. *Seashore-Bennett Stenographic Proficiency 
Tests: A Standard Recorded Stenographic Work- 

^ sample. Review by Ann Brewington. 

387. Shemwell-W hitcraft Bookkeeping Test. Review 
by Arnold E. Schneider. 

388. XShorthand Test: State High School Tests 
for Indiana. 

388a. *SRA Clerical Aptitudes. 

388b. XSRA Dictation Skills. 

388c. XSRA Language Skills. 

388d, *SRA Typing Skills. 


389. Stenogauge. Review by Beatrice J. Dvorak. 

390. Stenographic Aptitude Test. Reviews by Philip 
H. DuBois and Edward A. Rundquist. 

391. *Stenographic Test: United-NOMA Business 
Entrance Tests. Reviews by Ann Brewington and 
Elizabeth Fehrer. a 

392. kTurse-Durost Shorthand Achievement Test 
(Gregg). Excerpt from a review. 

393. XTurse Shorthand Aptitude Test. Review by 
Philip H. DuBois. S 

394. *Typing Test: United-NOMA Business En- 
trance Tests. Reviews by E. G. Blackstone and Bea- 
trice J. Dvorak. 

395. kTypewriting Test: State High School Tests 
for Indiana. 

396. *United-NOMA Business Entrance Tests. Re- 
view by Paul S. Lomax. 


COMPUTATIONAL & SCORING DEVICES 


397. International Test Scoring Machine, Review 
by Arthur E. Traxler. 
398. kThurstone Scoring Board. 


EDUCATION 

399. Coxe-Orleans Prognosis Test of Teaching Abit- 
ity. Reviews by Harl R. Douglass and David G. 
Ryans. 

400. Educational Aptitude Test: George Washing- 
ton University Series. Reviews by A. S. Barr and 
Harl R. Douglass, 

401. kExamination in Elementary Psychology— 
College Level. 

402. kOhio Teaching Record: Anecdotal Observa- 
tion Form, Second Revised Edition. Excerpt from 
a review by Thomas O. Marshall. 

403. kPurdue Teacher's Examination: 
Teach. Review by David G. Ryans. 

404. Stanford Educational Aptitude Test. Reviews 
by A. S. Barr and David G. Ryans. 

405. Teaching Aptitude Test: George Washington 
University Series. Review by A. S. Barr. 

406. kTests of Human Growth and Development. 
Reviews by Harold E. Jones and Goodwin Watson. 


ENGINEERING 

407. kExamination in Advanced Engineering Elec- 
tronics. 

408. *Examination in Diesel Engineering. 

409. *Examination in Engineering Drawing. 

410. *Examination in Engineering Electronics. 

411. XExamination in Engineering Mechanics. 

412. kExamination in Fluid Mechanics. 

413. kExamination in Machine Design. 

414. kExamination in Radio Engineering. 

415. *Examination in Strength of Materials. 

416. kExamination in Surveying. 


ENVIRONMENT 


417. kAmerican Home Scale. Reviews by Henry S. 
Maas and Verner M. Sims. 


HEALTH 


418. XByrd Health Attitude Scale. Reviews by May- 
hew Derryberry and H. H. Remmers; and an excerpt 
from a review by William J. Jones. 

410. Gates-Strang Health Knowledge Tests. Ex- 
cerpt from a review by W. W. Bauer. 

420. kHealth and Safety Education Test: State 
High School Tests for Indiana. 

421. XHealth Education Test: Knowledge and Ap- 
plication. Reviews by H. H. Remmers and Mabel E. 
Rugen. 

422. kHealth Inventory for High School Students. 
Reviews by Mayhew Derryberry and Mabel E. Rugen. 


How I 
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423. Health Practice Inventory. Review by Thomas 
Kirk Cureton. t 

424. *Indiana University Motor Fitness Index. 

425. *Kilander Nutrition Information Test. 

426. kTrusler-Arnett Health Knowledge Test. Re- 
view by Thomas Kirk Cureton. 


HOME ECONOMICS 

427. kAssisting With Care and Play of Children: 
State High School Tests for Indiana. Review by 
Helen C. Dawe. 

428. *Assisting With Clothing Problems: State 
High School Tests for Indiana. 

429. *Child Development: State High School Tests 
for Indiana. Review by Helen C. Dawe. 

430. kClothing I: State High School Tests for 
Indiana. 

431. Clothing II: State High School Tests for 
Indiana, 

432. kFoods I, Food Selection and Preparation: 
State High School Tests for Indiana. 

433: kFoods II, Planning for Family Food Needs: 
State High School Tests for Indiana. 

434. kHelping With Food in the Home: State High 
School Tests for Indiana. 

435. *Helping With the Housekeeping: State High 
School Tests for Indiana. 

436. kHome Care of the Sick Test: State High 
School Tests for Indiana. 

437. *Housing the Family: State High School Tests 
for Indiana. 

438. *Minnesota Check List for Food Preparation 
and Serving, Revised Edition. 

439. *Minnesota Food Score Cards, Revised Edition. 


INDUSTRIAL ARTS 
440. *Examination in Mechanical Drawing. 
441. XMechanical Drawing Test: State High School 
Tests for Indiana. 


RECORD & REPORT FORMS 
œ 442. kAmerican Council on Education Cumulative 
Record Card for Grades 1, 2, and 3. 

443. kAmerican Council on. Education Cumulative 
Record Card for Grades 4, 5, and 6. 

444. *Amertcan Council on Education Cumulative 
Record Card for Junior and Senior High Schools, 1941 
Revision, Review by Herbert A. Toops. 

445. *American Council on 'Education Cumulative 
Record Card for Colleges, 1942 Revision. Review by 
Herbert A. Toops. 

. 446. *Guidance Summary Form for Use in Voca- 
tional and Educational Counseling. Review by Norman 
Frederiksen; and an excerpt from a review. 

447. *Hamilton Cumulative Record Folder, Revised 
Edition. 

448. *Standard Profile Chart. Review by Charles R. 
Langmuir, 

449. XV.G.C. Anecdotal Record Form. 

450. *V.G.C. Cumulative Record Folder. 

451. XV.G.C. Interview Record Form. 

452. kV.G.C. Student Information Form, Revised. 


RELIGIOUS EDUCATION 
452a. *Hebrew School Attitude Test. 
453. *Jewish Home Environment Test. 
454. XJewish Information Test. 
455. *Religion Essentials Test. 
456. XWilson Tests of Religious Aptitude. Review 
by Goodwin Watson. 


SAFETY | 
457. Instructional Test on Safety. 
458. *Siebrecht Attitude Scale. 


SENSORY-MOTOR 

459. XA-B-C Vision Test for Ocular Dominance. 

460. KAO Sight-Screener. Reviews by Henry A. 
Imus and F. Nowell Jones. 

461. *Binocular Reading Test. Review by Albert J. 
Harris. B 
aoe: *Dvorine Color Discrimination Screening 

est. 

462. kDvorine Color Perception Testing Charts, 
Vol. 1. Excerpts from reviews by Knight Dunlap, 
Carel C. Koch, Elsie Murray, and Miles A. Tinker and 
a reply by Israel Dvorine. 

463. *Eames Eye Test. 

464. kFarnsworth Dichotomous Test for Color 
Blindness: Panel D-15. Excerpt from a review, 

465. kFarnsworth-Munsell roo-Hue Test for Anom- 
alous Color Vision. 

466. *Harris Tests of Lateral Dominance. Excerpt 
from a review. 

467. *Keystone Telebinocular. Review by F. Nowell 
Jones; and an excerpt from a review. 

468. *Massachusetts Vision Test. Excerpt from a 
review by the Council on Physical Therapy. 

469. *New Test for the Detection of Colorblindness. 
Excerpts from two reviews. 

470. *Ophthalmograph. 

471. XOrtho-Rater. Reviews by Henry A. Imus and 
F. Nowell Jones. 

472. kOseretsky Tests of Motor Proficiency: A 
Translation from the Portuguese Adaptation. Excerpt 
from a review. 

473. kPseudo-Isochromatic Plates for Testing Color 
Vision. Excerpt from a review. 

474. XTest of Color Blindness. 

475. XWestern Electric Audiometer. 


READING 


476. kAchievement Tests in Silent Reading: Do- 
minion Tests. Review by Margaret G. McKim. 

477. *Buffalo Reading Test for Speed and Com- 
prehension, Revised. Reviews by Holland Roberts and 
William W. Turnbull. 

478. Chicago Reading Tests. Excerpt from a re- 
view by William J. Jones. 

479. DeVault Primary Reading Test. Review by 
Alice N. Jameson. 

480. Diagnostic Examination of Silent Reading Abil- 
ities. Reviews by Frederick B. Davis, W. E. Hall, 
and J. B. Stroud. 

481. *Diagnostic Reading Tests. 

482. *Elementary Reading: Every Pupil Test. 

483. Garrison First Year Reading Test. Review by 
Ruth Lowes. 

484. *Gates Advanced Primary Reading Tests. Re- 
views by Virginia Seavey and George Spache. 

485. *Gates Basic Reading Tests. Reviews by George 
Spache, Herbert F, Spitzer, and T. L. Torgerson. 

486. *Gates Primary Reading Tests. Reviews by 
William S. Gray and George Spache. 

487. Gates Reading Survey for Grades 3 to 10: Vo- 
cabulary, Level of Comprehension, Speed, and Accu- 
racy. Reviews by Dorothy E. Holberg. and Herbert | 
P. MIRA School Readiig Test: National Achieve: 

4 ‘igh School Reading Test: National Achieve: 
ment Tests. Review by Robert. meer ES nl 

489. *Iowa Silent Reading 4 Se iton Re- 
vised. Reviews by Frederick B. is and William 
W. Turnbull; and excerpts from 3 by Earl R. 
Gabler and Margaret Pankaskie. — -— — 

.490. *Lee-Clark Reading Test—Primer, 1943 Edi- 
tion. Review by Ruth Lowes. 

490a. *Metropolitan Achievement Tests [Reading]. 


Reading s 
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491. Minnesota Reading Examination for College 
Students. Review by James M. McCallister. 

492. Nelson Silent Reading Test: Vocabulary and 
Paragraph: Clapp-Young Self-Marking Tests. Re- 
view by Constance M. McCullough. | i 

493. *Primary Reading: Every Pupil Test. Reviews 
by William S. Gray, and Virginia Seavey. 

494. Primary Reading Test. Review by Ruth Lowes. 

495. kPrimary Reading Test: Acorn Achievement 
Tests. Review by Alice N. Jameson. " 
ni Purdue Reading Test. Review by Albert J. 

arris. . 


497. *Reading Comprehension: Cooperative English . 


Test: Lower and Higher Levels, Tests Cr and Ca. 
Reviews by Robert Murray Bear and J. B. Stroud. 
See 120 for reviews by J. Paul Leonard, Edward S. 
Noyes, and Robert C. Pooley of the total battery. 

498. Reading Comprehension Test: National 
Achievement Tests. Review by James R. Hobson. 

499. XSchonell Reading Tests. Review by Edith I. 
M. Thomson. h 

500. *Schrammel-Gray High School and College 
Reading Test. Reviews by James M. McCallister and 
Robert L. McCaul; and an excerpt from a review by 
William J. Jones. 

501. *Silent Reading Comprehension: Iowa Every- 
Pupil Tests of Basic Skills, Test A. Reviews by 
James R. Hobson and Constance M. McCullough. See 
10 for reviews by Frederic L. Ayer, Gustav J. Froeh- 
lich, and Ralph C. Preston of the total batay: 

502. XSRÁ. Reading Record. Review by Frances 
Oralind i riggs: and an excerpt from a review. 

503. *Stanford Achievement Test [Reading]. Re- 
view by Margaret G..McKim. See 18 for reviews by 
MERI C. Cook and Ralph C. Preston of the total 

ittery. 

504. Stone-Webster Test in Beginning Reading. Re- 
view by Ruth Lowes. 

505. *Techniques in Reading Comprehension for 
Junior-Senior High School: Every Pupil Test. Re- 
views by Ivan A. Booker and James M. McCallister. 

506. *Thorndike-Lorge Reading Test for Grades 7 
to 9. Review by Robert L. McCaul; and an excerpt 
from a review. 

507. Whipple’s High-School and College Reading 
Test. Review by Frederick B. Davis. 

508. Williams Primary Reading Test. Review by 
Alice N. Jameson. 


MISCELLANEOUS 

509. Examiner's Reading Diagnostic Record for 
High School and College Students. Reviews by Rob- 
ert Murray Bear and Carolyn M. Welch. 
_ 510. *Gates Reading Diagnostic Tests, Revised Edi- 
tion. Review by T. L. Torgerson. i 

sit. Inventory of Reading Experiences. Review by 
Albert J. Harris. 

512. Record for Reading Diagnosis. Review by Caro- 
lyn M. Welch. 


READINESS 

, 513. *American School Reading Readiness Test; Re- 
views by David H. Russell and Paul A. Witty. 

514. *Binion-Beck Reading Readiness Test for Kin- 

» dergarten and First Grade. Reviews by Irving H. 
Anderson and Paul A. Witty. 

515. Classification Test for Beginners in Reading. 
car Marion Monroe Cox and David H. Rus- 
sell. 

516. Gates Reading Readiness Tests. Reviews by 
Marion Monroe Cox and Paul A. Witty. 

517. *Lee-Clark Reading Readiness Test, 1943 Re- 
vision. Reviews by Marion Monroe Cox and David 
H. Russell, 


518. Metropolitan Readiness Tests. Review by Irv- 
ing H. Anderson. 

519. Reading Aptitude Tests. Review by Irving H. 

derson. 

520. Reading Readiness Test. Review by David H. 
Russell. 


521. *Stevens Reading Readiness Test. Reviews by 
Irving H. Anderson and Marion Monroe Cox, 


SPEED a 

522. Chapman-Cook Speed of Reading Test. Review 
by Eason Atari. 

523. Michigan Speed of Reading Test. Review by 
Eason Monroe, 

524. “Reading Speed and Comprehension: Ohio 
Senior Survey Tests. Reviews by J. B. Stroud and 
Miles A. Tinker. 

525. Reading dis Test: National Achievement 
Test. Review by Eason Monroe. 


SPECIAL SUBJECTS 

526. *Interpretation of Reading Materials in the 
Natural Sciences: Tests of General Educational De- 
velopment: College Level, Test 3. See 20 for reviews 
by Herbert S. Conrad and Warren G. Findley of the 
total battery. 

527. kInterpretation of Reading Materials in the 
Natural Sciences: Tests of General Educational De- 
velopment: High School Level, Test 3. See 20 for re- 
views by Herbert S. Conrad and Warren G. Findley 
of the total battery. 

528. *Interpretation of Reading Materials in the 
Social Studies: Tests of General dicational Devel- 
opment: College Level, Test 2. Reviews by W. E. Hall 
and C. Robert Pace, See 20 for reviews by Herbert 
S. Conrad and Warren G. Findley of the total bat- 
tery. 

529. kInterpretation of Reading Materials in the 
Social Studies: Tests of General Educational Devel- 
opment: High School Level, Test 2, See 20 for re- 
views by Herbert S. Conrad and Warren G. Findley 
of the total battery. 

530. *Reading Scales in History. Reviews by Paul" 
Blommers and Albert J. Harris. 

531. Reading Scales in Literature. 

532. *Reading Scales if Science. Review by Ivan 
A. Booker. 

:533. XTest of Reading Ability for. " 


STUDY SKILLS 

534. kApplied Reading for Junior-Senior High 
School: Every Pupil Test. Review by Ivan A. Booker. 

535. "Interpretation of Data. Review by J. Wayne 
Wrightstone. 

536. *Library Test for Junior High Schools. Re- 
views by Robert A. Davis and Ethel M. Feagley. 

537. kLibrary Usage Test. Review by J. Wayne 
Wrightstone. 

538. Peabody Library Information Test. Review by 
Douglas E. Scates; and excerpts from reviews by 
Ruth E. Seeger. 

539. *Special Reading Test: Ohio Senior Survey 
Tests. Review bi Miles A. Tinker. 

540. *Study-Habits Inventory, Revised Edition, 
1941. Review by Douglas E. Scates. 

_ 541. Survey of Study Habits, Experimental Edi- 
tion. 

542. Test of Study Skills, Review by Douglas E. 
Scates. 

543. Use of Library and Study Materials, Review 
by Ethel M. Feagley. 

544. XWatson-Glaser Tests of Critical Thinking. 
Review by Robert H. Thouless; and an excerpt from 
a review by Harold P. Fawcett. 


1045 ] CLASSIFIED INDEX OF TESTS 


* -— Social Studies 


545. *Work-Study Skills: Iowa Every-Pupil Tests 
of Basic Skills, Test B, New Edition. Review by J. 
Wayne Wrightstone. See 10 for reviews by Frederic 
L. Ayer, Gustav J. Froehlich, and Ralph C. Preston 
of the total battery. 


SCIENCE 


546. * Ability for Science: Fife Tests of Ability, Test 
" See 8 for review by James Maxwell of the total 
attery. 

.547. Physical Science Aptitude Examination. Re- 
views by Jack W. Dunlap and John C. Flanagan. 


548. *Test of General Proficiency in the Field of 


Natural Sciences: Cooperative General Achievement 
Tests, Revised Series, Test 2. Review by Palmer O. 
Johnson. See 3 for review by John V. McQuitty of 
the total battery. 


BIOLOGY 

549. *Biology: Every Pupil Test. 

550. *Cooperative Biology Test. Review by C. W. 
Horton. 

551. XCooperative College Biology Test. Reviews 
by Clarence H, Nelson and Joseph J. Schwab. 

552. XExamination in Biology—College Level. Re- 
view by C. W. Horton. 

553. XExamination in Biology—High-School Level. 
Reviews by C. W. Horton and. Richard H. Jordan. 

554. Examination in Botany—College Level. Re- 
views by: C. W. Horton and Clarence H. Nelson. 

555. Wisconsin Biology Test. 


CHEMISTRY 

556, XA.C.S. Cooperative Biochemistry Test. 

557. XA.C.S. Cooperative General Chemistry Test 
for College Students. Reviews by Sidney J. French 
and Florence E. Hoopér, 

558. XA.C.S. Cooperative Organic Chemistry Test. 
Review by Shailer Peterson. 

559. XÁ.C.S. Cooperative Physical Chemistry Test. 
Review by Alfred S. Brown. 


= 560. *Chemistry: Every Pupil Test. 


561. *Cooperative Chemistry Test. Review by John 
H. Daugherty. 

562. kCooperative Chemistry Test in Qualitative 
Analysis. Reviews by William B. Meldrum and Wil- 
liam Rieman III. 

563. *Cooperative Chemistry Test in Quantitative 
Analysis. Reviews by William B. Meldrum and Wil- 
liam Rieman III. f 

564. kExamination in Chemistry—High-School 
Level. Review by Victor H. Noll. " 

565. XExamination in General Chemistry—College 
Level. Reviews by John H. Daugherty and Florence 
E. Hooper. 

566. *Iowa Placement Examinations: Chemistry Ap- 
titude. Review by Theodore A. Ashford. ? 

567. Iowa Placement. Examinations: Chemistry 
Training: Series CT1, Revised. Review by Theodore 
A. Ashford. 3 

568. *Kirkpatrick Chemistry Test. Review by Theo- 
dore A. Ashford. > 

560. XWisconsin Achievement Tests: Chemistry. 
Review by Victor H. Noll. 


GENERAL SCIENCE 

570. *Cooperative General Science Test. 

571. *Cooperative Science Test for Grades 7, 8, 
and 9. Reviews by Hans C. Gordon and Herbert A. 
Thelen. 3 
Te *Elementary Science and Health: Every Pupil 

est. 


573. kExamination in General Science—High- 


School Level. Reviews by Hans C. Gordon and Vic- 
tor H. Noll. 

574. kExamination in Senior S. cience—High-Scliool 
Level Review by Richard H. Jordan. 

575. *General Science: Every Pupil Test. 

576. *McDougal General Science Test. Reviews by 
Hans C. Gordon and Herbert A. Thelen. 

577. *Wisconsin General Science Test. 


£ MISCELLANEOUS 


578. *Examinalion in Astronomy—College Level. 
579. *Cooperative Geology Test. 

m *Examination in Meteorology—High-School 
vel. 


PHYSICS 


581. *Cooperative Physics Test. Review by G. P. 
Cahoon. 

582. *Cooperative Physics Tests for College Stu- 
dents. Review by Edgar P. Slack. 

583. XExamination in Electricity and Magnetism— 
College Level. 

584. kExamination in Electron Tubes and Circuits, 

585. XExamination in Physics—High-School Level. 
Reviews by G. P. Cahoon and Palmer O. Johnson. 

586. kExamination in Physics: College Level. 

587. *Iowa Placement Examinations: Physics Ap- 
titude. Review by Robert M. W. Travers. 

588. *Physics: Every Pupil Test. 

589. Wisconsin Physics Test. 


SOCIAL STUDIES» 


590. *American History—Government—Problems 
of Democracy: Acorn Achievement Tests. Review by 
Howard R. Anderson. 

501. XCooferative Community Affairs Test, Reviews 
by W. H. Cartwright, J. R. Gerberich, and Lavone 
A. Hanna. ; 

592. Cooperative Social Studies Test for Grades 
7, 8, and 9. Reviews by Robert A. Davis and Edgar 
B. Wesley. > 1 

503. *lllinois Teachers College Cooperative Social 
Science Test, 1942 Edition, Review by Harry D. 


Berg. 
e *Social Studies Test: National Achievement 
Tests. Review by Ray G. Wood. y 1 
505. *Stanford Achievement Test: Social Studies 
Tests. Review by Ray G. Wood. See 18 for reviews 
by Walter W. Cook and Ralph C. Preston of the 


total battery. f 3 [ 

506. *Test of General Proficiency in the Field of 
Social Studies: Cooperative General Achievement 
Tests, Revised Series, Test 1. Review by Har j 
Berg. See 3 for review by John V. McQuitty of the 
total battery. 


ECONOMICS 
507. *Cooperative Economics Test. 


GEOGRAPHY 
598. Emporia Geography Test. Reviews by Edwin 


H. Reeder and Agatha Townsend. - 
509. Fourth Grade Geography Test. Reviews by 


Elaine Forsyth and Agatha Townsend. 
(s Te Test: National Achievement Tests, 


Review by Elaine Forsyth. > 
bor. XTate Economic Geography Test. Review by 


Marguerite Uttley. 2 
b02. Wiedefeld-Walther Geography Test. Review 


by Marguerite Uttley. i 
(8 World Geography Test: Dominion Tests. Re- 


view by Edwin H. Reeder. 
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HISTORY 

604. *Cooperative American History Test. 

605. *Cooperative Mo European History Test. 

606. *Cooperative World History Test. 

607. *Examination in American History. Review 
by Howard R. Anderson. " : 

608. xExamination in Modern European History. 
Review by Frederick H. Stutz. 

609. kExamination in World History—High-School 
Level, Reviews by Dorothy C. Adkins and Wallace 
W. Taylor. 

610. Kansas American History Test. Review by 
W. H. Cartwright. y 

611. *Kansas Modern European History: Test. Re- 
view by Frederick H. Stutz. 

612. kKniss World History Test. Reviews by Doro- 
thy C. Adkins and Wallace W. Taylor; and excerpts 
from reviews by William J. Jones and Waldo F. 
Mitchell. R 4 

613. XSurvey Test in United States History. 


POLITICAL SCIENCE 

614-5. XCooperative Test in American Government. 
ne X*Examination in Civics. Review by Roy A. 

rice. 

617. kExamination. in Problems of Democracy— 
High-School! Level, Review by Lavone A. Hanna. 

18, *Mordy-Schrammel Constitution Test. Review 

by W. H. Cartwright. " 

619. Mordy-Schrammel Elementary Civics Test. Re- 
view by C. Robert Pace. 


VOCATIONS 


620. *Differential Aptitude Tests. Excerpt from a 
review, 

621. *Guilford-Zimmerman Aptitude Survey. 

622. kYale Educational Aptitude Tests. 


CLERICAL 

623. XClerical Aptitude Test: Acorn National A; 
titude Tests, Reviews by Marion A. Bills, Donald G. 
Paterson, Henry Weitz, and E. F. Wonderlic. 

624. Clerical Perception Test. Reviews by Ed- 
ward N. Hay, Raymond A. Katzell, Erwin K. Taylor, 
and E. F. Wonderlic; and an excerpt from a review. 

625. Clerical Test D: Extending— Verifying— 
Checking—Classifying. Reviews by Donald G. Pat- 
erson and John M. Willits. 

626. *Detroit Clerical Aptitudes Examination, Re- 
view by E. F. Wonderlic. 

627. *Minnesota Clerical Test. Reviews by Thelma 
Hunt, R. B. Selover, Erwin K. Taylor, and E. F. 
Wonderlic. " 

628. N.LI.P. Clerical Test: American Revision. 
Review by R. B. Selover. 

629. O'Rourke Clerical Aptitude Test, Junior Grade. 
Review by Raymond A. Katzell. 

630. Psychological Corporation General Clerical 
Test. Reviews by Edward N. Hay, Thelma Hunt, Ray- 
mond A. Katzell, and E. F. Wonderlic. 

631. kSurvey of Working Speed and Accuracy. 
Reviews by Edward N. Hay, Donald G. Paterson, 
and Erwin K. Taylor. 

632. Thurstone Examination in Clerical Work: 
Thurstone Employment Tests. Reviews by John M. 
Willits and E. F. Wonderlic. 


INTERESTS 
633. Basic Interest Questionnaire: For Selecting 
Your Vocation or Avocation. Excerpt from a review 
by Donald E. Super. 
634. *Brainard Occupational Preference Inventory. 
Reviews by Edwin W. Davis and Herschel T. Manuel. 


635. *Cleeton Vocational Interest Inventory, Re- 
vised Edition. Reviews by Edward B. Greene, C. A. 
Oakley, and Arthur E, Traxler; and an excerpt from 
a review by A. Gordon Nelson. 

636. *Gregory Academic Interest Inventory. Re- 
views by Paul S. Burnham, Lysle W. Croft, and 
Herbert A. Toops. t 

637. *Interest Questionnaire for High School Stu- 
dents, 1942 Edition. Review by Lysle W. Croft. 

638. *Inventory of Vocational Interests: Acorn Nas 
tional Aptitude Tests. Reviews by Marion A. Bills, 
Edward S. Bordin, Harold D. Carter, and Patrick 
Slater. 

639. XJob Qualification Inventory. Reviews by Ralph 
F. Berdie and Stanley G. Dulsky; and excerpts from 
reviews by Donald E. Gorseline and Robert C. Woell- 


ner, 

640. *Kuder Preference Record. Reviews by Ralph 
F. Berdie, E. G. Chambers, and Donald E. Super ; and 
an excerpt from a review by Arthur H. Brayfield. 

641. Motivation Indicator. Reviews by Norman 
Frederiksen and Arthur E. Traxler; and an excerpt 
Írom a review. 

642. Occupational Interest Blank for Women. Re- 
views by Gwendolen Schneidler Dickson and Frances 
Oralind Triggs. a 

643. kOccupational Interest Inventory. Reviews by 
Edward S. Bordin and Stanley G. Dulsky. 

644. kOccupational Interests: Self Analysis Scale. 
Review by Stanley G. Doiy: 

645. kPrimary Business Interests Test. Reviews 
by George K. Bennett, Glen U. Cleeton, and George 
A. Ferguson. 

646. kThurstone Interest Schedule, 

647. Vocational Interest Blank for Men, Revised. 

648. k[Re Vocational Interest Blank for Men]. 
Clinical Aspects and Interpretation of the Strong Vo- 
cational Interest Blank by John G. Darley. Excerpts 
from reviews by Bertha Peterson Harper and Donald 
E. Super. 

649. *V ocational Interest Blank for Women, Revised. 
Review by Gwendolen Schneidler Dickson. 

650. *[Re Vocational Interest Blanks for Men and 
Women.] Vocational Interests and Job Orientation: 
A Ten-Year Review by Harold D. Carter, Excerpt 
from a review by Edward B. Greene. 

651. *[Re Vocational Interest Blanks for Men and 
Women.] Hankes’ Answer Sheet for Vocational In- 
terest Blank. 

652. *[Re Vocational Interest Blanks for Men and 
Women.] Vocational Interest of Men and Women 
by Edward K. Strong, Jr. Excerpts from reviews by 
Harold D. Carter, Stephen M. Corey, John G, Darley, 
Robert J. Havighurst, Frederic Kuder, Charles S. 
Meyers, C. L. Shartle, Donald E. Super, and William 
H. Stone. 

653. Vocational Interest Schedule. Review by Don- 
ald E. Super. 

654. Vocational Interest Test for College Women. 
Review by Frances Oralind Triggs. 

655. Vocational Inventory. Reviews by Edward S. 
Bordin, Harold D. Carter, and Donald E. Super; and 
an excerpt from a review by Arthur H. Brayfield. 


MANUAL DEXTERITY 

656. *Benge Han-Dexterity Test. Reviews by C. H. 
Lawshe, Jr., and Joseph E. Moore. 

657. *Benge Two Hand Coordination Test, Review 
by Milton L. Blum. 

658. *Crawford Spatial Relations Test. Review by 
William R. Grove. 

659. *Hand-Tool Dexterity Test. Reviews by C. H. 
Lawshe, Jr., and Neil D. Warren. 
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660. Kent-Shakow Formboard: Industrial Model. 
Review by Milton L. Blum. S M 

661. *MacQuarrie Test for Mechanical Ability. Re- 
views by John R. Kinzer, C. H. Lawshe, Jr, and 
Alec Rodger. - / . 

662. XMellenbruch Curve-Block Series. Reviews by 
William R. Grove and Willard A. Kerr. 

663. *Minnesota Rate of Manipulation. Reviews by 
Edwin E, Ghiselli and John R. Kinzer; and am ex- 
eerpt from a review. 1 > 

664. Minnesota Spatial Relations Test. Review by 
Milton L. Blum. 

655. kPennsylvania Bi-Manual Worksample. Re- 
views by Edwin E. Ghiselli, Thomas W. Harrell, Al- 
bert Gibson Packard, and Neil D. Warren, ^ 

666. *Purdue Pegboard. Reviews by Edwin E. Ghi- 
selli, Thomas W. Harrell, and Albert Gibson Packard. 
a in *Small Parts Dexterity Test. Review by Joseph 

^ oore, 


MECHANICAL ABILITY 

668. Detroit Mechanical Aptitudes Examination, Re- 
um Reviews by Lloyd G. Humphreys and Dewey 

. Stuit. 

« 669. Mechanical Aptitude Tests: Acorn National 
Aptitude Tests. Reviews by Reign H. Bittner, James 
M. Porter, Jr., and Alec Rodger. 

670. *Mellenbruch Mechanical Aptitude Test for 
Men and Women. Reviews by Lloyd G. Humphreys 
and C. A. Oakley. 

671. Minnesota Assembly Test. Review by William 
R. Grove. 

672. *O'Rourke Mechanical Aptitude Test, imr 
Grade, Reviews by Jay L. Otis and George A. Satter, 

v *Perceptual Mechanics Test. Review by Charles 
M. Harsh. 

674. kPrognostic Test of Mechanical Abilities. Ex- 
cerpt from a review. 

675. Purdue Industrial Training Classification 
ae Reviews by D. Welty Lefever and Charles I. 

osier, 

.676. kPurdue Mechanical Adaptability Test. Re» 
views by Jay L. Otis and Dewey B. Stuit. 

677. * Revised Minnesota Paper Formboard Test. Re- 
view by Dewey B. Stuit. 

677a. *SRA Mechanical Aptitudes. 

678. Stenquist Mechanical Aptitude Tests. Review 
by James M. Porter, Jr. 

679. Stenquist Mechanical Assembling Test. Review 
j8o. "kSurvey o echanical Insight. Reviews 
pega H. Bittner, Jay L. Otis, and Shailer Pe y 

1. Survey of Object Visualization. Reviews by 
Charles M. Harsh, Clifford E. Jurgensen, Shailer 
Peterson, and Patrick Slater. 

682. *Survey of Space Relations Ability. Reviews 


Vocations 


by E. G. Chambers, Clifford E. Jurgensen, and James 
M. Porter, Jr. ; 
683. XTest of Mechanicab Com 
D M. Harsh, Lloyd 
rge A. Satter. 


MISCELLANEOUS 


684. XCancellation Test. Review by J 

685. cred Interviewers Guide. 
Clyde H. mbs and Douglas H. Fryer. " 

686. xEmployee Evaluation Form for Interviewers. 
Reviews by Douglas.H. Fryer and C. H. Ruedisili; 
and excerpts from reviews by Kenneth Byers and 
Martin L. Reymert. 3 

687. kHow Supervise? Reviews by D. Welty Le- 
fever, Charles I. Mosier, and C. H. Ruedisili. 

688. x/nteraction Chronograph. 

689. kMinnesota Occupational Rating Scales and 
Counseling Profile. Review by M. H. Elliott, 

690. Personnel Selection and Classification Test. 
Reviews by Dorothy C. Adkins, George K. Bennett, 
and George A. Ferguson. 

691. Rating Form for Use of Interviewers and Oral 
Examiners. Review by Douglas H. Fryer. | 

_ 692. *Self-Rating Scale for Leadership Qualifica- 


tions. 
693- *Survey of Company Morale: a Satisfaction 
Blank No. 12. Review by William W. Waite. 

604. XTest of Practical Judgment. Reviews by Glen 
U. Cleeton and Howard R. Taylor. 

695. *Vocational Aptiude Examination, Type E-A, 
Revised 1946. Reviews by D. Welty Lefever and Ben- 
jamin Shimberg. " 


SPECIFIC VOCATIONS 

.696. [American Transit Association Tests.] Re- 
views by Harold G. Seashore, Morris S. Viteles, and 
J. V. Waits. 

697. *Detroit Retail ers Inventory. Reviews by 
Milton E. Hahn and Floyd L. Ruch; and an excerpt 
from a review by William zi Jones and one other 

608. **Engineering and Physical Science Aptitude 
Test. Reviews by Norman Frederiksen and Robert 
M. W. Travers. 

699. *George Washingion University Series of 
Nursing Tests. 

oo. *Probst Service Report. 

701. kPurdue Test for Electricians. Review by John 
W. French. 

702. *Purdue Test for Machinists and Machine Op- 
erators. Review by William W. Waite. 

703. kSales Questionnaire. Review by Robert G. 
Bernreuter. 

704. *Steward Selection. System. Reviews by Mil- 
ton E. Pat par iii: L rae 

705. Test for Ability to Sell: rge Washington 
University Series. Review by Floyd L. Ruch. - 


rehension. Reviews 
. Humphreys, and 


E. King. 
iews by 
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